How to use Watson Speech to Text utilities to increase accuracy - Artificial Intelligence

How to use Watson Speech to Text utilities to increase accuracy - Artificial Intelligence

– Learn how to use Watson Speech to Text utilities to increase your accuracy

– We’ve included links so you can download S2T utilities

Try Watson Speech to Text for free

I thought I would take a moment to play with Watson Speech to Text and a utility that was released a few months ago.

So the purpose of asking about a puppy is that I have a sample conversation system that is about buying a dog.

Broadband: can I get a puppy Narrowband: can I get a phone

So the broadband now works.

Learn how to use Watson Speech to Text API to increase your accuracy. We’ve included links S2T utilities download links and sample .wav files and Python code @IBMWatson: How to use Watson Speech to Text utilities to increase accuracy: #AI #NLU #APIs

– Learn how to use Watson Speech to Text utilities to increase your accuracy

– We’ve included links so you can download S2T utilities

– Sample .wav files and Python code are also included

I thought I would take a moment to play with Watson Speech to Text and a utility that was released a few months ago.

The Speech to Text Utils allows you to train S2T using your existing conversational system. To give a quick demo, I got my son to ask about buying a puppy.

I set up some quick Python code to print out results:

So testing the audio with the following code:        

Of course the recording is crystal clear, which is why such a good result. So I added some ambient noises from SoundJay to the background. So now it sounds like it is in a subway.

Running the code above again gets these results.

Broadband: Greg it appropriate Narrowband: can I get a phone

So the purpose of asking about a puppy is that I have a sample conversation system that is about buying a dog. Using that conversation file I did the following.

2. Before you begin you need to set up the connection to your S2T service (using service credentials).

It will walk you through the username and password.

3. Once that was set up, I then tell it to create a customization.

You need to map to a particular model. For testing, I attached it to en-US_NarrowbandModel and en-US_BroadbandModel.

4. Once it was run, I get the ID numbers for the customizations.

Once I have the IDs I try the audio again:

Broadband: can I get a puppy Narrowband: can I get a phone

So the broadband now works. Narrowband is likely the quality is too poor to work with. There is also more specialised language models for children done by others to cope with this.

So this is one example, of one phrase. Really for testing, you should test the whole model. From a demonstration from development, it was able to increase a S2T model accuracy from around 50% to over 80%.

Interesting in trying this out for yourself? Try Watson Speech to Text for free with our 30-day trial.

(Note: This content was previously published on the author’s blog and is is reposted here with the author’s permission.)

Images Powered by Shutterstock