Wowza Community

Is it possible to accomplish speech recognition in live streaming using speech-to-text services from Google or Azure?

Speech-to-text services from Google or Azure appears to support only from microphone and the file format as input stream.

So,I’m curious about is there a way to acheive that?

1 Like

Did you find any solution connect wowza SDK with azure cognitive-services-speech SDK?
First step would be to extract the audio stream from the live stream. What features does wowza provide to handle / redirect the audio feed in parallel to the default transcoding process?

I can see that the stage of creating by home means is beginning…
I know it is possible to create a wowza module that will automate the conversion of audio to text /closed captions/.
I don’t understand why the wowza team concentrated their efforts on Wowza Video.
I believe that such an audio-to-text conversion module would attract new WSE users. It would retain current WSE users.
Speech-to-text services from Google works great with open captioning.
Third-party programmers can handle English but other languages are much worse for them.

Hello. I’m former Wowza now working independently.
I’ve been working on a couple speech to text implementations for WSE, although not yet Google. If you are interested in such a module, I am to build it on a contract basis.

Reach out to

1 Like

… and yet interest is emerging. and well. You know the point.
Scott, you’ll get it done faster than Wowza will be interested in such a solution.
Greetings to you