since i wasn’t happy with the results of the SpeechRecognizer i shared over at VL.Speech i went on to try the azure one. this one is a cloud-service you need to get an api-key for. it does computation in the cloud and indeed works much better, even without any more noticable latency. also the full patch necessary, to get quick results is quite…manageable:
also have a look at the other available cognitive services sdks (for vision, search, language and knowledge) that should all be similarly easy to access as the above. and then don’t forget to show us what you find…
hey looks really promising!
vl beginner question here
not sure why it doesn’t work…no error shown in vl
key, region and microphone input looks good
any hint on how to debug it!
thx
yep i try both keys… on win 8.1 and win 10 machines…
from a fresh new vvvv alpha install installing the Microsoft.Cognitive.Service.Speech
nuget 1.3.0
no luck so far!
Hallo! Sorry for bringing this topic back on top but it’s still opened, relies on the demo patch provided, and an issue already explained before.
So, same as @circuitb , I’m also having trouble using your demo patch @joreg .
No issue reported from Gamma 2021.4.6.
My subscription key and region are correctly copied from Azure Dashboard.
When I bang Start, I can see that vvvv is using my microphone thanks to the mic icon in Windows system tray. When I bang Stop IOBox the icon disappears. So looks like this is correctly working.
Nevertheless when I’m talking, I get no text output.
If I try to use the Azure Speech-To-Text service from the command line as explained from Microsoft documentation (so without using VL), then it’s working perfectly.
Coming back here because I’ve found out why the patch was actually not working and this might be useful for future people. Apparently Windows decided to change my default microphone for no reason… Thing is, I thought it was the correct one used by VL because it was the correct one used by other services requiring microphone (Zoom, Google Meet, etc.). I don’t exactly understand why Windows changed my setting all of a sudden, but I don’t want to try understanding that, haha.
As I was using another Create (SpeechRecognizer) with AudioConfig as a second input pin + FromDefaultMicrophoneInput node connected to it, I was expected that my default microphone input would be used. But since Windows changed my default mic, then obviously that couldn’t work.
But indeed you are totally right, I should have created a CaptureDevice node from the Audio category to check if the correct input device was used. I will do that for now on!