I've dabbled with a few of the commercial VXML products out there (like Nuance, VoiceGenie, etc) and you just can't really do what you're looking for. It's called "speech transcribing", and some third-party vendors offer plugins for the voice servers to provide this capability, but even then it really sucks.
One workaround though (which also sucks) is to build a grammar and VXML document that queries for each letter, one after another, until the user says something like "done".
In reply to Re: How to make a form in VoiceXML (VXML) with input of the equivalent of input type="text"
by LukeyBoy
in thread How to make a form in VoiceXML (VXML) with input of the equivalent of <input type="text">
by ice6200
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |