in reply to Convert wav file to letters

That's what I thought. Be warned, voice recognition is hard and to be honest, Perl wouldn't be my first choice of language for doing it in.

Some of the questions you need to resolve are:

  • Are you trying to match one voice (that matches your training set) or many voices (with or without matching training sets)?
  • Individually spoken characters are not going to match occurances of those letters in continuous speech.

    Think about the different sounds that the letter 'c' has in "concession", or the 't's in 'traction'. Every letter in the alphabet has multiple sounds depending upon the word it is in, where in the word it is, the accent of the speaker (US ba-th -v- UK bar-th; US too-na -v- UK (ch)tu-na etc).

  • Most VRS uses syllables or phonemes because many words have similar sounds spelt differently. Eg. The 'shun' sounds at the end of 'comprehension', 'composition' etc.

    Unless your in for the long haul of deep research, you probably should look at existing solutions and libraries rather than rtying to stat from scratch.


    Examine what is said, not who speaks -- Silence betokens consent -- Love the truth but pardon error.
    Lingua non convalesco, consenesco et abolesco. -- Rule 1 has a caveat! -- Who broke the cabal?
    "Science is about questioning the status quo. Questioning authority".
    The "good enough" maybe good enough for the now, and perfection maybe unobtainable, but that should not preclude us from striving for perfection, when time, circumstance or desire allow.
  • Replies are listed 'Best First'.
    Re^2: Convert wav file to letters
    by Anonymous Monk on Jul 22, 2005 at 23:19 UTC
      Thanks. My problem should be much easier. The "speech" are not words, they are simply letters, i.e., someone is spelling out the words. And there is only one voice, and we can even assume the person is very consistent. Thanks.