That's what I thought. Be warned, voice recognition is hard and to be honest, Perl wouldn't be my first choice of language for doing it in.

Some of the questions you need to resolve are:

  • Are you trying to match one voice (that matches your training set) or many voices (with or without matching training sets)?
  • Individually spoken characters are not going to match occurances of those letters in continuous speech.

    Think about the different sounds that the letter 'c' has in "concession", or the 't's in 'traction'. Every letter in the alphabet has multiple sounds depending upon the word it is in, where in the word it is, the accent of the speaker (US ba-th -v- UK bar-th; US too-na -v- UK (ch)tu-na etc).

  • Most VRS uses syllables or phonemes because many words have similar sounds spelt differently. Eg. The 'shun' sounds at the end of 'comprehension', 'composition' etc.

    Unless your in for the long haul of deep research, you probably should look at existing solutions and libraries rather than rtying to stat from scratch.


    Examine what is said, not who speaks -- Silence betokens consent -- Love the truth but pardon error.
    Lingua non convalesco, consenesco et abolesco. -- Rule 1 has a caveat! -- Who broke the cabal?
    "Science is about questioning the status quo. Questioning authority".
    The "good enough" maybe good enough for the now, and perfection maybe unobtainable, but that should not preclude us from striving for perfection, when time, circumstance or desire allow.

    In reply to Re: Convert wav file to letters by BrowserUk
    in thread Convert wav file to letters by Anonymous Monk

    Title:
    Use:  <p> text here (a paragraph) </p>
    and:  <code> code here </code>
    to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.