Chad C has asked for the wisdom of the Perl Monks concerning the following question:
I am hoping you guys can help me out for school I have to create a Perl script that takes out text from one file (a transcript) then try's to find the same word in a dictionary file that contains the word and and its pronunciation spelling and then writes the word with the pronunciation spelling into a new file. Below is an example.
YES NOW YOU KNOW IF IF EVERYBODY LIKE IN AUGUST WHEN EVERYBODY'S ON VACATION OR SOMETHING WE CAN DRESS A LITTLE MORE CASUAL OR
It would have to find each word in this text in a dictionary file such as this
YERKEY Y ER1 K IY0
YERMAN Y ER1 M AH0 N
YERXA Y ER1 K S AH0
YES Y EH1 S
YESES Y EH1 S IH0 Z
and report back all the terms with the pronunciation.
This is is a continuing of last semesters work I found a script one person created but when I run it its not outputting at all to the new file its only printing the first section on my terminal screen. I have never done perl scripting before and am trying to learn on the fly as the teacher won't help us out. Anything you guys could do would be immensely appreciated. Thanks
#!/usr/bin/perl if( $#ARGV != 2 ) { print "Compares the list of words in a file to the words in a dict +ionary and outputs the words available with pronunciations\n"; print "perl GenerateDictionary WordFile DictionaryFile OutputFile\ +n"; exit; } open( WORD_FILE, "$ARGV[0]" ); open( DICT_FILE, "$ARGV[1]" ); open( OUTP_FILE, ">$ARGV[2]" ); @theDICT = <DICT_FILE>; close( DICT_FILE ); while( <WORD_FILE> ) { my($line) = $_; chomp($line); foreach( @theDICT ) { $tmpLine = $_; @items = split( / /, $tmpLine ); if( @items[0] eq $line ) { print $line."\t".$tmpLine; print OUTP_FILE $tmpLine; } } } close( WORD_FILE ); close( OUTP_FILE ); exit;
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re: Matching Text
by NetWallah (Canon) on Apr 03, 2012 at 04:23 UTC | |
by jwkrahn (Abbot) on Apr 03, 2012 at 05:11 UTC | |
by NetWallah (Canon) on Apr 03, 2012 at 05:23 UTC | |
by Chad C (Initiate) on Apr 03, 2012 at 16:22 UTC | |
by Chad C (Initiate) on Apr 09, 2012 at 23:09 UTC | |
by Chad C (Initiate) on Apr 10, 2012 at 19:28 UTC |