Hello, here is my problem. I have written a script that looks for duplicate strings in 3 different files and outputs which stings are duplicated and in which files. This worked great until I was tasked with modifying the script to search for an unknown number of files and report on those as well. Below is the current script:

use File::Copy; use File::Find; ### Parse numerical characters and trailing white-space from rmtdb.lrl + for matching. copy("rmtdb.lrl" , "rmtdb.tmp") or die "rmtdb.lrl file cannot be copied: $!\n"; system "cat rmtdb.tmp | cut -d ' ' -f1 > rmtdb.tmp1"; ### Open File Handles. open comdb, "< dblist.comdbg" or die "Cannot open connection to dblist.comdb: $!\n"; open varldb, "< dblist.varldb" or die "Cannot open connection to dblist.varldb: $!\n"; open rmtdb, "< rmtdb.tmp1" or die "Cannot open connection to rmtdb.lrl: $!\n"; ### Create Lists @comdb = <comdb>; @varldb = <varldb>; @rmtdb = <rmtdb>; ### Close File Handles. close comdb; close varldb; close rmtdb; ### Case-shift rmtdb to lowercase. foreach (@rmtdb) {s/$_/\L$_/gi;} ### Begin matching. foreach $db (@comdb) # comdb against varldb. { @result = grep /^\Q$db\E$/i , @varldb; push(@com2var , @result); } foreach $db (@comdb) # comdb against rmtdb. { @result = grep /^\Q$db\E$/i , @rmtdb; push(@com2rmt , @result); } foreach $db (@varldb) # varldb against rmtdb. { @result = grep /^\Q$db\E$/i , @rmtdb; push(@var2rmt , @result); } ### Sort matches for final output. foreach (@com2var) { chomp($_); $hash1{$_}="dblist.comdbg dblist.varldb"; } foreach (@com2rmt) { chomp($_); if (exists $hash1{$_}) { $hash1{$_}="dblist.comdbg dblist.varldb rmtdb.lrl"; } else { $hash1{$_}="dblist.comdbg rmtdb.lrl"; } } foreach (@var2rmt) { chomp($_); if (! exists $hash1{$_}) { $hash1{$_}="dblist.varldb rmtdb.lrl"; } } ### Final Output. print "\n"; foreach (keys %hash1) { print "$_ is duplicated in: $hash1{$_}\n"; } print "\n"; ### Cleanup unlink "rmtdb.tmp", "rmtdb.tmp1"; exit 0;

Now there can be up to 20 rmt(*)db.lrl files in a given directory. I've figured out how to find the files, but I'm having trouble with the matching afterwards.


In reply to String matching by jwesley

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.