Can you be more specific as to what you're having trouble with? You know how to find duplicates between files. You found the files of interest. I don't understand "matching afterwards".
Repeating the code three times for each pair of files is bad to begin with, and clearly will not scale to larger groups of files! Use a loop. You don't need separate hashes for everything, and I don't see the point of temp files.
Here's how I would do it: For each file, read each line and hash it. Store the hash (not the whole line) as the key to a master hash, with the value being a list of file names it was seen in. So, for each line, push the current file name onto the value of that line's key.
After going through all the files, iterate through each hash entry and note which ones have more than one item in the value.
In reply to Re: Sting matching
by John M. Dlugosz
in thread String matching
by jwesley
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |