The problem with the filenames I seen for mp3's and the like is that everyone tends to classify them differently. The words used may be the same, but the order tends to get switched around. Some classify my the musician surname/first name/album/track, others by any number of permutations of those plus other stuff.
You might get somewhere if you striped non-alphas and spaces, and the used String::Approx,String::Similarity, Text::Levenstien or if speed is a concern Text::LevenstienXS, though I've had trouble getting the latter to compile.
In reply to Re: similar texts !?
by BrowserUk
in thread similar texts !?
by bugsbunny
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |