in reply to Re^3: Analysing a (binary) string.
in thread Analysing a (binary) string. (Solved)
Not sure how it compares to the frequency analysis.
In the generic, that might be worth pursuing though my (fairly extensive) experience of fuzzy matching techniques is that they are always slow.
For the specific case of my data, the frequency analysis proved so simple and efficient that it wasn't even worth timing it. My first attempt at the code worked first time and found all the reps that were there to be found in just a few seconds. There was no reason (for me) to pursue this further.
If my dataset had not proven to be so amenable to the frequency analysis method -- near perfect inverse log distribution -- I might still be looking for another method, but I have plenty of other nuts to crack with this particular dataset :)
|
|---|