Can you please clarify your second requirement? The relation "two mismatches" is not transitive, e.g. there are two mismatches between AAAA and AAGG and two mismatches between AAGG and TTGG but four mismatches between AAAA and TTGG. Would you consider all three to belong to one cluster?
In reply to Re: Find duplicate based on specific fields while allowing 2 mismatch
by hdb
in thread Find duplicate based on specific fields while allowing 2 mismatch
by amitgsir
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |