in reply to Re^2: redundancy Checker
in thread redundancy Checker
How do you determine redundant schools?
I had to import data from a system, that might've had the 'University of Louisville Speed School' as 'UL' 'U of L' 'U Louisville' 'Univ. Louisville', 'Speed School', etc.
If you're looking for exact string duplicates, it's fairly easy to just in SQL, assuming we're looking for duplicated entries of field1, field2:
SELECT COUNT(*) AS duplicates, field1, field2 FROM some_table GROUP BY field1, field2 HAVING duplicates > 2
Then you know which records to bother looking at, rather than having to go through the whole table.
|
|---|