in reply to Merge/Purge address data
Procecessing duplicate manually entered free-form records is hard to do by hand, and darn near impossible by machine.
It's hard enough parsing names, and streets, sometimes even figuring out what city someone is in can be challenging.
I would being by parsing out the street name and number, and the person's last name. Use those to generate a similarity index, for people at the same address, and flag those with a match exceeding some command-line specified value. Then you can examine those personally, and make a decision.
Mind you, my father was Thomas T. Legrady, and so am I. I occassionally get people who think I've been dead for ten years, but I insist on continuing to breath.
--
TTTATCGGTCGTTATATAGATGTTTGCA
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re: Re: Merge/Purge address data
by qq (Hermit) on Nov 11, 2003 at 21:37 UTC |