in reply to Merge/Purge address data

The ultimate solution sounds quite complex, you may find it helpful to use one of these CPAN modules in processing your data as part of a bigger solution: http://search.cpan.org/~kimryan/
Lingua::EN::AddressParse
Lingua::EN::NameParse