in reply to Re: De Duping Street Addresses Fuzzily
in thread De Duping Street Addresses Fuzzily

Ditto for the suggestion of using Lingua::EN::AddressParse. Even if the high level methods don't suit your exact problem space, you may find the back-end code and normalisations useful in putting your data in a more consistent format before sorting/processing. It also handles a number of special cases that occur in the real world.
  • Comment on Re^2: De Duping Street Addresses Fuzzily