in reply to Efficient Fuzzy Matching Of An Address

Some comments:
  1. The general term for what you are doing overall is called record linkage.
  2. The particular phase you are trying optimize might be called candidate record pruning.
  3. I learned a lot by reading the docs for FEBRL. They used Markov Modelling to prune the search space. Also there are some good approximate matching methods there not implemented in pure Perl (such as Jaro-Winkler).
  • Comment on Re: Efficient Fuzzy Matching Of An Address

Replies are listed 'Best First'.
Re^2: Efficient Fuzzy Matching Of An Address
by Anonymous Monk on Apr 25, 2009 at 13:50 UTC
    Look at www.matchlogics.com