in reply to Re: Merge/Purge address data
in thread Merge/Purge address data
And you should see what they do in Thailand!
Not to mention how a naive person from a developing country might write any type of address ...
I believe I need to limit the problem space to US addresses, rather than solve the problem for the global address space ...
But as far as I can tell, a large enough training set would enable a Bayesian or fuzzy solution to distinguish 'twixt Dutch and German 'Drs.'
PS. How uncanny! My younger sister has a linguistics doctorate, her first language is German, her second Dutch!
|
|---|