[Geocoding] geocoding misspellings
Stewart C. Russell
scruss at gmail.com
Wed May 29 11:55:22 UTC 2013
On 13-05-29 04:32 AM, Michal Palenik wrote:
>
> what would be the easiest option to connect misspelled names to their
> properlyspelled counterparts?
How are your programming skills? The classic way of doing this is using
an approximate string match (or "fuzzy match") using the Levenshtein or
Damerau-Levenshtein methods. There are modules to do this for many
scripting languages (like Text::Fuzzy in Perl). There is also the
command line tool 'agrep' which does the same thing.
I'd recommend you manually check the results. I know it's slow, but
there's no way to get this perfectly right automatically.
cheers,
Stewart
More information about the Geocoding
mailing list