[Geocoding] geocoding misspellings

Stewart C. Russell scruss at gmail.com
Wed May 29 11:55:22 UTC 2013


On 13-05-29 04:32 AM, Michal Palenik wrote:
> 
> what would be the easiest option to connect misspelled names to their
> properlyspelled counterparts?

How are your programming skills? The classic way of doing this is using
an approximate string match (or "fuzzy match") using the Levenshtein or
Damerau-Levenshtein methods. There are modules to do this for many
scripting languages (like Text::Fuzzy in Perl). There is also the
command line tool 'agrep' which does the same thing.

I'd recommend you manually check the results. I know it's slow, but
there's no way to get this perfectly right automatically.

cheers,
 Stewart




More information about the Geocoding mailing list