[Geocoding] Agenda

David Earl david at frankieandshadow.com
Tue Jul 14 18:57:10 BST 2009


Brian Quinion wrote:
>> ... UK is not a good test case...
> 
> The code handles the obvious variants, streets, saints, etc.
> Undoubtedly there are some that I've missed and the current list would
> need to be extended.  

What about the non-British stuff, like Sankt and the equivalence of 
german β and ss or ϋ and ue or danish Å and Aa for example? Or the way 
in which sometimes people write Potsdammerplatz and others Potsdammer 
Platz (though maybe your algorithm isn't as sensitive to word units as 
mine is). Can you fdeal with user searches for "Potsdammerplatz" when 
the database has "Potsdammerpl" & vv?

Do you deal with object types as well? For example, if a cinema is 
tagged "name=Odeon" will "Odeon cinema" give you a hit, or "cinemas near 
Chelsea"?

> Some of the examples Ed Freyfogle gave are
> possibly beyond what could reasonably be parsed, although it is always
> fun to try.

Indeed. (Though ironically the worst one I thought - the txtspk one, 
what was it, 2ton for Twoton or some such - would be almost trivial to 
do in Namefinder!).

Sometimkes the biggest issue is thinking what might happen. A recent 
request I had was very reasonably for "somewhere railway station" to 
match rather than "somewhere station" (and that of course leads to 
"somewhere train station" and "somewhere rail station" too).

>> Does it deal with updates as well as reloads (which are clearly impractical
>> as a frequent solution)?
> 
> This is the big 'todo' item, although osm2psql includes a framework
> for doing this that means that a lot of the work has already been
> done.  My intention was to try and get something out for people to
> test and then start work on this.

I don't know about your algorithm, but mine is focussed around names, 
and because people move objects, split ways, delete objects a change can 
result in a name not present in the update becoming "visible" to the 
search if a similarly named object is deleted. Deleting a way doesn't 
necessarily mean that street disappears from the search.

David





More information about the Geocoding mailing list