[Geocoding] [OpenStreetMap] #4827: dividing of street names (handling of composita)

OpenStreetMap trac at noreply.openstreetmap.org
Sat Dec 28 10:55:19 UTC 2013


#4827: dividing of street names (handling of composita)
--------------------------+-------------------------
  Reporter:  jotpe        |      Owner:  geocoding@…
      Type:  enhancement  |     Status:  new
  Priority:  minor        |  Milestone:
 Component:  nominatim    |    Version:
Resolution:               |   Keywords:
--------------------------+-------------------------

Comment (by lonvia):

 This could even be done during indexing in sql by simply adding an
 unhyphened version to the search terms and it would be less of a hack
 there.

 I don't see too much of an issue reducing hyphens(1) but I'm not sure
 about spaces. It is simply too difficult distinguish composite-like words
 (e.g. Freiberger Weg) and true multi-word names (e.g. Auf dem Berg) and
 would introduce a lot of bad search terms. They probably wouldn't do much
 harm for searching itself but we already have issues with DB indexes over
 the search terms growing too large, so the less unnecessary terms the
 better.

 (1) Thinking a bit further, it might even be a good idea to always remove
 hyphens and full stops from the complete word while still adding the
 composita parts as partial words.

-- 
Ticket URL: <https://trac.openstreetmap.org/ticket/4827#comment:3>
OpenStreetMap <http://www.openstreetmap.org/>
OpenStreetMap is a free editable map of the whole world



More information about the Geocoding mailing list