[Geocoding] [OpenStreetMap] #4895: In the search for street names in Spanish, stop words should be eliminated.

OpenStreetMap trac at noreply.openstreetmap.org
Wed Jul 10 19:08:43 UTC 2013


#4895: In the search for street names in Spanish, stop words should be eliminated.
--------------------------------------------+-------------------------
 Reporter:  Emilio Gomez                    |      Owner:  geocoding@…
     Type:  defect                          |     Status:  new
 Priority:  major                           |  Milestone:
Component:  nominatim                       |    Version:
 Keywords:  geocoding, spanish, stop words  |
--------------------------------------------+-------------------------
 The Royal Spanish Academy indicates that the preposition "'''de'''" (of,
 in English) should never be omitted in the names of streets, avenues and
 promenades, unless the name is an adjective: "Calle '''de''' Esproceda",
 "Plaza '''de''' Colón", "Avenida '''de''' América", "Paseo '''de'''
 Gracia", in the first case, and "Calle Mayor" or "Plaza Nueva" for the
 second case.

 But the right way to name a street is put the preposition '''de''' (of)
 after of the type of road (examples: Calle '''de''' Alcalá, Avenida
 '''de''' Pérez Galdos, Plaza '''de''' España, etc.), it's very common skip
 it when contracting the name. This peculiarity must be taken into account
 by search engines.

 Nominatim not currently have this in mind, which makes the search engine
 not shows a lot of streets that actually exist in OpenStreetMap. Examples:

 *
 "[http://nominatim.openstreetmap.org/search.php?q=Calle+Hern%C3%A1n+Cort%C3%A9s%2C+Santander%2C+Spain&viewbox=-3.81%2C43.47%2C-3.79%2C43.46
 Calle de Hernán Cortés, Santander, Spain]" -> No search results found.
 *
 "[http://nominatim.openstreetmap.org/search.php?q=Calle+de+Hern%C3%A1n+Cort%C3%A9s%2C+Santander%2C+Spain&viewbox=-151.18%2C61.15%2C151.18%2C-61.15
 Calle Hernán Cortés, Santander, Spain]" -> Find the correct street.

 What makes also Nominatim unhelpful for geocoding reverse directions in
 this language.

 Note that in Spanish besides the preposition de, another usual construct
 is to use '''el''', '''la''', '''los''' or '''las''' (all translate to
 '''the''') just after de: Carretera '''del''' faro, Calle '''de los'''
 Caídos '''de la''' División Azul, Calle de las Descalzas. Note that "'''de
 el'''" contracts itself to "'''del'''" in most cases.

 In [http://snowball.tartarus.org/algorithms/spanish/stop.txt this link]
 there is a list of stop words that should be ignored in searches, which
 include the prepositions above.

-- 
Ticket URL: <https://trac.openstreetmap.org/ticket/4895>
OpenStreetMap <http://www.openstreetmap.org/>
OpenStreetMap is a free editable map of the whole world



More information about the Geocoding mailing list