[Geocoding] [OpenStreetMap] #4895: In the search for street names in Spanish, stop words should be eliminated.
OpenStreetMap
trac at noreply.openstreetmap.org
Wed Jul 10 19:08:43 UTC 2013
#4895: In the search for street names in Spanish, stop words should be eliminated.
--------------------------------------------+-------------------------
Reporter: Emilio Gomez | Owner: geocoding@…
Type: defect | Status: new
Priority: major | Milestone:
Component: nominatim | Version:
Keywords: geocoding, spanish, stop words |
--------------------------------------------+-------------------------
The Royal Spanish Academy indicates that the preposition "'''de'''" (of,
in English) should never be omitted in the names of streets, avenues and
promenades, unless the name is an adjective: "Calle '''de''' Esproceda",
"Plaza '''de''' Colón", "Avenida '''de''' América", "Paseo '''de'''
Gracia", in the first case, and "Calle Mayor" or "Plaza Nueva" for the
second case.
But the right way to name a street is put the preposition '''de''' (of)
after of the type of road (examples: Calle '''de''' Alcalá, Avenida
'''de''' Pérez Galdos, Plaza '''de''' España, etc.), it's very common skip
it when contracting the name. This peculiarity must be taken into account
by search engines.
Nominatim not currently have this in mind, which makes the search engine
not shows a lot of streets that actually exist in OpenStreetMap. Examples:
*
"[http://nominatim.openstreetmap.org/search.php?q=Calle+Hern%C3%A1n+Cort%C3%A9s%2C+Santander%2C+Spain&viewbox=-3.81%2C43.47%2C-3.79%2C43.46
Calle de Hernán Cortés, Santander, Spain]" -> No search results found.
*
"[http://nominatim.openstreetmap.org/search.php?q=Calle+de+Hern%C3%A1n+Cort%C3%A9s%2C+Santander%2C+Spain&viewbox=-151.18%2C61.15%2C151.18%2C-61.15
Calle Hernán Cortés, Santander, Spain]" -> Find the correct street.
What makes also Nominatim unhelpful for geocoding reverse directions in
this language.
Note that in Spanish besides the preposition de, another usual construct
is to use '''el''', '''la''', '''los''' or '''las''' (all translate to
'''the''') just after de: Carretera '''del''' faro, Calle '''de los'''
Caídos '''de la''' División Azul, Calle de las Descalzas. Note that "'''de
el'''" contracts itself to "'''del'''" in most cases.
In [http://snowball.tartarus.org/algorithms/spanish/stop.txt this link]
there is a list of stop words that should be ignored in searches, which
include the prepositions above.
--
Ticket URL: <https://trac.openstreetmap.org/ticket/4895>
OpenStreetMap <http://www.openstreetmap.org/>
OpenStreetMap is a free editable map of the whole world
More information about the Geocoding
mailing list