[Geocoding] Namefinder future
David Earl
david at frankieandshadow.com
Mon Dec 14 11:18:15 GMT 2009
There's a mini discussion going on in one of the trac bug reports for
namefinder at the moment re "name not found"[1]. It's probably better to
have the wider discussion here.
The position with Namefinder is unchanged: the index is a year out of
date. I did try many times to reload the whole planet for a couple of
months in the summer, but the scale of the task is so much greater than
when I first did it has meant that it now takes so long as to be
completely impracticable - weeks, and on the last occasion something in
the system seems to have restarted after some weeks running.
It is clear that a major amount of work would have to be done to make
namefinder practical again. I think it would probably work again if the
index could be reloaded as daily updates aren't particularly any more
onerous than they used to be. There are a couple of serious but solvable
problems that need to be dealt with to avoid the updates interfering
with the searches (but the most obvious that Tom suggested slows the
whole update process down so much that it wasn't practical. But
CycleStreets has a pragmatic solution, by swapping database files over
to make an update available); and searches on more than a few words are
also problematic, and switching to Popstgres might help there - though
who knows what other problems that might introduce.
Anyway, there is a lot to do and I am not motivated to do it. If it was
a matter of tweaking or just sitting there while it reloaded the index,
fine. But it is apparent that isn't going to work. But I can't see the
point in me investing the necessary time in Namefinder when Nominatim is
also doing a good job. No one else has expressed an interest in working
on Namefinder and the only development response has been to start from
scratch on a different project. I'd much rather be in a position where
we are co-operatively working together to produce a better collective
result than in competition with Brian, and there are so many other
things I can and am doing that Namefinder is never going to bubble to
the top if I can't see a good reason for it.
I don't know what the code status of Nominatim is. Is it in svn? Is it
GPL or similar. There doesn't seem to be a wiki page describing it or
how it works. Brian, would you welcome other contributions or is it
something you want to keep to yourself?
There are some things Namefinder does better still: looser matches
(though it could go much further), contextualising of the result (though
the OSM home page largely suppresses this); but some things it doesn't
even attempt (addresses); and it needs to have the syntax for contextual
searches got rid of. But if it doesn't have a current index, none of
that has any worth.
I think we should now remove Namefinder from the OSM search results, as
it's incomplete index is clearly unhelpful and counter-productive. We
can then free up the machine it runnuing on as well, maybe make it
available for Nominatim or other things.
Or of someone else wants to get into Namefinder and help improve it,
that is also a possible way forward.
But as things stand, it isn't sustainable and I'm just not motivated to
do the necessary work to make it so.
David
[1] http://trac.openstreetmap.org/ticket/1843
More information about the Geocoding
mailing list