[Geocoding] Local Mirror of OSM Data...
Anthony
osm at inbox.org
Fri Oct 23 16:20:12 BST 2009
On Fri, Oct 23, 2009 at 6:13 AM, Emilie Laffray
<emilie.laffray at gmail.com> wrote:
> 2009/10/23 Peter Childs <pchilds at bcs.org>
>> I'm looking at using my own Schema and a SAX parser
>> Currently our UK Streets using a high bread soundx but I am thinking that
>> the double meta phone may be better. Unfrotnally the UK Streets we have are
>> old and a bit out of date (and have copyright issues) hence why I want to
>> move over to OSM.
>
> Ok, if you are interested only in UK, then maybe you might want to
> concentrate on UK first.
Is there any way to keep up-to-date on a subset of the world? I can
download Florida from CloudMade
(http://downloads.cloudmade.com/north_america/united_states/florida),
but I haven't found a way to keep this up-to-date other than 1)
downloading all of Florida periodically or 2) downloading the
changesets for the world and filtering them out.
I'm probably planning on writing a customized parser rather than using
a generic XML parser, as my experience dealing with other projects
shows that doing this tends to increase performance significantly.
For my database, I installed rails-port, dumped the database, and
edited the dump file to fit my needs (I got rid of a bunch of the
foreign key constraints, got rid of the "current" tables, got rid of
the "private" tables and columns which I won't have access to, added
fields like "isdownloaded" to tables like the changesets table to let
me know how up to date the data is, etc.). I plan to implement the
"current" tables initially as a view, and ultimately as a materialized
view.
I'm probably reinventing a lot of wheels, but I want access to both
the current and history items (and changesets), and I want to be able
to download bits and pieces as needed (*); and I didn't see any way to
do this with the tools already available.
(*) Alternatively, is there an easy way to download *everything*,
including current, history, and changeset info, preferably just for a
large subset of the world, in my case Florida? I could probably
accept that, but I couldn't find the data. So, I've basically decided
to implement a cache (I call the database osm_cache), and get the info
from the API when it's not in the cache (and cache it).
More information about the Geocoding
mailing list