[OSM-dev] Mass imports (TIGER and AND)
Dave Hansen
dave at sr71.net
Tue Aug 28 16:44:50 BST 2007
On Tue, 2007-08-28 at 09:34 +0200, Martijn van Oosterhout wrote:
> On 8/28/07, Dave Hansen <dave at sr71.net> wrote:
> > I've been very painfully uploading the TIGER-generated data through
> > JOSM. At the rate I'm going it will probably take 5 or 10 years to
> > upload the entire US. Literally. I'm uploading one or two counties a
> > day, and there are 3,234 counties in the country.
>
> Ah, do you have any idea on the number of requests per second that
> approximatly equates to?
I just did 100 requests in 30 seconds, so ~3.3.
> > So, I installed the rails port on my laptop, and sicked JOSM on it. The
> > uploads are maybe twice as fast as they are to the main OSM server. So,
> > the round-trip-time actually isn't that _huge_ of a performance
> > bottleneck.
>
> Over the last view days I've written a bulk_import.pl which takes a
> JOSM or osmChange file and uploads it to the server. It should have
> lower overhead than JOSM, but from what you're saying it that it won't
> really help. :(
>
> The AND import has > 8 million objects to create, so speed is of concern.
Actually, at 3.3/second, that's only 28 days. That's no so bad.
I'm estimating that the average TIGER county in the US is about 3MB, in
compressed OSM format. Using a random 3MB county, that means ~192,000
objects. There are 3200 counties in the US, so 192,000*3200 /
3.3*86,400 (seconds in a day) = 2154.88 days. Counting leap days,
that's ~5.9 years. :)
I'm tempted to start chanting *U* *S* *A* over and over, but I think
I'll hold myself back on this one. ;)
-- Dave
More information about the dev
mailing list