[OSM-dev] Deleting TIGER node tags

Andy Allan gravitystorm at gmail.com
Tue Jul 21 18:48:16 BST 2009


On Tue, Jul 21, 2009 at 3:38 PM, Frederik Ramm<frederik at remote.org> wrote:
> Hi,
>
> Andy Allan wrote:
>>
>> Tiger node tags make up 85.43% of all node tags and take up:
>
> [...]
>
> I just did a little test, prepared an .osc document that removed the node
> tags from about 1000 nodes:
>
> http://www.openstreetmap.org/browse/changeset/1894387
>
> It came out at roughly 10 node changes per second. I count 177m nodes with
> TIGER tags, which means that the whole process would take about 200 days on
> one API thread. It might be slightly faster if you upload larger or smaller
> chunks - would have to do some experimenting to find the sweet spot. Time
> could also be saved by running it one the LAN (on dev), but again probably
> not a lot.

Ah, good stuff. I was assuming the best way to do it was a script on
dev doing 50,000 nodes at a time (i.e. max diff upload) when it came
to hitting the sweet-spot, but I'm interested in your experiments.

Like you, I'm a bit concerned about the side effects of creating x
billion changesets, but for replication, verification etc it's nice to
behave nicely (i.e. through the API). But that just gives everyone an
added incentive to stop other people doing ill-constructive bulk
imports in the first place! The sooner we can clear this from TIGER
the fewer people will use it as an example to aim for.

Cheers,
Andy




More information about the dev mailing list