[OSM-dev] Help needed for processing the planet.osm (for osmdoc.com)

Stefan de Konink stefan at konink.de
Mon Aug 17 15:25:02 BST 2009


On Mon, 17 Aug 2009, Lars Francke wrote:

> I need to aggregate data about the number of tags, tag combinations,
> keys and values. See http://osmdoc.com for the kind of data I need.

Do something smart on line 346 for on the fly counting. Though I think
that I can generate the page that you provide maybe faster using just
group by on an existing database... MonetDB uses string deduplication,
perfect for this job ;)

So that answers your:

"So what I'm looking for is a simple (I really don't want to have to
setup another Hadoop cluster) but still reasonably performant way to
process the planet.osm to aggregate the needed data in a suitable
format for importing it into a relational database."


Stefan





More information about the dev mailing list