[OSM-talk] US City Import?

Beej Jorgensen beej at beej.us
Sat Nov 24 01:53:44 GMT 2007


Robert (Jamie) Munro wrote:
> Beej Jorgensen wrote:
>> 3. Some cities are in the OSM database already.  Data duplication,  blah 
>> blah.
> 
> Can you try and look for them by comparing names with nearby names in
> Planet.osm, or using the zappy api?

I could.  All that would be left is to decide which data was "right". :)

Getting all the cities and towns in the US with the xapi is no problem, 
so I could read that in to not upload duplicates.

A few quick queries show that there are currently about 1000 cities, 
towns, villages, and hamlets in total in the US.

> I'd like to see the date alongside all the census tags. Then we could
> know how up to date the values are, and after future censuses we could
> keep historical data.

Something like:

	<tag k="population" v="3040;YYYY-MM-DD" />

?

This almost sounds like something that should be standardized, if we go 
this route.  The census data files I have go back a few years, so I 
could import the lot of it.

A question I have here is: is historical census data something that 
should go in the OSM database?  It can easily be crossreferenced by 
interested parties later from external sources.  Furthermore, wouldn't 
future population changes be automatically saved in the node history?

> If you make the files, someone can probably load it in to the Tiger bulk
> uploader that runs on the "dev" server.

This probably won't be necessary, but I could do it.  I estimate it 
would take a little less than a day to do the whole US from here, or 
about 2-3 hours if I only did the major cities.

Another question... I use the stock definitions of the city sizes, 
currently:

hamlet < 100  (I made this up--page says "a few" houses)
village < 10000
town < 100000
city >= 100000

This means that nothing in Montana, for instance, will be a "city". 
Should the city sizes be uniform over the whole US, or should I scale 
them in each state?  I'm for uniform, since things might look really 
weird on a state boundary.

-Beej





More information about the talk mailing list