[Talk-transit] Naptan import

Christoph Böhme christoph at b3e.net
Mon Jul 27 21:49:35 BST 2009


Good evening,

Peter Miller <peter.miller at itoworld.com> schrieb:
> On 26 Jul 2009, at 22:14, Christoph Böhme wrote:
> > I also created a copy of the NOVAM viewer and changed it to display
> > NTPG data instead of bus stops:
> >
> > http://www.mappa-mercia.org/cgi-bin/nptg.wsgi/viewer.html
> 
> Great stuff, and clearly there are many additional place-names in
> NPTG that are not in OSM a present in many parts of the county. I
> checked North Norfolk and bits of Scotland and there are a good
> number of additional places.

I have now also added all nodes with place=* tags from OSM. The NPTG
import will really add a lot of additional places! OSM has only 25397
places in the UK at the moment. However, I was a bit suprised to see
some hamlets in the OSM data which are not in the NPTG data. Do you
know of any gaps in the NPTG data?

> The LocalityClassification field should be more useful and should  
> contain city, town, village, hamlet, suburb, urbancentre, place of  
> interest, other, or unrecorded. I am not sure how well this field is  
> populated - possibly it is not well populated at all. UrbanCentre
> can possibly be ignored.  

The LocalityClassification tag is used 856 times in the dataset. That is
about 2% of all localities.

> The field may be well populated in some parts of the country and not
> in other. I am not sure how much NPTG is used for Points of Interest.
> There is a POI model in NPTG but possibly we treat this separately or
> not at all or import the data as invisible to start with. My main
> interest is the locality names and the main technical job will
> probably be to spot duplicates with what is in OSM already.

Finding duplicates should not be too difficult. We basically just need
to check for each imported location if there are any places with the
same name within a reasonable distance. Except for typos and different
spellings that should work very well. The positions of locations in
both datasets also match nicely which should make it even easier to
find duplicates.

> Would it be worth creating a NPTG Import wiki page and an NPTG
> Import user to do the actual import - ie, keep the documentation and
> audit trail for the two imports separate?

I am in favour of keeping them separate. Both datasets are fairly
independent and we will probably use different methods to import them.
Having everything on one wiki page will be confusing to users, who might
be interested only in one of the imports.

	Cheers,
	Christoph




More information about the Talk-transit mailing list