[Talk-ca] OSM data quality in Canada

Steve Singer steve at ssinger.info
Sat Jun 20 19:46:43 UTC 2015


On Wed, 17 Jun 2015, Martijn van Exel wrote:

> Hi Andrew, 
>
> Thanks for elaborating on the CanVec / Geobase imports! This also raises new questions.. See below.
>
>> On Jun 17, 2015, at 3:00 PM, Andrew MacKinnon <andrewpmk at gmail.com> wrote:
>> 
>> A lot of the data in Canada was imported from CanVec and Geobase,
>> some of it by me several years ago. The imported data is pretty poor
>> quality in many places. I haven't done much work on this recently, as
>> imports have a bad reputation in OSM and I am mostly concerned with
>> surveying. For example:
>> 
>> - Some older road data comes from an import which combined CanVec and
>> Statistics Canada road names, attempting to match the road names in
>> Statistics Canada with roads without names from CanVec, and this data
>> is poor quality.
>
> Is this described in more detail anywhere? Are the data / scripts / 
> process still available? Which dat was poor quality, CanVec or Statistics 
> Canada?

The StatsCan geometries were really poor at least as bad as the original 
TIGER stuff but they were the only source of road names in some places.

The scripts used for the geobase->osm (and attaching statscan names) are 
available at 
http://svn.openstreetmap.org/applications/utils/import/geobase2osm
I only did this in Alberta and Ontario.

We tried to use roadmatcher to only include road segments that we were 
pretty sure didn't already exist in OSM.  This often left gaps in road 
segments where roadmatcher wasn't sure if something was or wasn't included. 
Also we didn't have any way of automatically attaching the existing OSM 
ways with the new geobase ways which left A LOT of unconnected roads.  This 
has mostly been fixed (often thanks to keeprite and maproulette) but it 
tooks many years.

Some of the initial sections also didn't connect new geobase roads with each 
other due to a bug the import script, we tried to fix this with repair 
scripts at the time.


Steve




More information about the Talk-ca mailing list