[Imports] Road data imports from CC0 government database for Sweden

Anders Torger anders at torger.se
Wed Mar 24 07:36:24 UTC 2021


Hello Frederik,

The skepticism is as follows. In all OSM communities there are some that 
prefer if all mapping is made manually and no imports are made at all, 
to conserve the originality of OSM data. In this view, quality in terms 
of positional accuracy, richness of tags and completeness of the map is 
secondary. I have much respect for this view and understand that getting 
data from an external database can be seen as a bit "boring" and taking 
the joy out of mapping.

There's also been concerns that the import would be made in a careless 
way, such as deleting and replacing with fully automatically translated 
data without review. There's also been concerns that NVDB which has a 
built in classification of roads in 9 levels does not match how OSM has 
classified roads (trunk->tertiary), and that these levels would replace 
the classification already done.

I would not say that these concerns are "irrelevant", I have much 
respect for these views and concerns. However, we have designed our 
import process in a way that in that context they are indeed irrelevant. 
NVDB is also in terms of data complementary, it doesn't have all the 
information OSM has, paths, and even some small roads are lacking, 
detail inside cities is lacking etc, it doesn't have any area polygons 
etc. The information it has is good, but there's still a large need to 
make local manual mapping as well, so it's certainly not a replacement 
for the traditional mapping. Just a help to keep the overall map in a 
high quality state, and relevant for Sweden, better positional accuracy 
which today is rather poor in many places.

As always, if a careless person start applying the automatic translated 
layer without following the appropriate manual merging process, these 
concerns are surely relevant. Therefore it's important to us in the 
community to follow up and see what people are actually doing so it's 
not misused. We present a toolbox and a process, we cannot stop anyone 
from misusing it until we detect it has happened. Fortunately there are 
quite many of us in the Swedish community that have an eye on various 
supervision tools to detect suspicious activity, just like what happened 
past year.

The process is detailed here (in Swedish)
https://wiki.openstreetmap.org/wiki/Import/Catalogue/Sweden_highway_import/Import_guidelines

A few details from it:

1) Only modify operations are used, never delete/insert, to keep history 
and make sure existing tags and relations are properly transferred. The 
key tool used here is JOSMs "replace geometry", which is used in a 
special workflow with matching segment splitting to make sure tag 
transitions happen in the right places and everything gets transferred 
and possible tag conflicts are manually resolved.

2) There are extensive documentation on how the NVDB translated layer 
should be pre-processed manually, adjusting highway types, making sure 
bridges are properly placed (the bridge data has unfortunately not good 
positional accuracy in NVDB), describing limitations in multi-lane roads 
and how these need manual additions etc.

3) A process of how to use the JOSM validator and interpret its warnings 
is documented, to go through issues in translated NVDB layer, and in the 
manual merging phase to make sure all roads are connected and no 
duplicates are accidentally left in.

4) A process of how to adjust positional errors of other features, like 
buildings and natural features, to match the improved accuracy of roads. 
In many places in Sweden the positional accuracy is so poor that if you 
just care about roads they will end up going right through buildings. In 
fact, in some municipalities there will be more work with correcting 
positional errors than actual upgrading the road geometry, and this too 
is an important goal for this project.

5) Suggestions of how to manually enrich the data with things like 
turning circles (prominent features in our forestry road network for 
example)

6) Discussions about different styles in NVDB geometry and pre-existing 
OSM geometry, especially in cities and how to relate to these, for 
example if cycleways are mapped separately or not, how noexit roads are 
mapped, turning circles etc.

7) NVDB is never used as the sole source in the merging process. You 
always use aerial photos, also make use of the Lantmäteriet raster map 
which is a great positional guide and provides some additional details, 
and of course carefully observe the previously existing data. Highway 
types are nearly always kept the same as previously mapped, unless those 
are obviously wrong.

Indeed there is zero relationship with this and the attack we had on 
Sweden last year, I followed that personally and was actually one of the 
"policing" guys which analyzed a lot of those accounts and the change 
sets and reported the activity. That import used some sort of old 
foreign database (not NVDB or any other publically available Swedish 
database) and was obviously highly automated and left roads unconnected, 
and did actually not provide much tags like maxspeed, road names and 
road surfaces which would be expected by a quality import from NVDB. We 
in the Swedish OSM community never figured out what database they used 
or what their motives were. It was surely not a helpful or careful 
import, even using hijacked accounts to some extent. There were no 
Swedish local mappers involved in that activity, we didn't have the 
ability to track it down but it seemed like it came from Russia and/or 
other eastern countries.

(For the regular mapping activity there are mostly Swedish mappers 
together with some very active and experienced mappers from the German 
community.)

Using our manual merging process, the mapping speed compared to 100% 
manual mapping is say 5-10 times higher, so it's fast, but not 
super-fast, no way near an automated import. Going through Sweden is 
expected to take a number of years. Going through a municipality of 
medium size outside the rural areas is about 80 hours of manual work. If 
there are detailed mapping with large positional errors in a mountainous 
region, it will take more time than that. Updating cities with several 
multi-lane roads and bridges and tunnels will also takes lots and lots 
of time, but we will not focus on dense cities to start with as these 
are already mapped with pretty decent quality.

I've made an announcement to talk-se as the process requries, no reply 
yet. But it was recent and there is very low activity in that group. 
Still waiting. I actually do not know about the visibility, there's 
relatively low activity in also in the Facebook group, lots of passive 
listeners which I do not know the names of. That I ended up there is by 
going through all the various forums etc to find where "all the action 
were", and while relatively low activity it was more than in the other 
places. (Facebook groups for all sorts of things is very popular in 
Sweden).

I do know about a few long-term guys active in the facebook group which 
are also active in some of the international mailing lists (not sure 
about talk-se specifically though), and also several prominent mappers 
far up in the Swedish top list, including those that are known to use 
NVDB data in their personal mapping and has published previous 
processes, like this: 
https://wiki.openstreetmap.org/wiki/%C3%96ppnadata_erfarenheter, and we 
have got endorsement from them.

We are not expecting a huge number of mappers to join in on this effort. 
Most mappers active in Sweden are casual iD users, and to do this type 
of work you need to be an experienced JOSM user preferably with plenty 
of time on your hands. I would personally expect 1 - 5 active people at 
any given time doing this work, although I hope for more.

/Anders

On 2021-03-23 23:50, Frederik Ramm wrote:
> Hi,
> 
> On 3/23/21 21:51, Anders Torger wrote:
>> This work has been in the open during the whole process and so far 
>> been discussed in the most popular informational communication channel 
>> in the Swedish OSM Community, which is a facebook group. It has been 
>> very well received, with a dash of the usual amount of healthy 
>> skepticism that any import effort generates.
> 
> Since the Facebook group does not have a publicly accessible archive,
> could you be so kind as to summarize the discussion, outlining what
> the "usual amount of healthy skepticism" was about and whether the
> different points of critique have been (a) addressed in the import
> plan or (b) considered irrelevant and if so, why?
> 
> There was a very large (90k changesets) undiscussed and quite harmful
> import of road data in Sweden end of last year where significant
> effort had been spent on making the import hard to detect and
> difficult to revert. If I remember correctly, that import suffered
> from having lots of small bits of track that connected to nothing. I
> hope there is zero relationship between that effort and what you are
> proposing here? Have any of the names on the list
> (https://lists.openstreetmap.org/pipermail/talk-se/2020-December/003928.html)
> popped up anywhere in the Facebook discussion?
> 
> Bye
> Frederik



More information about the Imports mailing list