[Imports] Import of Flemish Government data (building footprints and addresses)

Gertjan Idema g.idema at zonnet.nl
Mon Nov 5 19:35:26 UTC 2018


Dear fellow-mappers,

In The Netherlands we went through this process 4 years ago. This means 
we can talk from experience.

After the preparation period, we started the real import process around 
march 2014 with a group of some 30 volunteers. At the end of 2014, the 
initial import was finished. At the time, we decided to import the 
building id's, but not the address id's. The address id's we not 
available in the WFS data source we used and we reasoned that the 
addresses can be identified by the address tags.

Since then, we are in the process of maintaining the imported buildings 
and address. This concerns around 11.000 new buildings and the same 
number of new addresses every month.
The tooling we use in this process depends highly on the building id's. 
Not having have the address id's seriously complicates the maintenance 
of addresses, Even though the combination of postcode and house number 
are unique in The Netherlands. Being official government id's, the 
building and address id's in The Netherlands get more and more official 
applications day by day. They are already required in notarial acts and 
used by the chamber of commerce. Leaving them out would not be an option 
in my opinion.

We have no issue with mappers being afraid to touch buildings because of 
the building id's. Most people know where to find the id, or file an 
import request on the forum for a building or a block of buildings. Some 
complex buildings like Utrecht Central station keep being mapped 
manually because the official data is too complex.
The main issue we have is with addresses being deleted, because people 
delete a node instead of the poi tags. This is nothing new, but it is 
detected more often because of the continuous comparison between OSM and 
official building and address data.

Gertjan Idema


On 01/11/2018 15:39, Pieter Vander Vennet wrote:
> Hello everyone,
> Original Poster here.
>
> Thanks for your remarks. A lot of peopleactivein the Belgian community 
> are following this discussionwith interest and some bewilderment.Like 
> the first message, this is a reply that has been written by several 
> memberstogether.
> We did have a hard time filtering the actual discussion about our 
> import from the thread. We would kindly invite everyone to take the 
> discussion of other imports and the more philosophical points  to a 
> more appropriate place (e.g. the talk-list) and stick to the topic of 
> the Flemish buildings import.
>
> We have been working on this for two years and don't mind working on 
> it for a few more months, before running the actual integration. 
> Weare, in the first place, looking for advice and guidance so as to 
> further improve our methods - together with witha go-ahead once all 
> issues are resolved.
>
> In case this was not clear before:this is _not_ a flat import where 
> all the data is dumped into OSM automatically. This is an effort of 
> the Belgian community, where mappers select a subset of the data they 
> choose to integrate piecemeal, usually one street at a time. The data 
> is loaded in JOSM and checked against the aerial imagery involving the 
> mapper's common sense.
> As an example, this is a screenshot of the tool in action: 
> https://matrix.org/_matrix/media/v1/download/matrix.org/WmemnfTQZOSfoKoIlByFBHmv. 
> You can see that the tool reuses tags from OSM and is offering the 
> mapper the choicewhich geometry should be used(by not saving the 
> import). In combination with aerial imagery, this should cause little 
> to no problemsin regard with the original OSM-data.
>
> Mateusz and Frederik, your points regarding documentation are being 
> addressed as we speak.
>
> # External IDs
>
> The most discussed point seems to be the IDs we wanted to include.
> We believe theIDswill significantly ease updates to our buildings when 
> the GRB is updated,and make the whole process more robust.
> Theyprovide a way to update and crossreference the data now and in the 
> future.Exactly by keeping theseIDs, updating data down the road will 
> be smoother and prevent later changes from OSM contributors to be 
> overwritten. As an added benefit, keeping IDsmakes the import 
> tool-agnostic. It will also make it much easier to flag errors in the 
> source data.
>
> We aren't afraid that people will refrain from editing source-tagged 
> objects: new users (using theiD editor) will probably not notice those 
> tags in the first place; advanced users will know about them. And 
> intermediate users will probably look them up. Merging and splitting 
> are rather rare operations –once the geometry is accurate, they should 
> becomeunnecessary operations. Finally, as the data set is available 
> for free under an open license, anyone can verify the data 
> independently (though noton the ground, of course).
>
> We will address the worries about the external IDs on a point by point 
> basis below:
>
> From *Frederik Ramm:*
>
>   * /If you delete a building////that has such an ID, how will you
>     ensure i//t//isn't brought in again////through a later "update
>     import"? Etc.//"/
>
>
> There will not be "an update import". The tool is built for continuous 
> use. It will improve the geometry of existing buildings and already 
> looks at the source tags. The tool is builtfor heavy mappers who will 
> be monitored by our closely knit community. Importing and updating 
> will be done street by streetor block by blockbasis. We believe we 
> cantrustthese mappers to analyze situations like this on a case by 
> case basis. In this case, whether of not the deleted building had an 
> ID does not matter much.
>
>
> These points, respectively by Frederik Ramm, Christoph Hormann and 
> Mateusz Konieczny are quite similar. *We'll answer them together.*
>
>   * /The idea of having an "audit trail" for every single geometry by
>     way of an Id for that individual geometry is interesting, but I
>     think that it is totally sufficient if a changeset carries the
>     information that this changeset has been imported from XYZ data
>     source at time stamp T; everything else can be researched down the
>     line if the need should/
>   * / For this purpose it is completely unnecessary to bother the OSM 
>     community with external IDs.  If you want to check if the data has
>     been unchanged since you added it then do exactly that - check if
>     there are  any newer versions of the objects that have originally
>     been added in  the import./
>
>   * /* What is the point of adding tags like source:geometry:entity
>     given that like any other tags they may be edited once added to OSM?/
>
>
> It is not, and has never been, the intention to fully automate this. 
> As we are trying to make clear, this info is needed and used all the 
> time, and not at some vague "maybe we do an update sometime". We are 
> not using the IDs to make a direct (full database) comparison to see 
> what exists in one and not in the other on a gigantic scale. We are 
> not looking for an 'auto update OSM with all new buildings added to 
> GRB'. We use the IDs, on a SINGLE BUILDING comparison basis only to 
> see what's changed (geometry touch-ups, or entirely replacing a building).
>
> "THEY WON'T BE STABLE ANYWAY"
> If the external references are edited, the tool will flag the building 
> as needing an update. The only thing a (deliberate or accidental) 
> unneeded change of the ID would do, is alert the tool's users that 
> something doesn't add up. Either the building has been replaced in 
> reality (and thus having a new UIDN), and you can improve the geometry 
> of the new building. If no apparent change to the physical building 
> can be traced, the UIDN can be restored.  The amount of 'false 
> positives' due to unintended edits of the IDs is expect not to come 
> remotely close to the useful flaggings.
> Without the tags, it's hard to tell which buildings have been 
> imported: you would need complicated spatial heuristics because we 
> don't blindly copy buildings, we improve them through other sources. 
> Once mapped, the geometry changing would happen more often than the 
> tags changing, so we'd have a lot more false positives.
>
>
> "WHY NOT JUST CREATE A DATABASE OF LINKS EXTERNALLY?"
> In theory it would be possible to have the tool keep a register of 
> which OSM ID maps to which ID in the GRB or to evaluate changesets to 
> get similar info. Some issues with this approach:
>
>   * Because we aren't doing an automatic import but manually add the
>     buildings through JOSM, this would require us to copy the OSM ID
>     of each building manually, or rely on heuristics.
>
>   * We don't want to centralize the link between OSM and GRB. There
>     isn't one single person doing the import, it's something several
>     people in the communitywork on. Anyone could host the tool if its
>     current maintainer disappears.
>
>   * Pushing the analysis of what exactly has happened towards a
>     changeset is an unnecessary burden on a mapper. The changeset will
>     not just contain "here be new buildings", but also "in this case,
>     we just used part of the geometry of the GRB building, but we left
>     part of the building geometry intact because that was better in
>     OSM". After having analysed the changeset, the mapper would still
>     have to look up in an import database what exactly the
>     relationship between the OSM and GRB objects was at the time of
>     import.
>
>   * An external database providing the link between OSM and GRB
>     objects would be outdated after a day and almost impossible to
>     update with changes on the OSM side
>
>
>
> "WHY NOT JUST COMPARE GEOMETRIES?"
> Doing a geometrical analysis to analyze differences would be 
> impractical, not just because this would be computationally heavy, but 
> also because it would lead to too much false positives. For example 
> because of tiny changes, but also because we do not blindly use source 
> geometries since we first address overnoding.
>
>
>
> I hope that all confusion is cleared now and that we can move this 
> issue forward.
>
> With Regards,
> The Belgian Mappers,
> Pieter Vander Vennet
>
> _______________________________________________
> Imports mailing list
> Imports at openstreetmap.org
> https://lists.openstreetmap.org/listinfo/imports
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openstreetmap.org/pipermail/imports/attachments/20181105/3d1759fd/attachment-0001.html>


More information about the Imports mailing list