[OSM-talk-be] Importing Villo! API data
CedB12
cednospam at gmail.com
Sun Oct 15 11:48:42 UTC 2017
Hello all,
Lately I have been looking at the Villo! dataset from the JCDecaux API
at [1], which is released under the Etalab Open License (see also [2]).
I want to consult the community about the use of this data to improve
the tagging of the stations we have already mapped. I would also like to
discuss a potential import of the hundred or so stations that are
reported in the API but have not been mapped yet in OSM.
My priority is to fix the tagging of station names and reference
numbers, which are often wrong or missing in the already-mapped
stations. I am aware of a few quality issues in the names reported by
the API (which, as far as I know, are actually the names reported at the
stations themselves), so this cannot be a fully automated process. As
far as ref tags are concerned, only 25 existing station nodes do not
match the API. I have not pushed any change yet in case this thread
brings up an objection to the use of this API data.
More importantly, given the quality issues in the API names, we would
need to discuss how exactly we want to tag names vs. what the "official"
names are.
To give you a quick example of what kind of problems we can find in the
API, consider that one station is named "342 - MAISON COMMUNALE DE
BERCHEM ST AGHATE". Like all other stations, the name is in all-caps.
This one in particular contains a misspelling: the commune is actually
spelled "Berchem-Ste-Agathe". Also, unlike other stations, this one has
no official Dutch name, and it is not clear to me whether we should
provide our own translation in the name and name:nl tags.
I actually got a little bit ahead of myself and had prepared a diary
entry draft as well as a more detailed and specific email for this
mailing list, but I now realize that unloading all of this at once might
have felt a bit forceful. So before I go into the details of all the
quirks in the API data and formulate a general proposal for tagging, I
wanted to take a more open-ended approach and ask if anyone had anything
to share regarding our mapping and tagging of Villo! stations. I am also
interested in your thoughts on how we should tag the station I gave
above as an example (in terms of name, name:fr, name:nl, and maybe other
kinds of name tags like official_name).
But before that, I would like to make sure that it is OK to import
Etalab-licensed data, because otherwise this effort will be pointless. I
assume it must be fine because the license states to be compatible with
"any licence which requires at least the attribution of the «
Information »" [3], including the Open Government License which is in
turn listed on the OSM wiki page on ODbL compatibility [4]. How are the
requirements of the license (attribution by source name + date + URL)
handled, though?
Also, does an operation of this scale (tagging a subset of 200 existing
nodes and possibly importing another 100) require that I follow the
import guidelines?
Thanks,
Cédric
[1] https://developer.jcdecaux.com/
[2] http://opendatastore.brussels/en/dataset/villo
[3] https://developer.jcdecaux.com/files/Open-Licence-en.pdf
[4] https://wiki.openstreetmap.org/wiki/Import/ODbL_Compatibility
More information about the Talk-be
mailing list