[OSM-talk] Adding Wikidata tags to 70k items automatically
Archer
arch3r at gulli.com
Sun Aug 31 16:14:47 UTC 2014
Please don’t understand me wrong. I’m a big fan of Wikidata but I'm against
an automated import. The mismatches list gives good examples that your
matching algorithm doesn't work very well:
http://edwardbetts.com/osm-wikidata/mismatches.html
Some examples:
1. Isar Nuclear Power Plant <http://wikidata.org/wiki/Q569510>: your
algorithm matches only one reactor of the power plant: Isar 2
<http://www.openstreetmap.org/way/32918120> but the right matching
would be Kernkraftwerke
Isar <http://www.openstreetmap.org/way/23802422>
2. Heligoland <http://wikidata.org/wiki/Q3038>: you’ve matched the island
Heligoland <http://www.openstreetmap.org/relation/3787052> but the right
match would be the municipality Heligoland
<http://www.openstreetmap.org/relation/1157962> (for the island there
exists a different object in Wikidata)
3. Puerto Rico <http://wikidata.org/wiki/Q1183>: the Wikidata objects says
„is a unincorporated area of the United states“ – the right match therefore
would be the administrative relation: Puerto Rico
<http://www.openstreetmap.org/relation/306157> but your algorithm matches
the island: Island of Puerto Rico
<http://www.openstreetmap.org/node/357271412>
I also don’t understand why you prefer nodes instead of ways or relations.
Ways and relations provide more information (e.g. extent of an area) than
nodes. The Matching algorithm should first look for relations, when there’s
no relation it should search for ways. Nodes should come last.
What does your matching algorithm when a Wikidata object describes
different objects and therefore should be split?
A good example for this is the Wikidata object for Thasos
<https://www.wikidata.org/wiki/Q204096> (currently it describes the island
and the municipality “Thasos”) but the object has to be split into two
Wikidata objects so that you can say “the island Thasos lies in the
administrative division Thasos”. There are also other examples like mixed
up nature reserves, lakes and administrative divisions in Wikidata which
you have to solve before you can import the IDs into OSM.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openstreetmap.org/pipermail/talk/attachments/20140831/0c86f76a/attachment-0001.html>
More information about the talk
mailing list