[OSM-dev] Wikipedia Matching
dieterdreist at gmail.com
Sun May 21 11:19:34 UTC 2017
2017-05-19 13:23 GMT+02:00 Andy Townsend <ajt1047 at gmail.com>:
> I'd certainly take some of those added tags with a pinch of salt. A
> number "place" objects near me have been linked to wikidata items by a
> well-meaning wikipedian, but unfortunately they don't actually match. What
> tends to happen is something like:
> o OSM has a place object for a village and an admin entity
> o An OSM user adds a wikipedia tag to the admin entity. The wikipedia
> entry describes itself as covering both the village and the admin entity,
> so that's OK.
> o A wikipedian writes a bot that creates a wikidata item from the
> wikipedia article. The bot creates wikidata entries for villages, not
> admin entities. That's not entirely wrong, because the wikipedia article
> actually covers both.
> o A different wikipedian spots that there is an OSM admin entity and a
> wikidata item with the same name in a similar location and links them via a
> wikidata tag. This results in the wrong OSM entity being linked to a
> wikidata item.
+1, this is an example for a general issue with wikipedia, mostly their
articles cover both, the socio-geographic place and the administrative
entity, with parts of the data (e.g. population) usually referring to
administrative territorial entities. Wikidata has generally an item for
every article of Wikipedia, but not much beyond wikipedia articles, so even
if they state they are about an administrative territorial entity, often
they are also about the place, so linking it to OSM objects is kind of
problematic because there is some overlap in what is represented, but it is
not the "same thing" that is described.
It is not a huge problem to have slightly different wikidata/WP objects
(but with significant overlap) linked to OSM via tags, but people should be
aware that the linked wikidata object is not the same as the osm object,
but really is just a (hopefully) useful link to something somehow related.
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the dev