[OSM-talk] Semi-auto converting Wikipedia -> Wikidata tags
Martin Koppenhoefer
dieterdreist at gmail.com
Fri Nov 25 23:24:53 UTC 2016
sent from a phone
> Il giorno 25 nov 2016, alle ore 22:55, Yuri Astrakhan <yuriastrakhan at gmail.com> ha scritto:
>
> . I am simply converting existing Wikipedia tag into the Wikidata tags, because there is always a 1 to 1 matching between them,
you are checking individually and critically whether the osm objects fit to the wikidata object definitions, or are you just adding wikidata tags for wikipedia articles that are already linked from osm?
Afaik many wikidata objects are linked to several wikipedia articles (because of wp articles being written in different languages). Using wikipedia quite a bit in 3 languages I have found that inconsistencies aren't that rare ("wrong" articles interlinked). Partly this is because wp articles in different languages are mostly not translations but are articles that have varying coverage and levels of detail and focus (i.e. a wikidata object that fits onto an English article does not necessarily fit on the German article that is linked to the English article). Some linked articles are also simply wrong.
One example: In the field of geographic places and settlements it can occur that socio-geographic places and political territorial entities are either mixed in the same article or are split over different articles, and it might also differ between languages (some languages might have 1 article dealing with both, others might have 2 and more). Wikidata seems to have a preference for administrative entities (not sure, it is just a first impression) and related statements in all cases I have seen so fat (even when there's a different object that also deals with the administrative entity).
Misguided wikipedia tags are not very frequent in osm, but they do occur of course. Blindly adding corresponding wikidata tags might make it look more consistent even if the tag is wrong, because both tags seem to confirm each other.
cheers,
Martin
More information about the talk
mailing list