[OSM-talk] Proposed bot edit: remove "surface=no data" and other useless surface values
Mateusz Konieczny
matkoniecz at tutanota.com
Sun Sep 11 13:19:39 UTC 2022
Jul 29, 2022, 07:11 by pella.samu at gmail.com:
> > I propose to remove following surface tags by doing an automated edit:
>
> some comments:
>
> 1.) Root case analysis
> It would be nice to have a 2-3 month grace period,
> meaning that those who want to analyse the data in more depth can do so,
> ( tag combinations, distinct users, years, osm editors, etc.. )
>
If anyone did this and has a comments: please comment now
>
> 2.) count_tags()==1 and surface=*
> Sometimes there is only one surface tag on the geometry.
> In this case, removing just the last tag - will not fix the problem.
> What is your proposal in this case? ( keep or drop the geometry? )
>
bot will edit only tag: which may cause elements to become listed
on various QA tools and become a more detectable error
this is a good thing
in other words: no geometries will be deleted
> 3.) count_tags()==2 and surface=* and name=*
> To a local editor, I think the two tags together give more information than only the name=* tag
> This means that half-corrected data can be more human work later.
> ( not always of course )
>
Can you give example of real data where it happens? I cannot imagine
case where surface=undefined would be an useful info
>
> 3.) Transparency:
> It would be nice if there were some kind of continuous data collection (statistics) on corrections.
> Some kind of simple dashboard about the automated edit.
>
> Anyway, the problem is real, and some solution is needed. :-)
>
This would require substantial amount of coding.
Anyone interested - feel free to parse changesets and visualise them
> best,
> Imre
>
>
> Mateusz Konieczny via talk <> talk at openstreetmap.org> > ezt írta (időpont: 2022. júl. 28., Cs, 6:56):
>
>> I propose to remove following surface tags by doing an automated edit:
>>
>> surface=unclassified >> https://taginfo.openstreetmap.org/tags/surface=unclassified
>> surface=no data >> https://taginfo.openstreetmap.org/tags/surface=no%20data
>> surface=unknown >> https://taginfo.openstreetmap.org/tags/surface=unknown
>> surface=undefined >> https://taginfo.openstreetmap.org/tags/surface=undefined
>> surface=unspecified >> https://taginfo.openstreetmap.org/tags/surface=unspecified
>> surface=Unspecified >> https://taginfo.openstreetmap.org/tags/surface=Unspecified
>>
>> and other such null values, explictly expressing that surface is not
>> tagged. Note that
>>
>> surface=yes
>> surface=*
>> surface=no
>> surface=<different>
>> surface=surface
>> surface=Maxar
>> surface=a
>>
>> and similar values would NOT be removed despite being utterly useless
>> as a surface=* value as repair may be possible or is it possible that
>> it is some unusual tagging scheme which is actually useful
>>
>> Edit would be automatic, rerun from time to time, split into small
>> changeset by geographic areas and run by
>> https://www.openstreetmap.org/user/Mateusz
>> Konieczny%20-%20bot%20account/history bot account
>>
>> Why it is useful? It helps newbies to avoid becoming confused. It
>> protects against such values becoming established. Without drudgery
>> that would be required from the manual cleanup. It also makes easier to
>> add missing surface= values
>>
>> Why automatic edit? I a have massive queue (in thousands and tens of
>> thousands) of automatically detectable issues which are not reported by
>> mainstream validators, require fixes and fix requires review or
>> complete manual cleanup.
>>
>> There is no point in manual drudgery here, with values completely useless.
>>
>> This values here do NOT require manual overview. If this cases will
>> turn out to be an useful signal of invalid editing than I will remain
>> reviewing nearby areas where bot edited.
>>
>> Yes, bot edit WILL cause objects to be edited. Nevertheless, as result
>> map data quality will improve.
>>
>> I have experience with bot edits and will repair any damage caused
>> by bot edits that I operate.
>> _______________________________________________
>> talk mailing list
>> >> talk at openstreetmap.org
>> >> https://lists.openstreetmap.org/listinfo/talk
>>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openstreetmap.org/pipermail/talk/attachments/20220911/b6da4b40/attachment.htm>
More information about the talk
mailing list