[OSM-talk] Proposed bot edit: remove "surface=no data" and other useless surface values

Imre Samu pella.samu at gmail.com
Fri Jul 29 05:11:32 UTC 2022


> I propose to remove following surface tags by doing an automated edit:

some comments:

1.)  Root case analysis
It would be nice to have a 2-3 month grace period,
meaning that those who want to analyse the data in more depth can do so,
( tag combinations, distinct users, years, osm editors, etc.. )

2.)  count_tags()==1 and surface=*
Sometimes there is only one surface tag on the geometry.
In this case, removing just the last tag  -  will not fix the problem.
What is your proposal in this case?  ( keep or drop the geometry? )

3.) count_tags()==2 and surface=*  and name=*
To a local editor, I think the two tags together give more information than
only the name=* tag
This means that half-corrected data can be more human work later.
( not always of course )

3.) Transparency:
It would be nice if there were some kind of continuous data collection
(statistics) on corrections.
Some kind of simple dashboard about the automated edit.

Anyway, the problem is real, and some solution is needed.  :-)

best,
 Imre


Mateusz Konieczny via talk <talk at openstreetmap.org> ezt írta (időpont:
2022. júl. 28., Cs, 6:56):

> I propose to remove following surface tags by doing an automated edit:
>
> surface=unclassified
> https://taginfo.openstreetmap.org/tags/surface=unclassified
> surface=no data https://taginfo.openstreetmap.org/tags/surface=no%20data
> surface=unknown https://taginfo.openstreetmap.org/tags/surface=unknown
> surface=undefined https://taginfo.openstreetmap.org/tags/surface=undefined
> surface=unspecified
> https://taginfo.openstreetmap.org/tags/surface=unspecified
> surface=Unspecified
> https://taginfo.openstreetmap.org/tags/surface=Unspecified
>
> and other such null values, explictly expressing that surface is not
> tagged. Note that
>
> surface=yes
> surface=*
> surface=no
> surface=<different>
> surface=surface
> surface=Maxar
> surface=a
>
> and similar values would NOT be removed despite being utterly useless
> as a surface=* value as repair may be possible or is it possible that
> it is some unusual tagging scheme which is actually useful
>
> Edit would be automatic, rerun from time to time, split into small
> changeset by geographic areas and run by
> https://www.openstreetmap.org/user/Mateusz
> Konieczny%20-%20bot%20account/history bot account
>
> Why it is useful? It helps newbies to avoid becoming confused. It
> protects against such values becoming established. Without drudgery
> that would be required from the manual cleanup. It also makes easier to
> add missing surface= values
>
> Why automatic edit? I a have massive queue (in thousands and tens of
> thousands) of automatically detectable issues which are not reported by
> mainstream validators, require fixes and fix requires review or
> complete manual cleanup.
>
> There is no point in manual drudgery here, with values completely useless.
>
> This values here do NOT require manual overview. If this cases will
> turn out to be an useful signal of invalid editing than I will remain
> reviewing nearby areas where bot edited.
>
> Yes, bot edit WILL cause objects to be edited. Nevertheless, as result
> map data quality will improve.
>
> I have experience with bot edits and will repair any damage caused
> by bot edits that I operate.
> _______________________________________________
> talk mailing list
> talk at openstreetmap.org
> https://lists.openstreetmap.org/listinfo/talk
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openstreetmap.org/pipermail/talk/attachments/20220729/fe128ef9/attachment-0001.htm>


More information about the talk mailing list