[OSM-talk] Worldwide non-surveyed tag edits
John Baker
rovastar at hotmail.com
Thu Jun 12 13:33:39 UTC 2014
There are always bad practice examples and they are good ones. Guessing amenities/features from names can be problematic as can spell checking. It would be interesting to know if community centre example overwrote "amenity=social_outreach" which is worse.
One major bone of contention is that you comment of:
"The policy of "if no one complains, then it's ok" is in place so people
can actually do gardening without needing to discuss everything before."
Is it? That seems to be an unwritten rule. It for sure is not in the policy document.
http://wiki.openstreetmap.org/wiki/Mechanical_Edit_Policy
If it was then fine but it is not.
Lets establish a few facts.
1) Gardening happens frequently. (I am surprised actually how much given the support/admissions on this list)
2) Gardening happens without people following the policy. (you can see this because there is very little documentation/mailing list chat about all these edits)
3) Because of this there is actually visibility of the gardening happening and hence the ability to spot error before they happen.
As said very few follow the policy when doing gardening it might be useful to consider why that is.
I consider there is too much bureaucracy with the current policy and suggest that that should change.
That not should be seen as allow more mechanical edits/gardening but gaining valuable visibility of the changes.
I would probably consider "if no one complains, then it's ok" unwritten rule is too soft and would like to foster a culture of people who do the gardening to regularly communicate by simple means before untaking them and a culture of those questioning them to be a little more understanding that they are trying to do good and help them see potential problems.
And yes warn/block/punish those that do not comply but it has to be easier to comply to begin with.
Lets take an real world example from yesterday I did and posted in IRC (to see if that would be a suitable communication channel)
sport=crazy_golf tag change it to sport=miniature_golf
crazy golf is what we call it in the UK others call it miniature golf
http://en.wikipedia.org/wiki/Miniature_golf
Now obviously I got the usual responses. But it did get noticed that was the point and picking a minor problematic example.
however IMHO the correct response should/could have been something like:
"Changing sport=crazy_golf to sport=miniature_golf is meaningless and possibly wrong. Better would be to leave the tag alone (some might want that) and additionally tag with the established tag http://wiki.openstreetmap.org/wiki/Tag:leisure%3Dminiature_golf"
I think this is more useful it improves the map gives proper feedback and improves the database. (I in fact did that above looking at aerial imagery to confirm those and left alone those that didn't.)
From: john.packer7 at gmail.com
Date: Thu, 12 Jun 2014 09:36:55 -0300
To: talk at openstreetmap.org
Subject: Re: [OSM-talk] Worldwide non-surveyed tag edits
Friends,
Some examples of bad and good gardening, and how unpredictable this business can be:
1. Once I searched for wikidata values without "Q" as a prefix. I found some values (not many), and to guarantee the quality of my work, I opened each object's wikidata page to verify the new value would be the correct one. Turns out that one of the objects had a tag "wikidata=1960", and it didn't correspond to "wikidata=Q1960". It was in another language, so I couldn't find the correct one, so I messaged the user that added this data, and he verified my assumption: it was supposed to be start_date instead of wikidata;
2. Once, some guy removed around 80 websites or so in South America. They were all websites that Keepright reported as not found or something equivalent. I was annoyed by this change, and asked him to revert his changeset, which he quickly complied. At first the logic seems solid on this kind of change, but we never know if the website will remain offline or it was just temporary, if Keepright (or the gardener himself) simply can't access the website from his location, if there is a typo in the url (e.g. ".com" instead of ".org"), etc. To aggravate the problem, as some of you may know, recently I found out there were some URLs that were "corrupted" by invisible characters (see [1]). They probably would be removed by these kind of changes, even though they only need a simple fix.
3. In the beginning of this year, there was some commotion in the brazilian community because some guy started manually using a spell-checker in street's names in Brazil. It did fix some hard to find mistakes (like Kubitscheck instead of Kubitcheck), but also could introduce errors (like it did on a street which had an indigenous name). The user was an considerably experienced mapper, and was tech-savvy, but there was no way to guarantee this kind of work would be free of errors, so he stopped doing that.
This kind of correction can be useful in some select cases: Here in Brazil, there are some official maps that are devoid of diacritics, so it would be completely valid to manually run an ortographic corrector on streets that were added from these maps (and not by survey).
4. A friend of mine, which is a really experienced mapper, sometimes go around gardening the map. Once I was reviewing one of his changesets, and he added a tag amenity=community_center to a place with "Community Center" in the name, even though it was actually "amenity=social_outreach". Usually he does good gardening, but it's just to show that even an experienced mapper makes mistakes (just imagine what an enthusiastic OSM newbie that wants to go on a gardening spree worldwide can do)
Besides the numerous cases of users that try to "tidy up" some objects, and unintentionally make the object lose accuracy.
What I mean to conclude:
Gardening (without local knowledge) IS a tricky business, and it should be treated like so.
It is far too easy to change another user's work, and it is orders of magnitude more expensive to actually go and collect the data.
The policy of "if no one complains, then it's ok" is in place so people can actually do gardening without needing to discuss everything before.
But unless you want to risk having your work reverted (and therefore lose the time spent on it), you should probably discuss changes that might affect a considerable number of users, are mechanic changes, may be polemic, etc.
Oh, and personally I think it's every gardener's DUTY to make meaningful changeset comments.
[1]: http://overpass-turbo.eu/s/3eJ
Roland,
> > Please go to taginfo
> https://taginfo.openstreetmap.org/
> choose "highway" there, browse through the values and tell me which of
> the values you would like to change.
>
Post processing that you spoke is about process the bad data into a readable form after the fact.
It is used when you do not know what state the data is in and fixing the bad data.
Potentially this could be anything.
Also the data has already been fixed by the mech edits/gardening that has been done before and is done on a regular basis. I know I have done residential typos in the past. So many for this example might not occur still however for post processing you need to consider that they might and do happen.
In theory if the data is perfect to begin with there is no need for any post processing.
But searching on taginfo highlight the potential in-correctness of the data. There are currently 1049 values for highway.
I could reverse your point and say are all of these 1049 values correct?
For example is this correct?
highway = "Bandar Road"
https://taginfo.openstreetmap.org/tags/highway=Bandar%20Road#overview
which is here:
http://www.openstreetmap.org/#map=16/16.5072/80.6299
Now how do we fix that with any form of mechanical editing?
Do I follow all the rules? wait a few weeks for a consensus on the import mailing list?
Now I could seeing that just fix it to
name = "Bandar Road"
highway = unclassified (I could at aerial imagery and guess the correct type)
The point of this type of gardening is to fix errors like this and make a better map. Some people are happy to leave that there until a local mapper fixes it as it will ruin the local community if I fix. They will be threats or actual blocks/bans etc if any fixes this that has no local knowledge and does this in a mechanical way. Even using in conjunction with aerial imagery may not be ok.
> Couldn't this be even worse than applying those changes directly in
> the database?
>The postprocessing refers to the final data consumer, not the map on
>osm.org. The map on osm.org is specifically designed for giving mappers
>feedback. Therefore, it has no such postprocessing and will never have.
The map at osm.org does have post processing to varying degrees most of it simple stuff (it is a bridge if it is true, yes or 1) and is a data consumer just as much as anyone else. Creating maps is probably the greatest data consumer use of openstreetmap data. The map is designed for various reasons (and that changes over time) one of them is mappers feedback.
Post processing is a balance of doing everything and there is an overhead. Not much ignoring the case of something like that some might remove whitespace around a tag but beyond that there is very little most of time the solution is fixing the data.
Thank you for your ongoing discussion.
_______________________________________________
talk mailing list
talk at openstreetmap.org
https://lists.openstreetmap.org/listinfo/talk
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openstreetmap.org/pipermail/talk/attachments/20140612/ea9c47d7/attachment.html>
More information about the talk
mailing list