[OSM-talk] Identical Duplicate buildings
Wayne Emerson, Jr.
ibemerson at verizon.net
Sat Jan 30 16:01:51 UTC 2021
To analyze this situation I decided to check my local area and examine
the problematic changesets in detail. So far I have found a few
different scenarios:
1.) Duplicate ways uploaded in one changeset which took a long time to
process.
Changeset 92289777 created 65 duplicate parking spaces (created
2020-10-11 00:43:02; closed 2020-10-11 00:46:45) by the iD editor. We
can see it took 3 minutes and 43 seconds for this changeset to upload
which is unusually long)
The 2 sets of duplicate parking spaces had a gap in their numbering
sequence:
Set 1 was numbered ID# w857666299 through w857666363
Set 2 was numbered ID# w857666698 through w857666762
These were the only new ways uploaded in this changeset.
So I decided to check the way ID's that fell between the above sequences
w857666364 from changeset 92289780 (created 2020-10-11 00:43:16; closed
2020-10-11 00:43:17)
w857666697 from changeset 92289805 (created 2020-10-11 00:46:12; closed
2020-10-11 00:46:13)
This helps us to see the timing of the 2 upload attempts of the duplicates:
Set 1 was uploaded between 2020-10-11 00:43:02 and 00:43:16
Set 2 was uploaded between 2020-10-11 00:46:13 and 00:46:45
I have sent a message to the mapper of the changeset in question to see
if they can recall what happened during their upload. But no response yet.
2.) Another set of duplicates I found were uploaded in 2 different
changesets, 36 seconds apart by the iD editor.
https://overpass-api.de/achavi/?changeset=76618180
2019-11-05 Created: 00:36:10 Closed: 00:36:11
4 ways deleted; 10 ways modified; 7 new ways
https://overpass-api.de/achavi/?changeset=76618192
2019-11-05 Created: 00:36:47 Closed: 00:37:06
7 ways modified; 7 new ways(duplicates)
Only 7 out 10 modified ways were re-uploaded. Each of the modified ways
had the same tags & geometry as the previous changeset but still had
their version number increased by 1.
The 3 ways from the first changeset that were not re-uploaded:
https://pewu.github.io/osm-history/#/way/569219081
https://pewu.github.io/osm-history/#/way/568979084
https://pewu.github.io/osm-history/#/way/214527867
3.) Potlach 2 created these duplicate ways
Changeset: 75951029
original: https://pewu.github.io/osm-history/#/way/737004106
duplicate: https://pewu.github.io/osm-history/#/way/737376779
Changeset: 76010386
original: https://pewu.github.io/osm-history/#/way/737004108
duplicate: https://pewu.github.io/osm-history/#/way/737360688
In these 2 changesets the mapper moved the pre-existing swimming pools
but they were uploaded as duplicates.
Does this provide any more clues about what is going on? One possible
conundrum for using a bot to remove duplicates is that some duplicates I
found ( but not referenced above) were later added to relations, so a
bot would need to pick the duplicate not in a relation to delete.
-Wayne
--
This email has been checked for viruses by Avast antivirus software.
https://www.avast.com/antivirus
More information about the talk
mailing list