[Talk-ca] duplicate address data
Gerd Petermann
gpetermann_muenchen at hotmail.com
Thu Mar 26 19:48:59 UTC 2015
Hi Paul,
up to now I did not edit much data in OSM, esp. I've never done
a mass update. My idea is to create a list of identical objects
which could be removed without losing information.
Something like
duplicate addr:interpplation way <id1> <id2>
I wonder why nobody has coded a bot for this?
Gerd
To: gpetermann_muenchen at hotmail.com
CC: talk-ca at openstreetmap.org
From: penorman at mac.com
Subject: Re: [Talk-ca] duplicate address data
Date: Thu, 26 Mar 2015 19:38:48 +0000
On Mar 26, 2015, at 11:55 AM, Gerd Petermann <gpetermann_muenchen at hotmail.com> wrote:
Example: The ways
http://www.openstreetmap.org/way/99649911
and
http://www.openstreetmap.org/way/83504524
One has source=NRCan-CanVec-7.0, the other source=CanVec 6.0 - NRCan
Is there a good reason for this redundancy?
If not, what is the best way to remove these duplicates?
I can think of different ways:
1) keep only the eldest entry
2) keep only the youngest entry
3) keep the older and add a note that the data is confirmed by NRCan-CanVec-7.0 This is a case where someone imported CanVec improperly without resolving conflicts with existing data.
If both interpolation ways are the same, it doesn't matter which is deleted. You could investigate the changeset the more recent one came from and see if the entire thing needs reverting.
If they differ, look to see if one has been edited and keep that one. If neither have been edited, I'd presume CanVec 7 to be more accurate than CanVec 6 in absence of any other information.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openstreetmap.org/pipermail/talk-ca/attachments/20150326/f5ab3b13/attachment.html>
More information about the Talk-ca
mailing list