[talk-au] Possible mechanical edit of address names

Warin 61sundowner at gmail.com
Sun Aug 20 08:22:20 UTC 2023


Hi

I'm considering doing a mechanical edit of address names.

There are over 3,000 conflicts between address names and the local road 
names. The mechanical edit would target capitalization of, for example, 
'street' to 'Street'.

Source of the conflicts comes from rendering errors which are then made 
into a comma delimited list and the features downloaded by JOSM and 
saved in to a .osm file. The text to change would be limited to a list 
of words with the text starting with a space character and ending with a 
' - this removes the possibility of changing a word that contains the 
searched for text as part of an actual word.

The words to search and change would be those common such as 'street', 
'road', 'highway', 'drive', 'crescent' etc.

I'm intending to write it in Python though I am not a python programmer 
so there will be a learning curve delay.

Any thoughts particularly if you think this is a "bad idea" or some 
possible improvement.

I did write a similar program in another language, copyright so I cannot 
share it) that had echoed the change to the screen and required a 
keyboard confirmation to complete the change.. I found that to be both 
exhausting and to have found no errors. That only involved a few 
hundred, this one looks to be much larger. Of course that will be checks 
performed to ensue it does as intended.


-----------------

I have already done some 100+ of these, a good proportion are the above 
capitalization problem. Some are things like O'riely that should be 
O'Riely .. and some are the Mc or Mac problem that does not have a set 
rule. Of course some don't fall into a category. But if 1,000 or more 
can be fixed mechanically then I'd prefer to go that way.

As for the Mc/Mac problem I'd go with the OSM road name as that is 
displayed where as the address name is hidden so the road name is more 
likely to be corrected and therefore more likely to be correct, however 
this would be done manually so is not the main topic here.




More information about the Talk-au mailing list