<html><head></head><body><div><br>I think that most of the reduction was due to me going through the obvious formatting issues last weekend. I used an import from the Geofabrik GB extract into an osm2pgsql database looking at the addr:postcode and postal_code tags. The database enabled me to easily get the appropriate OSM object IDs such that I could download and fix each in turn using JOSM after using some judgement.<br><br></div>
<div>I've also fixed a few way-off postcodes when compared to the distances to that which OS OpenData CodePoint thought that their centroids were at -- things like postcodes in Edinburgh (EH) mistakenly typed in with an Enfield (EN) postcode -- also identified using my import into PostGIS. Cases which were less clear I've added notes for, to have local mappers review.<br><br></div>
<div>Gregory<br><br><br></div>
<div><!-- tmjah_g_1299s -->Sent from <!-- tmjah_g_1299e --><a href="http://www.bluemail.me/r"><!-- tmjah_g_1299s -->BlueMail<!-- tmjah_g_1299e --></a><br><br></div>
<div class="gmail_quote" >On 16 Nov 2016, at 20:36, "Robert Whittaker (OSM lists)" <<a href="mailto:robert.whittaker+osm@gmail.com" target="_blank">robert.whittaker+osm@gmail.com</a>> wrote:<blockquote class="gmail_quote" style="margin: 0pt 0pt 0pt 0.8ex; border-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;">
<pre class="blue">My daily report of addr:postcode value errors at<br><a href="http://robert.mathmos.net/osm/postcodes/osm-errors.html">http://robert.mathmos.net/osm/postcodes/osm-errors.html</a> seems to be<br>being used by at least one other person, since the numbers of errors<br>showing there has dropped significantly now. The page is regenerated<br>daily, but unfortunately the data hasn't been refreshed for a few days<br>now because the source data on which it relies (The Geofrabrik GB<br>extract via the GB Taginfo instance) hasn't been updated in that time.<br><br>I've also starting playing with a second report that lists location<br>discrepancies of postcode-tagged OSM objects compared with the<br>postcode centroid locations in Code-Point Open. This is less of an<br>exact science, since postcodes will not all be located at the centroid<br>for that postcode unit, and the allowable deviations vary depending on<br>the unit. However, you can find an initial list of postcodes that
are<br>more than 1km from their official centroid at<br><a href="http://robert.mathmos.net/osm/postcodes/location-errors.cgi">http://robert.mathmos.net/osm/postcodes/location-errors.cgi</a> -- there<br>are about 1500 of them, although quite a few are in groups where the<br>same postcode is on multiple neighbouring objects. Presumably most of<br>the 1500 will be cases of a typo being made by an editor or in the<br>data source they used, so they'll need manual checking and updating.<br><br>If anyone fancies looking at any of these please feel free to dive in.<br>If you find any false positives (i.e. errors in the processing, or<br>postcodes that genuinely are that far from their centroid), please let<br>me know, and I'll see if there's anything that can be improved in the<br>tool, or if they need to be marked manually as ok.<br><br>Robert.<br></pre></blockquote></div></body></html>