<div dir="ltr"><div>Dear all,</div><div>Following to our import plan discussion please take a moment to read more details and clarification about the process.</div><div><br></div><div><p class="MsoNormal"><b><span lang="EN-CA">Background:</span></b></p><p class="MsoNormal"><span lang="EN-CA">When Triplinx was first launched, we received a lot of feedback regarding missing addresses. We explored a number of potential solutions (including building our own local copy of the OSM database merged with the StatCan address data), but decided that the best solution for both Triplinx and the OSM community was to add the data directly to OSM. This meant that we had to spend quite a bit of time ensuring that the data was correct, as we knew there was a greater element of accountability when publishing to OSM (as opposed to merging with a local copy of the database).</span></p><p class="MsoNormal"><span lang="EN-CA"> </span></p><p class="MsoNormal"><b><span lang="EN-CA">Our Initial Import:</span></b></p><p class="MsoNormal"><span lang="EN-CA">I think there is still a bit of a misunderstanding about what we actually did. While we did use JOSM to import all of our data at once, we also spent several weeks prior to the upload manually reviewing the data in JOSM to resolve issues such as duplication and node conflicts. Originally we had intended to split the import into much smaller chunks, but JOSM was capable of handling our entire dataset and the modifications to it, so we decided to do a single JOSM upload. While this certainly doesn’t excuse our lack of communication about the process, I do want to emphasize that there was quite a bit of manual review before the upload.</span></p><p class="MsoNormal"><span lang="EN-CA"> </span></p><p class="MsoNormal"><b><span lang="EN-CA">Process of Identifying and Creating Missing Address Ranges:</span></b></p><p class="MsoNormal"><span lang="EN-CA">Our process focused solely on address ranges, and the goal was to identify the gaps in the existing address ranges and populate these gaps with address ranges generated from StatCan Road Network data. We aimed to replicate the structure of the existing CanVec address range data.</span></p><p class="MsoNormal"><span lang="EN-CA"> </span></p><p class="MsoNormal"><span lang="EN-CA">At a high level, our process of identifying gaps in the address ranges is summarized below:</span></p><p class="MsoNormal"><span lang="EN-CA"> </span></p><p class="" style="text-indent: -0.25in;"><span lang="EN-CA" style="font-family:Symbol">·<span style="font-stretch:normal;font-size:7pt;font-family:'Times New Roman'"> </span></span><span lang="EN-CA">For each side of each StatCan road segment with a valid address range (start value and end value exist and are different):</span></p><p class="" style="margin-left:1in"><span lang="EN-CA" style="font-family:'Courier New'">o<span style="font-stretch:normal;font-size:7pt;font-family:'Times New Roman'"> </span></span><span lang="EN-CA">Create buffer to the appropriate side (left or right) of the street segment</span></p><p class="" style="margin-left:1in"><span lang="EN-CA" style="font-family:'Courier New'">o<span style="font-stretch:normal;font-size:7pt;font-family:'Times New Roman'"> </span></span><span lang="EN-CA">Find all OSM address ranges that fall within the buffer and compute the intersection of these ranges and the buffer (extract only the portion within the buffer)</span></p><p class="" style="margin-left:1in"><span lang="EN-CA" style="font-family:'Courier New'">o<span style="font-stretch:normal;font-size:7pt;font-family:'Times New Roman'"> </span></span><span lang="EN-CA">For each OSM address range within the buffer:</span></p><p class="" style="margin-left:1.5in"><span lang="EN-CA" style="font-family:Wingdings">-<span style="font-stretch:normal;font-size:7pt;font-family:'Times New Roman'"> </span></span><span lang="EN-CA">Localize the start and end of each address range to the street segment</span></p><p class="" style="margin-left:1.5in"><span lang="EN-CA"><span style="font-stretch:normal"><font face="Wingdings">-</font></span><span style="font-family:'Times New Roman';font-stretch:normal;font-size:7pt"> </span></span><span lang="EN-CA">Compute the distance along the segment from the localized start coordinate to the localized end coordinate</span></p><p class="" style="margin-left:1in"><span lang="EN-CA" style="font-family:'Courier New'">o<span style="font-stretch:normal;font-size:7pt;font-family:'Times New Roman'"> </span></span><span lang="EN-CA">If the sum of the distances divided by the length of the segment is less than the threshold (we used 0.2):</span></p><p class="" style="margin-left:1.5in"><span lang="EN-CA" style="font-family:Wingdings">-<span style="font-stretch:normal;font-size:7pt;font-family:'Times New Roman'"> </span></span><span lang="EN-CA">This address range is poorly represented by existing OSM data, and is a good candidate to be added</span></p><p class="" style="margin-left:1.5in"><span lang="EN-CA" style="font-family:Wingdings">-<span style="font-stretch:normal;font-size:7pt;font-family:'Times New Roman'"> </span></span><span lang="EN-CA">The segment is shifted to the appropriate side and trimmed (to replicate the structure of the CanVec data) and is added to an XML file</span></p><p class="" style="margin-left:1in"><span lang="EN-CA" style="font-family:'Courier New'">o<span style="font-stretch:normal;font-size:7pt;font-family:'Times New Roman'"> </span></span><span lang="EN-CA">Else:</span></p><p class="" style="margin-left:1.5in"><span lang="EN-CA" style="font-family:Wingdings">-<span style="font-stretch:normal;font-size:7pt;font-family:'Times New Roman'"> </span></span><span lang="EN-CA">This address is likely represented by existing OSM address data</span></p><p class="MsoNormal"><span lang="EN-CA"> </span></p><p class="MsoNormal"><b><span lang="EN-CA">Duplication:</span></b></p><p class="MsoNormal"><span lang="EN-CA">No process is perfect, so I certainly wouldn’t expect that there will be no duplication in our data. Our process is particularly susceptible to duplication when the addresses are represented by existing address nodes (rather than interpolated ways). That being said I believe we have manually reviewed our data extensively to remove this sort of duplication, and intend to continue to do so if/when the import is complete.</span></p><p class="MsoNormal"><span lang="EN-CA"> </span></p><p class="MsoNormal"><b><span lang="EN-CA">Benefit To The Community:</span></b></p><p class="MsoNormal"><span lang="EN-CA">The vast majority of the address range data seems to come from some version of the CanVec data. While this data is reasonably comprehensive in some areas, it also has quite a few gaps. By adding the StatCan data, we can fill these some of these gaps in a consistent manner with a single data source. From our perspective, adding address data for areas/streets that don’t have this data is a step in the right direction.</span></p><p class="MsoNormal"><span lang="EN-CA"> </span></p><p class="MsoNormal"><span lang="EN-CA">Please find on attachment a portion of our generated data. For details about import plan feel free to see:</span></p><p class="MsoNormal"><span lang="EN-CA"><br></span></p><p class="MsoNormal"><span lang="EN-CA"><a href="https://wiki.openstreetmap.org/wiki/Triplinx_Metrolinx_Import_Plan">https://wiki.openstreetmap.org/wiki/Triplinx_Metrolinx_Import_Plan</a><br></span></p><p class="MsoNormal"><span lang="EN-CA"><a href="https://wiki.openstreetmap.org/wiki/User_talk:Triplinx.canada">https://wiki.openstreetmap.org/wiki/User_talk:Triplinx.canada</a><br></span></p><p class="MsoNormal"><span lang="EN-CA"><a href="http://wiki.openstreetmap.org/wiki/Import/Catalogue">http://wiki.openstreetmap.org/wiki/Import/Catalogue</a><br></span></p><p class="MsoNormal"><span lang="EN-CA"><a href="http://wiki.openstreetmap.org/wiki/Contributors#Triplinx_Metrolinx">http://wiki.openstreetmap.org/wiki/Contributors#Triplinx_Metrolinx</a><br></span></p><p class="MsoNormal"><span lang="EN-CA"><br></span></p><p class="MsoNormal"><span lang="EN-CA">For those who wants to check the initial changeset that are reverted by DWG:</span></p><p class="MsoNormal"><span lang="EN-CA"><a href="https://www.openstreetmap.org/changeset/36946223">https://www.openstreetmap.org/changeset/36946223</a><br><a href="https://www.openstreetmap.org/changeset/36944498">https://www.openstreetmap.org/changeset/36944498</a><br><a href="https://www.openstreetmap.org/changeset/36943764">https://www.openstreetmap.org/changeset/36943764</a><br><a href="https://www.openstreetmap.org/changeset/36942733">https://www.openstreetmap.org/changeset/36942733</a><br><a href="https://www.openstreetmap.org/changeset/36940905">https://www.openstreetmap.org/changeset/36940905</a><br><a href="https://www.openstreetmap.org/changeset/36939163">https://www.openstreetmap.org/changeset/36939163</a><br></span></p><p class="MsoNormal"><span lang="EN-CA"><br></span></p><p class="MsoNormal"><span lang="EN-CA">We are look forward to having your feedback,</span></p><p class="MsoNormal"><span lang="EN-CA"><br></span></p><p class="MsoNormal"><span lang="EN-CA">Sincerely yours,</span></p><p class="MsoNormal"><span lang="EN-CA"><br></span></p><p class="MsoNormal"><span lang="EN-CA">Mojgan</span></p><p class="MsoNormal"><span lang="EN-CA"><br></span></p></div><div><div class="gmail_signature"><div dir="ltr"><div class="gmail_signature"><div dir="ltr"><div><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div style="font-size:12.8000001907349px"><span style="font-family:georgia,serif"><b><font color="#666666">Mojgan (Amaneh) Jadidi, Ph.D.</font></b></span></div><div style="font-size:12.8000001907349px"><i><b><font color="#999999">Intern, Applied Research & Corporate Monitoring</font></b></i></div><div style="font-size:12.8000001907349px"><i style="font-size:12.8000001907349px"><font color="#999999">Planning & Policy <b>| </b></font></i><i style="font-size:12.8000001907349px"><span style="font-family:Arial,sans-serif"><font color="#999999" size="2">Metrolinx | 97 Front Street West, Toronto, ON, M5J 1E6 | T: <a href="tel:416-202-5844" value="+14162025844" target="_blank">416-202-5844</a></font></span></i><font face="arial, helvetica, sans-serif" color="#999999"><i><b><br></b></i></font></div><div style="font-size:12.8000001907349px"><i style="color:rgb(153,153,153);font-family:arial,helvetica,sans-serif;font-size:12.8000001907349px"><b><br></b></i></div><div style="font-size:12.8000001907349px"><i style="color:rgb(153,153,153);font-family:arial,helvetica,sans-serif;font-size:12.8000001907349px"><b>Postdoctoral Research Fellow</b></i><font color="#999999"><font face="arial, helvetica, sans-serif"><i><br></i></font></font></div><div style="font-size:12.8000001907349px"><font color="#999999"><font face="arial, helvetica, sans-serif"><i>GeoICT Lab | York University | 4700 Keele St, Toronto, ON, </i></font><span style="font-size:13px;line-height:16.1200008392334px"><i>M3J 1P3</i></span></font></div></div></div></div></div></div></div></div></div></div></div></div></div></div>
</div>