<div dir="ltr"><div><br></div><div>Hi Martin, I have limited experience with imports, but I helped in Portland, OR. I looked at your Westport file Two things I would pass on:</div><div><br></div><div>- if the field is null, I don't think the tag should be in the import. Null fields make the files unnecessarily large and provide no value</div><div>- we removed the buildings that conflicted with existing buildings in advance</div><div>- we had a separate process to handle buildings with multiple addresses <br></div><div>- we tiled the geography into squares of no more than 500 buildings each. This makes it possible for an import volunteer to have a workflow where they import a square and then review every building for the conflicts you mentioned (roads crossing buildings, etc). More buildings imported at one time makes the review very difficult, especially if there are lots of such corrections<br></div><div><br></div><div> Our Github was here in case there is anything else of value to you <a href="https://github.com/pdxosgeo/pdxbldgimport">https://github.com/pdxosgeo/pdxbldgimport</a>, and our wiki page was here: <a href="https://wiki.openstreetmap.org/wiki/Portland,_OR_Bldg_import">https://wiki.openstreetmap.org/wiki/Portland,_OR_Bldg_import</a></div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Thu, Sep 10, 2020 at 6:31 PM Martin Machyna <<a href="mailto:machyna@gmail.com">machyna@gmail.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr">So to update this thread, I have integrated addresses from CT Open Data dataset and also updated the wiki page (<a href="https://wiki.openstreetmap.org/wiki/Connecticut/Western_COG_Building_Import" target="_blank">https://wiki.openstreetmap.org/wiki/Connecticut/Western_COG_Building_Import</a>)<div><br></div><div>The whole dataset can be looked at: <a href="https://drive.google.com/file/d/10hUl09WSmK-I8h1hkMlELIGwDYsEziSe/view?usp=sharing" target="_blank">https://drive.google.com/file/d/10hUl09WSmK-I8h1hkMlELIGwDYsEziSe/view?usp=sharing</a><br>For quick loading I also made a subset for one town (Westport): <a href="https://drive.google.com/file/d/1oMSbHXpPY5eLSHhGlikGy2uudyq3VyoV/view?usp=sharing" target="_blank">https://drive.google.com/file/d/1oMSbHXpPY5eLSHhGlikGy2uudyq3VyoV/view?usp=sharing</a><br><br>If there is any issue, please let us know.<br><br>On the side note of the CT Open Data buildings suitability for import I found a way how to simplify all buildings in an automated way in python.<br>Here is a quick comparison of before/after <a href="https://files.slack.com/files-pri/T029HV94T-F01AD0FNFD3/simplified.png" target="_blank">https://files.slack.com/files-pri/T029HV94T-F01AD0FNFD3/simplified.png</a> (you need to have Slack account)<br>It doesn't look so bad and we could consider it for a next round of import.<br><br>Just for a future reference in case someone would need to do the same, the python code is:<br><br>import geopandas as gpd<br>import pandas as pd<br>from shapely import speedups</div><div>speedups.enable()</div><div><br>address = gpd.read_file("Connecticut_Buildings_with_Addresses_experimental.shp")<br>simple = address.simplify(0.000005, preserve_topology=True)<br>simple.to_file('Buildings-simplified.geojson', driver='GeoJSON')<br></div><div><br></div><div><br></div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Sat, Aug 29, 2020 at 6:50 PM <<a href="mailto:joe.sapletal@charter.net" target="_blank">joe.sapletal@charter.net</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div lang="EN-US"><div><p class="MsoNormal">I was going to look at the buildings too. I’ve used a tool in ArcGIS to correct some pretty awful buildings, but I couldn’t download them either. If there is no hurry, I’d check in again with the contact on Monday. It would be nice to have the buildings with addresses on them.<u></u><u></u></p><p class="MsoNormal"><u></u> <u></u></p><p class="MsoNormal">Joe<u></u><u></u></p><p class="MsoNormal"><u></u> <u></u></p><div style="border-color:rgb(225,225,225) currentcolor currentcolor;border-style:solid none none;border-width:1pt medium medium;padding:3pt 0in 0in"><p class="MsoNormal"><b>From:</b> Yury Yatsynovich <<a href="mailto:yury.yatsynovich@gmail.com" target="_blank">yury.yatsynovich@gmail.com</a>> <br><b>Sent:</b> Saturday, August 29, 2020 5:05 PM<br><b>To:</b> Julien Lepiller <<a href="mailto:osm@lepiller.eu" target="_blank">osm@lepiller.eu</a>><br><b>Cc:</b> <a href="mailto:imports@openstreetmap.org" target="_blank">imports@openstreetmap.org</a>; <a href="mailto:talk-us@openstreetmap.org" target="_blank">talk-us@openstreetmap.org</a><br><b>Subject:</b> Re: [Talk-us] [Imports] Import WestCOG building footprints in south-west Connecticut<u></u><u></u></p></div><p class="MsoNormal"><u></u> <u></u></p><div><p class="MsoNormal">Hi Julien,<u></u><u></u></p><div><p class="MsoNormal">Unfortunately, I have limited knowledge on the data quality as I wasn't able to download it (the server returns error). I let the CT point of contact (Scott) know about the problem -- he mentioned in our communication that he forwarded the issue to the tech support team, but I haven't heard from them since then and I'm still unable to download it. <u></u><u></u></p></div></div><p class="MsoNormal"><u></u> <u></u></p><div><div><p class="MsoNormal">On Sat, Aug 29, 2020, 4:57 PM Julien Lepiller <<a href="mailto:osm@lepiller.eu" target="_blank">osm@lepiller.eu</a>> wrote:<u></u><u></u></p></div><blockquote style="border-color:currentcolor currentcolor currentcolor rgb(204,204,204);border-style:none none none solid;border-width:medium medium medium 1pt;padding:0in 0in 0in 6pt;margin-left:4.8pt;margin-right:0in"><p class="MsoNormal">So, it's been a week since that last message. Do you think we should<br>import addresses and buildings at the same time? Should we import the<br>buildings first and care about addresses later?<br><br>Yury, what are your thoughts about the data source quality? Do you<br>think it's a good idea to import from WestCOG and maybe rely on CT data<br>for the rest of CT? I tried playing with the data and I didn't see any<br>difference between drawing the buildings from scratch and having to<br>simplify and correct CT's data.<br><br>Thanks!<br><br>Le Sat, 22 Aug 2020 19:36:23 -0400,<br>Martin Machyna <<a href="mailto:machyna@gmail.com" target="_blank">machyna@gmail.com</a>> a écrit :<br><br>> Thank Julien for pushing this forward!<br>> <br>> yeah, I tried to get addresses from here:<br>> <a href="http://geodata-ctmaps.opendata.arcgis.com/datasets/bfa7da83da384c2aa809882179369dc4_0/features/305004" target="_blank">http://geodata-ctmaps.opendata.arcgis.com/datasets/bfa7da83da384c2aa809882179369dc4_0/features/305004</a><br>> and add them on top of the westCOG buildings.<br>> <br>> The data is a big mess because it's a join_table of like 30 different<br>> address databases. I lost a bit of motivation there, but I could have<br>> a look at it again.<br>> <br>> Martin<br>> <br>> On Sat, Aug 22, 2020 at 2:19 PM Julien Lepiller <<a href="mailto:osm@lepiller.eu" target="_blank">osm@lepiller.eu</a>><br>> wrote:<br>> <br>> > Le Sat, 22 Aug 2020 13:30:02 -0400,<br>> > Yury Yatsynovich <<a href="mailto:yury.yatsynovich@gmail.com" target="_blank">yury.yatsynovich@gmail.com</a>> a écrit :<br>> > <br>> > > Hi Julien,<br>> > > The following communication that I've had recently with a CT<br>> > > official might be of interest to you:<br>> > ><br>> > > <br>> ><br>> > Oh, great! I think we already saw this data (I tried to contact them<br>> > too, but never got a reply :/). From what we saw (I think it was in<br>> > February?) the footprints have simplification issues (see<br>> > <a href="https://files.slack.com/files-pri/T029HV94T-FTDGDHXTM/image.png" target="_blank">https://files.slack.com/files-pri/T029HV94T-FTDGDHXTM/image.png</a> for<br>> > instance) where they are too detailed, not square enough, etc. Some<br>> > buildings also have holes in them, when there's none in the imagery.<br>> ><br>> > So I think it's too bad to be used directly, without a lot of manual<br>> > effort to simplify, square and redraw the shapes. However, the<br>> > address data is very interesting, so maybe we could extract from<br>> > it? Or we could use a separate dataset if they have addresses<br>> > separately.<br>> ><br>> > _______________________________________________<br>> > Imports mailing list<br>> > <a href="mailto:Imports@openstreetmap.org" target="_blank">Imports@openstreetmap.org</a><br>> > <a href="https://lists.openstreetmap.org/listinfo/imports" target="_blank">https://lists.openstreetmap.org/listinfo/imports</a><br>> > <br><br><br>_______________________________________________<br>Talk-us mailing list<br><a href="mailto:Talk-us@openstreetmap.org" target="_blank">Talk-us@openstreetmap.org</a><br><a href="https://lists.openstreetmap.org/listinfo/talk-us" target="_blank">https://lists.openstreetmap.org/listinfo/talk-us</a><u></u><u></u></p></blockquote></div></div></div>_______________________________________________<br>
Imports mailing list<br>
<a href="mailto:Imports@openstreetmap.org" target="_blank">Imports@openstreetmap.org</a><br>
<a href="https://lists.openstreetmap.org/listinfo/imports" rel="noreferrer" target="_blank">https://lists.openstreetmap.org/listinfo/imports</a><br>
</blockquote></div>
_______________________________________________<br>
Talk-us mailing list<br>
<a href="mailto:Talk-us@openstreetmap.org" target="_blank">Talk-us@openstreetmap.org</a><br>
<a href="https://lists.openstreetmap.org/listinfo/talk-us" rel="noreferrer" target="_blank">https://lists.openstreetmap.org/listinfo/talk-us</a><br>
</blockquote></div>