<div dir="ltr"><div><div>I would still like to propose an import. Manually going through the data and each hospital's website is fine to find missing hospitals in OSM, but it's less feasible for the full 8,000 records.</div><div><br></div><div>Based on the feedback in this thread, it sounds like we'll need to better assess the quality of the data first. I've seen a previous approach of comparing 100 random records to OSM and measuring the accuracy based on that.</div><div>This would also require a separate wcedmisten_imports account.<br></div><div><br></div><div>I think this would still need to be a semi-automated import - where the HIFLD data is used as primary source, but a human still verifies every proposed change.<br></div><div><br></div>Addressing Greg's point about a fully automated script:<br>> It's not formally part of the norms, but I think imports should be done<br>
> with code, basically a program that takes the OSM db as it is and the<br>
> import db and produces a set of proposed editing actions<br>
> (additions/conflations) and things to inspect (object in OSM but not in<br>> import). That can then be debugged before any changes are made. It is<br>
> critical that it be automated so that it can be re-run with little effort.<br><br></div><div>I'm a bit unclear how this would work in practice. I feel like there are so many edge cases in OSM tagging styles that it would be quite difficult to ensure the program handles everything appropriately without checking everything.</div><div>Do you have any prior examples of successful fully automated import programs that have been run?<br><br></div><div>This is why I'm only proposing a semi-automated import, just to have human eyes on things.<br><br></div><div>Best,<br><br></div><div>William Edmisten (wcedmisten)<br></div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Fri, May 5, 2023 at 5:37 PM Mike Thompson <<a href="mailto:miketho16@gmail.com">miketho16@gmail.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr"><div dir="ltr">Thanks for the additional information William!</div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Fri, May 5, 2023 at 1:08 PM William Edmisten <<a href="mailto:wcedmisten@gmail.com" target="_blank">wcedmisten@gmail.com</a>> wrote:</div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr"><div><div><div>I hope to
distinguish two different projects which I may have conflated in my
initial Wiki page: 1) using HIFDL Hospitals as reference material, but
not primary source, to find missing hospitals in OSM and 2) the actual
import of the HIFDL Hospitals dataset, which would rely on HIFDL as the primary source rather than as reference.<br></div></div></div></div></blockquote><div>Are you therefore dropping the proposal for an import? It seems that if you are going to do all of this manually, there is no reason for an import. </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr"><div><div><div><br></div><div><br></div>I
do not believe 1) constitutes an import, because it is only using the
dataset as reference to find missing POIs, but using objectively
verifiable sources (each hospitals' website), similar to the workflow of
many "armchair" mappers. I believe this is a widely accepted practice
in the OSM community.<br></div></div></div></blockquote><div>Your original email, and the wiki page that it cited, led me to believe that you intended to do an import, and thus the rules for imports would apply, and thus my comments.</div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr"><div><div><br><br>To the point on VAL_DATE, I believe checking the hospital website mitigates this issue also.<br></div></div></div></blockquote><div>You might consider going to each state and getting its current list of licensed hospitals, this is what HIFLD does to build/maintain this dataset. This would avoid the issue of old data in the HIFLD dataset.</div><div><br></div><div>Mike</div></div></div>
</blockquote></div>