[Imports] HIFLD
James Crawford
sherbets at disroot.org
Sun Oct 2 21:19:07 UTC 2022
Hi Mike,
On 10/2/2022 1:51 PM, Mike Thompson wrote:
> * HIFLD seems to contain 570 separate datasets, it is unreasonable to
> expect the community to review all of them to approve this import.
> The people proposing the import need to narrow down the scope to just
> a few datasets, or perhaps just one to start.
The imports listed on the wiki page are the datasets that I have deemed
useful for importing/obtaining metadata for OSM. the full list link on
the HIFLD website is somewhat misleading to the total amount of datasets
that are eligible for OSM. there are a lot of datasets that have
duplicate entries as both "feature layer" and "file geodatabase" but
contain the same data. If you filter the data to just contain the
feature layers, it narrows it down to about 376 datasets. But I've
omitted a lot of datasets from my list of candidates for a couple reasons:
1: it already has been imported into OSM
the HIFLD includes data from the NHD, GNIS, and TIGER datasets. these
have already been imported into OSM and as such I could exclude these
from consideration
2: the quality is too low
from the data I looked at in the mining datasets for example, the
quality was overwhelmingly poor, and I chose to omit these because it
would be closer to vandalism to add these due to the low quality.
3: the contents aren't within the scope of OSM
There are quite a few datasets, such as "Historical Tsunami Event
Locations" or "Environmental Protection Agency (EPA) Regions that don't
record anything that physically exists on the ground and as such aren't
verifiable or within the scope to include in OSM.
from a quick count, I already determined that about 92 datasets were
eligible for OSM (listed on the wiki). when I reassess quality for the
datasets, I may make the number even smaller.
> * As Greg suggested, we need to know how specifically the fields in
> these datasets are going to be transferred into OSM tags. Ideally
> having the scripts used to do so, and the output in OSM format to
> ensure the process worked as expected.
I've been writing specific instructions on an individual dataset basis
in order to clearly translate tags in the database to tags in OSM. If I
can figure out the scripting, I will document it then as well.
> * At least one of the datasets (Major Sports Venues) doesn't even
> contain geometry/coordinates that I can determine, so I am not sure
> how it can be imported.
"Major Sports Venues" isn't on the list of datasets that I deemed
eligible for import probably for this reason.
> * I would recommend that where possible, the data be obtained from the
> original source (other Federal agency, state, etc.). This ensures that
> the most recent data is used (still might not be recent enough), and
> that nothing has been lost in the process of merging/translating the
> data into HIFLD. I don't think HIFLD did any "value add" other than
> aggregating various sources and perhaps some standardization (e.g.
> individual features were not verified).
I can document and do this where possible.
> * Some of the data may not belong in OSM, e.g. "American Red Cross
> Chapter Regions"
I'm aware of this, and I haven't listed any of these region datasets in
my list of eligible data.
-James Crawford (SherbetS)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openstreetmap.org/pipermail/imports/attachments/20221002/a6eba052/attachment.htm>
More information about the Imports
mailing list