[Talk-GB] [Spam] Looking for places to map?
Peter Miller
peter.miller at itoworld.com
Wed Feb 24 00:54:57 GMT 2010
Great stuff. I assume you are aware of woeids from Yahoo which have
some interesting boundaries derived from crowd sourced data in Flickr
and other places. Details here.
http://developer.yahoo.com/geo/geoplanet/guide/concepts.html#woeids
It would be a great project to match up boundaries from yahoo woeids
with Wikipedia articles for UK places, OSM place names and Traveline
NatGaz database and also I guess the Cycle Streets DB. Note that we
still haven't imported the NatGaz DB (with 50,000 place names in a
hierarchy) into OSM and are probably missing many places on the map in
more obscure parts of the country as a result.
Regards,
Peter
On 23 Feb 2010, at 14:18, Peter Reed wrote:
> As the weather improves we are all going to be out and about tracing
> roads for OSM. So there are various discussions, and work under way
> to help find, prioritise and then fill the most important gaps in
> the map. Larger towns and bigger areas that need attention are
> fairly well known, but experience suggests that it can be more
> difficult to find smaller and more local places where a short
> session could make a difference quickly.
>
> So this is an attempt to help, at a bit more of a micro-level –http://www.reedhome.org.uk/Documents/osmembed.html?kml=KML/osmcategory.kml
>
> (to view the same thing on Google maps use this http://maps.google.co.uk/maps?f=q&source=s_q&hl=en&geocode=&q=http:%2F%2Fwww.reedhome.org.uk%2FDocuments%2FKML%2Fosmcategory.kml&sll=51.525483,-0.746802&sspn=0.01287,0.033023&ie=UTF8&z=6
> )
>
> Broadly – green areas look as though they are already fairly well
> covered, red areas look as though they would benefit from some
> attention, and the rest are grey.
>
> WARNINGS
>
> This is very much a first cut, pre-beta, etc. We know there are
> places missing, some gaps in the data, and we are aware that it
> doesn’t always get things right (we think it does often enough to
> be useful).
> It really cannot tell how well an area is mapped from the data. It
> can only try to find areas that look a bit thin - and some areas
> look thin when they are not.
> At the moment we are concentrating on roads, not the full list of
> features in areas that are most thoroughly mapped.
>
> As a result this sometimes flags up an area as “poor” when in
> reality it has been perfectly well mapped. For example, there are
> two areas near here that a lot of work has gone into, where the
> classification is not right. So if your favourite areas shows up
> wrongly, please don’t take it personally.
>
> Still, it seems to point in the right direction quite a lot of the
> time, and it’s offered up on that basis. In my own area (which is
> already fairly well covered) it has flagged up some towns that I
> already knew were a bit thin, and another half dozen options that I
> can reach fairly easily, but didn’t know about.
> If it looks as though it will prove useful then future plans will
> address some of the current limitations by refining the borders,
> filling gaps, correcting errors in the underlying measures, and
> tweaking the arithmetic.
>
> However, some mis-classification will always be inevitable. Read on
> to see why.
>
> THIS IS HOW IT WORKS:
>
> We start with a list of about 1,600 UK settlements, and a figure for
> the population that lives there. Baring a few errors and omissions,
> the settlements are the same ones that Cyclestreets uses for local
> areas -http://www.cyclestreets.net/area/
>
> We then try to find a boundary for each of these settlements. This
> is based either on the local authority admin area where there is
> one, the naptan pay-scale area if there is one, or if all else
> fails, a guesstimate of how big the settlement must be based on
> population density.
> (FWIW we already realise that some of the pre-defined areas are too
> big, some of guesstimates are off-centre, or the wrong size, and the
> guesstimates don’t work well on the coastline. But it’s a start, and
> most settlements don’t look too far out.)
>
> Where we know the actual length of roads in a settlement from
> Department for Transport data we use that to classify the area, but
> this only works for bigger towns and cities, so for the rest we are
> trying to figure out how well they are covered without knowing the
> true length of roads on the ground.
>
> At the moment we do this using various ratios. Within each
> settlement boundary we measure the length of roads in the OSM
> database, and from that we calculate three measures: the length of
> roads per sq km (the road density within the settlement), the length
> of road per head of population ,and the proportion of roads that are
> major (primary, secondary and motorway).
>
> The underlying hypothesis is that a thoroughly mapped area should
> have a relatively high road density, plenty of road per head of
> population, and a relatively low proportion of major roads (because
> it’s the unclassified and residential roads that tend to be missing,
> not the major roads).
> However, there are anomalies - some areas are thinly populated, some
> are at the intersection of a lot of major roads, some have more
> tightly packed houses (so a high road density), while others have
> big gardens (and hence a low road density). So inevitably our
> hypothesis sometimes breaks down.
> To avoid tripping over some of the more extreme cases, we therefore
> highlight as “good” only those areas that fall into the top quartile
> on at least two of the ratios, and to be classified as “poor” we
> pick only those areas that fall into the bottom quartile on at least
> two ratios. Everywhere else is in the middle.
>
> That covers complete settlements. On top of that we have plotted all
> the bigger residential areas (landuse = residential) where there
> don’t seem to be many roads. We can’t know the population of these
> areas so we just highlight those with a low road density (road
> length / area). If the road density in a residential area is above
> the threshold, or if it is a very small residential area (where the
> measures can be unreliable), then we just ignore it. We have only
> plotted the bigger, low density residential areas.
>
> One final caveat – the base data on which this is all based was
> extracted several weeks ago, so it’s a little bit out of date. An
> updated version will follow when time permits.
>
> IN CONCLUSION
>
> We have several ideas on how this can be improved, based on earlier
> suggestions. We welcome more comments and ideas.
> Even more importantly, we welcome more roads on the map.
> So most of all we hope this helps members of the community find some
> handy places where they can quickly make a difference by plugging a
> few gaps. Remember that it’s all a bit hit-and-miss though. It’s
> probably a good idea to check some other sources as well before
> rushing out with the GPS.
> _______________________________________________
> Talk-GB mailing list
> Talk-GB at openstreetmap.org
> http://lists.openstreetmap.org/listinfo/talk-gb
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openstreetmap.org/pipermail/talk-gb/attachments/20100224/68cc2ec6/attachment-0001.html>
More information about the Talk-GB
mailing list