[OSM-dev] Mixed character encoding in planet.osm - plan for fixing it

Joerg Ostertag (OSM Munich/Germany) openstreetmap at ostertag.name
Sat Nov 4 17:15:43 GMT 2006


...
> In parallel, I am thinking of cleaning up the database. First, I will
> try to make a list of all entries with non-valid UTF-8 encoding, based
> on the latest planet.osm.
>
> Is anyone else working on cleaning up this issue? I don't wan't to
> interfere.

to find the errors in planet.osm, you might find the following usefull:
./utils/planet.osm/C/UTF8sanitizer

-
Joerg




More information about the dev mailing list