[OSM-dev] Non-UTF-8 German Umlauts in planet.osm

Jan-Benedict Glaw jbglaw at lug-owl.de
Thu Mar 15 20:40:48 GMT 2007


On Thu, 2007-03-15 20:33:50 +0000, Artem Pavlenko <artem at mapnik.org> wrote:
> On 15 Mar 2007, at 20:27, Jan-Benedict Glaw wrote:
> > Current planet.osm has a sharp-s in (probably) ISO-8859-1{,5}, which
> > breaks the PostGIS import:
[...]
> > Any chance to report (and in case of tags: drop) non-UTF-8 stuff
> > during planet.osm generation?
> 
> You need to UTF8 sanitize planet first:
> 
> UTF8sanitize < planet.osm > planet-utf8.osm

jbglaw at nini:~/planet.osm$ head -1 planet-070314.osm
<?xml version="1.0" encoding="UTF-8"?>

Will do.  ...and I'll try to fix the non-UTF-8 codes manually. And we
shouldn't state that it's UTF-8 if it really isn't.

MfG, JBG

-- 
      Jan-Benedict Glaw      jbglaw at lug-owl.de              +49-172-7608481
Signature of:                   ...und wenn Du denkst, es geht nicht mehr,
the second  :                          kommt irgendwo ein Lichtlein her.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: Digital signature
URL: <http://lists.openstreetmap.org/pipermail/dev/attachments/20070315/c7096c2a/attachment.pgp>


More information about the dev mailing list