[OSM-dev] Non-UTF-8 German Umlauts in planet.osm

Jan-Benedict Glaw jbglaw at lug-owl.de
Thu Mar 15 20:27:28 GMT 2007


Hi!

Current planet.osm has a sharp-s in (probably) ISO-8859-1{,5}, which
breaks the PostGIS import:


jbglaw at nini:~/planet.osm$ bzcat planet-070314.osm.bz2 |./osm2pgsql/osm2pgsql -
NOTICE:  CREATE TABLE / PRIMARY KEY will create implicit index "tmp_segments_pkey" for table "tmp_segments"
NOTICE:  CREATE TABLE / PRIMARY KEY will create implicit index "tmp_nodes_pkey" for table "tmp_nodes"
Processing: Node(8550k)
Processing: Segment(8970k)
Processing: Way(376k)-:73188476: parser error : Input is not proper UTF-8, indicate encoding !
Bytes: 0xDF 0x65 0x22 0x20
    <tag k="name" v="Volkersbrunner Stra?e" />
                                        ^
- : failed to parse

Any chance to report (and in case of tags: drop) non-UTF-8 stuff
during planet.osm generation?

MfG, JBG

-- 
      Jan-Benedict Glaw      jbglaw at lug-owl.de              +49-172-7608481
Signature of:                http://catb.org/~esr/faqs/smart-questions.html
the second  :
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: Digital signature
URL: <http://lists.openstreetmap.org/pipermail/dev/attachments/20070315/424f23c2/attachment.pgp>


More information about the dev mailing list