[OSM-dev] Non-UTF-8 German Umlauts in planet.osm

Artem Pavlenko artem at mapnik.org
Thu Mar 15 20:33:50 GMT 2007


You need to UTF8 sanitize planet first:

UTF8sanitize < planet.osm > planet-utf8.osm

The utility somewhere in svn tree.
HTH
Artem

On 15 Mar 2007, at 20:27, Jan-Benedict Glaw wrote:

> Hi!
>
> Current planet.osm has a sharp-s in (probably) ISO-8859-1{,5}, which
> breaks the PostGIS import:
>
>
> jbglaw at nini:~/planet.osm$ bzcat planet-070314.osm.bz2 |./osm2pgsql/ 
> osm2pgsql -
> NOTICE:  CREATE TABLE / PRIMARY KEY will create implicit index  
> "tmp_segments_pkey" for table "tmp_segments"
> NOTICE:  CREATE TABLE / PRIMARY KEY will create implicit index  
> "tmp_nodes_pkey" for table "tmp_nodes"
> Processing: Node(8550k)
> Processing: Segment(8970k)
> Processing: Way(376k)-:73188476: parser error : Input is not proper  
> UTF-8, indicate encoding !
> Bytes: 0xDF 0x65 0x22 0x20
>     <tag k="name" v="Volkersbrunner Stra?e" />
>                                         ^
> - : failed to parse
>
> Any chance to report (and in case of tags: drop) non-UTF-8 stuff
> during planet.osm generation?
>
> MfG, JBG
>
> -- 
>       Jan-Benedict Glaw      jbglaw at lug-owl.de               
> +49-172-7608481
> Signature of:                http://catb.org/~esr/faqs/smart- 
> questions.html
> the second  :
> _______________________________________________
> dev mailing list
> dev at openstreetmap.org
> http://lists.openstreetmap.org/cgi-bin/mailman/listinfo/dev





More information about the dev mailing list