[OSM-dev] Non-UTF-8 German Umlauts in planet.osm
Artem Pavlenko
artem at mapnik.org
Thu Mar 15 20:33:50 GMT 2007
You need to UTF8 sanitize planet first:
UTF8sanitize < planet.osm > planet-utf8.osm
The utility somewhere in svn tree.
HTH
Artem
On 15 Mar 2007, at 20:27, Jan-Benedict Glaw wrote:
> Hi!
>
> Current planet.osm has a sharp-s in (probably) ISO-8859-1{,5}, which
> breaks the PostGIS import:
>
>
> jbglaw at nini:~/planet.osm$ bzcat planet-070314.osm.bz2 |./osm2pgsql/
> osm2pgsql -
> NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index
> "tmp_segments_pkey" for table "tmp_segments"
> NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index
> "tmp_nodes_pkey" for table "tmp_nodes"
> Processing: Node(8550k)
> Processing: Segment(8970k)
> Processing: Way(376k)-:73188476: parser error : Input is not proper
> UTF-8, indicate encoding !
> Bytes: 0xDF 0x65 0x22 0x20
> <tag k="name" v="Volkersbrunner Stra?e" />
> ^
> - : failed to parse
>
> Any chance to report (and in case of tags: drop) non-UTF-8 stuff
> during planet.osm generation?
>
> MfG, JBG
>
> --
> Jan-Benedict Glaw jbglaw at lug-owl.de
> +49-172-7608481
> Signature of: http://catb.org/~esr/faqs/smart-
> questions.html
> the second :
> _______________________________________________
> dev mailing list
> dev at openstreetmap.org
> http://lists.openstreetmap.org/cgi-bin/mailman/listinfo/dev
More information about the dev
mailing list