[OSM-dev] UTF-8 errors in our DB, or elsewhere?

Grant Slater openstreetmap at firefishy.com
Tue Feb 12 02:27:01 GMT 2008


Frederik Ramm wrote:
> in the course of producing shapefiles, I applied the libxml2 built-in
> character set conversion from UTF-8 to Latin-1 to our tag values, and
> found a lot of problems (about 20k nodes/ways) where it complained.
>   

UTF-8 to Latin-1 is lossy, that is likely what caused most of the 
warnings/problems...

> Here's a list of objects that libxml2 complained about (not complete
> as I didn't process a full planet):
>
> http://www.remote.org/frederik/tmp/utf8.txt
>   

A quick view of this document in Firefox, forcing the character encoding 
(View->Character Encoding->UTF 8), most of the characters seem to 
display as expected.

/ Grant





More information about the dev mailing list