[OSM-dev] planet.osm - fix

Lars Aronsson lars at aronsson.se
Tue Aug 15 16:53:25 BST 2006


Michael Strecke wrote:

> According to the docs provided by David, the "HTML-escaped" characters
> &#..; represent UTF-16/32 characters (and only UTF-16/32). Adding just
> another encoding scheme to a simple street name :-)

This must be wrong.  The terms UTF- (-8, -16, -32) only apply to 
the binary encodings of Unicode, not to the XML/HTML ASCII &#..; 
encodings.

> > "Straße" or "Genter Straße" since these are

The first (&#DF;) is correct, the second example is badly broken.

This discussion reveals some grave misunderstandings of how 
Unicode works.  That is sad.  People should read up.


-- 
  Lars Aronsson (lars at aronsson.se)
  Aronsson Datateknik - http://aronsson.se




More information about the dev mailing list