[OSM-dev] REST API encoding

David Sheldon dave-osm at earth.li
Tue Jul 11 15:04:49 BST 2006


On Tue, Jul 11, 2006 at 02:54:37PM +0200, Thomas Walraet wrote:
> >    <tag k="name" v="t&#xC3;&#xA9;st"/>
> 
> It should be <tag k="name" v="t&#xC3A9;st"/> if we want to use unicode 
> entities.

No it shouldn't, it should be <tag k="name" v="t&#xE9;st"/>

UTF-8 to unicode conversion isn't just a case of concatenating the
nibbles. http://www.cl.cam.ac.uk/~mgk25/unicode.html#utf-8

> The actual response seems to be an UTF-8 string encoded as if it was 
> ISO-8859-1.

It seems to be UTF-8 byte sequences encoded as if they are individual
unicode characters.

David


-- 
    "0 tends to simplify things a bit when you multiply by it..."
                            -- Bill McColl




More information about the dev mailing list