[OSM-dev] broken utf8 in minute changeset 200907140650
Tom Hughes
tom at compton.nu
Tue Jul 14 22:51:39 BST 2009
On 14/07/09 22:30, Jon Burgess wrote:
> The biggest issue seems to be with step 1. Using the compose sequence
> for á (compose ' a) I get the following:
>
> First box: á
> Second: c3 a1
> Third: C3 83 C2 A1
Which is a straightforward double UTF-8 encoding. The second box is the
encoding of the first and third is the encoding of the second:
dunsmere [~] % echo -n "\xe1" | iconv -f iso-8859-1 -t utf-8 | od -t x1
0000000 c3 a1
dunsmere [~] % echo -n "\xc3\xa1" | iconv -f iso-8859-1 -t utf-8 | od -t x1
0000000 c3 83 c2 a1
Tom
--
Tom Hughes (tom at compton.nu)
http://www.compton.nu/
More information about the dev
mailing list