[OSM-dev] way 27483626 UTF-8 truncation

Brett Henderson brett at bretth.com
Sat Oct 4 09:34:12 BST 2008


Florian Lohoff wrote:
> On Sat, Oct 04, 2008 at 03:24:12PM +1000, Brett Henderson wrote:
>   
>>> Another 2 change files contain utf-8 bugs and osmosis refuses to process
>>> them:
>>>
>>> 200810031022-200810031023.osc
>>> 200810031023-200810031024.osc
>>>  
>>>       
>> I've tested both of these files and they seem okay.  The only problem I 
>> can find is way 27483626 which has a broken "note" tag in file 
>> 2008100310-2008100311.osc.  Are you sure these files are broken?
>>     
>
> wget -O - http://planet.openstreetmap.org/minute/200810031022-200810031023.osc.gz | gzip -d | iconv -f utf8 -t utf8
> [...]
>     <way id="14783001" timestamp="2008-10-03T10:22:11Z" user="logictheo">
>       <nd ref="145957773"/>
>       <nd ref="163161140"/>
>       <nd ref="146004252"/>
>       <nd ref="301736490"/>
>       <tag k="name" v="Οδός Ιουστινιανού"/>
>       <tag k="created_by" v="Potlatch 0.6a"/>
>       <tag k="highway" v="residential"/>
>       <tag k="name:en" v="Ioustinianou Street"/>
>       <tag k="note" v="Ρώτησα ένα φίλο που μένει καιρό εδώ εάν αυτός ήταν κάποτε δρόμος. Το κοίταξα και από κοντά. Βλέπω μπάρες και στις 2 άκρες που είναι για να εμπ
> iconv: illegal input sequence at position 16342
>
>
> wget -O - http://planet.openstreetmap.org/minute/200810031023-200810031024.osc.gz | gzip -d | iconv -f utf8 -t utf8
> [...]
>     <way id="27483626" timestamp="2008-10-03T10:23:02Z" user="logictheo">
>       <nd ref="301736490"/>
>       <nd ref="145958259"/>
>       <nd ref="301736491"/>
>       <tag k="name" v="Οδός Ιουστινιανού"/>
>       <tag k="created_by" v="Potlatch 0.6a"/>
>       <tag k="highway" v="pedestrian"/>
>       <tag k="name:en" v="Ioustinianou Street"/>
>       <tag k="note" v="Ρώτησα ένα φίλο που μένει καιρό εδώ εάν αυτός ήταν κάποτε δρόμος. Το κοίταξα και από κοντά. Βλέπω μπάρες και στις 2 άκρες που είναι για να εμπ
> iconv: illegal input sequence at position 58891
>
> Flo
>   
Ah, sorry. I misread your first email. I didn't realise you were 
referring to minute changesets. I didn't realise there were two errors 
in that hourly file. I have to leave now, I'll try to take another look 
tomorrow morning (approx 15 hours from now).





More information about the dev mailing list