[OSM-dev] way 27483626 UTF-8 truncation

Florian Lohoff flo at rfc822.org
Sat Oct 4 09:36:15 BST 2008


On Sat, Oct 04, 2008 at 06:34:12PM +1000, Brett Henderson wrote:
> Subject: Re: [OSM-dev] way 27483626 UTF-8 truncation
> 
> Florian Lohoff wrote:
> >On Sat, Oct 04, 2008 at 03:24:12PM +1000, Brett Henderson wrote:
> >  
> >>>Another 2 change files contain utf-8 bugs and osmosis refuses to process
> >>>them:
> >>>
> >>>200810031022-200810031023.osc
> >>>200810031023-200810031024.osc
> >>> 
> >>>      
> >>I've tested both of these files and they seem okay.  The only problem I 
> >>can find is way 27483626 which has a broken "note" tag in file 
> >>2008100310-2008100311.osc.  Are you sure these files are broken?
> >>    
> >
> >wget -O - 
> >http://planet.openstreetmap.org/minute/200810031022-200810031023.osc.gz | 
> >gzip -d | iconv -f utf8 -t utf8
> >[...]
> >    <way id="14783001" timestamp="2008-10-03T10:22:11Z" user="logictheo">
> >      <nd ref="145957773"/>
> >      <nd ref="163161140"/>
> >      <nd ref="146004252"/>
> >      <nd ref="301736490"/>
> >      <tag k="name" v="Οδός Ιουστινιανού"/>
> >      <tag k="created_by" v="Potlatch 0.6a"/>
> >      <tag k="highway" v="residential"/>
> >      <tag k="name:en" v="Ioustinianou Street"/>
> >      <tag k="note" v="Ρώτησα ένα φίλο που μένει 
> >      καιρό εδώ εάν αυτός ήταν κάποτε 
> >      δρόμος. Το κοίταξα και από κοντά. 
> >      Βλέπω μπάρες και στις 2 άκρες που 
> >      είναι για να εμπ
> >iconv: illegal input sequence at position 16342
> >
> >
> >wget -O - 
> >http://planet.openstreetmap.org/minute/200810031023-200810031024.osc.gz | 
> >gzip -d | iconv -f utf8 -t utf8
> >[...]
> >    <way id="27483626" timestamp="2008-10-03T10:23:02Z" user="logictheo">
> >      <nd ref="301736490"/>
> >      <nd ref="145958259"/>
> >      <nd ref="301736491"/>
> >      <tag k="name" v="Οδός Ιουστινιανού"/>
> >      <tag k="created_by" v="Potlatch 0.6a"/>
> >      <tag k="highway" v="pedestrian"/>
> >      <tag k="name:en" v="Ioustinianou Street"/>
> >      <tag k="note" v="Ρώτησα ένα φίλο που μένει 
> >      καιρό εδώ εάν αυτός ήταν κάποτε 
> >      δρόμος. Το κοίταξα και από κοντά. 
> >      Βλέπω μπάρες και στις 2 άκρες που 
> >      είναι για να εμπ
> >iconv: illegal input sequence at position 58891
> >
> >Flo
> >  
> Ah, sorry. I misread your first email. I didn't realise you were 
> referring to minute changesets. I didn't realise there were two errors 
> in that hourly file. I have to leave now, I'll try to take another look 
> tomorrow morning (approx 15 hours from now).

To get the ROMA database in sync again i replaced the notes by
"broken-utf8" - As notes typically get not rendered thats not a problem
for me though. ROMA was down for a half a day before i discovered the
broken files and fixed them ...

Flo
-- 
Florian Lohoff                  flo at rfc822.org             +49-171-2280134
	Those who would give up a little freedom to get a little 
          security shall soon have neither - Benjamin Franklin
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: Digital signature
URL: <http://lists.openstreetmap.org/pipermail/dev/attachments/20081004/e5c45b65/attachment.pgp>


More information about the dev mailing list