[OSM-dev] Minutely Diff Broken UTF-8

Christopher Schmidt crschmidt at metacarta.com
Sun Feb 1 06:08:00 GMT 2009


On Sun, Feb 01, 2009 at 04:44:46PM +1100, Brett Henderson wrote:
> I've manually truncated the broken tag in the relation in the 
> daily/hourly/minute changeset files on the planet server.  If you now 
> have missing data in your database you should be able to reset the 
> timestamp and begin applying changesets including the missing data again.
> 
> I've also asked TomH to fix the broken tag in the db but it might take a 
> while given that it's the weekend.

Shaun mentioned that the reason for this one is that the database allows
255 'bytes' -- and in this case, the 255th byte cuts off in the middle
of a character. This should be fixed in 0.6, which will reject thse
edits outright.

-- Chris

> Brett
> 
> Christopher Schmidt wrote:
> >Osmosis's --rci task just blew up on me because of a broken diff. i
> >narrowed the problem down to 200901312208-200901312209.osc.gz , which
> >has invalid UTF-8, specifically, in the description tag of relation 28417 .
> >
> >I've skipped this update by manually changing my timestamp, but
> >http://openstreetmap.org/browse/relation/28417 and so on still looks
> >broken, so this is primarily a mail to let people know that something is
> >broken with utf-8 stuff in the db.
> >
> >Regards,
> >  
> 

-- 
Christopher Schmidt
MetaCarta




More information about the dev mailing list