[OSM-talk] Planet dump
Jonas Svensson
jonass at lysator.liu.se
Wed Dec 13 11:21:40 GMT 2006
On Wed, 13 Dec 2006, Jonas Svensson wrote:
> On Wed, 13 Dec 2006, SteveC wrote:
>
> > * @ 13/12/06 09:13:50 AM jonass at lysator.liu.se wrote:
> > > On Wed, 13 Dec 2006, SteveC wrote:
> > >
> > > > * @ 13/12/06 08:20:04 AM sxpert at sxpert.org wrote:
> > > > > Jonas Svensson wrote:
> > > > > > The new planet dump fails on utf-8 validation. :-(
> > > > > > Some names seems ok and some is broken.
> > > > > > node id="477715" is a broken one I beleive, while node id="100479" seems
> > > > > > to be ok. Could it be that some got broken in the translation from utf-8
> > > > > > to latin-1 when the database table were set to latin-1?
> > > > > >
> > > > > > /Jonas
> > > > > >
> > > > > planet-061213.osm:397870: parser error : Input is not proper UTF-8,
> > > > > indicate encoding !
> > > > > Bytes: 0xDF 0x65 0x22 0x20
> > > > > <tag k="name" v="Speicherstra???e" />
> > > > > ^
> > > > > planet-061213.osm : failed to parse
> > > >
> > > > is it broken out of the API?
> > >
> > > That particular one (node id="477715") is broken when retrieved by the api
> > > I think. Something is happening with the "ss" and the "e".
> >
> > Ok, then this means it's broken in the database.
> >
> > Are there lots of them?
>
> Almost 500. I can give you linenumbers referring to the dump.
The log is at <http://www.mozoft.com/OSM/planet-061213.errors.txt> should
anyone like to see it.
/Jonas
More information about the talk
mailing list