[OSM-talk] invalid UTF8 characters redux

Joerg Ostertag openstreetmap at ostertag.name
Fri Jul 6 16:39:57 BST 2007


Am Freitag, 6. Juli 2007 16:49:53 schrieb David Earl:
> Some months ago I corrected all the tag values which had invalid UTF8
> characters in them. I'm pleased to see that in processing the planet
> file every week since then, no more have appeared.
>
> This may just be luck. On the other hand the rails port happened about
> the same time, and I'm wondering if the api would actually reject
> invalid UTF8 in the uploaded XML - most XML parsers seem to.
>
> If this is the case, we can dispense with the 'sanitize' program for
> removing bad UTF8.

I already disabled the sanitizing in the perl modules for downloading and 
importing into gpsdrive/mysql,csv,... 
And it seems to work fine here.

-
Joerg




More information about the talk mailing list