[OSM-talk] invalid UTF8 characters redux
Joerg Ostertag
openstreetmap at ostertag.name
Fri Jul 6 16:39:57 BST 2007
Am Freitag, 6. Juli 2007 16:49:53 schrieb David Earl:
> Some months ago I corrected all the tag values which had invalid UTF8
> characters in them. I'm pleased to see that in processing the planet
> file every week since then, no more have appeared.
>
> This may just be luck. On the other hand the rails port happened about
> the same time, and I'm wondering if the api would actually reject
> invalid UTF8 in the uploaded XML - most XML parsers seem to.
>
> If this is the case, we can dispense with the 'sanitize' program for
> removing bad UTF8.
I already disabled the sanitizing in the perl modules for downloading and
importing into gpsdrive/mysql,csv,...
And it seems to work fine here.
-
Joerg
More information about the talk
mailing list