[OSM-dev] dev server: script killed?

Jonas Svensson jonass at lysator.liu.se
Thu Aug 17 17:28:43 BST 2006


On 17 Aug 2006 at 18:03, Joerg Ostertag (OSM Munich/Ge wrote:

> that'S what I try. But it's a little bit of work; since the only think to edit 
> I've found is sed. ANd I'm still looking for something like 
> sed -e 's/[\x80-\xFF]/_/g'
> but in sed inside [] not \ works. Any hints or ideas?
> -
> Joerg

Not about sed, sorry. But maybe my UTF8sanitizer: 
<http://www.lysator.liu.se/~jonass/UTF8sanitizer.c>

Compile:

gcc -Wall -O2 -o UTF8sanitizer UTF8sanitizer.c

Usage something like this in bash:

bzcat planet-2006-08-14.osm.bz2 | UTF8sanitizer 2> UTF8errors.txt | 
check_osm.pl

or if you want a copy:

bzcat planet-2006-08-14.osm.bz2 | UTF8sanitizer 2> UTF8errors.txt 
>planet-2006-08-14.osm.bz2

/Jonas
/Jonas Svensson
-- 
jonass at lysator.liu.se, <http://www.lysator.liu.se/~jonass/>





More information about the dev mailing list