[OSM-dev] 04to05.pl doesn't like Puerto Rico

Dave Hansen dave at sr71.net
Thu Jan 17 17:28:54 GMT 2008


Well, I've done virtually the entire US's TIGER data with the script,
with no issues, but it finally choked on Puerto Rico.

It gets this:

not well-formed (invalid token) at line 330, column 38, byte 14569
at /usr/local/lib/perl/5.8.8/XML/Parser.pm line 187

when running on this file:

http://dev.openstreetmap.org/~daveh/tiger.files/counties/PR/Adjuntas.osm

I think it's the crazy characters in tags like this:

	<tag k="name" v="Carr Sillo de Calder�n"/>
	<tag k="tiger:name_base" v="Carr Sillo de Calder�n"/>

Being a stupid American, I have no real knowledge of character sets and
that fun.  Any idea what the right way to fix this is?

-- Dave








More information about the dev mailing list