[OSM-talk] planet.dump

Jonas Svensson jonass at lysator.liu.se
Wed Aug 2 10:00:24 BST 2006


On Wed, 2 Aug 2006, David Sheldon wrote:

> The dump is supposed to be in XML. In XML '&' MUST ALWAYS be encoded as
> '&'. I believe the dump should be in UTF-8 anyway, but it is
> probably safest to encode any non-ascii characters using the appropriate
> entity references. For exampl £ rather than a (British) pound sign.
> The HTML £ is not valid XML unless you elsewhere define the entity
> "pound".

The & was just a sign of the lack of encoding in the july dump. There is a
lot of non-ascii characters in that file. Hopefully the august dump will
be properly encoded.

/Jonas





More information about the talk mailing list