<div dir="ltr"><br><div class="gmail_extra"><br><div class="gmail_quote">On Wed, Jan 31, 2018 at 6:49 PM, marc marc <span dir="ltr"><<a href="mailto:marc_marc_irc@hotmail.com" target="_blank">marc_marc_irc@hotmail.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">I remain convinced that spelling rules have no place in osm tags<br>
even if it would be convenient.<br></blockquote><div><br></div><div>They are not spelling rules. And your comment implies that people shouldn't try to spell names<br>correctly. If "spelling rules have no place in OSM tags" then there can be no objection to mapping<br></div><div>the capital of the UK as "Lundun," etc. There is every reason to hope that mappers will spell<br>names correctly according to the rules of the local language. So spelling rules do have a<br></div><div>place in the values of free-form OSM tags such as name=*.<br></div><div><br></div><div>These are, in fact, local typographical conventions. Almost as important as local spelling rules.<br></div><div>Hyphenation conventions, for example, can be quite complex. Even in English, it would be<br>undesirable for "Therapist's Lane" to be hyphenated as "The- rapist's Lane."<br></div><div><br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
If they are to be added, the primary tools should first be asked to<br>
manage them before considering their use. otherwise the slightest search<br>
on a street name can fail, it's worse than having an incorrect return to<br>
the line.<br></blockquote><div><br></div><div>I'd be very scathing if there is <b>any</b> widely-used OSM tool that does not handle Unicode<br></div><div>correctly. It's been over 40 years since 7-bit ASCII could be considered adequate. A<br></div><div>couple of decades since operating systems did not handle Unicode as standard.<br><br></div><div>There are specifications of how Unicode strings should be compared. Programming<br>libraries which follow those specifications shouldn't suffer the problems you anticipate.<br></div><div>Programming libraries which don't follow Unicode specifications are <b>broken</b>.<br><br></div><div>You appear not to know that correct Unicode handling is essential for many languages<br></div><div>where a single glyph may be composed of two or more combining characters. Without<br></div><div>correct Unicode handling it is impossible to represent names correctly in those<br></div><div>languages.<br></div><div><br></div><div>The correct course of action is to check that widely-used OSM tools handle Unicode<br></div><div>correctly (which they should, anyway) and fix them if they do not.<br><br>-- <br></div><div>Paul<br></div></div></div></div>