[Talk-GB] City names translation

David Woolley forums at david-woolley.me.uk
Tue Aug 5 07:16:22 UTC 2014


On 05/08/14 00:37, Lester Caine wrote:
> Simply writing a name in a different
> alphabet is something that the renderer can do if required.

There is rarely a 1:1 mapping between different alphabets, except within 
a single country, and the mapping depends on both source and destination 
languages - and in fact dialect.  If you are going to do it 
mechanically, you must have detailed transliteration rules between every 
pair of languages that don't share a script.  Just consider the 
different transliterations needed for Paris in French and Paris (Texas?) 
in English.

You could reduce the problem to O(n) rather than O(n**2) by having a 
phonetic transliteration, but note that, as normally used, IPA is an 
approximation, designed to distinguish phonemes within a single 
language, so you would need a more detailed IPA markup than most people 
are used to.

In this particular case, I think there is almost certainly an element of 
Ukrainian nationalism, attempting to expunge all traces of Russian.  If 
you started  transliterating based just on Russian pronunciation of 
Cyrillic, even if you called it a pseudo-language called Cyrillic, it 
would probably never be acceptable to the Ukrainians in the current 
political climate.

Especially with Chinese, there are large numbers of homophones with 
different meanings, and words are often composed from two characters. 
Approximate transliteration requires a lot of knowledge, to avoid 
unintended meanings in the transliteration. Also the script is used with 
widely different spoken languages, so a mechanical transliteration would 
have to choose one of those languages.

I'm not suggesting that it is a good idea to translate every street 
name, although I would note that that will already have been done, 
outside OSM, for all the central London (tourist area) streets, into 
Chinese.

I think place names are so fundamental to any geographical map that 
contains any text at all, that alternative names should be part of the 
core database (although one could have special structures for them.

Any mechanical process needs to be predicated on having phonetic 
transcriptions available (probably two: the one used in the place 
itself, and the one in the country's equivalent of received pronunciation).

English street names largely come from a small vocabulary, which 
strongly overlaps with place names.  If you are afraid of a 
proliferation of ad hoc street name transliterations, maybe the map 
database should contain a list of names used for streets within a 
country.  (In the USA, there is also a lot of re-use of place names 
(Roswell in New Mexico is not the only Roswell).



More information about the Talk-GB mailing list