Slashdot Mirror


Russia Weighs Going Cyrillic For DNS

An anonymous reader writes "The Guardian reports that the Kremlin may start an alternate top-level domain, .rf. According to the story, .ru in Cyrillic translates to .py, the top-level domain for Paraguay, which the Russian government claims leads to confusion. This is similar to a move by China, which has their own .net and .com top-level domains in their native character set along with .cn, .com, and .net in ASCII." Hindering Paraguayan hackers may matter less to the Russian government than establishing greater control over a walled-off Internet.

223 comments

  1. The end of the DNS as we know it by loadrunner · · Score: 0, Redundant

    It was only a matter of time...

  2. Great!!! by Anonymous Coward · · Score: 2, Interesting

    It's great that nations can use their own languages instead of being forced to use alien Latin-English characters.

    1. Re:Great!!! by AKAImBatman · · Score: 1, Informative

      It's great that nations can use their own languages instead of being forced to use alien Latin-English characters.

      In this case, the characters are exactly the same. It's just that 'p' (pronounced 'pee' in English) is the letter 'er' in Russian, and 'y' (pronouced 'why' in English) is the letter 'oo' in Russian. So .ru to us is literally .py to them.

      Cyrillics has a number of Greek letters sprinkled in, but in this instance it is of no help.
    2. Re:Great!!! by Arthur+B. · · Score: 2, Interesting

      they are not the same, they just look very similar
        != py

      --
      \u262D = \u5350
    3. Re:Great!!! by Anonymous Coward · · Score: 2, Informative

      No, the characters only look the same to a human eye. To a computer they would look quite different:

      English "py" is keycode U+0070, U+0079
      Russian "py" is keycode U+0440, U+0443

      Of course, the whole internationalization issue wouldn't be an issue if ICANN didn't have their head up their collective ass.

    4. Re:Great!!! by Sigismundo · · Score: 5, Interesting

      Not sure why the parent has been modded flamebait. It's probably the phrase "alien Latin-English characters", but it's actually an accurate description of how a domain name might appear to speakers of non-European languages.

      I wasn't aware that China had already began experimenting with Chinese characters in domain names, so I did some Googling. Here is a link (in English) that describes how to register a Chinese Domain Name (CDN). It makes for a pretty interesting read. It includes the predictable clause that you can't register CDNs that "harm the glory of the state." Users of CDNs are encouraged to use "Official Client-end CDN Software" to make access more convenient. I wonder exactly what this does.

      In general I think it's pretty cool to be able to have non-ASCII characters in domain names, but it seems to introduce a lot of extra compexity into DNS. Also, it seems like it could open the door for more governmental control of the internet, as TFA mentions.

    5. Re:Great!!! by adrianbaugh · · Score: 1

      I take it there is some good reason why a new but backwards-compatible version of DNS can't be released that uses unicode? Never mind Cyrillic, or Chinese characters, I want a domain name in Tengwar!

      --
      "'I pass the test,' she said. 'I will diminish, and go into the West, and remain Galadriel.'"
      - JRR Tolkien.
    6. Re:Great!!! by Dogtanian · · Score: 5, Funny

      Actually one of big advantages of Microsoft was internalization. You mean that it was possible to shove them up your ass?
      --
      "Slashdot - News and Chat Sites Deviant". (Click "homepage" link above for details).
    7. Re:Great!!! by AKAImBatman · · Score: 1

      I don't think you understand what I mean. I'm not saying that they are the same computer code. I'm saying that they are literally the same characters. Just used differently between Cyrillics and English. The fact that computers have different character codes for the languages is beside the point. In an international environment py is going to equal py. Which can create a bit of a problem. Did I just receive a legitimate email from bankofrussia.py or a phishing attempt from bankofrussia.py?

      Can you tell the difference? I sure as hell can't. The only clue I have is that the browser encoded ErOo in the URL, while PY was spelled out in the URL. Otherwise I'd need to start looking for subtle differences that some character sets provide over others. (Sorry, in Arial they are the same pixel for pixel. Try Courier New.)

    8. Re:Great!!! by msuarezalvarez · · Score: 1

      You mean, something like the '96-proposed IDN <http://en.wikipedia.org/wiki/Internationalized_domain_name>?

    9. Re:Great!!! by Arthur+B. · · Score: 1, Informative

      The characters are not displayed in the same way, I cannot paste cyrillic in slashdot, but the difference between the y and the russian u is visible, the tail of the y is rounder.

      Of course, it leaves room for phishing attack, but they are not the same character. Not historically, not linguistically, not in encoding, not in display.

      --
      \u262D = \u5350
    10. Re:Great!!! by saforrest · · Score: 1

      I take it there is some good reason why a new but backwards-compatible version of DNS can't be released that uses unicode? Never mind Cyrillic, or Chinese characters, I want a domain name in Tengwar!

      And you can get it if you can get Tengwar in Unicode, with the exception of the top-level domain. Unicode characters are already supported and used, but no top-level domains using non-Latin Unicode characters yet exist. Russia is proposing a new top-level domain.

      Thinking about it, there's no need to reinvent the wheel here, is there? The existing IDNA system can be used: if Russia wants "" (or some other abbreviation of the Cyrillic ) as a top-level domain, just give them whatever that maps to in the ToASCII conversion (described in the article I linked). Then the existing software that supports international domain names will work without trouble.

      Of course then Japan will want , and China (maybe Taiwan will want it too), and this may make the top-level domain system balloon out of proportions. But these sorts of top-level domains make a hell of lot more sense to me than dot-biz or dot-museum!

    11. Re:Great!!! by AKAImBatman · · Score: 2, Informative

      The characters are not displayed in the same way

      As I said, it depends on your font. In Arial, they are pixel for pixel. In Courier, they have slightly different shapes. Either way, it doesn't really matter. Very few people will notice the font differences. Why? Because they are the same characters. The fact that a computer provides two copies of the same character, actually causes as many problems as it solves.
    12. Re:Great!!! by ajs · · Score: 2, Informative

      Slashdot is lame like U**x in 1980 and ate the characters you typed. Actually, Slash (the engine behind Slashdot) does exactly the right thing, converting any out-of-latin-1 characters into HTML-encoded characters such as &#041F;

      However, it also eliminates these from display because of the confusion that people use them to inject (e.g. mis-spelling a domain name with Cyrillic characters so that when someone cuts-and-pastes it, their session can be hijacked). It's a specific security feature used on MANY sites which are intended for English-language discussion.

      Actually one of big advantages of Microsoft was internalization. MS jumped on the internationalization bandwagon VERY late in the game, but they were the first to incorporate Unicode into the filesystem which made up for a lot of their delays... better late than never, I guess. Prior to Unicode the approach was typically to have multiple versions of the text associated with an application, in multiple character sets which would be loaded on-demand. These features worked in Unixes that I was using as early as 1987.,

      I could use national characters without any problem in 1994 on NT. "Use" is an interesting term. Most uses of Unicode outside of a Word Processor in vintage NT would result in system crashes and/or corruption.

      Good luck with Linux or most of Unices then. Well... Linux didn't really exist as a commercial OS at that time, so I guess you're right by default. What's more, the Unicode standard had JUST been published in 1991. It took years for most software to adapt to using Unicode, and even longer for the interoperability features to be worked out. Even today, new releases of, for example, Gnome continue to adapt to the ways other cultures use the desktop and OS with their native characters (e.g. with vertical or RtL script).

      You seem to have this rosy view of the world that involves Microsoft products solving the hard problem of internationalization from day one, and everyone else staring dumbly... this is far from the case.

    13. Re:Great!!! by saforrest · · Score: 1
      Unicode characters are already supported and used, but no top-level domains using non-Latin Unicode characters yet exist

      Apparently top-level domains support Unicode characters in URLs, but Slashdot chokes on them! (In links, anyway). Here are some attempts, all failing:

      bücher.de (UTF-8 or ISO 8859-1)

      bücher.de (HTML entity u-uml)

      bücher.de (Unicode character 00FC as entity)

    14. Re:Great!!! by AKAImBatman · · Score: 2, Interesting

      they are not the same character. Not historically

      And yes, they are the same character, historically speaking. Both characters were borrowed from a common Greek/Semitic ancestry. Cross pollination of Latin and Cyrillic languages have lead to Cyrillic renderings of the letter that are more or less the same as the Latin rendering.

      http://en.wikipedia.org/wiki/Y
      http://en.wikipedia.org/wiki/%D0%A3

      http://en.wikipedia.org/wiki/P
      http://en.wikipedia.org/wiki/%D0%A0
    15. Re:Great!!! by Maimun · · Score: 3, Informative
      They ARE the same. Trust me, I am Bulgarian and we also use the Cyrillic alphabet. The Cyrillic alphabet was created in the 9th century by Constantine, a Byzantine friar (I dunno if this is the correct term) serving the emperor in Constantinopol. The church name of Constatine was Cyrill, that is where the name of the alphabet came from. At that time, both Rome and Constantinopol were trying to convert the Slavic states to Christianity. The Eastern Roman Empire, a.k.a. Byzantia, was more flexible than the Catholics: she offered Christianity in the native Slavic languages, while the Catholics insisted on using Latin. The Cyrillic alphabet was introduced precisely for that purpose. It was modified Greek alphabet (Greek was, of course, was the language of the East Roman Empire) with symbols added for those Slavic sounds that had no Greek equivalent. Intially it was adopted in Bulgaria and after about a century or two it was adopted by the Russian proto-state -- in contrast to the Russian myths that the Cyrillic alphabet was first introduced in Russia and even invented in Russia.

      The initial Cyrillic alphabet looked quite different from what is used today in Russia and Bulgaria; the appearance of the modern Cyrillic alphabet is due to a reform by Tzar Peter I of Russia. Peter I imposed visual style similar to the one of the Roman font.

      BTW, the Cyrillic alphabet was not the only creation of Constantine-Cyrill. He had invented another alphabet to be used by the Slavs which was called "glagolitsa" and visually was totally different from the Cyrillic one. This radical design was not very successful, although I've heard it had been used in Croatia until 2-3 centuries ago.

      Here is a four-column table of the original Cyrillic alphabet and the Glagolic one ("glagolitsa"). The first column is the name of each letter (yes, each one had a name; if the names are read sequentially they form a saying, quite deep and meaningful at that), the second is the cyrillic glyph, the third is the glagolic glyph, the fourth is the numeric value.

    16. Re:Great!!! by Arthur+B. · · Score: 1

      The fact that they share ancestry does not mean they are historically the same character. Historically French and Spanish have always been distinct, although they both come from latin.

      --
      \u262D = \u5350
    17. Re:Great!!! by argiedot · · Score: 1

      As far as I know, the top level domain is the .de part.

    18. Re:Great!!! by Maimun · · Score: 4, Interesting

      No, the characters only look the same to a human eye. To a computer they would look quite different:
      This is precisely why Cyrillic symbols are not used in DNS. It is possible to have two URLs, one having latin letters only, the other one latin and cyrillic, that look exactly the same in most fonts but are completely different as strings, so if they are resolved by DNS they'd resolve to distinct IP addresses. This is just perfect for phishing attacks: you can't tell whether www.mybank.com is the URL of your bank "MyBank", or it has a Cyrillic "a" and is registered by the attacker, by simply lookong at it. To tell if the URL is genuine one must examine it with hex editor ro something...
    19. Re:Great!!! by Arthur+B. · · Score: 1

      The initial Cyrillic alphabet looked quite different from what is used today in Russia and Bulgaria; the appearance of the modern Cyrillic alphabet is due to a reform by Tzar Peter I of Russia. Peter I imposed visual style similar to the one of the Roman font.

      You just proved my point. If he needed to impose a visual style *similar* to the one of the Roman font, it means precisely that the characters are different. A character is more than it's different representation with different fonts, a character is a logical unit and the cyrilic er is not the latin p, although they may have similar and sometimes identical representation.

      --
      \u262D = \u5350
    20. Re:Great!!! by Cyberax · · Score: 1

      BTW, it's amusing that you translated the word 'bank' as 'kren'. It means 'list, careen, roll', and not 'place where you put money' :)

    21. Re:Great!!! by AiToyonsNostril · · Score: 1

      The difference between /. and Fark: in /. you get a discussion on the history of an alphabet; in Fark you get people wondering if the Bulgarian nurses imprisoned in Lybia are hot. Glad to see more Bulgarians here.

      --
      "I'm not good. I'm not nice. I'm just right."
    22. Re:Great!!! by Maimun · · Score: 1
      I hardly proved your point. You said:

      The characters are not displayed in the same way, I cannot paste cyrillic in slashdot, but the difference between the y and the russian u is visible, the tail of the y is rounder.
      As someone said, that depends on the font you use. In the fonts I use most of the glyphs are the same pixel for pixel with their visual counterparts. Though they maybe treated differently by the displaying system. I mean the spaces between them may be (very slightly) different.
    23. Re:Great!!! by AKAImBatman · · Score: 1

      I was actually wondering about that. I'm well aware that "Bank" is "baHK" (bahnk) in Russian, but Babelfish insisted that Kren Russee was a proper translation for "the bank of Russia". I wonder which "bank" it was thinking of. Stupid Babelfish. :-/

    24. Re:Great!!! by Anonymous Coward · · Score: 0

      I really doubt allowing unicode in non-US TDLs is somehow going to open up the American public to more phishing attacks.

      What you describe is a problem of dumb users clicking random links, and unicode isn't going to change that. Really, there are better ways to "fake" user links (e.g. I've been getting emails with links that look something like www.mybank.com/blah/blah/@rwww.hackerserver.com for years)

      Your banks should be educating the users to either manually type in the address each time, or bookmark it, and always look for a valid certificate (which phishers will not use since that makes them much easier to trace).

      Maybe ICANN has a valid reason for being hardassed, but this is not it.

    25. Re:Great!!! by alexhs · · Score: 1

      Can we all agree that they have similar glyphs but are different characters / graphemes ?

      --
      I have discovered a truly marvelous proof of killer sig, which this margin is too narrow to contain.
    26. Re:Great!!! by zsau · · Score: 1

      Have you been to countries that use Cyrillic alphabets? I have. I frequently saw URLs written in Cyrillic, even though they were meant to be transliterated into Latin. So someone advertising "bank.kg" (I was in Kyrgyzstan) would write (fancy looking b)a(small captial h)(small capital k).(small capital k)(small capital gamma). They might have written the URL on the computer, but when it's a poster the character codes aren't there any more, it's just ink. I can understand the confusion that would come in Russia, where someone might right "bank.ru" as (fancy looking b)a(small captial h)(small capital k).py.

      This isn't (entirely) about internationalised domain names and punycode; this is about being able to print domains in a cultrally acceptable, unambiguous way, even when character codings aren't around.

      --
      Look out!
    27. Re:Great!!! by oldhack · · Score: 2, Insightful

      You guys are failing to communicate because you have different premises. Batman defines character by the appearance, Arthur by its semantic (as does Unicode). Semantic definition is clearer than the visual one, especially since the appearance of the same character varies depending on the font used. The possible problem due to similar appearances remains, although I don't know how big of a problem it is/will become.

      --
      Fuck systemd. Fuck Redhat. Fuck Soylent, too. Wait, scratch the last one.
    28. Re:Great!!! by AKAImBatman · · Score: 1

      Y and Y are fully equal in the historical sense. They derive from the same letter, and even convey the same sound in their respective languages. English overloads Y with a few different allographs, but pronouncing Y as "oo" is a common form.

      P and P are different graphemes due to the evolution of the modern glyph by way of the Latin language. The Romans had already evolved a P symbol from Greek/Semitics, so when faced with P "Er" they chose to add a line to differentiate it; thus forming the modern letter R. P (ru) and R (en) are compatible graphemes, but incompatible symbols. P (ru) and P (en) are compatible symbols, but incompatible graphemes.

      Don'tcha just love allography and etymology? =D

    29. Re:Great!!! by saforrest · · Score: 1

      As far as I know, the top level domain is the .de part.

      Sorry, I meant "domain names support..." not "top-level domain names support...".

      Even that doesn't really make sense: I should have said "Unicode characters in domain names are supported."

    30. Re:Great!!! by AKAImBatman · · Score: 1

      Actually, to correct myself, I'm confusing Y with U. (I don't know what the heck I'm smoking on that one.) U gets overloaded with an "oo" sound. Y is overloaded with an "ee" sound. Both U and Y have a common ancestry, having evolved from the same character (Upsilon). So Y and P sort have the same compatibility status.

      Sorry about the brainfart.

    31. Re:Great!!! by ctzan · · Score: 1

      Users of CDNs are encouraged to use "Official Client-end CDN Software" to make access more convenient.
      last time I've tried you couldn't query their bloody registry with a simple whois client: you had to go through a website with cookies, captcha and click-wrap agreement, or use a windows-only binary. They're probably advertising the latter.
    32. Re:Great!!! by ardle · · Score: 1

      Think it's "bank" as in "fighter plane".
      Interesting ;-)

    33. Re:Great!!! by shutdown+-p+now · · Score: 1

      Still different characters, sorry. They happen to look familiar for certain fonts (and even then you were wrong there - for "y", capital letters look quite different in Latin and Cyrillic, for example), but (standard) handwriting is different for both. Also, quite a few Cyrillic letters are noticeably different when rendered in italic.

    34. Re:Great!!! by Dakkus · · Score: 1

      Well, the Russian bureaucrats seem to think of them as the same character, at least. If it wasn't so, the Russian licence plates probably would also use other letters than only A, B, C, E, H, K, M, O, P, T, X ;)
      BTW, those letters transcribe to latin as A, V, S, YE, N, K, M, O, R, T, H.

      I don't think it's a coincidence that the licence plates only use letters that can be recognized both in Russia and in west. It is impossible saying whether they are using cyrillic or latin letters in the licence plates, because they are only considered characters by the bureaucracy. And now, somebody will probably dig up an old soviet plate and tell that for example the Leningradskij Oblast plates have letter in them. That is true. However, Soviet Union != Russia. Hopefully it's not a surprise that interoperability with west wasn't top-priority in CCCP/SSSR.

    35. Re:Great!!! by CaptainCarrot · · Score: 1

      Ah, yes. Wikipedia is the ultimate authority on such matters. Sorry, but I can't take any reference seriously where "basically" is used to introduce an assertion of fact. Yes, the letters all have a "Greek/Semitic ancestry". So do nearly all alphabets in the world other than Korean.

      That doesn't mean you can grab any two similar looking letters and say they're the same historically. Latin P and Cyrillic P are not. Latin P is descended from Greek pi as transmuted through Etruscan and ultimately derives from the Phoenician pe; Cyrillic P (er) is from Greek rho, which is from Phoenician resh. In some typefaces they look the same. In other typefaces, and most certainly in handwriting, they do not. (In the above, Cyrillic P is represented with Latin P, since if I type the Cyrillic letter, stupid /. changes it to a numerical entity and then won't display it. Idiocy. Note to Slashcode maintainers: How about we gallop into the 21st century and support UTF-8 encoding?)

      Cyrillic U has the same origin as Latin Y only by a roundabout route. Greek upsilon had already been borrowed into Latin as V to represent the sound now written with U in most European languages. Y was a re-borrowing of the same letter to spell Greek loanwords after the pronunciation of upsilon changed. Upsilon is from Phoenecian waw, which also gave us F. (F through Greek digamma, which is a secondary Greek borrowing from waw to represent the sound of our W, which later disappeared from the language. It survives only as a numeral.) Cyrillic U is short for an archaic digraph, uk -- in many fonts this is shown using the modern form of the letters; it was originally o-izhitsa with a ligature -- from the Greek digraph omicron-upsilon which has the same sound. (Izhitsa, which had been borrowed directly from upsilon, is no longer used in Russian.) So to the extent that the form of Cyrillic U is based on that of izhitsa, yes, it's from upsilon. Typographically, uppercase Cyrillic U isn't identical to the Latin Y in any font I know of, including Arial, and they are again dissimilar handwritten in both capitals and small letters. (Lowercase Cyrillic u is identical to y in most fonts -- but not Courier New, as it happens, since it exaggerates the serifs and knows perfectly well the lower one doesn't belong on the Cyrillic letter.) From a data processing perspective, even when they look identical they're a mismatch in a string comparison. So if there's any situation where it's a mistake to use one for the other, it's on a computer.

      The editorial addition to the post is needlessly alarmist. Whether they're going to heavily filter web traffic or not, it's perfectly reasonable for a nation of 140 million people to prefer their own alphabet to a foreign one. Ditto for China's 1.3 billion.

      --
      And the brethren went away edified.
    36. Re:Great!!! by CuriousCuller · · Score: 3, Interesting

      I live in Poland, more specifically in Przemysl on the Ukrainian border so I'm exposed to both alphabets more or less daily. I must confess, I envy the Easterners! The Latin alphabet is really not suited to Slavic tongues and I think the Cyrillic one is a far superior way to render them. For example, in Cyrillic you get one nice little letter looking like w with a tail, whereas we get szcz... if you're an English speaker, it'd be something like the sh ch between freSH CHeese. Anyway, the inadequacies of the Latin alphabet is why Polish sometimes ends up looking like a cat walked across the keyboard and totally bewildering to anybody living west of the river Oder. Consider this little gem: w Szczebrzeszynie chrzaszcz brzmi w trzcinie i Szczebrzeszyn z tego slynie - and that's without actually using any of the eight accented letters. Basically, horrible things were done in the past to squeeze a square peg into a round hole and that's why Polish has ended up with rather random letter combinations like cz, ch, rz, sz, szcz etc. in order to get 36 sounds out of a measly 23 letters (Polish doesn't use v, x or q)... Cyrillic is far more efficient all things considered - with one letter for each distinct sound. Alas, we're stuck with what we have now... a pity.

    37. Re:Great!!! by Hal_Porter · · Score: 1
      Actually there's an irony here. Dos used 8 bit character sets based on the original IBM PC. Actually Dos ran on other 8086 hardware so the Dos character set is called the OEM character set for "Original Equipment Manufacturer". Since IBM clones gradually drove out other hardware Dos ended up using the IBM PC character set in the US and Europe. In Asia there were national character sets based on MBCS. They encoded thousands of characters by using a Lead Byte scheme. I.e. characters above 0x7f combined with the next byte to give more character combinations.

      But MBCS was hard to deal with. String operations are complicated by the variable width encoding. But Dos needed to do this, since Asian governments and software vendors essentially insisted on this scheme.

      When Windows was being designed Unicode had just been standardised. The benefit was fixed width encoding at 16 bit, UCS2. Which meant you could use C string operations like ptr-- and ptr++ again rather than a mess of AnsiNext and AnsiPrev, assuming ptr was a PTCHAR. In Windows a TCHAR is a 16 bit word if you build for Unicode and a byte otherwise.

      The irony is that Unicode has bloated to the point where it can no longer be encoded as a 16 bit fixed width encoding. That gives 65536 characters, the Basic Monolingual Plane. But there are more than 65536 characters which means that you need to use a variable length encoding, UTF16, again.

      http://en.wikipedia.org/wiki/UTF-16/UCS-2#Use_in_major_operating_systems_and_environments

      UTF-16 is the native internal representation of text in the Microsoft Windows NT/2000/XP/CE, Qualcomm BREW operating systems; the Java and .NET bytecode environments; Mac OS X's Cocoa and Core Foundation frameworks; and the Qt cross-platform graphical widget toolkit.[citation needed]

      Symbian OS used in Nokia S60 handsets and Sony Ericsson UIQ handsets uses UCS-2.

      Older Windows NT systems (prior to Windows 2000) only support UCS-2. The Python language environment has used UCS-2 internally since version 2.1, although newer versions can use UCS-4 to store supplementary characters (instead of UTF-16). Which raises the question of why Windows didn't use UTF8. Mostly because it wasn't standardised when Windows was being designed. But there's a deeper issue that applications often infer a locale from the active code page and they would break if it wasn't the one they expect. So the main advantage of Unicode has been lost. The disadvantage that ASCII characters bloat to 16 bits is still there though.
      --
      echo -e 'global _start\n _start:\n mov eax, 2\n int 80h\n jmp _start' > a.asm; nasm a.asm -f elf; ld a.o -o a;
    38. Re:Great!!! by forgotten_my_nick · · Score: 1

      As I understand it the browser would handle this.

      so your URL would look like.

      www.myb%10%72nk.com

      or something like that.

    39. Re:Great!!! by andreyw · · Score: 1

      Cyrilllic actually originated from the Greek alphabet.

      Looking at the capitals (lower case greek being a medieval invention... nevermind cyrillic lower case)

      A - A
      B - B
      'G' - 'G' (written the same way, that Slashdot cannot show because it sucks)
      'D' - looks very similar
      E - Same
      X ('H' sound) - same
      P ('R' sound) - same
      'P' sound - same
      'T' sound - same
      'L' sound - same
      'Th' sound - was present in pre-revolution russian
      'OO' sound - Looks a lot like 'ipsilon', which used to be uepsilon, and in modern greek still sounds like a 'OO' when written like 'ou'.
      'O' sound - same
      'M' sound - same
      'EE' sound - came from the greek 'H' (eeta)
      'K' sound - same
      'N' sound - looks like the latin 'H', came from the Greek 'N'.
      'F' sound - Same
      'S' sound - Came from the the Greek sigma, and still looks a lot like the 'at-the-end' version of it.
      'Psi' - Same, still used in church slavonic
      'Ksi' - Same. still used in church slavonic
      'Z' sound - came from Dzeta
      'Omega' - Used to be present in cyrillic as well.

    40. Re:Great!!! by famebait · · Score: 1

      As I understand it

      You don't. If unicode domain names becomes commonplace, why would not browsers adapt to displaying them 'properly'?

      --
      sudo ergo sum
    41. Re:Great!!! by forgotten_my_nick · · Score: 1

      "You don't. If unicode domain names becomes commonplace, why would not browsers adapt to displaying them 'properly'?"

      Why don't they now?

      However even if your correct, it would be easy to make anti-phishing systems to show the difference.

    42. Re:Great!!! by Richard+W.M.+Jones · · Score: 1

      This is precisely why Cyrillic symbols are not used in DNS. It is possible to have two URLs, one having latin letters only, the other one latin and cyrillic, that look exactly the same in most fonts but are completely different as strings,[...]

      This is not the reason why Cyrillic symbols are not used in (old, non i18n) DNS. It's because DNS was invented before people took i18n seriously. If phishing was a real concern then ASCII characters such as '1/l', 'O/o/0' would also be banned.

      Rich.

    43. Re:Great!!! by CRCulver · · Score: 2, Interesting

      Sts Cyril and Methodius did not invent the Cyrillic alphabet. They invented only the Glagolitic alphabet. The Cyrillic alphabet was invented in the Kingdom of Bulgaria nearly a century later.

    44. Re:Great!!! by petermgreen · · Score: 1

      It's great that nations can use their own languages instead of being forced to use alien Latin-English characters.
      It may seem great but the internet is supposed to be a worldwide network.

      How will a chineese person type the email address of thier russian friend and vice-versa?

      How will you report spam when the abuse addresses are in an alphabet you don't recognise?

      How will you write down the address for a friend of a helpfull page that just happens to be on a russian site?

      For better or worse the basic form of the latin alphabet is the one thing that can be easilly typed by most computer users worldwide.

      --
      note: i'm known as plugwash most places but i screwd up registering that here somehow in the past and now can't register
    45. Re:Great!!! by petermgreen · · Score: 1

      1/l and 0/o (domain names are generally shown in lowercase at least in browsers) look fairly different in most fonts (the main difference between 1 and l is not so much the shape of the character as the different spacing.

      Unlike say a and it's crylic equivilent which look pixel for pixel identical in many common fonts.

      --
      note: i'm known as plugwash most places but i screwd up registering that here somehow in the past and now can't register
    46. Re:Great!!! by Anonymous Coward · · Score: 0

      I agree with almost all of your points, I would just like to point out that Cyrillic is not Russia's "own" alphabet - many other countries use it, and it was certainly not invented in Russia neither. I agree with the rest.

    47. Re:Great!!! by 21mhz · · Score: 1

      in contrast to the Russian myths that the Cyrillic alphabet was first introduced in Russia and even invented in Russia

      I'm from Russia and I'm unaware of anyone who would spread such myths.
      To the contrary, there's much reverence to Cyril and Methodius.

      --
      My exception safety is -fno-exceptions.
    48. Re:Great!!! by jc42 · · Score: 1

      In this case, the characters [English "py" and Russian chars that look like "py" are exactly the same.

      Actually, in Unicode they are not the same. The Russian TLD we're talking about looks like the English ".py", but that is encoded in Unicode as the hex chars 70 79. The Russian (Cyrillic) chars are encoded as hex 0440 0443, or in UTF-8 as D1 80 D1 83. At the bit level, which is what a DNS server or browser sees, they are totally different character strings. Their glyphs look the same, but they are different bit strings.

      If you understand a bit of Russian, you might enjoy the wikipedia article on what is sometimes called Volapuk encoding. (And note the omission of the umlaut over the 'u'. ;-) Russians have long been playing fast and loose with computer encodings of their language. Not surprising, considering the total disregard that the American-dominated computer industry has had for language that use non-Roman alphabets, plus of course the typical Russian sense of humor. Some of the links in this article lead to truly hilarious examples of Russian written with ASCII characters.

      The wikipedia Mojibake article is also worth reading.

      And note that, unlike slashdot, wikipedia has no problems with mixing languages and writing systems within a single article. Hint, hint ...

      --
      Those who do study history are doomed to stand helplessly by while everyone else repeats it.
    49. Re:Great!!! by CaptainCarrot · · Score: 1

      "Own" as in the one they use natively, not "own" as in exclusively theirs. Most Americans think of English as their "own" language, although it's used in a number of other countries and we didn't invent it. The word doesn't imply exclusivity in this context.

      --
      And the brethren went away edified.
  3. It's not really translation by mr_mischief · · Score: 4, Informative

    You can't really translate between 'r' and rho. It's a character set issue. It's a straight equivalency of sounds. Cyrillic is based on the Greek alphabet and the English alphabet is based on the Latin alphabet. It could be confused with Paraguay because of the character encoding, but it's not really the same letters.

    1. Re:It's not really translation by lb746 · · Score: 1

      I'm not sure if translation is just used due to the lack of a word to describe the process. Maybe more like transcribing?

      This whole process makes sense to me at least with my familiarity from browsing Chinese websites. They can 'translate' or transcribe the characters into the pinyin system of roman characters, but this in no way really explains the character or definition. It actually forces people to learn a middle language that helps make it easier to translate between the Chinese character set and the Roman character set.

      As a quick example with link to graphic of Chinese characters:

      Ni Hao == ni2 hao3 == Hello
      So while putting in the characters with a dot com at the end makes sense to Chinese speaking individuals, it makes no sense to those that use the roman system. Thus we find websites translated to pinyin first(with the tone numbers removed).

      Personally I would much rather type in Chinese characters to go to a specific Chinese website than typing in pinyin that doesn't really do the translation justice

    2. Re:It's not really translation by Cctoide · · Score: 2, Informative

      I'm not sure what you're asking, but I've always heard of conversion between scripts (i.e. writing systems) being called transliteration.

      --
      "Let's face it, it's a good story. Accuracy would kill it."
    3. Re:It's not really translation by lb746 · · Score: 1

      Yeah thats the word I was looking for. I knew it wasn't really translating or transcribing but I just couldn't remember. Thanks!

    4. Re:It's not really translation by _|()|\| · · Score: 1

      From the user standpoint, it's a distinction without a difference. In most fonts, Latin "py" is not readily distinguishable from Cyrillic "ru." However, I would argue that confusion is more likely with the proposed Cyrillic domain names than with the current all-ASCII system. I am sympathetic to the desire for more localization, but the ramifications of a change like this should be considered very carefully.

    5. Re:It's not really translation by mr_mischief · · Score: 1

      If there's no difference in the words we use, then we should stick to just one word. I propose "Oof". ;-)

      Seriously, though, I think you've struck on the right issue here. It's not a problem caused by the current system. It's a problem encountered when expanding the current system to include other character sets. For the people designing the DNS had considered this change way back when they designed the initial system and assigned the ccTLDs, it would have been nice but would've required an extreme case of forethought.

      These are the types of snags extending a system beyond its original design will encounter. As for the proper ccTLD, perhaps .ru should be replaced entirely with .rf in the standard and current domain holders grandfathered into the new system. It seems like a good fix for this specific problem.

    6. Re:It's not really translation by forkazoo · · Score: 1

      You can't really translate between 'r' and rho. It's a character set issue. It's a straight equivalency of sounds. Cyrillic is based on the Greek alphabet and the English alphabet is based on the Latin alphabet. It could be confused with Paraguay because of the character encoding, but it's not really the same letters.


      Well, it's both wrong, and not wrong. "Translation" is often used in a very broad sense to say things like source code is translated into an executable form by a compiler. Programmers who work with text might well have a background in things like compilers, and do i18n using that sort of terminology. Translation also means to take some semantic meaning and express it in another natural language, which is obviously not what this is. So, depending on what exactly the author meant, it may or may not have been an incorrect usage.

      And, best of all, it's completely valid to also have the argument about whether identical letters from different writing systems are the same letter or not. In some contexts they are. In other contexts, they have almost no relation. From the perspective of a person looking at a letter, the shape and appearance is absolutely the only thing that matters. For the DNS server, the only thing that matters is naturally the sequence of bits that appear in a query.

      The best term in place of translation is probably "transliteration" which refers to changing between two different writing systems without concern for meaning, but trying to preserve pronunciation. Even if it's the word I'd use, I can hardly argue it ought to be the only word that is correct. You know, some days, you just want to conquer the world and make all but one writing system illegal and burn the rest. I don't really acre which one we wind up with.
    7. Re:It's not really translation by Anonymous Coward · · Score: 0

      It's a character set issue. It's a straight equivalency of sounds. ...

      You completely missed the point of the article.

      English/Latin letter "P" looks exactly as Russian/Cyrillic letter equivalent to Latin "R". Similarly for "Y" and "U".

      If a Russian had to write "RU" in Rissian while using Latin letters that look exactly as Cyrillic, he would have wrote "PY".
    8. Re:It's not really translation by temcat · · Score: 1

      This holds true only for lowercase letters. Uppercase Russian "U" is different from "Y" - namely, it looks exactly like lowercase "y", only larger.

      Russian is my native language.

    9. Re:It's not really translation by mr_mischief · · Score: 1

      The fuck I missed the point. Translation is not the right word. Sure, I only took one semester of Russian, but we did learn the alphabet, thankyouverymuch.

      Russians who learn English don't use Latin characters for their resemblance to Cyrillic letters any more than Americans learning Russian use Cyrillic letters based on the Latin letters they look like. They are both phonetically spelled languages, and learning the different letters is part of learning the languages.

      The confusion isn't in using the wrong letters in the other language. The confusion is in not being aware at any given moment which language's encoding is in use for the domain. The problem of the characters being similar in appearance between the two languages is a transliteration problem, as many people have said in this thread. Transliteration and translation are two very different things.

    10. Re:It's not really translation by jonadab · · Score: 1

      > I'm not sure if translation is just used due to the lack of a word to describe the process.
      > Maybe more like transcribing?

      There is a word for taking a word and importing it phoneme-for-phoneme into a different writing system, but most non-linguists don't know the term. It's called "transliteration". Translation, technically, means that you move from one *language* to another, so as a general rule more changes than just the writing system. So, for instance, if you're going from Greek to English, translating gives you "messenger" where transliterating gives you "angelos". (Yeah, we get the English word "angel" from this Greek word.) With close cognates sometimes not *much* more than the writing system changes. For instance, going from Russian to English transliterating will give you "avtobiografiya" where translating gives you "autobiography". That's pretty similar, but only because the words in both languages have a shared etymology, i.e., they both get it from the same original words in Greek (literally, "self life message"; autos also has other meanings (primarily, it is the third-person personal pronoun), but here it means self).

      Anyway...

      Introducing a parallel system wherein sites can be registered with domains containing either Cyrillic *or* Latin characters will only exacerbate the confusion. Under an all-ASCII domain system people just have to learn that in domains p is a Latin pi, not the same letter as rho, and if they see what they think is a rho it's really a Latin pi. Under a parallel system that supports both, if you see a p you don't know if it's a Latin pi or a Cyrillic rho; they look so similar that they can be hard to tell apart even when you see them side-by-side, much less in isolation from one another. In some font faces (especially sans-serif ones) they may even be exactly the same glyph, visually identical. There are a number of other letters that will have this problem, too, e.g., Cyrillic &#1042; (i.e., v) and Latin B (capital b), Cyrillic &#1052; and Latin M (same phonetic value), Cyrillic &#1053; (n) and Latin H, Cyrillic &#1054; (omicron, i.e., short o) and Latin O, and so on and so forth.

      The Mozilla folks have put a lot of thought and effort into figuring out the best ways to deal with these issues.

      --
      Cut that out, or I will ship you to Norilsk in a box.
    11. Re:It's not really translation by jonadab · · Score: 1

      > (literally, "self life message"

      *smacks forehead*

      That should read "self life writing". If it were "self life message" it would be autobiology, but we use -ology to mean the study of something, so I guess autobiology would be the study of your own life. Or something.

      --
      Cut that out, or I will ship you to Norilsk in a box.
    12. Re:It's not really translation by jonadab · · Score: 1

      I should note too that Chinese is a special case, because it isn't a phonetically-defined language in the same way as English, Russian, and for that matter most other languages. When you transliterate from one phonetic writing system to another, you aren't really changing very much (except in cases where the target writing system doesn't have letters for all the phonemes you need, which admittedly does happen). It's still fundamentally the same word, with the same pronunciation and the same meaning, just written in a different alphabet.

      Chinese is different because the shapes of the characters carry meaning beyond the sounds they make. In a phonetic writing system the letters just stand for sounds, so when you go from one of them to another, as long as you transliterate in a way that gives you the same sound, the meaning is still carried in exactly the same way as before. Some languages even have more than one different writing system that it's normal to write them in, e.g., some east-European Slavic languages are routinely written in either Latin or Cyrillic.

      --
      Cut that out, or I will ship you to Norilsk in a box.
    13. Re:It's not really translation by rtb61 · · Score: 1
      It would not really be a global parallel system as the only people likely to use the Russian DNS servers would be Russians, just as for the Chinese DNS servers. Being that the choice of DNS server configuration is yours to make, you can of course use them or any other DNS server.

      This would mean that your browser should clearly show and define exactly which DNS server you are actually using to avoid confusion.

      Of course the other big thing is that the .gov, .mil and .edu can all immediately get reassigned to local addresses rather than some foreign imperialistic state ;).

      --
      Chaos - everything, everywhere, everywhen
  4. soviet russia bait by savuporo · · Score: 3, Funny

    i think this is a specially engineered news post to bring out the lamest "in soviet russia" jokes of slashdot. bring it on!

    --
    http://validator.w3.org/check?uri=http%3A%2F%2Fwww.slashdot.org Errors found while checking this document as HTML5!
    1. Re:soviet russia bait by Avohir · · Score: 1

      In Soviet Russia, you may have slight confusion over the proper TLD!...

      ... I dont think i get this game :(

      --
      To err is human, to really foul up requires a computer
    2. Re:soviet russia bait by Anonymous Coward · · Score: 0

      In Soviet Russia acrylic internets also blocks crossbows.

  5. cyrillic("ru") != "py" by Anonymous Coward · · Score: 0

    While "ru" written with cyrillic letters may look confusingly similar to "py", it is not the same.

    1. Re:cyrillic("ru") != "py" by David+Chappell · · Score: 1

      >While "ru" written with cyrillic letters may look confusingly similar to "py", it is not the same.

      Confusingly similar is an understatement. In many fonts the same glyphs are used. Yes, the same glyph is printed for both latin p and cyrillic r and the same glyph is printed for both latin y and cyrillic ew.

      Scammers could exploit the fact that people in Russia pronounce the domain "ru" as "rew". If they see the glyphs "py" and do not remember that the glyphs should be interpreted according the the latin alphabet, they will pronounce them as "rew" as well. Thus, one could easily fail to notice the difference between "rambler.ru" and "rambler.py".

    2. Re:cyrillic("ru") != "py" by mr_mischief · · Score: 1

      Which, really, is no different from allowing Cyrillic in the second-level domain. Having one site in the same ccTLD as the one it spoofs is no more secure than spoofing across ccTLDs.

  6. In Soviet Russia ... by trolltalk.com · · Score: 5, Funny

    In Soviet Russia, DNS blocks YOU.

    ... which is the whole point of "greater control".

    1. Re:In Soviet Russia ... by Anonymous Coward · · Score: 0

      Funnily enough, there is no slashdot.ro ... but guess what there is? A slashdot.cn!

    2. Re:In Soviet Russia ... by trolltalk.com · · Score: 1

      If they can block links to myminicity, I for one would welcome our new Russian DNS Overlords!

      Better yet, lets give them myminicity.ro or rf, or whatever, and let let them build their own gulags.

    3. Re:In Soviet Russia ... by adinu79 · · Score: 1

      Oh, how us Romanians (.ro domain) would love a Slashdot version in our language, and I'm also sure a Russian (.ru) site would be great for the russians.

    4. Re:In Soviet Russia ... by fm6 · · Score: 2, Funny

      In Soviet Russia, they are so tired of this joke.

    5. Re:In Soviet Russia ... by Random-words-writer · · Score: 1

      In ex-soviet Russia, there is a http://www.slashdot.ru/ ! (and they haven't myminicity links) I hope that their Slashdot won't be slashdotted by our.

  7. Domain names by Anonymous Coward · · Score: 0

    The desire to have national domains might be understandable, but way too many tools depend on fairly strict domain name validity checking and having non-Latin/non-numeric/non-dash/non-dot chars is going to make all those tools barf. Hardly a consideration for the Govt that presided over murder of millions of its own citizens.

  8. Well... by gibbdog · · Score: 2, Funny

    In Soviet Russia, the domains name you!

  9. Just to spike the ball..... by edwardpickman · · Score: 5, Funny

    and prevent foreign outsourcing of Russian web site construction they plan to launch a version of HTML in Cyrillic. Soon to be followed by C++ in Cyrillic. Microsoft decided it was a niffty idea so they plan to start a Pig Latin based coding language called "Squeal Like".

    1. Re:Just to spike the ball..... by Anonymous Coward · · Score: 0

      The sad thing is, this has already happened. If you bought a localized version of Microsoft Office you used to get a localized version of the macro language as well. Honestly. I've had to work with it. (And I derived a lot of humor from it.) On the bright side, it was interoperable with English Office; if you opened it you saw the English names instead of the localized statements, or vice versa. Nowadays Microsoft just seems to ship the English macro language.

    2. Re:Just to spike the ball..... by Animats · · Score: 1

      HTML in Cyrillic...

      You can write XML with Cyrillic tags. XML with tags in Mandarin Chinese shows up now and then.

    3. Re:Just to spike the ball..... by techpawn · · Score: 2, Interesting

      This is why we need "common" as a language choice! Go ahead and keep your individual languages (English, French, Goblin) but also have a "Common" language for all people. Like in Firefly everyone spoke a little English and a little Chinese to create a language of the people...

      I fear that it would create more and bloodier Wars than ever before though.

      --
      Ask not what you can do for your country. Ask what your country did to you
    4. Re:Just to spike the ball..... by megaditto · · Score: 1

      Funny you should mention outsourcing. Last time I checked, Russia annually issues about 7,000,000 work visas for foreigners.

      Now compare that to all the bitching about 60k H1Bs...

      --
      Obama likes poor people so much, he wants to make more of them.
    5. Re:Just to spike the ball..... by Anonymous Coward · · Score: 0

      Would the Microsoft Pig Latin Database be called C Squeal?

    6. Re:Just to spike the ball..... by Anonymous Coward · · Score: 0

      Like in Firefly everyone spoke a little English and a little really badly spoken Chinese to create a language of the people...

      I fixed that for you. They might as well of used a made up language instead of speaking pretend Chinese.
    7. Re:Just to spike the ball..... by Nerdfest · · Score: 2, Funny

      The database oriented variation would be called "SQL Like a Pig".

    8. Re:Just to spike the ball..... by Cyberax · · Score: 1

      Actually, the most-used programming language in Russia is the language of 1C:Enterprise (http://en.wikipedia.org/wiki/1C_Company), it has Russian keywords and system variable names - it's the only sane way, because some terms of Russian accounting do not translate well into English (and transliterated Russian is _ugly_).

      Though, Russian text in computer programs looks very weird.

    9. Re:Just to spike the ball..... by mdahl · · Score: 0

      We already have that, actually. Esperanto is designed to do that. Esperanto on Wikipedia for more info

    10. Re:Just to spike the ball..... by Anonymous Coward · · Score: 0

      I fear that it would create more and bloodier Wars than ever before though.

      I seem to be having tremendous difficulty with my lifestyle. -- A. Dent
    11. Re:Just to spike the ball..... by Hoi+Polloi · · Score: 1

      6,900,000 of them are working on sending spam.

      --
      It is by the juice of the coffee bean that thoughts acquire speed, the teeth acquire stains. The stains become a warning
    12. Re:Just to spike the ball..... by zoogies · · Score: 1

      Any video clips of this on, say, Youtube? I'm interested in seeing it. Speaking "a little English" and "a little Chinese" does not create a language of the people...it sounds pretty outlandish, if you ask me.

    13. Re:Just to spike the ball..... by Anonymous Coward · · Score: 0

      FWIW, there already is a "common" language that's been designed from the ground up to be as easily learned as possible by as many people as possible. It's called Esperanto. Guess what...no one's learned it. No one cares to learn it because there's no one that it enables them to talk to.

      Rather than that, I'd advocate i18n-izing programming languages. Just require an ASCII line at the top of each file that's something to the effect of:

      @en;UTF-8;
      or:
      @ru;iso-8859-5

      After that, the file is parsed according to the indicated language and character set and all language keywords are expected to be in the indicated language.

      Though this would create issues for open source projects that have developers all over the world who speak many languages (i.e. each project would likely have to mandate a specific language).

    14. Re:Just to spike the ball..... by jc42 · · Score: 1

      XML with tags in Mandarin Chinese shows up now and then.

      Yeah; a few months ago I got involved in testing "i18n" stuff on a bunch of machines. One sample file is a multi-lingual dictionary, with wildly different writing systems for each language. Just for fun, I decided to write some CSS, and for each language, I used class names in that language's alphabet. Thus the Chinese fields used CSS class names that were Chinese characters. I tested it against the dozen browsers on this Mac, 8 browsers on my linux box, plus IE and FF on a XP and Vista Windows machines. I was impressed by the fact that it worked just fine on all of them.

      Not that this should be a surprise. The text was encoded as UTF-8, of course, and most software doesn't actually need to be modified to handle this. If the chars in the range 0x80 to 0xFF are treated as unknown "letters" and just carried along as unanalyzed words, it everything should work for everything except code that is concerned with actual rendering of the glyphs. Problems usually arise only if code tries to do something "intelligent" with chars that have the high bit set.

      I have wondered whether this success means that the programmers who implemented CSS have been too dumb to even consider non-ASCII character sets, or they were smart enough to realize that they didn't need to do anything to make UTF-8 (and the ISO 8859-* encodings, for that matter) work correctly. If it's the former, I hope they don't try to get smart about i18n and mess things up as a result.

      IMO, the folks who decree Web standards should just declare UTF-8 the standard encoding for everything inside URLs. Implementing it is essentially trivial, except for rendering, which mostly just takes some rather big tables. (And memory is getting cheap, right? Right? ;-)

      The only real problem, as mentioned above, is things like the Cyrillic .py suffix (the first two letters of the Russian spelling of "Russian") looks in most fonts exactly like the Roman ".py" suffix for Paraguay. But this is nothing new. We have that problem in ASCII. Try distinguishing "0" from "O" in most fonts. Then consider "1", "l" and "I" in any sans-serif font. Actually, "1" and "l" look identical in the serif font I'm using here. I keep seeing examples where "modem" and "modern" look alike, as to "clear" and "dear" in many fonts. I'm sure you can make up lots of other examples. The spam and phishing crowds use this as one of their tools for getting past most filters.

      Frankly, I think it'd be fun watching the Net tackle this problem seriously, rather than pretending that it's just a problem with Cyrillic and using it as an excuse to block full internationalization of the Web. Not being able to spell non-English names right is a growing annoyance.

      --
      Those who do study history are doomed to stand helplessly by while everyone else repeats it.
    15. Re:Just to spike the ball..... by Animats · · Score: 1

      There are some security issues with Unicode URLs. See this article. There's been an attempt to define a matching process for Unicode domain names such that homoglyphs compare equal. This deters spoofing by using similar-looking Unicode characters, and makes it possible to type in domains.

  10. Make the damn furiners learn English! by Anonymous Coward · · Score: 0

    Translations just slow everybody down.

    It's time we get our tongues in order and build a tower to the heavens!

  11. Further Proof by Urger · · Score: 0, Troll

    This is further proof that everyone should just speak English.

    1. Re:Further Proof by jacquesm · · Score: 5, Insightful

      Hm, troll ? Maybe, maybe not. When I was 14 or so one of my main motivations in learning english was to be able to work better with computers, all the books I could find where in english. In the early 80's when everybody was too busy solving problems instead of customizing their desktop and putting the right accents on letters that are unambiguous anyway.

      The PC, the web and the laser printer changed all that. Mainframe printers were mostly 'chain' printers with a very limited (EBCDIC) character set, not much chance to get your fancy local script there, so people worked around it and on the whole were ok with the solutions.

      Now we get top level domains with all kinds of accents in them and completely local scripts. This 'internationalization' of computing is a good thing for many people because they can now access the digital world in their own language, but at the same time it removes us one step from having a universal language, and the web could have easily given us that holy grail. Because not to be part of the cyber community or learning English ? It would have been an easy choice for most, one or two generations and English would have become a de-facto world standard.

      The situation we have right now will long term probably mean that the amount of content on the net will be proportionally spread out over the various languages, with English only being a (slightly) disproportionally high fraction.

      That universal language window of opportunity is probably lost for a long time, whether it ever was a serious possibility if of course open to debate, I for one had some hope that it was.

    2. Re:Further Proof by SQLGuru · · Score: 1

      I agree.....granted, I speak English natively (and have no secondary language), I would have liked to see a universal language (not required to be English, but it is sort of the de facto standard) emerge and the web would have been the perfect vehicle for forcing it.

      Layne

    3. Re:Further Proof by Sigismundo · · Score: 1

      I agree that having a universal language is a nice idea. But I don't see how the web was a missed opportunity in this respect. Do you think that early on the web should have been restricted to English-only content?

    4. Re:Further Proof by SQLGuru · · Score: 1

      I don't think that it should have been a rule, but in the early stages of development, it was dominated by the English language (most of the participants were either native English speakers or those like the parent who learned English in order to participate). Even today, there is a large English influence in "things Internet".....just less so percentage wise compared to those early days. If, instead of easing the restrictions in place, they remained as they were, I think English would have dominated the web to such a point that it would have become that universal language.

      Layne

    5. Re:Further Proof by MMC+Monster · · Score: 1

      Can you even do HTML markup in anything other than English?

      Not that I think that a single language is good for the world in general. (Reminds me of how the Babel fish removed the barriers to communication and became the cause of more wars as people began to understand one another.)

      --
      Help! I'm a slashdot refugee.
    6. Re:Further Proof by Anonymous Coward · · Score: 0

      And use 7-bit ASCII.

      I know that I will need a firefox extension that blocks such URLs soon (based
      on a character whitelist).

    7. Re:Further Proof by Dasher42 · · Score: 1

      There's a downside to a universal language when it pushes aside other languages, though. The cultural and intellectual diversity in language is a good thing; they ought to go on. Just as you don't want Lisp with its expressive macros or Perl with its regular expressions to just go away so everyone writes in C, you don't want a universal language to displace all others in the real world, nor do you want to halt evolution or innovation.

      For example, written Korean beats any other language for writing distinct vowel sounds. This is because it was devised for the specific case of replacing borrowed Chinese characters that were awkward for Korean language with a cleanly organized system devised by royal scholars and actually put into wide use. For its purpose, you can't beat that. And yet, reading that link, you can see shortcomings that Korean has.

      Similarly, if you want to talk about snow, you can't beat the Inuit language, and if you want to talk about time as relative rather than discrete units Hopi is a good choice, and if you want to talk about burning things, you can't beat Latin - damn those Roman pyromaniacs.

      The universal language will surely happen to the degree that it is merited, but hopefully, no more.

    8. Re:Further Proof by Estanislao+Mart�nez · · Score: 1

      Similarly, if you want to talk about snow, you can't beat the Inuit language, and if you want to talk about time as relative rather than discrete units Hopi is a good choice, and if you want to talk about burning things, you can't beat Latin - damn those Roman pyromaniacs.

      And if you want to talk about urban legends and half-remembered bits of Whorf, English is just perfect.

    9. Re:Further Proof by Dasher42 · · Score: 1

      That's why I didn't incorrectly specify "50 words for snow" or try to say the Hopi had no words for specific units of time like Whorf did. :)

    10. Re:Further Proof by Anonymous Coward · · Score: 0

      Just do a Google video search for a film....and watch the chines sites pop up with the annotation : ????????my left nut???????is it there ???????????????

      So much for indexing alternative character sets.

  12. hackers by zyzzx0 · · Score: 1

    Hindering Paraguayan hackers may matter less to the Russian government than establishing greater control over a walled-off Internet.

    That was my first thought. Lookout for a flood of new registrants for the equivalent of sites like alfabank.ru. (maybe?)
  13. Another attempt by geekoid · · Score: 1

    to put a wall up across the internet.
    Also the reason I do not want changes to how the internet 'works'.
    It seems every change someone comes up with is designed to put a wall up someplace.

    --
    The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
    1. Re:Another attempt by kwerle · · Score: 1

      How?
      It is just a nameservice. If russia decides they want a top level .manyspecialcharacters, google will buy/register the domain name google.manyspecialcharacters, just like they bought/registered google.ru. Russia will get some money, and everyone is happy - especially Russia. You can still call it google.com, or just 72.14.207.99.

    2. Re:Another attempt by rs79 · · Score: 1

      Here's why it won't work. Note that alternative top level domains have been around for a *cough*decade*cough* while. And were making progress, that is, reasonable people, ISP's (earthlink) and companies (GE, etc...) used them. Not because they were cool or had new names, that was sort of icing, but because they were faster.

      But, this didn't sit well with some folks; here's what they did.

      Enter the "transparent" proxy cache.

      In a true end to end internet you type, say, yahoo.com into the browser address bar, your computer looks up the IP address of it, then sends a web request to that IP.

      If you used alternative DNS servers and had typed in, say, http://free.tibet/ then it would dutifully look up the name, and a web request to that IP and youd get the page rendered on your browser window.

      Enter the "transparent" proxy cache.

      It sits at your isp. It intercepts the web request for a website, does a DNS lookup - using the ISP's nameservers - then IF the name resoves does the web request and caches it. ISP's love this because it saves them outbound bandwidth. In many *cough*oversubscribed*cough*allofthem*cough* ISP's it makes the difference between reasonable performance and "uh, this is so slow it's not working".

      But the problem is it ignored the fact your computer has already looked up the name and successfully got back an IP address. That doesn't matter. If it doesn't resolve in the proxy caches/ISPs dns, you won't see that page despite the fact - again, your computer was able to resolve it. And of course in this day and age a DNS lookup error is gonna give you a frigging yahoo searchg page - at least that's what my satellite connection does.

      So, my computer can in fact resolve free.tibet. But because of all this tomfoolery (and frankly blatent disregard for end-to-end) I get yahoo instead. Yay.

      The Russians can set up all the alternative root servers they want. But if your ISP has a "transparent" proxy cache, you're never going to be able to see them.

      Plus there's the email issue. If the receiving mail server dosn's use the same alternative DNS as the sender, that mail is gonna bounce. Ask me how I know.

      Of course other protocols like ftp and irc are immune. I can ftp to ftp://free.tibet all day long. It's just web and mail that won't work. Yes, this was done on purpose by staunch critics of alternative DNS.

      You'd think this might be some sort of net.neutrality issue, no?

      --
      Need Mercedes parts ?
    3. Re:Another attempt by kwerle · · Score: 1

      So it'll work fine for the folks who want it to work most: people in Russia, with Russian ISPs.

      It turns out it will also work for me. I set my DNSServer to be someone who does opennic resolving. I was immediately able to visit http://www.opennic.glue/ You're right that I would not be able to send them email without configuring a different smart mailserver.

      But I don't think it is reasonable to say that the whole thing won't work because your ISP sucks and transparent proxies you.

    4. Re:Another attempt by rs79 · · Score: 1

      " But I don't think it is reasonable to say that the whole thing won't work because your ISP sucks and transparent proxies you"

      You're guessing. I've got 10 years experience with this and I'm telling you it's not practical.

      I'm not sure you understand what percent of transit providers uses these caches.

      --
      Need Mercedes parts ?
    5. Re:Another attempt by kwerle · · Score: 1

      You're guessing. I've got 10 years experience with this and I'm telling you it's not practical.

      I'm not sure you understand what percent of transit providers uses these caches.


      Pissing contest? Not much interested. Can you tell me where I can find the statistics for how many ISPs or users are stuck behind a transparent proxy online?

      But really, if *Russia* offers an alt top level, and it works for *Russians* (ie. Russian ISPs support it), and the workaround is trivial and well known (use a non-russian TLD, too), what is the problem?

  14. How long? by A+beautiful+mind · · Score: 5, Funny

    How long until someon registers rm.rf ?

    --
    It takes a man to suffer ignorance and smile
    Be yourself no matter what they say
    1. Re:How long? by ScotlynHatt · · Score: 1

      Done. Internet squatters picked up your post with domain text and have it in their perpetual registration pipeline.

    2. Re:How long? by sootman · · Score: 2, Funny

      My first thought was 'tm.rf'--in Soviet Russia, The Manual, um, Reads... wait...

      --
      Dear Slashdot: next time you want to mess with the site, add a rich-text editor for comments.
    3. Re:How long? by megaditto · · Score: 2, Insightful

      Bah, I can think up some that are way cooler. Let's see here:

      rt.fm
      poop-s.coop (a real TLD by the way)
      pen.is (BIC's homepage in Iceland?)
      vagi.na
      got.root (also real)
      Eat-sh.it
      sniff.co.ck (real TLD)
      Give-a-fu.ck
      por.no
      s.cat
      free.blow.jobs
      felat.io
      sc.um

      goat.se (deserves an honorable mention I guess).

      --
      Obama likes poor people so much, he wants to make more of them.
    4. Re:How long? by ed.markovich · · Score: 1, Funny

      How long until someon registers rm.rf ? You know you've been studying for the CFA too intensely when rm.rf makes you think of 'market risk premium' * * E(Rm-Rf) where Rm is the return of the market and Rf is the risk-free rate, it's the extra return investors demand for investing in the risky market rather than earling the risk-free rate.

  15. Just me by Rinisari · · Score: 3, Interesting

    Is it just me, or does it seem like the article is really blowing this out of proportion? From my understanding, the Russian government just wants to add a .rf (well, . if I'm remembering Cyrillic correctly). That's it. Users with Cyrillic keyboards will be able to access those sites without a problem, and those of us with non-Cyrillic keyboards will have to either use a character map program or temporarily switch keyboard layouts (as I just did).

    Is that it, or am I missing something?

    1. Re:Just me by Rinisari · · Score: 1

      Well, I did have the fancy Cyrillic characters there, but apparently, Slashdot hates UTF-8.

    2. Re:Just me by morgan_greywolf · · Score: 1

      I thought they wanted .&#1088;&#1091;

    3. Re:Just me by Escogido · · Score: 1

      From my point of view (I live in Moscow) it looks like this.

      The problem of having a Latin TLD that translates visibly to something different in Russian is a perfectly valid issue, which could (and quite probably will) be used as grounds to (attempt to) establish more governmental control over the Russian part of the Internet, or as Runet as it is often references to by the media here.

      I do not expect them to really erect some kind of Great Chinese Wall or something, it both will not work and is not needed either. While some authorities are quite likely thinking that this is desirable, I believe the reasons are more practical: they are more interested in control over the Russian Internet as a market. Administrating top level domain registration means they will have a way to influence who will get to be present in the new Runet and who will have trouble with that, and this translates into both monetary profits and political influence. If they do it 'right', transition may take a couple of years, maybe as much as five.. but after that, basically the powers that be will decide what Russians can read and what we can't.

      This is quite in line with all the state corporations they've been creating lately to seize (or regain, from a certain view) control over the state economics. Also I'd like to remind that they bought LiveJournal recently, the most popular Russian intelligentsia (yep, we still have that) playground.

      The Big Brother is back, and this time he's technologically armed.

  16. why not .po? by Anonymous Coward · · Score: 1, Interesting

    It's true that the cyrillic .(.py) looks very similar to .py, but as someone else pointed out, they are different character sets.

    Also, why did they want . in the first place? that's just "roo" in cyrillic, which is the English spelling for Russia. Wouldn't the Russians themselves rather have .? A .po-equivalent makes more sense for them, since in Russian they call their country Rossiya.

    (sorry if you can't see the Russian chars)

    1. Re:why not .po? by Zontar_Thing_From_Ve · · Score: 1

      Also, why did they want . in the first place? that's just "roo" in cyrillic, which is the English spelling for Russia. Wouldn't the Russians themselves rather have .? A .po-equivalent makes more sense for them, since in Russian they call their country Rossiya.

      For those who don't understand this point, "Russia" spelled in Cyrillic is written something like "POCCIR" with the "R" being backwards. I can tell you that .ru has been in use for so long that EVERYBODY there on the internet knows about it, so they just transliterated it to ".py" in Cyrillic, even though you can make a case that it might not be the 2 letter code that a native speaker would have chosen.

    2. Re:why not .po? by $0.02 · · Score: 1

      Actually, why not .KOM, .HET ?

      --
      If enithin kan gow rong it whil. (Murfey)
  17. Lost in translation by Ethanol-fueled · · Score: 1

    Why not impliment a babelfish translation across domains?

    *Ducks*

  18. In Soviet Russia ... by morgan_greywolf · · Score: 4, Funny

    In Soviet Russia, py ("pie") is confusing to ru ("roo")!

  19. In 20 years... by wysiwig3 · · Score: 1

    "Mr. Putin, tear down your digital wall!"

    1. Re:In 20 years... by skoaldipper · · Score: 1

      "My fellow Americans, I'm pleased to tell you today that I've signed legislation that will outlaw .py forever. We begin google bombing in five minutes."

      --
      I hope, when they die, cartoon characters have to answer for their sins.
    2. Re:In 20 years... by trolltalk.com · · Score: 1

      "My fellow Americans, I'm pleased to tell you today that I've signed legislation that will outlaw .py forever"

      What, no more "American Py?"

      While you're at it, how about reviving the old "We'll legislate pi to be 3.0"?

    3. Re:In 20 years... by Midnight+Thunder · · Score: 1

      "Mr. Putin, tear down your digital wall!"

      Ah he's too busy trying to beat bush to the next iteration of the KGB to do that.

      --
      Jumpstart the tartan drive.
    4. Re:In 20 years... by tsotha · · Score: 1

      I was wondering when the Bush-bashing would start, this being slashdot and all. The reality is Putin doesn't need a next iteration of the KGB, since the FSB is the KGB with a new name. It's even run by the same people.

      You aren't likely to find Polonium in your cornflakes if you piss Bush off. Hell, you'll be an honored guest on Larry King. Angering Putin, on the other hand...

  20. And THAT's the problem Einstein by El+Lobo · · Score: 0, Informative

    The problem is not that they are not the same or even different charsets. The problem is that they are near enough for the naked eye to confuse a russian user. Lets say they have a real bank with the address www.baHk.py (baHk = bank in russian but I'm not using cyrillics here so use your imagination). A pisher could easily setup a domain www.bahk.py (using latins py = paraguay) and this should be very difficult for a naormal user to catch the error.... This is a phishers wet dream, actually.

    --
    It's time to realise that Abble's products are the biggest abomination these days. Just say NO to the dumb iAbble way!!
    1. Re:And THAT's the problem Einstein by KinkoBlast · · Score: 1

      of course, if it displays as .ru, it's not a problem, now is it?

    2. Re:And THAT's the problem Einstein by dasunt · · Score: 1

      The problem is not that they are not the same or even different charsets. The problem is that they are near enough for the naked eye to confuse a russian user. Lets say they have a real bank with the address www.baHk.py (baHk = bank in russian but I'm not using cyrillics here so use your imagination). A pisher could easily setup a domain www.bahk.py (using latins py = paraguay) and this should be very difficult for a naormal user to catch the error.... This is a phishers wet dream, actually.

      The phishing problem would also exist if DNS was converted over to unicode, due to duplicate characters in unicode.

      Plus, what about similar characters? Would straße.com be the same as strasse.com? Or häuser.com be the same as hauser.com? What about encyclopedia.com and encyclopædia.com?

  21. A big issue for the rest of us ... by gstoddart · · Score: 3, Insightful

    As it is I see spam which has Chinese characters embedded in what appears to be a google URL, but which I strongly suspect isn't.

    I fear the more we see unicode bytes in URLs the more it will open up people to vulnerabilities as they click on very innocent looking links.

    Hopefully the browsers can keep up with this.

    Cheers

    --
    Lost at C:>. Found at C.
    1. Re:A big issue for the rest of us ... by cnettel · · Score: 1

      There have already been some browser fixes, mainly triggering cases where characters from different scripts appear next to each other. That certainly breaks some valid cases as well, but I guess it's bearable. (So you can't just switch a single o in some domain name to a Cyrillic o and get it to show almost indistinguishably... or at least that's the idea.)

    2. Re:A big issue for the rest of us ... by lofoforabr · · Score: 1

      That's something to be thought, even more if you can mix character sets on domain registrations. Don't the URLs below all seem the same?

      http://www.google.com/
      http://www.google.com/
      http://www.g/#1086;&%231086;gle.com/

      Cyrillic and latin alphabets have a few letters that overlap:

      a and
      c and
      e and
      H and
      k and
      m and (ok, almost, but upper-case still goes: M and )
      n and (kinda)
      o and
      p and
      T and
      x and

      I hope they take this into account when making other characters encodings into dns.

    3. Re:A big issue for the rest of us ... by lofoforabr · · Score: 1

      And how nice... just noticed slashdot is ISO8859-1 encoded, so my previous post won't display correctly.
      Hey, Slashdot, why not use UTF-8?? Being a (mostly) english site wouldn't show a problem, since US-ASCII and UTF-8 overlaps nicely.

    4. Re:A big issue for the rest of us ... by TheNarrator · · Score: 1

      How about putting a big hammer and sickle soviet flag icon next to the URL if the url is encoded in cryllic. :)

    5. Re:A big issue for the rest of us ... by RealGrouchy · · Score: 1

      If a link has the text "www.google.com", but actually links to "www.g00g1e.com", then what stops the browser from being able to compare the two and take effective action to warn/protect the user?

      - RG>

      --
      Hey pal, this isn't a pleasantforest, so don't waste my time with pleasantries!
    6. Re:A big issue for the rest of us ... by Anonymous Coward · · Score: 0
      And t/m, i/u, d/g, p/n when written in cursive scripts.

      I still remember the 'pogonu' cigarettes (that was actually 'rodopi').

    7. Re:A big issue for the rest of us ... by hcdejong · · Score: 1

      Should be easy enough to add a filter:
      if (char) ASCII, then nuke(url)

  22. Politically speaking by athloi · · Score: 5, Insightful

    It's a smart move. Russia has already demonstrated that it wants to be a superpower again, which means that its main competition is China and the USA.

    It has to keep up with China's level of control, and not leave the internet in the hands of the USA, if it can.

    Again Putin demonstrates a smart interpretation of Machiavellian Realpolitik while no one else yet realizes the Cold War is back on.

    1. Re:Politically speaking by dusanv · · Score: 3, Insightful

      Or maybe, just maybe, they only want Cyrillic characters in URLS. ASCII isn't suitable for majority of the world so brace yourself for more of this in future.

      The article is loaded with bs like this brownish pearl:
      Kleinwachter says the speculation is that people will need a password authorised by government agencies to use the global internet.

      How the fsck did he deduce that from introduction of Cyrillic DNS?

    2. Re:Politically speaking by Anonymous Coward · · Score: 0

      Nonsense.

      Russia will **NEVER** be a superpower again. Russia needs to worry about its existence, let alone any delusions of superpower status. Its demographics and geography doom it. Russia had the lowest birthrate in Europe and one of the largest Muslim populations. More importantly, it's a resource rich nation that shares a 4,000 mile border with a country that has a insatiable appetitie for said resources and a 60 million boy surplus.

      Once Islam gets a sizable minority in European Russia, they'll take over, as the actual Russians will be too few and too old to stop the imposition of Sharia. At trhe same time, the Chicoms' 60 million young men will blitz Siberia in about 2 weeks.

      Again Putin demonstrates a smart interpretation of Machiavellian Realpolitik while no one else yet realizes the Cold War is back on.

      Putin has the IQ of a piece of toast; he is Russia's Jimmy Carter. Ignorant of his own country's history, He has signed another mutual nonaggresion pact. This time with Islam (Iran) and China. He is blind to the fact that he gets to be Poland this time around.

    3. Re:Politically speaking by Anonymous Coward · · Score: 0

      Cold war against who? If it is against China that is ok.
      Because, with the US economy and military mighty going down the drain so fast as they are going now, should be better if Russia picks Liechtenstein as enemy, as we won't have much left from the USA on a couple of years to fight back the country that Putin is building.
      The USA are just the butt-joke of the world now.

    4. Re:Politically speaking by mike2R · · Score: 1

      Once Islam gets a sizable minority in European Russia, they'll take over, as the actual Russians will be too few and too old to stop the imposition of Sharia. At trhe same time, the Chicoms' 60 million young men will blitz Siberia in about 2 weeks.

      Meanwhile, back in reality..

      --
      This sig all sigs devours
  23. Re:Why is /. always late with stories? by jacquesm · · Score: 2, Informative

    minicity spam

  24. Icons for Victory by Doc+Ruby · · Score: 3, Interesting

    I'd like the URLs in my GUIs to be displayed in their frame with an icon indicating their character set, and colored if in a character set different from my GUI default. If I had that, I'd like to see "native" glyphs without fear that they're decoys. Even though such a system would no longer force most content publishers to deliver content in my own privileged native character set.

    --

    --
    make install -not war

  25. internet walls by pembo13 · · Score: 2, Insightful

    Hindering Paraguayan hackers may matter less to the Russian government than establishing greater control over a walled-off Internet.

    I don't really have a problem with government's filtering the internet of their own citizens -- let their citizens deal with that. When I don't like it is when a government want to control/monitor the the internet usage of other citizens.

    --
    "Thanks for all the money you paid to us. We've used it to buy off ISO among other things" -Microsoft
  26. Language in Star Wars. by UseTheSource · · Score: 1

    Or, like in Star Wars, everyone should learn to understand everyone else's language. Then, in conversation, one would speak their own language regardless of what the other person is speaking. Han's exchange with Greedo comes to mind.

    --
    "Ein Volk, ein Reich, ein Führer." -Adolf Hitler
    "We are one Nation, we are one People." -The One 'leader'
    1. Re:Language in Star Wars. by TooMuchToDo · · Score: 1

      Mod parent up. As languages have started dying out in an evolutionary way, only a few will remain. Which ones? Easy. It'll be based on population and commerce. I think Chinese (mandarin or traditional I'm not sure) and English will win out, with perhaps 4-5 others.

    2. Re:Language in Star Wars. by Hoi+Polloi · · Score: 2, Funny

      When the computers take over we'll all be forced to speak in binary.

      101101011101....

      --
      It is by the juice of the coffee bean that thoughts acquire speed, the teeth acquire stains. The stains become a warning
    3. Re:Language in Star Wars. by techpawn · · Score: 1

      01001001001000000110010001101111011011100010011101110100001000000111010001 10100001101001011011100110101100100000011011110111010101110010001000000111 00100110111101100010011011110111010000100000011011110111011001100101011100 10011011000110111101110010011001000111001100100000011101110110100101101100 01101100001000000110011001101111011100100110001101100101001000000111010001 10100001100001011101000000110100001010

      --
      Ask not what you can do for your country. Ask what your country did to you
    4. Re:Language in Star Wars. by Altus · · Score: 1

      01001001001000000110010001101111011011100010011101110100001000000111010001 10100001101001011011100110101100100000011011110111010101110010001000000111 00100110111101100010011011110111010000100000011011110111011001100101011100 10011011000110111101110010011001000111001100100000011101110110100101101100 01101100001000000110011001101111011100100110001101100101001000000111010001
      1 0100001100001011101000000110100001010


      2

      --

      "In America, first you get the sugar, then you get the power, then you get the women..." -H. Simpson

    5. Re:Language in Star Wars. by Hoi+Polloi · · Score: 1

      01001001011101000010000001110111 0110100101101100 011011000010000001101111 0110111001101100 011110010010000001100010
      01100101001000000111001001100101 0111000101110101 011010010111001001100101 0110010000100000 011011110110111000100000
      01010011011011000110000101110011 0110100001100100 0110111101110100

      --
      It is by the juice of the coffee bean that thoughts acquire speed, the teeth acquire stains. The stains become a warning
  27. Re:Solution to all of this is real simple by spaceyhackerlady · · Score: 1

    While the TCP/IP protocol suite was largely developed by DARPA, much of what the Internet is today (WWW) started at CERN in Switzerland.

    So there.

    ...laura

  28. Trouble ahead? by Duncan+Blackthorne · · Score: 2, Interesting

    I may not be looking at the whole picture here, but isn't this sort of decision going to have a tower-of-babel-like effect? Are search engines going to be able to index sites using the alternative character sets? Isn't there at least some risk of two different sites at least appearing to have identical URLs? Or is this really an attempt by countries like Russia and China to selectively cut their populations off from the public internet while not in actuality doing so? Don't get me wrong, I'm not saying that American English should be imposed on the rest of the world (I'm not that guy!), but the system in place was founded on such and I see this really mucking up the works..

    1. Re:Trouble ahead? by Estanislao+Mart�nez · · Score: 1

      Are search engines going to be able to index sites using the alternative character sets?

      What do you think?

      (Though I've heard that for some languages, Google's indexing technology is not as smart as that of smaller players who specialize in that language. But come on, it's not really harder than indexing English.)

      Isn't there at least some risk of two different sites at least appearing to have identical URLs?

      Yup, this is an acknowledged risk. Is the appropriate response to that risk to ditch the whole concept of URLs in non-Latin scripts? Hell no. What about developing actual solutions to the problem instead?

      Or is this really an attempt by countries like Russia and China to selectively cut their populations off from the public internet while not in actuality doing so?

      So, by providing internet content that doesn't require the use of an alphabet or language that most Russians or Chinese don't master, Russia and China thereby "cut their populations off from the public internet." Good one.

      Don't get me wrong, I'm not saying that American English should be imposed on the rest of the world (I'm not that guy!), but the system in place was founded on such and I see this really mucking up the works.

      Well, suppose you were Chinese, and the only language you could speak and read is Chinese. They taught you a bit of pinyin in school, sure, but you have to look at Latin letters very carefully to tell apart a 'd' from a 'b', a 'p' from a 'q', or an 'r' from a 'v'. You don't speak any English beyond phrases like "hi, how are you?".

      Wouldn't you think that the fact that you can't use the Internet in Chinese was "really mucking up the works"? As in, making it fundamentally and needlessly hard to use?

    2. Re:Trouble ahead? by Duncan+Blackthorne · · Score: 1
      So, by providing internet content that doesn't require the use of an alphabet or language that most Russians or Chinese don't master, Russia and China thereby "cut their populations off from the public internet." Good one.

      Last time I checked, isn't China censoring the living daylights out of the internet already? Is it really that far of a reach to say that they'd like to create their very own version of the internet, that they 100% control? If you can accept that then is it that much further to go to believe that some other government might think about doing the same?

    3. Re:Trouble ahead? by Estanislao+Mart�nez · · Score: 1

      Last time I checked, isn't China censoring the living daylights out of the internet already? Is it really that far of a reach to say that they'd like to create their very own version of the internet, that they 100% control?

      Let's grant your points. How the hell would that be relevant to whether we should have non-Latin domain names?

      And now, let's assume otherwise, that there was no censorship of the Internet in China. Have you considered the possibility that the fact that, gee, most Chinese don't speak English, might be the biggest obstacle to their use of what you call "the public internet"?

      Whether Russia or China are police states with heavy censorship is irrelevant to the question of whether there should be Cyrillic or Chinese domain names. It's a matter of whether people can easily use a computer in the first place.

    4. Re:Trouble ahead? by jc42 · · Score: 1

      [I]sn't China censoring the living daylights out of the internet already? Is it really that far of a reach to say that they'd like to create their very own version of the internet, that they 100% control?

      True. But consider the viewpoint of much of the world: The Internet is currently controlled in great part by the US government. Maybe not totally, of course, but close enough to make for good arguments for Doing Something About It.

      As an example of the US's power, consider the incident last year when the US government succeeded in ordering Swedish police to raid the piratburan.se site and cart off their computers. The charge was violation of US copyright laws. It was agreed by everyone that the site wasn't in violation of Swedish law, while the Swedish police's actions were illegal. But this didn't matter; when it comes to Internet activity, the US government can and does give orders to agencies of other supposedly "independent" governments.

      I'd think that a lot of the world has to be getting very nervous about this sort of US control.

      And the fact is that anyone can set up their own root DNS servers. I've helped do it on a couple of projects, where we wanted an DNS setup with our own .local domain that was visible only inside our organization. It's actually quite easy, and very useful to anyone who wants a semi-isolated chunk of the Internet for some purpose. Our purpose was software testing, and the motive was to create a few domains that weren't visible outside our labs. But in fact anyone with an internal DNS server can easily do the same thing. And there are several open-source DNS servers available.

      Unless the US government gives up on its recent arrogance toward the rest of the world (which is very unlikely within the next year or so ;-), I'd think that people all over the world have to be thinking of ways to declare network independence. Yeah, it's a bad idea for everyone. But the only way to prevent it is for the Internet authorities to become truly international and not beholden to any single government, and it doesn't look like that's going to happen soon, either.

      The other 95% of the world isn't likely to tolerate many more actions like what was done to piratburan.se before they take action. Any geek that has ever set up a local DNS server can tell them how to do it. And tweaking a server's code to allow UTF-8 URLs is essentially a beginner-level programming job.

      --
      Those who do study history are doomed to stand helplessly by while everyone else repeats it.
  29. More control? Doesn't seem like the Russian M.O. by damburger · · Score: 0, Flamebait

    The Russian government is quite openly murdering critics of the government, both at home and in foreign countries. Sneakily playing with TLDs to censor the internet doesn't seem like their style. If they want to clamp down on the internet, there won't be much doubt what they are doing. The fucking psychos will probably just bomb uncooperative ISPs or something.

    --
    If we can put a man on the moon, why can't we shoot people for Apollo-related non-sequiturs?
  30. That does it! by Quiet_Desperation · · Score: 4, Funny

    I'm registering my next domain in Klingon.

    1. Re:That does it! by Ster · · Score: 1

      I'm registering my next domain in Klingon.

      Please! I'm registering mine in Betacrypt 3!

      -Ster

    2. Re:That does it! by edwardpickman · · Score: 1
      I'm registering my next domain in Klingon.

      As a cheer of communal defiance rang out from basements around the country........

    3. Re:That does it! by Quiet_Desperation · · Score: 1

      Hey! I *OWN* the basement I hide in, thank you very much!

      Actually, I've hated Trek since the middle of DS9. :-\

  31. Redundant? Strange by iminplaya · · Score: 1

    This could bring an end to DNS. Fine by me. The system, like email, is very fragile. But it shouldn't hurt the internet at all. Only the commercial aspect would suffer any real disruption, and even that should be temporary. Just fix a permanent address to every device. And create your own hosts file. Could make spoofing a bit more difficult, and it could make tracking a bit easier for you government bureaucrats out there.

    --
    What?
  32. Re:Solution to all of this is real simple by Anonymous Coward · · Score: 0

    The Internet is an invention of the United States. True.

    We read and write in English using ASCII. If you want to connect to our Internet, then use our rules. Too late! At best, you can disconnect your *part* of the (now worldwide) Internet from the rest of the world. And that's not going to happen.

    If you want to do different, then, use your own but don't call it the Internet because that is our invention. If you don't like this solution....tough. This is our Internet. No; on the contrary, you foolish person. If people don't like your "solution" and ignore it, there's precisely *nothing* you can do about it :-P

    And if you don't like that... tough. :)
  33. Programming in Russian by mi · · Score: 2, Funny

    Soon to be followed by C++ in Cyrillic.

    When we studied programming in high school, we used a language called "Ershov" (last name of the textbook's author), which was really Pascal translated to Russian.

    I don't think, there was an actual compiler, though — nor did we have (enough) computers. Our little code-snippets were checked by the teacher by hand...

    "One laptop per child"? Right...

    In the American college, our professor was quite fond of (then brand new) Java. Among the advantages, he listed the ability of using non-ASCII characters. The poor man had to read my programs with variable-names in Ukrainian for the rest of the semester...

    --
    In Soviet Washington the swamp drains you.
  34. Re:Solution to all of this is real simple by Anonymous Coward · · Score: 0

    except for the fact that much of what the internet is today was actually developed here:

    http://en.wikipedia.org/wiki/University_of_Illinois_at_Urbana-Champaign

    What got started in Switzerland doesn't really conCERN us that much.

  35. Oblig by Elenthalion · · Score: 1

    In Soviet Russia, Paraguay hacks you.

  36. .su by Anonymous Coward · · Score: 0

    Soooo unfair. They already have .su (soviet union) and .ru. One more?
    And why not a cyrillic TLD?

  37. Re:More control? Doesn't seem like the Russian M.O by BattleCat · · Score: 0

    Care to back your statement with facts ? Not Western media hype akin to Iraqi WMD, but cols hard facts ? Censoring the Internet ? What a joke. No need to - information exchange is harmless contrary to popular belief.

  38. Actually In Soviet Russia ...ICANN .su YOU by HighOrbit · · Score: 1

    In Soviet Russia, ICANN .su YOU. You will .ru the day ICANN .su you.

  39. Easy solution to the problem by DaleGlass · · Score: 2, Insightful

    If the domain name contains characters not from the system's character set, highlight them (with another color say), and warn the user.

    It's not a new problem either, "slashdot", and "sIashdot" will look the same in many fonts.

  40. worlds largest oil producer by peter303 · · Score: 1

    They've topped Saudi Arabia the past couple of years. Saudi has more reserves but nto the incentive to greatly increase production. Both are raking it it in.

  41. Re:More control? Doesn't seem like the Russian M.O by yoprst · · Score: 1

    Could you please enlighten me on how exactly introducing a top level domain is clamping down on internet? What exactly is wrong with great firewall of China and how an .rf domain would be a better solution (or a solution at all)?

  42. Who are you? by Anonymous Coward · · Score: 0

    And why do you want other people to live on terms that are so obviously for your convenience, and not theirs?

    We'll never get a "universal language," and that's good; it means that the people who speak the universal language natively will have that much less power over those who don't.

    1. Re:Who are you? by jacquesm · · Score: 1

      I don't speak english natively, I overcame that difficulty, and so can anybody else. If the world would standardize on chinese tomorrow that would be just fine by me too...
      Culture is a great thing, and I hope that everybody will be able to retain their own somehow, but wouldn't it be nice if there was a way to communicate with everybody ?

    2. Re:Who are you? by Anonymous Coward · · Score: 0

      I don't speak english natively, I overcame that difficulty, and so can anybody else.

      I see. People who don't speak English have nobody to blame but themselves, according to you.

      Culture is a great thing, and I hope that everybody will be able to retain their own somehow, but wouldn't it be nice if there was a way to communicate with everybody?

      Sure. However, wouldn't it be awful if, because everybody was expected to know English, people who didn't speak it natively were an underclass? And don't you think that that would lead to the death of all those languages and cultures that you say would be nice to preserve?

      And what's worse is that minority ethnic groups that lose their language do not thereby become integrated with the mainstream society whose language they adopt. Most Native American groups have lost their native languages to a colonial language, and they are still marginal.

      The point is that what "universal language" really means in practice is "forced integration of other people into my culture and society, where we do not value them and have no place for them."

    3. Re:Who are you? by Anonymous Coward · · Score: 0
      Hey old buddy, did you learn yet that light bulb filaments are wound to preserve heat?


      http://invsee.asu.edu/Modules/lightbulb/history.htm


      "Winding the wires into fine coils, as used in modern incandescent filaments, reduced convective heat loss, allowing the filament to operate at the desired temperatures."


      And that directly heated cathodes are not spiral wound?

    4. Re:Who are you? by jacquesm · · Score: 1

      I'll ignore your strawman with your permission, and stick to your main points:

      > Sure. However, wouldn't it be awful if, because everybody was expected to know English, people who didn't speak it natively were an
      > underclass? And don't you think that that would lead to the death of all those languages and cultures that you say would be nice to
      > preserve?

      That is a real risk, but there are plenty of examples of peoples living in one country right now where they are separated in a caste system, I don't think the language in and of itself is to blame there.

      Cultural poverty is a real risk though, and that would be something not to take lightly.

      > "And what's worse is that minority ethnic groups that lose their language do not thereby become integrated with the mainstream society whose
      > language they adopt. Most Native American groups have lost their native languages to a colonial language, and they are still marginal.

      Agreed. I've lived in Canada for a while and the treatment of minorities there is absolutely shameful. They are literally bought off. There is this weird collective guilt in Canada that the Canadian government seems to think can only be absolved with large amounts of money. Which is strange because the problem is not rooted in lack money but in something much simpler, lack of respect.

      > The point is that what "universal language" really means in practice is "forced integration of other people into my culture and society,
      > where we do not value them and have no place for them.""

      That's an excellent point, I had not thought it through that far, I think my idealism got the better of me.

      But somehow there must be a way to do such a thing in a way that it doesn't lead to someone losing out. I don't want anybody to lose anything, I would like for all of us to gain something.

      As they say, the road to hell is paved with good intentions...

    5. Re:Who are you? by jacquesm · · Score: 1

      Hey as well!

      No I had not spent any time on it actually, too busy recovering from the bloody flu and trying to keep my business afloat between odd jobs :)

      But it's nice to see you're still willing to spend time on educating me!

    6. Re:Who are you? by Sigismundo · · Score: 1

      wouldn't it be awful if, because everybody was expected to know English, people who didn't speak it natively were an underclass?

      If that were the case, wouldn't the corollary be true? That those who, regardless of their background, could manage to achieve a degree of fluency in English could be a part of an upper class? Is that such a bad thing?

      don't you think that that would lead to the death of all those languages and cultures that you say would be nice to preserve?

      It's not necessarily true. In China, there are dozens if not hundreds of dialects local to a particular city or region. These dialects are spoken at home between family members, and by locals to one another. Mandarin (the official language) is taught in schools, and is pretty effective in allowing all Chinese to communicate with one another, for the most part. The local dialects do not appear to be in any danger of dying out.

      I agree that language can be used as a tool of oppression, for instance, if a particular group is forbidden from using their native language, as the Kurds were in Turkey for a time. Provided that a universal language is promoted not by discouraging minority languages, but by promoting it in a positive way, through education, what is the harm in more people being able to communicate with one another?

    7. Re:Who are you? by Anonymous Coward · · Score: 0

      my business afloat between odd jobs

      Mr. CEO of this and that needs odd jobs... I guess there really isn't any weight in title anymore.

  43. Yes. To their detriment. by Anonymous Coward · · Score: 0

    Russia is the number one oil producer. For now. Until the Chicoms come and take it from them. See your sibling post.

  44. Welcome! by Anonymous Coward · · Score: 0

    I, for one was going to welcome our new domain-naming overlords, and in Russian, but Slashdot apparently doesn't care for Cyrillic in comments.

  45. Paraguayan phishing? Not a very good excuse. by sombragris · · Score: 1

    I live in Asunción, Paraguay, South America. Here, the ccTLD .py is managed by NIC.py. NIC.py specifically states that Paraguayan domain names must be third-level domains only, from the second-level domains ORG.PY, EDU.PY, MIL.PY, GOV.PY, NET.PY, COM.PY.

    Straight from the horse's mouth (my rough translation):

    Second-level domains determine the nature of a domain name. A company or institution may request a certain third-level domain name only under one of the second-level domains (emphasis mine) , i.e., the one that best identifies or reflects its goals, structure, services, or products.

    On the other hand, Russian domain names are usually second-level domain names such as alphabank.ru, etc.

    Therefore, and given that Paraguayan domain names are third-level domain names such as mybank.com.py, it would be very difficult for a Paraguayan phisher to set up Paraguayan sites to phish Russian users.

    NIC.py states that it might consider creating other .PY second-level domains, so names such as www.python.py could be theoretically possible; but for that you must make a very good and convincing case to the NIC.py folks.

    --
    -- Look to the Rose that blows about us--"Lo, Laughing," she says, "into the World I blow..."
  46. What does "the same" mean? by Estanislao+Mart�nez · · Score: 1

    While "ru" written with cyrillic letters may look confusingly similar to "py", it is not the same.

    What does "the same" mean in this context, and does it always mean that?

    Unicode certainly has separate codepoints for Latin and Cyrillic characters that look "confusingly similar." This is a technical choice that it adopts in that case, but not in all cases; the Unicode CJK unification, for example, takes the opposite approach, assigning the same codepoint to characters that are regularly written differently in Chinese, Japanese and Korean.

  47. Could anyone rephrase TFA in technical terms? by Anonymous Coward · · Score: 0

    I have no idea what the article is really about, because it is dumbed down to the point of inintelligibility. What happened to punycode? What encoding (or transliteration - pinyin?) is "the Chinese Internet" currently using for third-level domain names? How can there possibly be a confusion between .ru and .py if .[Cyrillic r][Cyrillic u] isn't a valid TLD? Or is it? How would Cyrillic domain names be encoded? How would a fixed IP address, which would go with you wherever you approach the internet (RFID implant?) solve any DNS problems?

  48. US dominance of the "Internet" by G3ckoG33k · · Score: 1

    Sorry, but the grand parent probably is correct here. The US dominance of the information highway "Internet" is probably not looked favorably upon by the governments mentioned.

    You don't have to be very bright to see that Cyrillic and Chinese are perfectly legitimate reasons for acquiring their own DNS systems, and that they seem prepared to use those reasons. Despite the trouble it sadly WILL create, both in the short and long run.

  49. Re:Why is /. always late with stories? by Anomolous+Cowturd · · Score: 1

    Dammit, slashdot should unlink links in -1 messages. I got goatse'd yesterday, with innocents in the room. Luckily my goatse reflex is down to about 100 milliseconds so nobody consciously suffered.

    --
    Software patents delenda est.
  50. www.slashdot.ru by Derling+Whirvish · · Score: 1

    http://www.slashdot.ru/

    Of course it exists.

  51. TFA is clueless by Tonik,+the · · Score: 1

    There are reasons why Russians want a cyrillic .rf domain, and it has nothing to do with who's got control.

    1. Imagine the Internet has been developed in China and you have to enter all URLs in Chinese characters. Well, that's how it currently feels to a lot of Russians right now.

    2. There is a problem in simply allowing cyrillics in the .ru domain because of the possibilities for phishing. Consider 'paypal.ru': any of the first five letters in 'paypal' could be substituted with absolutely identically looking cyrillic letters. That makes 2^5 = 32 possible domain names that all look the same but potentially resolve to different addresses; maintaining all such domains to avoid fishing may not be an option for small businesses, let alone individuals.

    3. You could of course impose some restrictions on how one can mix cyrillics and latin in the same domain name; indeed, you could allow only one script in a domain name. But there are two problems still: a) spoken communication and b) layout switching. a: When someone says "go to image dot ru", it won't be unclear if they mean the domain name in cyrillics or in latin. b: Russians use two separate keyboard layout for cyrillics and latin. To type an URL that starts with cyrillcs and ends in the latin suffix, you'll have to switch the keyboard layout midway (normally the Alt-Shift keyboard shortcut, but the less tech-savvy ones have to use the mouse!)

    4. An cyrillic .rf TLD solves all this elegantly: when someone says "image dot rf", it is obvious they expect you to type cyrillics; the entire url is in cyrillics so you don't have to change the keyboard layout. (The .rf will be in cyrillics of course; see this to get an idea how it looks)

  52. that's no slashdot by Tonik,+the · · Score: 1

    There doesn't seem to be a Russian equivalent to Slashdot. (slashzone.ru was a good try, but it never became popular and is sorta abandoned now)

    1. Re:that's no slashdot by Derling+Whirvish · · Score: 1

      That's a good question. Where do Russian techies hang out? I know Live Journal is very popular there. Maybe somewhere there? Anyone know of a good Russian social tech site like Slashdot?

    2. Re:that's no slashdot by gr8dude · · Score: 1

      nnm.ru is where you can find some interesting information; it is not /.-like, it never will be. A drawback is that there are a lot of lame comments posted by trolls and flamers, and perhaps too much advertising?

      The good part is that with time the quality of the content does improve. This is the only Russian site that has a broad audience, and which sometimes manages to spawn interesting discussions.

  53. Re:Solution to all of this is real simple by Anonymous Coward · · Score: 0

    What got started in Switzerland doesn't really conCERN us that much. How's gopher doing these days? Life must be hard without a web browser :-P
  54. it would be much more useful... by Tonik,+the · · Score: 1

    It would be much more useful to allow non-ASCII characters in the identifier names.

    As it is now, if you look at a program written by Russians, it'll usually be a mix of transliterated Russian and bad English.

  55. And Gibraltar? by Anonymous Coward · · Score: 0

    .gi written in Cyrillic looks almost like .ru! Poor Russkis, the whole world is trying to phish them.

  56. Did you think your response through? by Anonymous Coward · · Score: 0

    Wouldn't it be awful if, because everybody was expected to know English, people who didn't speak it natively were an underclass?

    If that were the case, wouldn't the corollary be true? That those who, regardless of their background, could manage to achieve a degree of fluency in English could be a part of an upper class? Is that such a bad thing?

    Um, your response here suffers from two key defects:

    1. It violates elementary logic: "If A, then B" does not entail "If B, then A."
    2. It ignores relevant material from the post it responds to. To quote myself (you know, the post you are supposedly responding to):

      And what's worse is that minority ethnic groups that lose their language do not thereby become integrated with the mainstream society whose language they adopt. Most Native American groups have lost their native languages to a colonial language, and they are still marginal.

      The adoption of a common language, either through bilingualism or through the death of a native language, usually occurs with the subordination of one group of people to another.

    In China, there are dozens if not hundreds of dialects local to a particular city or region. These dialects are spoken at home between family members, and by locals to one another. Mandarin (the official language) is taught in schools, and is pretty effective in allowing all Chinese to communicate with one another, for the most part. The local dialects do not appear to be in any danger of dying out.

    You're not very knowledgeable about Chinese linguistics.

    1. Passive understanding of Mandarin is common across China, but literacy and spoken skill in Mandarin in non-Mandarin areas can be quite spotty.
    2. Many Chinese dialects have been displaced by the bigger dialects. You think
    3. China is home to many non-Han peoples with endangered languages.

    I agree that language can be used as a tool of oppression, for instance, if a particular group is forbidden from using their native language, as the Kurds were in Turkey for a time.

    Note that the post you're responding to does not raise that kind of scenario.

    Provided that a universal language is promoted not by discouraging minority languages, but by promoting it in a positive way, through education, what is the harm in more people being able to communicate with one another?

    If you assume that your proposal is going to achieve what you want it to achieve, you can easily prove that your proposal will achieve what you want it to. Of course there is no harm in "more people being able to communicate with one another." The problem is the unintended consequences that are brought to light when you evaluate the proposal in the light of sociolinguistics and linguistic anthropology.

    And, what the hell do you want everybody to have the same language for anyway? You're just taking it for granted that that would be any better than now. Would it really be so? Would it be so for everybody? Would it be better for some people, but worse for others? Can we get some serious, realistic reasoning about social relationships, instead of cheerful proclamations about how great it would be to live in Utopia?

  57. Please, MOD PARENT UP by Anonymous Coward · · Score: 0

    This undermines their whole excuse about confusion with the .py ccTLD.

  58. Mixing politics and Unicode considered harmful by AndyElf · · Score: 1

    I'd suggest Mr. kdawson next time to take a slightly less political look at the stories he's putting up. This has nothing to do with "Hindering Paraguayan hackers" and even less so with "establishing greater control over a walled-off Internet." TLD '.RU' comes from ISO two-letter country code for Russia which is 'RU' (Notice that it matches first two letters of country name. Coincidence?). When you move to ITLDs, which presumable will allow us all to use domain names written in any language, translating 'RU' to a literal equivalent in Russian '' makes *no* sense at all. Forget about Paraguay here -- it simply makes 0 sense. Options that make a little more sense are '' or '' -- but they contain thee letter each.

    Now, official name of the country is not 'Russia' (just as US is not 'America'), it is 'Russian Federation'. In Russian this is ' ', and that is where ITLD '.' is coming from.

    --

    --AP
    1. Re:Mixing politics and Unicode considered harmful by AndyElf · · Score: 1

      Oh, man... One would think that by 2008 Slashdot should fully support multiple character sets, but it does not. Shocking. Feeling totally lazy to try and cheat it to accept either unicode entities or spelling it out long hand, here's a link to the same (almost) text, but with all Cyrillic letters kept intact

      --

      --AP
    2. Re:Mixing politics and Unicode considered harmful by AndyElf · · Score: 1
      --

      --AP
  59. Re:Why is /. always late with stories? by jacquesm · · Score: 1

    that's an excellent idea !

  60. wrong by Maimun · · Score: 1

    That is not true.

    1. Re:wrong by CRCulver · · Score: 1

      Of course it is true. Look at any Old Church Slavonic primer: Lunt, Nandris, Schmalstieg, Vojlova, Gardiner, whatever, they all give a good chronology of the rise of Slavonic letters.

      Obolensky argued that St. Methodius wanted to empower the Slavs against Byzantine control. It may be that part of this was creating an alphabet--Glagolitic--that was completely unlike Greek (though vaguely based on Greek forms). Cyrillic, on the other hand, was created by elements in the Kingdom of Bulgaria that saw no problems with Byzantine influence.

    2. Re:wrong by Anonymous Coward · · Score: 0

      It's been thirty years since I studied Old Church Slavonic; the only thing I remember from Lunt is
      that in interpreted Staroslavyanskii all code blocks must be terminated with a tvyordii znak.

  61. Thats why! by MrJones · · Score: 1

    Thats why I was getting too many hit to my .py site :)

    Greetings from .py

    BTW, in .py you can not get a .py TLD without a .com.py, .org.py, etc
    Also, it cost 40$ a domain name!

    --
    Get my e-mail after a captcha test in: http://tinymailt
  62. You are not entirely right by gr8dude · · Score: 1

    Cyrillic is far more efficient all things considered - with one letter for each distinct sound
    The Cyrillic alphabet is suitable for sounds that are typical to the Russian language. However, it is not suitable for languages such as English. For instance, there is no equivalent for 'th' in Russian, so you hear people pronouncing it in funny ways.

    I even wrote a short story about it, "In SOVIET Russia, Bluetooth mispronounces YOU!!". Most folks tend to say "s" instead of "th", even though a plain "f" sounds nicer, IMHO. The Hamming distance between "s" and "th" is greater than between "f" and "th", so "f" should be a "compatibility-mode sound for 'th'", if I can express myself that way.

    The final point is that you need a right tool to do the job right.

    Note: I am a fluent speaker of both, Russian and English. I also happen to be a fluent speaker of Romanian. In this case we also have some special letters - diacritics: {, , , î, â} /*slashdot didn't render the first three characters, they were 's' and 't' with a tail, and 'a' with a curved line on top*/. There are no words in English that use either of these - {î,â}.

    To support your point, I do think that Cyrillic is a bit more flexible than Latin. Back in the days when my country was occupied by the soviets (I live in Moldova), they forced a transition to the Cyrillic alphabet (to speed up the "rusification" of the locals). As a result, Romanian (Moldova and Romania used to form a single state; and in the future we will be united again) was spelled with Cyrillic letters, this artificial language was called "Moldovan" (or "Moldavian"). I am not going to criticize the fact that they forced people to switch to a different writing; but this does show that Cyrillic can be used for languages other than Russian.



    I must also add that you too are correct, this: "w Szczebrzeszynie chrzaszcz brzmi w trzcinie i Szczebrzeszyn" is a weird linguistic artifact. I don't speak Polish, but in my mind it is known as the "szcz-language" :-) I am not familiar with the history of Poland, so I don't know why Latin characters were chosen.
    1. Re:You are not entirely right by CuriousCuller · · Score: 1

      Sure, I wouldn't dare advocate the Cyrillic alphabet for English. But for Polish, which is a West Slavonic language and thus not a million miles away from Russian, it would be a great step in the right direction - indeed the right tool for the job! Actually, proposals were made to change to Cyrillic in the 1850s... the reason it never happened are myriad, but mainly Political. Which is also why it'll never happen! Poles are none to fond of their neighbours, for much the same reasons as Moldovans if I can hazard a guess. It's a real shame, because szsz is the classic case in point, it's really just one letter... and that's what you'd get with Cyrillic, namely a (and /. won't allow it, but the W with a tail, which no Russian would have a problem pronouncing)

      Anyway, the adoption of the Latin alphabet in Poland is most likely due to the hegemony of the Roman Catholic church and the prevalence of Latin in the early church. Indeed, nothing much has changed in that respect. Every town in Poland worth it's salt has at least one street, school or square named Jan Pawel II... Polish is also a bit of an oddity, because whereas most Slavic languages using the Latin alphabet picked up the Czech orthographic system, Polish orthography developed independently.

      By the way, if you speak Russian, Polish should be reasonably easy to pick up once you get past the orthography and learn to gadac[talk]. After all, spell piwo however you like, the best thing to do is still to drink it :)