Slashdot Mirror


ICANN Under Pressure Over Non-Latin Characters

RidcullyTheBrown writes "A story from the Sydney Morning Herald is reporting that ICANN is under pressure to introduce non-Latin characters into DNS names sooner rather than later. The effort is being spearheaded by nations in the Middle East and Asia. Currently there are only 37 characters usable in DNS entries, out of an estimated 50,000 that would be usable if ICANN changed naming restrictions. Given that some bind implementations still barf on an underscore, is this really premature?" From the article: "Plans to fast-track the introduction of non-English characters in website domain names could 'break the whole internet', warns ICANN chief executive Paul Twomey ... Twomey refuses to rush the process, and is currently conducting 'laboratory testing' to ensure that nothing can go wrong. 'The internet is like a fifteen story building, and with international domain names what we're trying to do is change the bricks in the basement,' he said. 'If we change the bricks there's all these layers of code above the DNS ... we have to make sure that if we change the system, the rest is all going to work.'" Given that some societies have used non-Latin characters for thousands of years, is this a bit late in coming?

471 comments

  1. Changing a system by Kamineko · · Score: 5, Insightful

    Changing a system which works is a very, very bad idea.

    Wont this open up the system to many more phishing attacks involving addresses which include non-latin characters which look similar to latin ones?

    1. Re:Changing a system by Daniel_Staal · · Score: 4, Insightful

      That's one possible problem. Then there are characters that are technically equivilent but have different representations. (Accented vowels for instance: you can code them directly, or you can code the accent and the vowel seperate.) You need some way to make sure they both go the same place, no matter UTF-8, -16, -32 or whatever else people throw at it.

      And, of course, you need to make sure when someone types this into a browser some major DNS server someplace won't crash.

      I'm all for adding non-latin characters. But I do recognize that it should be a slow process.

      --
      'Sensible' is a curse word.
    2. Re:Changing a system by MindStalker · · Score: 0

      Definitely the truth, the system works, why change it. What I envision as a potential workaround is a layer that works above DNS. Keep DNS strictly as it is, but introduce an approved maybe even required in IPv6 system that works above the DNS system. Something akin to a search engine but that requires registration. So that you could register and endless array of weird character names, that all map down to a simple latin based DNS name and future browsers could show you the latin version as well for security verification purposes.

    3. Re:Changing a system by KingJoshi · · Score: 5, Insightful

      But it's not working. Mainly for all those people that want non-latin characters. It's been broken from the beginning. Sure, there is historical reasons why we have the system we do, but change is definitely needed. Twomey is right that a change can't be rushed and it needs to be done right (for reasons of security, compatibility, stability, etc). However, the change does need to occur and there needs to be some level of pressure to ensure that it happens.

      --
      In times like these, it is helpful to remember that there have always been times like these. - Paul Harvey
    4. Re:Changing a system by jmorris42 · · Score: 4, Insightful

      > Wont this open up the system to many more phishing attacks involving addresses which include non-latin characters which look similar to latin ones?

      Even worse, although your problem is reason enough to postpone doing this change. It will break the very idea of the Internet as a common when URLs can't even be typed in on all keyboards. There are good reasons why DNS didn't even include the whole ASCII set. Least common denominator is a good design decision. Every character currently allowed is easy to generate on ALL keyboards, can be printed in an unambigious way by EVERY printing system, etc. Remember that a lot of wire services aren't even 7-bit ASCII clean, email addresses on a lot of news wires have to use (at) instead of @.

      More bluntly, of what use is the parts of the Internet I can't even type the domain name for? As things now stand I CAN, and have, snarfed firmware directly from .com.tw sites where I couldn't read any of the text. Learned things from sites where I couldn't read anything but the code text and command lines. Seen images and understood even when the captions were meaningless to me. I'm sure the reverse is equally true, that those who do not speak English still benefit from the English majority of the Internet the same way. All this because DNS is currently universal. Break that universal access feature and, frankly they can just as easy ingore ICANN and just get the hell off the Internet and make their own walled garden network based in IPv6 technology.

      At a minimum, unicode DNS should be restricted to IPv6 ONLY. No sense wasting scarce IPv4 resources on supporting walled off ghettos.

      --
      Democrat delenda est
    5. Re:Changing a system by krell · · Score: 1

      "More bluntly, of what use is the parts of the Internet I can't even type the domain name for?"

      There might be some places that would like to block going to sites that don't have certain character sets as their name.

      --
      Where were you when the voynix came?
    6. Re:Changing a system by voice_of_all_reason · · Score: 1

      And, of course, you need to make sure when someone types this into any application ever made that access the internet Fixed.

    7. Re:Changing a system by imbaczek · · Score: 2, Insightful

      Except that it doesn't. Being allowed to use 37 characters as a domain name is not what many people consider "working".

    8. Re:Changing a system by ericlondaits · · Score: 4, Insightful

      Accented vowels would be a problem, at least in spanish. Though their use is "mandatory", people with mediocre spelling don't use them in the internet. Even people who use them don't always do it: even though the use of accents is mostly regular, there are many (and very common) irregular placements.

      Let's say for instance we have an online shop for tea called "Sólo Té" (Tea Only). Both accents are due to irregular rules ("Sólo" = "Only" and "Solo" = "Alone", "Te" is a personal pronoun and "Té" = Tea). Some people would try the current www.solote.com, others would try the correct www.sóloté.com, some would try www.sólote.com and yet others www.soloté.com depending on their spelling capabilities.

      What this basically means is that in order to make sure everybody finds your domain and to avoid phishing you have to register four different domains.

      A solution to this problem could be what Google does right now with accents: map them to the unnacented vowel. Thus "Solo Te" and "Sólo Té" would both find the "Sólo Té" store.

      --
      As a Slashdot discussion grows longer, the probability of an analogy involving cars approaches one.
    9. Re:Changing a system by Exocrist · · Score: 1

      I'm all for adding support for non-latin characters. However, I do realize the problems with different encodings, similar looking characters, etc. Perhaps there should be a system that knows that certain characters are equivalent? I imagine that might cause some significant overhead...

      There are already some systems that use this, for example: http://centralops.net/co/DomainDossier.aspx?addr=v rsn-end-of-zone-marker-dummy-record.root&dom_dns=t rue
      Click some of the links on the bottom.

    10. Re:Changing a system by Tet · · Score: 2, Insightful
      Wont this open up the system to many more phishing attacks involving addresses which include non-latin characters which look similar to latin ones?

      Potentially, yes. But I'm not too bothered about that. Protecting people from their own stupidity is rarely a good long term strategy. However, i18n for DNS is a particularly bad idea for purely pragmatic reasons. Currently, anyone anywhere in the world can go to any URL in the world in their web browser. If we allow the full range of unicode characters, that simply ceases to be true. When URLs start containing unicode characters, many people are simply not going to be able to enter them into their computer (with current input methods, anyway). True, many of those sites will not be of interest to the average person that doesn't have a convenient way to enter the URL anyway. But there will always be those that need to grab a data sheet from a Taiwanese electronics manufacturer, or look at live results from a sporting event in the middle east. That will cease to be possible with i18n. As you say, the system currently works. Changing it for political reasons is just stupid.

      --
      "The invisible and the non-existent look very much alike." -- Delos B. McKown
    11. Re:Changing a system by Anonymous Coward · · Score: 1, Funny
      Changing a system which works is a very, very bad idea.


      George? Is that you? We really need to talk about this Gitmo thing openly.

    12. Re:Changing a system by BJH · · Score: 1

      It already works, at least mostly (and assuming /. doesn't fuck up that URL).

    13. Re:Changing a system by Bwian_of_Nazareth · · Score: 1

      Well, in many ways, the system does NOT work. It does not work for me when I want to register the name "zluounký" or "". It may work for people in the US, but it does not work for most of the world. It works in what it does but it does not do what people need.

    14. Re:Changing a system by BJH · · Score: 1

      Which it did. Well done, Slashcode.

    15. Re:Changing a system by benoitg · · Score: 1

      I think it can be argued that as a naming system DNS doesn't work very well if it prevents most of the planet form using common, everyday words in their native language, or forces them to mis-spell them.

      "But it works now, why change it?" Well, the same could have been said of IP addresses, and that didn't stop us from moving to DNS!

    16. Re:Changing a system by coastwalker · · Score: 1

      They are going to have to wait for the system to be capable of using more characters.

      Its just a fact of life that the encoding scheme implemented has a limited set of characters that is readable by the technically adept people who built the thing.

      Its a great idea to enable lookup by character strings using alphabets from other languages but if it takes time to implement a global standard thats just too bad.

      If they are desperate to implement lookup in their own character sets then let them get on with it - but don't expect the lookup to work outside their own countries if it causes problems.

      --
      Facts are history now plebs have politics for religion on social media.
    17. Re:Changing a system by CRCulver · · Score: 1

      When URLs start containing unicode characters, many people are simply not going to be able to enter them into their computer (with current input methods, anyway).

      Windows, Mac OS X, and Linux distros all have a character map. There's no way it can be "impossible" to enter such URLs.

    18. Re:Changing a system by GeorgeS069 · · Score: 2, Funny

      "37?!!"
      "...in a row??"

      --
      I'd rather have a bottle in front of me than a frontal lobotomy
    19. Re:Changing a system by Sin+Nombre · · Score: 5, Insightful

      'when URLs can't even be typed in on all keyboards'
      As far as Japanese go, there are very usable technologies that allow to type in kanji. Using a standard latin keyboard. It works pretty well, and i'm not sure what other languages have such options available, but since most of Asia uses the same kanji system I'm pretty sure that at least Asia has viable typing options.
      'of what use is the parts of the Internet I can't even type the domain name for?'
      Its of no use... to you. But then again, can you read Japanese, Korean, Arabic, Sanskrit or any other non-latin language? no? Then your usability isn't in question here.

      --
      "Im such a nonconformist I'm going to not conform to the rest of you!"
      "Dude I think we just got goth-served"
    20. Re:Changing a system by salec · · Score: 1

      I agree. The characters are grapheme, so we should treat them that way.

      Perhaps even whole idea of encoding alphabets is a relict (and biased to phonetic alphabets, as well)? Today computers have enough power to operate on pictures as UI, so why don't we switch to shape-based data processing?

      That would instantly break digital divide between present and history (think digitization of ancient documents) and between various cultures.

      As bonus, we get to ditch keyboard-induced RSI, that feeling of being constrained in what you can do "on computer" and plus my kids will get a motivation to learn to handwrite nice (and to speak laud and clear)!

    21. Re:Changing a system by teh+kurisu · · Score: 4, Insightful

      Just because the letters aren't printed on your keyboard doesn't mean it won't type them. Have a look at the list of keyboard layouts in your OS. Sure, it's an inconvenience for you, but less of an inconvenience than it is to the people for whom it is a barrier to entry. Or you could use Google - a lot of people don't even bother typing in domain names any more, they just search.

      The whole point about this is that it avoids walled gardens, because the DNS records are still held by ICANN. The alternative is that China decides it's had enough, and creates its own root servers, causing a very real split.

    22. Re:Changing a system by voice_of_all_reason · · Score: 1

      And what happens when the owners of "sol ote" (sol as in the sun), who already have a website, find people attempting to access it "helpfully" redirected to your example?

    23. Re:Changing a system by stunt_penguin · · Score: 1

      I call dibbs on slashdöt.org

      --
      When the posters fear their moderators, there is tyranny; when the moderators fears the posters, there is liberty.
    24. Re:Changing a system by Colin+Smith · · Score: 1

      You're right. It'd just be extraordinarily difficult to choose the letters you need. In fact, it would be easier and more reliable to simply to use the IP address.

      --
      Deleted
    25. Re:Changing a system by Cygnus78 · · Score: 1

      Every character currently allowed is easy to generate on ALL keyboards

      Not at all true, the pirates (http://www.craphound.com/images/piratekeyboard.jp g) are still struggeling.

    26. Re:Changing a system by LoonyMike · · Score: 1

      That wasn't unicode DNS, that was an unicode path for the slashdot.org server.

    27. Re:Changing a system by pla · · Score: 1

      As perhaps the better question, does registering "www.førÐSüçks.org" count as domain-squatting or a trademark issue? How about the same thing in Kanji that, purely by accident (ie, no historical or functional similarity to the Latin characters, only superficial appearance) looks very much like the same phrase?

      C'mon, people... Get serious. English had all those stupid little accent marks once upon a time, and they went away because the language worked better without them! Take the hint. Standardize or get left behind (and I don't mean that as flamebait, just the way the world works).

    28. Re:Changing a system by Anonymous Coward · · Score: 0

      In the immortal words of Sir Abraham Lincoln: "I seem to have unwillingly linked to a completely bizarre site trying to give a good example."

    29. Re:Changing a system by Hamled · · Score: 2, Insightful

      This could certainly be a big win for Google.

      I'm all for people using and having websites and domains in their native language and alphabet, however it would be very difficult for me to find traditional Persian music (which I happen to be fond of) if the domain were .sa (if that doesn't show up, it was a simple translation of persianmusic.sa into Arabic). On the other hand, I could probably find that site through Google, and largely would have to go through Google or some other search engine, to find and visit websites and domains in another alphabet.

      On the other hand, I suppose that's how I do it now.

    30. Re:Changing a system by jb.hl.com · · Score: 1

      But then again, can you read Japanese, Korean, Arabic, Sanskrit or any other non-latin language? no? Then your usability isn't in question here.

      However if you can read those languages, and you want to go to a website in that language using your Latin-alphabet keyboard, you may be a little screwed.

      --
      By summer it was all gone...now shesmovedon. --
    31. Re:Changing a system by Znork · · Score: 1

      "A solution to this problem could be what Google does right now with accents:"

      An even better solution to this problem would be for the user to simply use a search engine and query for solo tea store, which would get him to the right place whatever their domain name is.

      Dont make more of domain names than what they are. These days they're usually cut'n'pasted, linked or found through searches anyway, so the problem just isnt that big of a deal.

    32. Re:Changing a system by Anonymous Coward · · Score: 0

      What you and many others don't get, is that it DOESN'T work, not for all. It works for you. Imagine if things were different and only cyrillic characters were allowed, you would have to switch keyboard layout every time you want to go to .. (ok, no one would *want* to go there, but you get the idea, I hope). Very few languages can be written just with ASCII, and some have some horrible orthography quirks because of that (like Swahili and ng', that's a trigraph! is so much nicer in my opinion). Just to name a few languages that aren't written in ASCII (even though they're written in Latin script): every romance language (you can't get more 'Latin' than that), all north Germanic languages, German, Dutch and Afrikaans, Finnish and other Finno-Ugric languages (I don't know if all of them, but most, I'm sure), Slavic languages that don't use cyrillic often have non-ASCII characters as well. Actually, off the top of my head, I can only think of two languages that can be written with ASCII only, and that's English and Swahili (its orthography looks designed by an Englishman, if you ask me).

    33. Re:Changing a system by Alioth · · Score: 1

      Phishing attacks can be prevented with some simple rules:

      com/org/net/edu/mil etc. are Latin-1 only

      Unicode characters can only be used within the country code top level domain where that character set is used - for example, cyrillic can only be used in .ru/.ua/.su (and all the other cyrillic using countries) - and not .com/.uk/.org/.us etc.

    34. Re:Changing a system by minion · · Score: 0, Flamebait

      The whole point about this is that it avoids walled gardens, because the DNS records are still held by ICANN. The alternative is that China decides it's had enough, and creates its own root servers, causing a very real split.
       
        And that would be a travesty how? They don't want to have the internet anyway - it has too many things they deem "inapporiate". I think they it'd just be better off if they did have their own internet. Might make the people rebel a bit faster and overthrow that communist government and get something in there that represents real human rights.

      --

      -- If we don't stand up for our rights, now, there will be no right to stand up for them later.
    35. Re:Changing a system by ericlondaits · · Score: 1

      I actually think mostly the same... but I still sometimes get to places through URLs. Particularly if our hypothetical on-line tea store had a stone and mortar store they'd probably hand out cards or flyers with their URL. In cases like this the URL sometimes is all you've got because the store is new and Google hasn't indexed it yet, or the site has a low PageRank (lower at least than other search results), or because the store's name uses words that are too common.

      So, the store would have to pay Google so whenever someone searches for "Solo Te Store" their link comes up... which would be like a DNS, except for the fact that I can pay for my competing store to show up on the same query and probably steal some customers.

      Domain names are a really poor solution to identifying and accessing web sites, but alternatives seem to have their share of problems too.

      --
      As a Slashdot discussion grows longer, the probability of an analogy involving cars approaches one.
    36. Re:Changing a system by Zaatxe · · Score: 3, Insightful

      As far as Japanese go, there are very usable technologies that allow to type in kanji. Using a standard latin keyboard. It works pretty well, and i'm not sure what other languages have such options available, but since most of Asia uses the same kanji system I'm pretty sure that at least Asia has viable typing options.

      I wonder how you got +4 mod points... this makes no sense at all!!

      Let's suppose you are are a japanese person and you travel to Brazil. Nevermind if can speak portuguese or not, but then you need to send an e-mail using your company's webmail server from a computer at the hotel. And suppose this webmail server has kanji characters in its URL. How are you going to type them? Believe me, brazilian portuguese Windows has no support for asian languages (at least not by default, and actually I don't know if it's even possible with a regular brazilian Windows XP). What now?

      --
      So say we all
    37. Re:Changing a system by Anonymous Coward · · Score: 5, Informative

      What's this? I've been able to use the Norwegian characters in domain names for a long time. There are screetshots over at http://en.wikipedia.org/wiki/Internationalized_dom ain_name

    38. Re:Changing a system by EvilIdler · · Score: 1

      This westerner frequently uses two 29-letter alphabets.

    39. Re:Changing a system by Anonymous Coward · · Score: 0

      Then I guess maybe it's time for those people to fix the problem. We set things up, and they work for us. You now think we should help you out as well. Too bad. Learn some self-reliance, nancy boy.

    40. Re:Changing a system by amorsen · · Score: 1

      In fact, it would be easier and more reliable to simply to use the IP address.

      Using the IP address rarely works. All but the largest sites depend on virtual hosts.

      --
      Finally! A year of moderation! Ready for 2019?
    41. Re:Changing a system by Zaatxe · · Score: 1

      Anyway... aren't there more than 37 characters? I can think of the 26 letters (A through Z, both included), 10 digits (0 through 9, both included), dot, slash and dash/hyphen. I don't know if I should include '&', '=' and '?', but that doesn't matter much, because even without them, we would already have more than 37 characters.

      --
      So say we all
    42. Re:Changing a system by dingDaShan · · Score: 1

      We already have enough problems in spelling already just try out gogle.com, google.co, or any other variant.

    43. Re:Changing a system by amorsen · · Score: 1

      Its just a fact of life that the encoding scheme implemented has a limited set of characters that is readable by the technically adept people who built the thing.

      It's not particularly a fact of life. It's just a policy that ICANN has decided on. Domains in .dk are available with the Danish letters æ, ø, and å, admittedly by using a harebrained encoding scheme. ICANN could easily allow more letters in domain names, either using the same harebrained scheme or by simply using UTF-8.

      --
      Finally! A year of moderation! Ready for 2019?
    44. Re:Changing a system by operagost · · Score: 1, Informative

      If you can explain to me how you register a domain with a space in it, I'll try to answer your question.

      --

      Gamingmuseum.com: Give your 3D accelerator a rest.
    45. Re:Changing a system by MrNougat · · Score: 3, Interesting
      Though their use is "mandatory", people with mediocre spelling don't use them in the internet.


      I don't have mediocre English spelling, and I would use the correct accented characters in English words like "naive" - except I don't know how to type those characters. Like many people, I know how to type the characters that are on the keyboard. Additionally, because there's no need for me to type characters outside the ones printed on the keys on my keyboard to make the internets come down my tubes, I have no incentive to learn how to type any differently than I already do.

      It's not necessarily a matter of spelling ability.
      --
      Web 2.0 == Giant Blogspam Circle Jerk
    46. Re:Changing a system by jZnat · · Score: 1

      They register sol-ote.com (or the other site registers solo-te.com instead). Ain't punctuation great?

      --
      'Yes, firefox is indeed greater than women. Can women block pops up for you? No. Can Firefox show you naked women? Yes.'
    47. Re:Changing a system by dasunt · · Score: 4, Insightful
      As far as Japanese go, there are very usable technologies that allow to type in kanji. Using a standard latin keyboard. It works pretty well, and i'm not sure what other languages have such options available, but since most of Asia uses the same kanji system I'm pretty sure that at least Asia has viable typing options.

      I must have missed where Japan conquered 51%+ of the area east of the Ural mountains.

      AFAIK (and I'm not an expert), China, Japan, Korea and Vietnam used very similar writing system decended from Chinese Hanji characters. Vietnam and Korea (South Korea at least) later adopted other alphabets. So really, only China and Japan commonly use Hanji/Kanji, and even then, the CJK unification of hanji/hanja/kanji characters really annoyed a few purists when similar hanji/hanja/kanji were merged in unicode.

      So, other than hanji/kanji, there is hangul (S. Korea), hana/kana (Japan -- yes, they have more than one writing system!), the Thai alphabet, the Cyrillic alphabet (former USSR), the Arabic alphabet (Middle East), Hebrew (Israel), the Brahmic scripts (India) and the Georgian alphabet. (And this is just off the top of my head, I wouldn't be surprised if there were a few more writing systems in use in Asia!).

      And then, just to confuse the problem, there are the various forms of encoding. Admittedly, unicode would probably be one of the better methods, but there are a lot of pre-unicode encodings in common use.

      When you expand the problem to be worldwide, there's also the Ethiopian and Greek alphabets that are used in their respective regions. There's also a ton of latin-based alphabets, which introduces many more characters than are currently used in the DNS system. (Including characters that look a lot like existing characters!)

      And then you have the problem of alphabets used only by very small groups, such as Cherokee (Oh, I'm going to get flamed!). There are very few people who can write in Cherokee, but does that mean that the Cherokee language shouldn't be part of the DNS system?

      Now, can you see why this is a mess?

    48. Re:Changing a system by ichigo+2.0 · · Score: 1

      Obviously you enter the ASCII/UTF codes (that you've memorized) for the characters! What kind of nerd are you? ;)

    49. Re:Changing a system by jZnat · · Score: 1

      A domain name can only contain [-a-z0-9]; a URI can include all the characters that you were describing. Domain names are split by full stops, so a FQDN would match something like ([-a-z0-9]+\.)*[-a-z0-9]+

      --
      'Yes, firefox is indeed greater than women. Can women block pops up for you? No. Can Firefox show you naked women? Yes.'
    50. Re:Changing a system by voice_of_all_reason · · Score: 1

      While that will work somewhat, it doesn't address the GP's original dilemma -- you want to register the max number of permutations for your brand name. A space in the name simply doubles that number.

      In this case, what happens when one of these companies sues the other over a combination they didn't think to register that is now supposedly infringing on their trademark? Even with latin-only characters we don't know how to handle this -- afaik mikerowesoft.com was settled.

    51. Re:Changing a system by Anonymous Coward · · Score: 0

      Got to hell imperialist pig. You might note the web wasn't invented in the US. It's just effectively controlled there. And like in every other endeavour on earth the americans are doing a great job of getting themselves hated even more because of their goddamed "fuck you everyone else this is about us" attitude.

    52. Re:Changing a system by Znork · · Score: 3, Insightful

      "It will break the very idea of the Internet as a common when URLs can't even be typed in on all keyboards"

      You know, when one sees comments like that, it's not strange that non-7bit ascii countries find themselves rather exasperated with the rate of progress. If you take a few seconds to actually research the issue you'll find both a suggestive lack of multi-thousand key keyboards, as well as a whole host of solutions to that problem.

      I mean, I can cut'n'paste chinese and japanese into vi, save the file with a unicode filename, and it'll just work. Earlier valid technical reasons are gone, everyone else has solved this; now the excuses start sounding really hollow.

      It's time to drag DNS kicking and screaming out of the dark ages.

    53. Re:Changing a system by ericlondaits · · Score: 2, Informative

      If you are spanish-speaking (which was my example) not knowing how to place accents is not an excuse. They're a fundamental part of the language, unlike in english where they're only required for foreign words written in their original form.

      In Argentina some people have keyboards with spanish language distribution (that is, with extra letters) and some learn the ASCII codes and use the ALT key (along with the code typed in the Numpad) to place accents and the letters Ñ and ñ (which are mandatory as well and can't be replaced by N or n... specially when Año means "year" and Ano means "Anus").

      I know of many people that know how to place accents and are just lazy... but I consider that a sign of poor spelling as well, since the best spellers I know use all accents and get a bit of pain every time they find an omission (which normally changes the meaning of the word, makes fluent reading a bit more difficult, and it's just ugly).

      --
      As a Slashdot discussion grows longer, the probability of an analogy involving cars approaches one.
    54. Re:Changing a system by Doctor+Memory · · Score: 1
      I must have missed where Japan conquered 51%+ of the area east of the Ural mountains.

      They didn't &mdash China did. They just adopted the name "Japan" as it had better brand recognition. Unfortunately, due to censorship issues, the news hasn't gotten out yet...
      --
      Just junk food for thought...
    55. Re:Changing a system by BJH · · Score: 1

      No, it was a Japanese URL that Slashcode decided it didn't recognise inside an a tag and thus converted to a local URL.
      Try it and see.

    56. Re:Changing a system by aardvarkjoe · · Score: 1
      Imagine if things were different and only cyrillic characters were allowed, you would have to switch keyboard layout every time you want to go to

      No, I wouldn't. I'd type "slashdot" into google, and then click on the cyrillic-character domain name to get here. And, of course, such things would be done automatically in the next generation of browsers.

      All DNS really is is a method of translating readable names into IP addresses. I don't see any real need to shove this stuff into the existing system when it could be done more easily and less riskily with a new parallel or higher-level system.

      --

      How can we continue to believe in a just universe and freedom to eat crackers if we have no ale?
    57. Re:Changing a system by amorsen · · Score: 1

      Believe me, brazilian portuguese Windows has no support for asian languages (at least not by default, and actually I don't know if it's even possible with a regular brazilian Windows XP). What now?

      You're screwed already, if you don't happen to have a link somewhere. Try Korsør (Ha! Slashdot is messed up and can't handle æøå in domains. The link is supposed to be www.korsør.com).

      But what is it with Slashdotters these days? Luddites and conservatives, all of you! There is such a thing as POSITIVE change, you know.

      --
      Finally! A year of moderation! Ready for 2019?
    58. Re:Changing a system by Zaatxe · · Score: 1

      Hey, nice!!! This site has some information in english, some in portuguese and some in a language I can't identify (probably norse/swedish/icelandic/atlantean, I happen to know a few languages and that is none of them).

      Anyway, I'm not against change, either positive or negative. Actually, first time I saw about non-latin characters in URL's I thought it was a wonderful idea. But I foresee some problems with this, that's all. Japanese people won't have problem with japanese characters in URL's in Japan, same about all other languages. But problems will happen.

      --
      So say we all
    59. Re:Changing a system by tacocat · · Score: 0

      How do I type in a character for a domain name that isn't supported on my keyboard?

      No matter what you do, I'm still limited to the keys on my keyboard. I think that's ~104 by last count. But I certainly don't use that many characters.

      I admit that there are some people who are going to bitch about the internet being english. But does that give me a right to bitch about classical music being defined in French and Italian terms like a fugue, sonata, adiago, allegro... I think not. In the past 400 years we've all managed very nicely to adopt to these terms in order to converse with each other an a common basis.

      Perhaps there are some terms that these anglicans can adopt from the middle east besides Jihad?

    60. Re:Changing a system by Anonymous Coward · · Score: 0

      And, with the modifying of part of one pixel worht of data, we can phish limitlessly without being noticed. Woo!

    61. Re:Changing a system by Melfina · · Score: 1
      That's more of a webmaster's problem than anything else. Even with standard english we come across difficulty like this.

      Anyone remember pen island?

      --
      :3 rawr.
    62. Re:Changing a system by shadowdodger · · Score: 1

      It really would not be that hard to 'expand' the characters on your keyboard to allow more possible options.
      You could simply code the url line of your web browser to accept more complex text imputs like and good text program.
      Most work editors can accept something of ctrl+354 scheme to enter other non-latin characters. Why can't my browser?..

      As to the comment a few people down talking about misspellings of webnames and the like, you're going to have that problem no matter if you use latin or non latin characters. You're going to have stupid people that mispell your name no matter what, so that doesn't really affect the nature of what character set we should use.

    63. Re:Changing a system by tank3544 · · Score: 1

      "Its of no use... to you. But then again, can you read Japanese, Korean, Arabic, Sanskrit or any other non-latin language? no? Then your usability isn't in question here." You're logic doesn't work for all of us. In my job I have to access sites that I surely can't read several times a month. That's why things such as Babelfish and other page translators are so important.

    64. Re:Changing a system by DrVomact · · Score: 1

      In German, the "special" characters (ä, ö, ü and ß) can be replaced by ae, oe, ue, and ss, respectively. This isn't orthographically orthodox, but it's generally recognized as an expedient for people who don't have proper keyboards. Google seems to recognize these substitutions. So German isn't a problem.

      I'd like to know exactly who is insisting on this "inclusiveness". Do the Chinese really want to include their thousands of characters in domain names? Are the Japanese clamoring for Hiragana, Katagana, and Kanji domain names? If it's just some stupid UN agency or the European non-Union, I'd say ignore them.

      Sometimes, compromises have to be made for the sake of practicality, even if those compromises aren't culturally sensitive. For example, the internationally accepted language for civilian air communications is English. Can you imagine the result if everyone insisted talking only their own language? I can just see the black box transcript:

      Pilot: What the f*ck does "ändern Sie sofort Höhe zu 3000 Meter" mean? He wants me to climb another 3000 meters? Or is it lower? Or..hey Jack, can you bring up Babelfish on your cellphone?
      Airplane: Crunch

      If we're going the internationalization route with domain names, I'd say we'd be better off just using IP numbers. Domain names were supposed to make navigating to (and remembering) web sites more convenient; this change would make domain names more cumbersome than just a string of numbers. Well, that'll work until someone insists on using Roman numerals for IP addresses...

      --
      Great men are almost always bad men--Lord Acton's Corollary
    65. Re:Changing a system by bernywork · · Score: 1

      Just to add fuel to a flame war, this could potentially work, fortunately we have all been there and now we are in the process of correcting this with another system. IPv4 and NAT. Anyone else see the similarities to what's been suggested here?

      We all know that at the end fo the day we are going to run out of domain names.

      --
      Curiosity was framed; ignorance killed the cat. -- Author unknown
    66. Re:Changing a system by jacksonj04 · · Score: 2, Insightful

      So when foreign stores open their UK or US branches, complete with accented characters, how in God's name are UK and US citizens going to suddenly learn how to type accented vowels, umlauts etc?

      --
      How many people can read hex if only you and dead people can read hex?
    67. Re:Changing a system by dillee1 · · Score: 1

      Chinese characters are exactly a counter example for internationalization of the DNS. Almost every nation in the chinese cultural influence circle have their own variation(s) of kanji, and own version(s) of kanji coding and input methods.
      e.g. the char (war)
      traditional chinese(big5):
      simplified chinese(gb):
      kanji(jis/euc):

      Most people who can read chinese character will be abled to read all variations without trouble, thus mix used of different regional variations might not be apparent. The different variations of the same character however have distinct unicode and computer treat them differently. Obvious problem will be e.g. 1)User confusions because their local IME input wrong variation of the character 2)Phishing with char variations

      And the fun doesn't stops here. With Chinese char DNS possible, users will not hesitates to use chinese path/file name as well. Many OS does not supports unicode properly, and thus use local coding on the URL instead. Disasters like http://unicode.domain/gb3212/big5/euc.filename will spread like wild fire. Even with an unicode OS, aforesaid char variations problem will still be there.

      So yeah, welcome to our new internationalized DNS and see IT support cost explodes.

    68. Re:Changing a system by dsanfte · · Score: 1

      The forced use of Latin DNS characters is a normalizing force towards the western alphabet, and should be embraced for being such. It can only be better for everyone involved, in the long run, if we all end up using the same alphabet.

      --
      occultae nullus est respectus musicae - originally a Greek proverb
    69. Re:Changing a system by DragonWriter · · Score: 1
      I'd like to know exactly who is insisting on this "inclusiveness".

      Countries "in the Middle East and Asia", probably in part because they use non-Latin alphabets without a single universally accepted mapping to the Latin alphabet so that Latin-alphabet names are extremely inconvenient.

      Sometimes, compromises have to be made for the sake of practicality, even if those compromises aren't culturally sensitive. For example, the internationally accepted language for civilian air communications is English. Can you imagine the result if everyone insisted talking only their own language?


      Familiarity of domain names (and URLs more generally) to people who speak a different language from the domain operator is not anything at all like consistency of safety messages in air traffic.

      If we're going the internationalization route with domain names, I'd say we'd be better off just using IP numbers. Domain names were supposed to make navigating to (and remembering) web sites more convenient


      And internationalization does that: not all users are, after all, English users, and internationalization allows those selecting domains to select domains that are convenient for their intended audience.

    70. Re:Changing a system by miller60 · · Score: 1

      Back in 2005 Firefox briefly disabled its support for IDN after The Shmoo Group demonstrated the ease of using Internationalized Domain Names (IDNs) to spoof existing domains, including those for major retailers or banks. At the time, Mozilla said domain registrars were ignoring ICANN guidelines on IDN, and developed a list of problematic Unicode characters that could be banned in domain names to limit homographic attacks. Not sure if this is still current.

    71. Re:Changing a system by fbartho · · Score: 1

      Use ALT+[Number pad code] on windows. ALT+130 is acute-accented-e é é (I use it all the time when I write French, because that's the accent I most run across, and I have a Dvorak layout on a QWERTY keyboard, and I never bothered to learn all of the different program's shortcut keys to print accented characters each OS) you can also use the System tool available on most OSes that lets you mousehunt to pick from a table showing many characters, from the keyboard accessible through the range of unicode formats.

      --
      Gravity Sucks
    72. Re:Changing a system by Kirkoff · · Score: 1

      I think the point is that I may want to visit a site in Khmer, Laotian or Thai - all Asian languages that don't use the kanji you mentioned - unless I have a keyboard or key layout that supports that alphabet, I cannot type it in. Say, for example, that you know how to read Chinese and you get a computer with an English Keyboard in an Internet cafe. Try going to that website.

      The point isn't that it doesn't cater to those who don't know the language. The point is that not all people will be able to get to all websites due to different entry systems. Beyond that, not all writing systems are in Unicode yet.

      --
      There are exactly 42,935,718 letter sized sheets in a square mile.
    73. Re:Changing a system by HomelessInLaJolla · · Score: 1

      > I'm all for adding non-latin characters

      In theory, yes. In practice, no. Here's why: the system has already been standardized and stabilized. The system currently works. If people are upset because their character set isn't supported in DNS then please cry a river and then go get a box of kleenex and get over it. If those societies had set up the DNS system in the first place perhaps they could have done it perfect to their hearts' desire. Since they didn't then they can't. The system is what it is. As a network society we shouldn't expand something so close to the core of the operation of the global internet until a comprehensive majority of downstream apps are written to conform with protocols and specifications which will handle an expanded DNS character set.

      A more important problem to address at this time is farming of the existing DNS name space. Too many domain names are being held--for ransom, just for the sake of holding them, or to spite the guy down the street that wants the domain name--by business units (companies, divisions of companies, departments of divisions of companies, conglomerates, associations... anything that isn't private interest) and are no longer available to people who may legitimately have a reason to put them to use.

      --
      the NPG electrode was replaced with carbon blac
    74. Re:Changing a system by sjf · · Score: 1

      I'm not sure what IPv4 vs. IPv6 has to do with this. In fact, DNS is a solution to the IPv4 problem, not a cause. (IP addresses are scarce, not domains.)
      On the other hand, I think that there is a case to be made for handing DNS over to Google. Let them handle unicode to address translation, they all-but do this today.

    75. Re:Changing a system by Anonymous Coward · · Score: 0

      No thanks. My keyboard may someday cause me RSI. But until then, it is *FAST*. It's faster than writing, it's faster than drawing, hell sometimes it's faster than thinking (can be a problem when posting, but hey it seems normal around here). The only input device I'd honestly consider upgrading to would be a brain-reader and/or transmitter.

    76. Re:Changing a system by putte_xvi · · Score: 1

      The Brazilian computer without a Japanese method probably doesn't have Japanese fonts either, so even if the hypothetical webmail server had an ASCII URL, you wouldn't be able to read any of the text.

      If it does have the fonts, but for some reason not an IME, I suppose you could Google for the company's webpage, and cut and paste. Or find the characters with WWWJDIC.

      So kanji in the URL doesn't present any additional obstacle.

    77. Re:Changing a system by coaxial · · Score: 1

      Yes keyboards exist to type those extended characters, but the fact is that not everyone that is a potential user of the site with the accented domain has the correct keyboard. You mention spanish, and that's a great example. Spanish is the second most popular language in the United States, yet I have never seen a keyboard in the United States that has the keys to type spanish words correctly. Simply calling people lazy or stupid for not typing accents is frankly lazy and stupid. For a large number of users, it's impossible to do the right thing.

      Unicode domain names sound nice, but frankly they're an unworkable solution.

    78. Re:Changing a system by coaxial · · Score: 1

      I don't have mediocre English spelling, and I would use the correct accented characters in English words like "naive" - except I don't know how to type those characters.

      To be pedantic, naive is properly spelled with an umlaut, not an accent mark. The general term is diacritical marks. And don't call the diacriticals in pinyin "accent marks." They're tonal marks damn it. ;)

      You're point is dead on though.

    79. Re:Changing a system by sfurious · · Score: 1

      More bluntly, of what use is the parts of the Internet I can't even type the domain name for? As things now stand I CAN, and have, snarfed firmware directly from .com.tw sites where I couldn't read any of the text. Learned things from sites where I couldn't read anything but the code text and command lines. Seen images and understood even when the captions were meaningless to me.

      Where did you find these sites? I'd guess via a search engine, in which case you didn't need to type the domain name at any point.

    80. Re:Changing a system by MrNougat · · Score: 1
      You're point is dead on though.


      I know you've got to be smacking yourself about that one, but I'll read it as extremely funny intended sarcasm.
      --
      Web 2.0 == Giant Blogspam Circle Jerk
    81. Re:Changing a system by minus_273 · · Score: 1

      "but since most of Asia uses the same kanji system"

      holy shit, you stepped on one there. Most of asia DOES NOT use Kanji. Kanji it derived from Chinese. Chinese is fragmented in two forms since the communists took over, simplified and traditional and neither map to Kanji 1 to 1. I am not sure about Hanja but i doubt it as well. Japanese also has kana and korean has hangul which are completely different from chinese characters.

      Much of South East Asia uses a modified latin alphabet south asia uses a sanskrit based alphabet. Add arabic and hebrew and thats a ton of new characters. The two chinese, korean and Japanese alone push you into hundreds of thousands.

      --
      The war with islam is a war on the beast
      The war on terror is a war for peace
    82. Re:Changing a system by From+A+Far+Away+Land · · Score: 1

      I thought of the same phishing problem, which has been known about in the tech world for some time now. Kids discovered it through 1337 speek too, using numbers that look like letters to fill in for names they otherwise want to claim. Will we see more 3Bay.com sites because of the move to include other languages? I think certainly we will.

      In a way it seems like adding buttons to a phone's keypad, instead of having the world settle on a single standard. I suppose I'd feel differently if it was my language that wasn't the standard, though.

    83. Re:Changing a system by ericlondaits · · Score: 1

      Read my words a little bit better, and with a little bit of good will (just enough to not assume I'm a complete idiot)... would you? I never stated that americans would be lazy or stupid for not typing accents.

      What I DID say is that even among people that SHOULD type the accents (that is, people in spanish speaking countries), it's not consistently done. Some are lazy, some have poor spelling.

      I certainly never called you either lazy or stupid, but you managed to state I was both. Nice way of having a healthy discussion...

      --
      As a Slashdot discussion grows longer, the probability of an analogy involving cars approaches one.
    84. Re:Changing a system by Anonymous Coward · · Score: 1, Interesting
      Perhaps there are some terms that these anglicans can adopt from the middle east besides Jihad?


      http://www.krysstal.com/borrow_arabic.html

      http://www.krysstal.com/borrow_farsi.html

      http://www.krysstal.com/borrow_hebrew.html

      HTH
    85. Re:Changing a system by coaxial · · Score: 1

      You are too kind. :)

    86. Re:Changing a system by mibus · · Score: 1

      It gets worse. A domain name can only start with a letter, and can only end with a letter or number.

    87. Re:Changing a system by dcam · · Score: 3, Insightful

      You mean: It can only be better for me, in the long run, if we all end up using my alphabet.

      --
      meh
    88. Re:Changing a system by coaxial · · Score: 1

      But see you missed my point. There's litterally millions of Spanish speakers in the United States, almost none of which can type the characters. You used Argentina for example, so let's use that. Say there's an expatriate Argentinian that wants to read about what's going on back in Argentina. He goes to the website for the newspaper Clarín. Let's say Clarín's jumps on the unicode dns bandwagon and registers clarín.ar, and for some reason discards their current clarin.ar domain. The expatriate just got screwed because he has the wrong the keyboard and can't find the correct keyboard anywhere in the country he now resides.

      The key problem with your position is that you believe that at some level not using diacriticals is a choice. For a very large number of people, it's impossible for them to do so. That is the problem. You can't dismiss these people. That's balkanization. You've gone from people self-selecting to throwing up a velvet rope and saying that certain people because of where they live and what technology they have access to aren't wanted. That is a stupid.

    89. Re:Changing a system by cortana · · Score: 3, Informative

      It depends on your operating system. The "standard" way is to hold Ctrl+Shift and then type the hexadecimal representation of the unicode code point that you want, but that conflicts with a lot of keyboard shortcuts that people use and so implementors often alter it a bit (for example, with GTK+ you press Ctrl+Shift+U and then type the code point).

      If your keyboard has a compose key then you can often compose a glyph from two similar looking glyphs. For example, for an o with an umlaut, " o -> ö (though I expect Slashdot will filter that character out).

      Macintosh users have an Option key that they can use to make weird glyphs (option-8 for the infinity symbol, option-g for the copyright symbol, etc). On most operating systems, various other combinations of the Ctrl/Shift/Meta/Alt/AltGr modifier keys and regular keys will allow you to type more glyphs. Most desktop environments also have an on-screen keyboard type program that ease experimentation in this area.

      Users of complex (e.g, Asian) scripts have a host of input methods to choose from and configure.

      Finally, if all else fails, create a text file full of your faviourite non-ascii characters and resort to the tried and tested method of copying and pasting! :)

    90. Re:Changing a system by Sin+Nombre · · Score: 1

      Well for one, if you want to go to a website that uses kanji, your browser has to support them in the first place. If you're in a hotel, chances are your browser is IE. If IE supports the language, there is a keyboard map in place for it. I guarantee it.

      Seriously, look at the alternate languages support for windows. They may or may not be pre-installed, but there is support for hundreds, if not thousands of languages already in place. The packages may require installation, but the support is available, and all of these languages are already usable.

      --
      "Im such a nonconformist I'm going to not conform to the rest of you!"
      "Dude I think we just got goth-served"
    91. Re:Changing a system by Sin+Nombre · · Score: 1

      So maybe I'm making a mistake with this, but I'm an Asian Languages and Literatures major. Korea, China, and Japan all use the same kanji. They call them different things, and they all have alternate written languages, but if you can read Chinese, you can read Japanese. And I never said that Japan conquered all of Asia; I said they used the same writing system. My mistake for forgetting the Middle Eastern languages. My fault. However, South Asia and Russia have other things more important to them than getting their languages supported by DNS. Like feeding people. These are the countries that I think are interested in this project. Chinese, Korean, Japanese, Greek and Ethiopian alphaphabets all already have support in Windows at least. My point was typing them is not a problem.

      --
      "Im such a nonconformist I'm going to not conform to the rest of you!"
      "Dude I think we just got goth-served"
    92. Re:Changing a system by Anonymous Coward · · Score: 0
      You might note the web wasn't invented in the US.
      I don't even know how to respond to such a misinformed user except with http://en.wikipedia.org/wiki/ARPANET
    93. Re:Changing a system by kchrist · · Score: 1

      3com begs to differ. As do the millions of people out there with domains beginning with numbers (myself included).

    94. Re:Changing a system by Matthias+Wiesmann · · Score: 1
      Indeed a brazillian windows computer might not be configured to input japanese text, but I don't think this is a reason for discarding the concept, this is merely a software limitation. I'm routinely typing japanese text using a swiss-french keyboard, you just need to have a proper OS. Interestingly, my keyboard does not have the square brackets, the @ or the ~ characters directly accessible (they are reached using the alt-key), but I'm not clamoring for e-mail addresses to stop using those caracters in e-mail addresses or urls.

      At any rate there might be technical hurdles, but the core problem is social. Asian people want a system that supports their own text system for addresses because using roman, non-accented characters for the job is really a kludge. Converting kanjis into plain ASCII leads to ambiguous texts or multiple possible transcriptions. For instance the city of Tôkyô can be transcribed as Tokyo, Toukyou, Tohkyoh. Actually, the correct romanization involves the use of the macron accent which is not supported by slashdot's engine (and I don't feel like mucking with escape sequences).

      So using their own character system for designating servers is my opinion a valid wish. While some urls might not be typable directly, this simply means a fallback mechanism is needed. At any rate, many url are so long as to be very difficult to type. So either we have a common international system to handle this legitimate need, or every country/company will hack around it, which will be much worse.

    95. Re:Changing a system by Anonymous Coward · · Score: 0

      You are wrong. Any character can be typed on a standard keyboard. You just need to educate yourself.

    96. Re:Changing a system by pablo.cl · · Score: 1
    97. Re:Changing a system by McNihil · · Score: 1

      My perfect example would be...

      http://www.m/crosoft.com

      I would like that to be possible maybe?

      How many will even notice the difference?

    98. Re:Changing a system by McNihil · · Score: 1

      Ok slashcode fubared my devious little plan...

      the i in Microsoft should be a Spanish upside down exclamation point.

    99. Re:Changing a system by pablo.cl · · Score: 1
      To be more pedantic, naïve is spelled with a dieresis
      a sign (upper part of ö) placed over the second of two adjacent vowels to indicate separate pronunciation, as in one spelling of the older forms naïve and coöperate: no longer widely used in English.
      An umlaut is visually the same
      a mark (upper part of ö) used as a diacritic over a vowel, as ä, ö, ü, to indicate a vowel sound different from that of the letter without the diacritic, esp. as so used in German.
      On the other hand, calling upper part of ö an accent is not really wrong (see number 4).
      3. a mark indicating stress (as (upper part of á), or (')), vowel quality (as French grave `, acute , circumflex ^), form (as French la "the" versus là "there"), or pitch.
      4. any similar mark.
      Note: Slashdot doesn't allow non-ASCII floating accents. I added "upper part of x" to the dictionary definitions quoted.
    100. Re:Changing a system by pablo.cl · · Score: 2, Informative
    101. Re:Changing a system by sydneyfong · · Score: 1

      Ever heard of search engines? hyperlinks? Bookmarks? Copy and paste? Keyboard layout switching? Input methods?

      Tell me you actually type things like this: http://slashdot.org/comments.pl?sid=207712&op=Repl y&threshold=0&commentsort=3&mode=thread&pid=169325 84 into your browser URL textbox, and only then maybe your point is valid.

      Perhaps you'll find it inconvenient, but if you can't find a way to enter (for example) Chinese characters into your system, chances are that you probably don't read it anyway, and the intended audience of that website is not you.

      --
      Don't quote me on this.
    102. Re:Changing a system by Fatalis · · Score: 1

      What's this? I've been able to use the Norwegian characters in domain names for a long time. There are screetshots over at http://en.wikipedia.org/wiki/Internationalized_dom ain_name

      Wrong. It doesn't work in IE6, try pasting an address from the Wikipedia article in it's address bar and see. Then try the same with Firefox, and see that it translates the address to some ascii garble starting with xn--.

      --
      Deus est fatalis
    103. Re:Changing a system by Anonymous Coward · · Score: 0

      ...since most of Asia uses the same kanji system...

      Who besides the Japanese uses this "kanji system?"

    104. Re:Changing a system by Hadlock · · Score: 1

      Indeed. The solution? Re-write the IPv6 protocol to allow non latin characters, and leave IPv4 alone. Kill two birds with one stone. Leave the existing IPv4 DNS servers working, and force the rest of the world to upgrade to IPv6, thus jumpstarting Internet 2.

      --
      moox. for a new generation.
    105. Re:Changing a system by dsanfte · · Score: 1

      Yup.

      --
      occultae nullus est respectus musicae - originally a Greek proverb
    106. Re:Changing a system by dasunt · · Score: 1
      So maybe I'm making a mistake with this, but I'm an Asian Languages and Literatures major. Korea, China, and Japan all use the same kanji. They call them different things, and they all have alternate written languages, but if you can read Chinese, you can read Japanese.

      My understanding of it is that there is a common subset of symbols used in both Chinese and Japanese. There are also symbols in Japanese that are not in Chinese (kokuji), and symbols with different meaning in Japanese (kokkun). (I wouldn't be surprised if the reverse was true.) There are also some symbols that have changed slightly when they came from China to Japan -- for example, the glyph for 'bone' has changed slightly. And sometimes Japanese and Chinese use entirely different symbols -- 'one' is different between Chinese and Japanese. Unicode "solves" this by relying on the user to have the correct font to display the symbols they are used to.

      In addition, there are 'kana' in Japanese, which could be considered the Japanese alphabet (I'm probably oversimplifying). These are decended from the Chinese symbols, but they are massively simplified. These aren't part of the Chinese language, and I suspect that someone who reads Chinese would be utterly puzzled when attempting to read Japanese kana. (Kana has two forms as well -- katakana and hiragana.)

      But Korea doesn't use Hanja (Chinese characters) that much anymore. Korea (South Korea, that is) uses its own alphabet called Hangul.

      *queue 'the more you know' music*

      PS: Feel free to correct me if I'm wrong, I'm not an expert.

    107. Re:Changing a system by adolfojp · · Score: 1

      Qué?

    108. Re:Changing a system by Anonymous Coward · · Score: 0

      Did you personally invent internet? It is ridiculous that such a small minority as english speaking persons define the one and only allowed character set. Character sets are not defined by any deity.

      The system does not work, there is absolutely no argument about that. For instance, several different meaning words must be compressed into a single word simply because one cannot use proper characters (think of peas written as piss). To get an idea how well this system works, you may imagine that you would type english without e, t and a. Do you really call this a working system?

    109. Re:Changing a system by identity0 · · Score: 1

      Right now, Japan, Mainland China, Taiwan, and S. Korea use similar characters descended from the ancient Chinese script which are different enough to make mutual comprehention very difficult. In addition, Korean and Japanese mixes the Chinese-derived characters with native characters not found in Chinese.

      The ancient Chinese character set was used throught China by people who had different spoken languages, like Mandarin and Cantonese. The symbolic nature of the characters meant that people who had different spoken languages could have the same written language, which would be pronounced differently but have the same meaning across China. This character set was then exported to Korea, Japan, Vietnam, and other neighboring countries, who adapted it to their own languages.

      In time, the Koreans and Japanese added different native characters, which were syllabic and easier for commoners to learn. The originally Chinese characters also changed a bit over time. Vietnam changed to a western-style script after it became a French colony.

      After WW2, China's communists won the mainland and the nationalists took Taiwan. On the mainland, the Chinese Communist Party instituted a massive change to the character set designed to simplify it and make it easier to teach to peasants. Taiwan, and the foreign Chinese populations like Hong Kong, Singapore, etc. continued to use the traditional characters.

      So at this point, there are at least 4 major Chinese-style character sets. Each has enough similarities with the other that many basic characters are the same. There are also many common complex characters between any two of them, however they usually have different common characters with the other sets, so it's probobly not possible to have one "East Asian" character set. That and the political difficulties in getting the countries to agree on one set would make it impossible for now.

    110. Re:Changing a system by Ben+Hutchings · · Score: 1

      Last I checked, they weren't proposing to stop you using Copy-Paste or following links from a Google results page.

    111. Re:Changing a system by Ben+Hutchings · · Score: 1
      Then there are characters that are technically equivilent but have different representations.

      IDN requires normalisation which deals with most such problems, but not the trickier one of similar-looking characters. Unfortunately that's font-dependent and isn't even specific to IDN - some fonts don't adequately distinguish I, l and 1, or O and 0, for example.

    112. Re:Changing a system by Ben+Hutchings · · Score: 1
      Its just a fact of life that the encoding scheme implemented has a limited set of characters that is readable by the technically adept people who built the thing.

      That problem has been solved by "Punycode" - see the links near the bottom of this page for examples (Slashdot won't let me post IDN URLs). However, the phishing threat from similar-looking characters seems to have stalled full support for IDNs; Mozilla disabled them by default after the paypal.com demo.

    113. Re:Changing a system by salec · · Score: 1

      Well, ... no. Quite opposite - how it looks, that's what it is. Single pixel should be thrown away as noise.

      The idea is that i.e. if you write in Cyrilic alphabet: H, E, R, O, or upper case Chi, Eta, Rho, Omicron in Greek alphabet and it shows written like "XEPO" it should *be* recognized as exact equivalent to X, E, P, O "XEPO" in latin.

      So, in the end, each machine would have internal inferred encoding of each graphic symbol it encounters, only not a predefined one. Of course, that would require training of UI (or "ghosting" initial configuration to new ones).

    114. Re:Changing a system by Fred_A · · Score: 1
      you want to register the max number of permutations for your brand name
      I know that this is the current trend (and has been for some time), but I still find it stupid as it exhausts the namespace, especially as in the Real World (tm) several businesses can have the same name in various places or even in the same place as long as it's in different areas of activity. I know that this has been said a million time before and there's no real solution, but the Spanish example given above just means that even *more* domains will now have to be registered to cover a brand. If only to provide a "universal" (i.e. latin) domain, since most keyboards have a latin mode even if the locale doesn't use it, so the Internet at large can access the domain, in addition to the "real" (i.e. localized) domain.

      Theoretically I'm pretty much in favour of a move to Unicode wherever possible. In this case I'm beginning to wonder if the drawbacks don't outweigh the advantages.
      --

      May contain traces of nut.
      Made from the freshest electrons.
    115. Re:Changing a system by joto · · Score: 2, Insightful
      How do I type in a character for a domain name that isn't supported on my keyboard?

      Why do you need to type in a character for a domain name written in a language you don't even understand the alphabet of, and certainly can't read or write?

      No matter what you do, I'm still limited to the keys on my keyboard. I think that's ~104 by last count. But I certainly don't use that many characters.

      And how do you think a chinese keyboard looks? Do you think they have hundreds of thousands of keys? There are three reasons why you can't enter chinese characters into your keyboard, and none of them has to do with hardware:

      1. You don't know chinese
      2. Your computer software may lack an input method for chinese text
      3. Even if you knew chinese, and your computer had an input method for chinese text, you still need to learn to type with it

      I admit that there are some people who are going to bitch about the internet being english.

      Yes, you are one of them. The people who want non-latin characters are not wanting them because they want to communicate with other english-speaking people on the Internet. They want them because they want to communicate between themselves, in their own native language. Imagine that you only had hebrew letters available for domain names in the US. The hebraic alphabet is relatively easy to learn, and most english words can be written in it. But it's cumbersome for english-speaking people to communicate with the hebrew alphabet. And that's why people speaking different languages than english, want to be able to write their domain names in different alphabets than english.

      But does that give me a right to bitch about classical music being defined in French and Italian terms like a fugue, sonata, adiago, allegro... I think not. In the past 400 years we've all managed very nicely to adopt to these terms in order to converse with each other an a common basis.

      Perhaps there are some terms that these anglicans can adopt from the middle east besides Jihad?

      Sorry, you are not making sense.

    116. Re:Changing a system by JohnFluxx · · Score: 1

      Surely alt gr+; e is easier: é

    117. Re:Changing a system by Raffaello · · Score: 1

      You bring up an interesting point. I think the best that ICANN can do is to have TLD specific conversion rules to and from Latin (an possibly other scripts). That is to say, each TLD (.es in this case) would define how it's unicode domain names would map to ascii/Latin. This way each nation could define their own domain name standard and others could enter these multi-byte characters directly or use the standard means of converting to UTF8/ascii/Latin specified by that country.

      Alternatively, each individual domain name owner could specify a second ascii/Latin domain name (just as they use now) rather than relying on a TLD specific conversion standard.

    118. Re:Changing a system by Raffaello · · Score: 1

      To be even more pedantic, naive is properly spelled with a diaeresis. See the Wikipedia article.

      A diaeresis causes what would otherwise be a single syllable diphthong to be read as two separate syllables (e.g., na-ive rather than rhyming with "knave" or "nave"). An umlaut is the German term for a diacritical mark which looks the same as a diaeresis, but has a completely different phonetic function. An umlaut causes the pronunciation of the vowel so marked to be shifted forward in the mouth. Unlike a diaeresis, an umlaut does not cause a vowel sound to become two separate syllables.

    119. Re:Changing a system by fbartho · · Score: 1

      don't have that modifier key on my keyboard.

      --
      Gravity Sucks
  2. What? by Aladrin · · Score: 5, Funny

    Wait, so it's not tubes... It's a 15 story building?

    Anyone else getting more lost every day?

    --
    "If you make people think they're thinking, they'll love you; But if you really make them think, they'll hate you." - DM
    1. Re:What? by Anonymous Coward · · Score: 0

      I think it's that the building is supported by tubes. Wait... no, it's supported by bricks in the basement. Um... so the tubes deliver por... I mean, spam throughout the building? Or maybe they have replaced elevators for transportation? I bet there's just one big waterslide from the top to the bottom.

    2. Re:What? by rubycodez · · Score: 2, Funny

      those that live in 15 story buildings made of glass tubes should not throw brick laptop power supplies.

    3. Re:What? by jmyers · · Score: 2, Funny

      No, there are only 15 stories about the internet that are just retold with slight modifications. One is about tubes, one about bricks, etc, etc, etc...

    4. Re:What? by morgan_greywolf · · Score: 2, Funny

      It's a 15-story building made of tubes and supported by a brick basement, on a flatbed truck headed down the information superhighway.

    5. Re:What? by UbuntuDupe · · Score: 1

      Well, it's not a truck, that's for damn sure.

    6. Re:What? by andphi · · Score: 1

      But the OSI model only has seven layers! Where did the other eight floors come from? Don't tell me they're redirected layer two to layer eight. Grandfathered IRQS are bad enough.

    7. Re:What? by Joebert · · Score: 1

      It works when you try thinking of them as shit bricks.

      --
      Wanna fight ? Bend over, stick your head up your ass, and fight for air.
  3. not the whole internet! by syrinx · · Score: 5, Funny

    It won't break the whole Internet! Just DNS. DNS is overrated anyway. Now if you'll excuse me, I need to finish reading all the new posts on 66.35.250.150.

    --
    Quidquid latine dictum sit, altum sonatur.
    1. Re:not the whole internet! by Aladrin · · Score: 1, Interesting

      And mail. And ... Hmm, yeah, the whole thing.

      Seriously... How many mail servers are going to freak out because they can't handle unicode?

      --
      "If you make people think they're thinking, they'll love you; But if you really make them think, they'll hate you." - DM
    2. Re:not the whole internet! by David+McBride · · Score: 1

      Philistine! I'm off to http://[2001:200:0:8002:203:47ff:fea5:3085]/.

      So nyeeer.

      (Unfortunately, SlashCode mangles IPv6 addresses, so don't bother clicking.)

    3. Re:not the whole internet! by chipster · · Score: 1
      +5 Funny?

      It'll be funny when you try putting an IP in a browser to access a site using name-based virtual hosts.

      Enjoy. ;-)

    4. Re:not the whole internet! by syrinx · · Score: 1

      pah! I don't use this newfangled HTTP 1.1 anyway.

      --
      Quidquid latine dictum sit, altum sonatur.
    5. Re:not the whole internet! by Ben+Hutchings · · Score: 1

      IDN uses a "Punycode" encoding which doesn't use any bytes outside the range currently used. In fact the only thing that makes IDN domains invalid in the old scheme is that they include two consecutive hyphens (they all begin with "xn--").

  4. Yes and No by Aadain2001 · · Score: 4, Insightful

    Yes, countries that use non-English characters should be able to interact with the rest of the world using their natural language. No, they shouldn't rush the change and risk a possible crash of a large portion of the Internet. Be patient young patawans, soon you will be able to have DNS names with any character you can think of, but it will be reliable and actually work.

    --
    Space for rent, inquire within
    1. Re:Yes and No by RobertCorsaro · · Score: 1

      You already can use illegal characters for DNS. Just don't use their system. Some of us have already become part of the undergroun.d.

    2. Re:Yes and No by Anonymous Coward · · Score: 0

      Patawans? d/t?

    3. Re:Yes and No by jamar0303 · · Score: 1

      I've already seen systems where Chinese characters can be directly used in the address bar. That was a browser add-on that only worked in IE, though.

      --
      OSx86 FTW
    4. Re:Yes and No by emmagsachs · · Score: 1
      As if the non-English speaking world is barred from using the internets now? It's just DNS, which to most people is nothing more than typing in English letters into the address bar. In which countries do keyboards not come with the usual Qwerty/Azerty/Qwertz Latin layout, plus a second and third character set?

      The resulting problems will outnumber any benefit from said revision, I reckon.

    5. Re:Yes and No by BJH · · Score: 1

      Works in Firefox right now. I can type http:///#35501;&%2322770;.jp in the address bar and I get redirected to http://www.yomiuri.co.jp/ just fine.

    6. Re:Yes and No by BJH · · Score: 1

      ...which should have appeared correctly in Japanese if Slashcode didn't fuck with it...

  5. Break the whole Internet? by GBWisc · · Score: 2, Funny

    Plans to fast-track the introduction of non-English characters in website domain names could 'break the whole internet', warns ICANN chief executive Paul Twomey

    Luckily for us, GWB knows that we have some redundancy with the Internets, so if one breaks we can just use another.
    1. Re:Break the whole Internet? by PFI_Optix · · Score: 1

      Thank goodness Al Gore built it in. But what if one of the tubes leaks and the basement floods?

      --
      120 characters for a sig? That's bloody useless.
  6. Maybe it's time to get rid of Bind? by Frosty+Piss · · Score: 1
    Given that some bind implementations still barf on an underscore, is this really premature?
    Maybe it's time to get rid of Bind? The Model-T of DNS implementations...
    --
    If you want news from today, you have to come back tomorrow.
    1. Re:Maybe it's time to get rid of Bind? by huguley · · Score: 0


      Right.... And use the wonderful MS DNS? The one that went against RFCs and introduced a non-standard character that horked up everything? I am sure MS would have this whole il8n thing sorted out in no time....

      What exactly would you replace bind with? Oh wait you don't care your just a troll.

    2. Re:Maybe it's time to get rid of Bind? by igb · · Score: 1
      Actually, back in the day bind _did_ tolerate underscores. I remember the anguish we had flushing machines called things like fileserver_one out the day that Vixie et al decided to enforce the standards. The DNS standards say [-a-z0-9], with dot as a delimiter. It's not the place for implementations to play hooky with that. For years there were hacks in bind to allow you to choose between accepting underscores in master zones (bad idea), secondary zones (quite bad idea) and recursive queries (sometimes unavoidable). Had the standards been followed in the first place this issue would never arise.

      ian

    3. Re:Maybe it's time to get rid of Bind? by Frosty+Piss · · Score: 1
      I am sure MS would have this whole il8n thing sorted out in no time....
      Who said anything about Microsoft? Not me...

      What exactly would you replace bind with? Oh wait you don't care your just a troll.
      There *are* non-MS implementations such as djbdns.
      • cr.yp.to/djbdns.html
      • http://en.wikipedia.org/wi ki/Djbdns
      --
      If you want news from today, you have to come back tomorrow.
  7. That would be a good reason to get the UN in by aadvancedGIR · · Score: 2, Funny

    The ICANN tries to give a technical reason to a political problem, although this reason may be valid, it is not a very good idea. With the UN, it will be handled by international comitees and we will all be long dead before they finally agree on which country will be in that comitee.

  8. Late in coming? by grasshoppa · · Score: 2, Insightful

    Perhaps, but I can't fault ICANN for this one, as much as I might like to. Like it or not, most internet technologies have their roots in latin speaking countries, which means systems developed there may not be tweaked to work with outside language schemes.

    If the fault lies with anyone, it's with the individual contributers of the tech. Or better, with the non-latin countries appearent lack of interest in some of the core projects needed to push this through ICANN ( specifically DNS, httpd ).

    --
    Mod me down with all of your hatred and your journey towards the dark side will be complete!
    1. Re:Late in coming? by radja · · Score: 1

      although you're right, many of the latin-alphabet countries have at least some 'unique' letters, mostly ligatures, like the german ringel S(long S/short S ligature) and the dutch 'long y' (i/j ligature and different from the y). many of these letters don't appear in the alfabet itself.

      --

      No one can understand the truth until he drinks of coffee's frothy goodness.
      --Sheikh Abd-Al-Kadir, 1587
    2. Re:Late in coming? by 1u3hr · · Score: 1
      Like it or not, most internet technologies have their roots in latin speaking countries

      Yes, the Vatican State, back in the MCMLX's.

    3. Re:Late in coming? by Anonymous Coward · · Score: 0

      "with the non-latin countries appearent lack of interest"
      The domination of English on the internet is not due to the fact that only English-speaking people are interested in technology, is it? The latin alphabet is dominant because the technology was developed in the west. The reason for this is not that other countries weren't interested, but because western Europe and northern America has WAY more resources.

      And now for something completely different: the three unique Scandinavian vowels Æ Ø Å (not sure if anyone from outside Scandinavia will see these characters properly) can be used in domain names, though some browsers need extra support to avoid being confused by them. Even though they are closely related to it, they are not really a part of the standard Latin alphabet, but they work just fine.

    4. Re:Late in coming? by Anonymous Coward · · Score: 0

      Like it or not, most internet technologies have their roots in latin speaking countries

      Did the trolls go around saying "Primoris stipes" and "Vos deficio is"

    5. Re:Late in coming? by T.E.D. · · Score: 1
      Like it or not, most internet technologies have their roots in Latin speaking countries


      I believe you meant to say something like "in countries that use a Latin-derived script". The only "Latin-speaking" country left is the Papal State, and I think the pope's been too busy with matters of faith the last few decades to spend a lot of time developing internet technologies.

      There are quite a few countries around that speak languages descended from Latin, but I think a lot (if not most) of the development was done in countries that speak Germanic languages instead. (eg: English and German)
    6. Re:Late in coming? by Grishnakh · · Score: 1

      Or better, with the non-latin countries appearent lack of interest in some of the core projects needed to push this through ICANN ( specifically DNS, httpd ).

      I agree. If the non-latin-character countries want this capability, why don't they develop the software to do it themselves instead of complaining about it? BIND and all the other internet tools are open-source.

  9. When you've built on a foundation of straw- by Bonker · · Score: 4, Insightful

    - Don't be too surprised when people around you start building their own houses rather than choosing to pay rent.

    DNS upheaval has been a long time coming, and the current anti-American sentiment worldwide isn't exactly helping to stabilize it. We're already seeing all sorts of adhoc routing setups that deal with shortcomings of an ameri-centric DNS. My guess is that within the next few years, ICANN's 'control' of the internet will be in name only as everyone else in the world will have moved on to alternative routing and domain systems.

    --
    The next Slashdot story will be ready soon, but subscribers can beat the rush and slashdot the links early!
    1. Re:When you've built on a foundation of straw- by Ingolfke · · Score: 1

      Cool... sounds like a good solution to me. If someone can develop a better system that works for people who WANT to use it great.

    2. Re:When you've built on a foundation of straw- by Aadain2001 · · Score: 1

      I think you are confusing anti-American GOVERNMENT sentiment with anti-American PEOPLE sentiment. Oh, and don't forget, we built the Internet. We were there first. We laid the groundwork and did the first R&D. It was only later that other countries started to get involved. And at any point in those phases, they could have suggested these changes. Instead, they wait until the house it built and everyone else it hanging their family pictures to complain about the choice of land and demand everyone start over. The changes will come, but they will have to be slower and more tested now since they didn't speak up in the beginning.

      --
      Space for rent, inquire within
    3. Re:When you've built on a foundation of straw- by t0tAl_mElTd0wN · · Score: 3, Insightful

      I think that might be jumping the gun. American or not, the internet plays a huge role in the functionality of the modern world. Just imagine the chaos if international office networks went from "I can't open this word document you sent me because it's in a different format" to "I can't get email from you because you're on a different internet". American DNS control or not, decentralizing the internet like you suggest might happen could be one of the worst things that could happen for global communications.

    4. Re:When you've built on a foundation of straw- by archen · · Score: 1

      Seems like a good time to hype IPv6. These ancient bind implementations probably can't resolve in the IPv6 address space anyway. So all we need to do is give country X an address block of XXXX. The main reason I point out IP blocks, is due to the fact that if a country has a domain name of citybank.com (with some crazy characters that appear the same), and the ip block is from a country that may be suspicious, then it allows phishing implementations to warn people from various countries.

      And lets face it, in order for this stuff to work, the DNS portion is going to need a complete overhaul anyway. Really all you need is a translation table based system for character 'X' means 'xfg' in the legacy DNS entries - even if it makes no sense and doesn't form words. It sounds like a huge pain in the ass, but if these countries really want to do this (and I'm assuming they do) - then it's probably time we accommodate them before this grows into some screwed up system with no standards.

    5. Re:When you've built on a foundation of straw- by benoitg · · Score: 3, Insightful

      Please, there have been complaints about DNS not supporting most language's (even latin) character sets since the birth of the web, so it's completely untrue that we waited till everything was built. After well over a decade of patient waiting, it seems that actual pressure was required to get this change through.

    6. Re:When you've built on a foundation of straw- by IHateAllofYou · · Score: 1

      I'd mod you up but I don't have any points :p

      The "american-centric" DNS system serves as a standard just like he said. There's no american conspiracy to keep any culture from using their traditional characters for DNS names it simply wasn't built for that. I personally can't wait for the day when I can get DNS in characters I've made up using my etch-a-sketch.

    7. Re:When you've built on a foundation of straw- by Trespass · · Score: 1

      Bitching and moaning about something is far easier than building a working alternative.

    8. Re:When you've built on a foundation of straw- by davidsyes · · Score: 1

      FORKIN' AYE!

      Personally, if China "forks" the Internet, then good for them. The country is burgeoning with people clamoring for access to information from everywhere, whatever the source. At some point, if lucky, some of them will become entrepreneurs. At the very least, a few hundred thousand new DNS entries per day could ensue. Why should ICANN be the gatekeeper of OTHER NATIONS' domestic registries? When those registries are to be sought and interacted with, why should ICANN solely control the registries. Domestically, I am sure a country can wall off itself, but internatonally, it could find itself isolated, unless it very, very technically and successfully side-steps ICANN and ends its single-source control. Does it still make sense to have only ONE sole source for TLD access? (Aside from "TLD")...

      But, I suppose that if China DOES fork things, the US would probably "retaliate" by domestically blocking any and all in and outbound traffic using the "rogue" DNS apparatus. I know that if I had an information addiction that was fed by a rogue DNS or TLD system and the US politically cut it off, I'd be PISSED. Anyway, I feel it is only a matter of time before numerous countries and locales join up and for the thing. I suspect it would be instigated by the current anti-US sentiment that most likely won't be going away any time soon. Besides, a forked Internet would just make things more interesting. Might even create a few thousand more domestic IT/programming jobs.

      --
      Previously: "Linux... Toward the Sunrise..." Now: "Linux... Toward the-- No, now, part of Every Sunrise"
    9. Re:When you've built on a foundation of straw- by Joey+Vegetables · · Score: 1

      There has long been worldwide distrust and hatred of the U.S. government, apart from the people. However, when the U.S. people consistently vote for politicians who expand the government's aggression against the rest of the world, then some of the people most badly affected by that aggression - particularly, but not exclusively, Arabs and Muslims - tend to begin to blame the people as well. That isn't entirely fair, as we honestly don't have a lot of choices, nor a system that encourages choice. And most of us frankly have no idea what awful suffering we inflict on people elsewhere, especially in those nations with whom we have had declared or undeclared wars. I'd like to think we would act and live differently if we did. But to the admittedly limited extent to which we govern ourselves, we are responsible for the actions of the government, and will be perceived accordingly by those elsewhere.

  10. Stupid question by VENONA · · Score: 3, Insightful

    "Given that some societies have used non-Latin characters for thousands of years, is this a bit late in coming?"

    No.

    Zonk either knows zero about the histories of the Internet or DNS, or is so enamored of finishing stories with questions that he'll tack on the truly ridiculous.

    --
    What you do with a computer does not constitute the whole of computing.
    1. Re:Stupid question by Ingolfke · · Score: 1, Offtopic

      Read the news. Is organized religion currently a net win, or a dead loss?

      I like your sig... it's just not accurate. You've focused to much on a particulary component of the larger problem and have failed to recognize the actual whole of the issue. Here's a correct understanding of the problem.

      Read the news and some history. Is organized humanity currently a net win, or a dead loss?

  11. Watch out for attacks by Agelmar · · Score: 5, Insightful

    For all you people saying "There's no problem, just do it" - I say watch out... there will be a rush of attacks and spoofs as soon as this is opened up. The letter "a" appears in the unicode character set multiple times, and some of the variants are almost indistinguishable. I'm not just talking about someone registering släshdot.org, I'm talking about someone reigstering slashdot.org (the a is FF41 instead of the normal a). Good luck telling the attacks appart from the real sites.

    1. Re:Watch out for attacks by gsasha · · Score: 5, Informative

      It's called a "Homograph Attack". See http://en.wikipedia.org/wiki/IDN_homograph_attack

    2. Re:Watch out for attacks by archeopterix · · Score: 1
      I'm talking about someone reigstering slashdot.org (the a is FF41 instead of the normal a).
      Yikes! You almost tricked me into thinking it's 0430!
    3. Re:Watch out for attacks by tonigonenstein · · Score: 2, Insightful

      As a human you might be fooled, but a well designed browser could tell the difference and alert you. So this shouldn't be a problem.

      --
      The sooner you fall behind, the more time you have to catch up.
    4. Re:Watch out for attacks by Anonymous Coward · · Score: 0

      You seriously underestimate the difficulty of what you propose. Please, tell me more about this design...

    5. Re:Watch out for attacks by Anonymous Coward · · Score: 0

      Yeah, well, that wouldn't be a problem if unicode wasn't broken.

      I mean they started out to create an encoding for characters and immediately created an exception for lower case (which ultimately is just a different glyph for the same letter - but okay, I can get that one).

      Then they added loads of duplicates to "facilitate" processing by computers. Like roman numerals. Because you see "when everything's encoded properly" (read: never), computers will have it easier understanding that it really is a number. Or to facilitate CJK back and forth translation. Oh - and because CJK was broken too and encoded half width and double width latin glyphs (ie, formatting) as different characters, we got that too!

      Then there's the accents... You could have unified separate accents - pro: one simple, straightforward and versatile encoding, con: it's a bit harder to process because you got 2 characters glyphs, and in some language an accent makes for a different character (but that didn't stop them before..). Or you could have different characters for all the accented options - pro: simpler to process, con: that's loads of them, and in some language, an accent doesn't a character make. In their infinite wisdon, they chose neither.. So now we've got this monstrosity which has all the inconvenients and none of the advantages.

      I understand countries asking for native dns encoding, I also understand that it's not simple. But it would have been much easier if unicode wasn't such a broken mess.

    6. Re:Watch out for attacks by HiThere · · Score: 1

      I seem to remember that this was tried a couple of decades ago, and there was an immediate attack based on substituting a Russian character that looks like a "c", so it was recanted. Fortunately, it was still so new that it was easy to recant. At the time there was talk about possible fixes, but I don't think an agreement was ever reached on how to proceed.

      OTOH, I was definitely FAR out at the periphery, being a mere /. reader.

      --

      I think we've pushed this "anyone can grow up to be president" thing too far.
    7. Re:Watch out for attacks by psydeshow · · Score: 1

      Fairly straightforward solution: don't allow mixed character sets. Your proposed domain name includes characters from two different alphabets, therefore it is illegal and un-registerable.

      See, that's not so bad, is it?

    8. Re:Watch out for attacks by Agelmar · · Score: 1

      Actually, it is bad. See my previous replies as to why you would want to allow mixed sets (hello.jp), plus the fact that you don't need to leave the CJK sets to get roman characters, as they're duplicated in multiple sets all over the place in Unicode.

  12. Sure, go 'head by kahei · · Score: 4, Insightful


    I'd be in favor of the change just because anything that undermines the Unix Tower of Babel -- the dependency on ASCII which complicates text handling sooooo much even when Windows solved the problem soooo long ago -- is good. Even Java gets it. Even Apple (finally) get it. Unix Is Teh Problem.

    And the ASCII problem isn't just bad because it forces people to use inefficient encodings like UTF-8 (THREE bytes per character?) It's bad because it allows people to write code like:

    if(string[index] == '.' || string[index] == '?' || string[index] == '!') sentenceEnd = true;

    (a line repeated, with subtle variations, several hundred times in the code of a certain ubiquitous editor).

    And, lo and behold, the above does not work, but once it appears in a few thousand places it's impossible to fix, and a vast towering structure of fixes made by people who don't really understand why it's an issue is built.

    So, even though the proposed change would be hugely inconvenient for a huge number of people, I'm in favor, because I want the world to grow the fork up and understand that text != byte array some time while I'm still alive.

    --
    Whence? Hence. Whither? Thither.
    1. Re:Sure, go 'head by reed · · Score: 1
      ... the dependency on ASCII which complicates text handling sooooo much even when Windows solved the problem soooo long ago ... inefficient encodings like UTF-8 (THREE bytes per character?)


      What the hell are you talking about??

    2. Re:Sure, go 'head by SilentGhost · · Score: 1

      why on earth did you decided that world needs to understand smth.? it needs simple and reliable system to work with, and ascii completely fulfils this role for this particular task. Let the world burn for sake of Arabic alphabet? nope, thanks

    3. Re:Sure, go 'head by wumpus188 · · Score: 1

      UTF-8 is variable length encoding and uses one to four bytes per character. ASCII will fit in just one byte, Latin, Cyrillic, Greek and likes...
      Maybe you should stop talking out of your ass and learn something first.

    4. Re:Sure, go 'head by tokul · · Score: 1
      And the ASCII problem isn't just bad because it forces people to use inefficient encodings like UTF-8 (THREE bytes per character?)

      Other side of story. Some people use efficient character sets like GB2312, Big5, euc-jp. These efficient character sets are very difficult to parse, because you have to parse text byte after byte. Code can't see difference between first and second byte.

      UTF-8 is variable length charset. One symbol can take from one to six bytes.

      It's bad because it allows people to write code like:
      if(string[index] == '.' || string[index] == '?' || string[index] == '!') sentenceEnd = true;
      Use regexp or switch.
    5. Re:Sure, go 'head by Anonymous Coward · · Score: 0

      can someone mod the above poster down as "-1 idiot"? You have to wonder about the technical knowledge of the people who modded this interesting. UTF - variable length character set. Obviously if you want to encode the world's character sets you need more than 8 bits. UTF (which I think comes from the Unix world from Bell labs) is an elegant solution to the problem

    6. Re:Sure, go 'head by davidwr · · Score: 1

      if(string[index] == '.' || string[index] == '?' || string[index] == '!') sentenceEnd = true;

      If that code is replicated throughout with subtle variations, it sounds like someone really needs to learn to use functions or macros.

      You aren't the only one to read code like that. Sadly, I've probably written some of it.

      --
      Knowledge is how to play a game, intelligence is how to win, wisdom is knowing what game to play.
    7. Re:Sure, go 'head by kahei · · Score: 1


      To clarify, I realize that UTF-8 isn't necessarily three bytes per character. It's usually three bytes per character for me, but it can be as few as 1 and as many as 7 -- which is a lot less efficient than even a constant 7 bytes would be, for many tasks. Of course, if you then handle the UTF-8 with software that follows the Unix Way, and treats strings as byte arrays -- I'm looking particularly at Ruby here but there are a lot of programs to be named and shamed -- then even collecting together a whole character as opposed to a byte may involve a layer of code that you have to worry about.

      So, your first step is to convert it to UTF-16, or, more likely in reality, 'UCS-2'. Now it has zero bytes in it. CRASH! goes the Tower of Babel, which likes things to start at 1 -- much in the manner of VB5 programs.

      --
      Whence? Hence. Whither? Thither.
    8. Re:Sure, go 'head by hclyff · · Score: 1

      You would speak otherwise if you were ever making a localized application with anything more than Western European character sets. That means Eastern European characters, Cyrilic, Hebrew, Arabic, Hindu, Japanese, Chinese, Thai and countless others.

    9. Re:Sure, go 'head by hclyff · · Score: 1

      Or maybe it's just my sarcasm meter broken.

    10. Re:Sure, go 'head by FrostedChaos · · Score: 1

      All engineering is a tradeoff. In the case of UTF-8 versus UTF-16, UTF-32, Shift/JIS, etc., the tradeoffs mostly involve compatibility, space, and speed.

      As you've noticed, for non-English languages, UTF-32 often takes up less space in memory than UTF-8. However, converting applications to use UTF-32 usually requires an expensive rewrite. Also, UTF-16 and UTF-32 have endianness issues. Basically, in order to achieve real interoperability, you must either mandate that one endianness or the other be used (which slows down the CPU if it has the "wrong" endianness), or tell programmers to plop in conversions in wherever they feel appropriate (compatibility hell, BOMs.)

      UTF-8 sidesteps all of those endianness issues by being a byte-oriented standard. It also allows existing C code to work well with foreign text. You may look down your nose at C code, and admittedly some of it is bad, but it is not going away any time soon. Both the Linux kernel and the Windows kernel are written in C, and there are no plans to change! Keep in mind that the computer industry is dominated by compatibility concerns. There are still people hacking FORTAN or COBOL code from the 70s. Telling people they need to throw away their own code is like telling people they need to stop using gasoline engines and start using your new, slightly better petroleum fuel. People will nod, smile and keep on doing what they were doing. Sorry, Charlie, it's called installed base!

      UTF-8's only disadvantage, the space issue, is overhyped anyway. It's very likely that Asian or Eastern European text encoded in UTF-8 will compress very well with gzip or another free compression algorithm. Also, English text, the common case for most people, does better with UTF-8 than under any other encoding! No matter how you feel about England or the United States, you cannot deny that English has become the new lingua franca of commerce, and most organizations support it in one way or another.

      Also, text is a very small part of the total size of most applications. If every string constant in Linux started taking double the size, would anyone even notice? 18 months later, hard disk sizes and memory densities would double, and it would be a big yawn for everyone involved. On the other hand, rewriting every piece of software from the ground up, which is basically what you're proposing with UTF-32, is an enormous and ongoing cost.

      As for your comments about UNIX... those who don't understand UNIX are doomed to reimplement it, poorly.

      --
      "Any connection between your reality and mine is purely coincidental." -Slashdot
    11. Re:Sure, go 'head by DragonWriter · · Score: 1

      And the ASCII problem isn't just bad because it forces people to use inefficient encodings like UTF-8 (THREE bytes per character?)


      UTF-8 can actually take up to four bytes per character (or 6 using the apparently-common, from the Unicode FAQ, but nonstandard, conversion of characters that take "surrogates" in UTF-16 into pairs of three-byte sequences instead of single four-byte sequences in UTF-8.)

      But ASCII dependency has little to do with the number of bytes required, representing lots of characters does; if you can find a way to communicate characters in a 21-bit character set with an integer number of bytes less than 3 in all cases, that's pretty impressive.

      UTF-8 is actually particularly efficient because of ASCII compatibility, when the data is mostly latin text, since in that case it is mostly 1 byte per character.
  13. Once again we are just western savages... by Anonymous Coward · · Score: 0

    > Given that some societies have used non-Latin characters for thousands of years, is this a bit late in coming?

    Had Hammurabi stored his laws on silicon instead of stone then perhaps there would be a point to that question.

  14. Can't trust your browser's address bar anymore. by Anonymous Coward · · Score: 2, Interesting


    Unicode has many characters that look almost exactly like characters in Latin-1.

    For example, if "www.microsoft.com" is shown in your browser's address bar, how would you know for sure that the "c" is not from the Cyrillic alphabet, or the "o" is not from the Greek alphabet?

    You simply won't be able to trust your browser's address bar anymore. The possibilities for phishing attacks are endless.

    1. Re:Can't trust your browser's address bar anymore. by NoMoreNicksLeft · · Score: 4, Insightful

      Why not have the browser fail to render them outside of the user's preferred alphabet?

      Cyrillic users would see www.**c******.com, latin users would see www.mi*rosoft.com?

      Or better yet, put up a big warning that it's using mixed alphabets?

    2. Re:Can't trust your browser's address bar anymore. by Anonymous Coward · · Score: 0

      For example, if "www.microsoft.com" is shown in your browser's address bar, how would you know for sure that the "c" is not from the Cyrillic alphabet, or the "o" is not from the Greek alphabet?

      Because that's not what I typed in? Because that's not what my bookmark points to?

      The trust issue only arises when you follow links from untrustworthy sources. The only people who'll get caught out by this are the people who click links to PayPal in spam emails and check the address bar to make sure they really are on PayPal. I don't think there's that many people doing that, do you?

    3. Re:Can't trust your browser's address bar anymore. by reed · · Score: 2, Insightful
      Or better yet, put up a big warning that it's using mixed alphabets?


      In general, browsers ought to make users more aware of the parts of their current URL, and maybe also of link destinations (also mail client).

      For example, seperate the URL into its parts (scheme, host, path). Display some of the WHOIS info below the hostname, and some info from the SSL certificate if it has one.

      This would help people spot phishing scams or other suspicious activity.

      Reed

    4. Re:Can't trust your browser's address bar anymore. by Srin+Tuar · · Score: 2, Insightful

      Thats a good start.

      Registrars shouldnt accept such names in the first place though: Is there a valid reason to ever have a domain name with stray characters mixed in from different languages?

      If a standard were to specify that a domain name must use a subset of unicode that is self-consistent, and that browsers should turn the address bar red to warn anytime a domain uses characters not in the users selected languages subsets, that would go a long way towards minimizing the phishing problem.

      There would still be issues between users of the same orthography, but in general there is no way to prevent phishing style attacks completely, which fundamentally rely upon people to be careless. Even the current DNS system is vulnerable:
      spoofing "cnn.com" with "cnn-news.com" or "cnn.newsnetwork.com" doesnt need i18n support to work at all.

    5. Re:Can't trust your browser's address bar anymore. by Anonymous Coward · · Score: 0

      It is quite common for non-Latin companies to use Latin in their names. For example, the Japanese site for Sony's PSP is full of Japanese characters and "PSP". Odds are their international domain name would be something like [Japanese]PSP[Japanese].jp, or something like that. And Coca-Cola in Isreal would probably have a domain name like Coke[Hebrew].il, where the Hebrew is written from right-to-left rather than left-to-right like Latin is.

      While you have a good idea, it is not a perfect solution because many people not only have multiple scripts available, but would expect to see multiple scripts in a single domain name.

      dom

    6. Re:Can't trust your browser's address bar anymore. by psydeshow · · Score: 1

      Or you could require all of the letters in a given sub-domain to be in the same alphabet.

      This would allow [Japanese].PSP.[Japanese].jp without mixing character sets and setting us all up for phishing schemes.

    7. Re:Can't trust your browser's address bar anymore. by Bogtha · · Score: 2, Informative

      Is there a valid reason to ever have a domain name with stray characters mixed in from different languages?

      You're assuming that characters belong exclusively to one language. Try telling a French guy that he can't register café.com because 'c' 'a' and 'f' are English, not French.

      --
      Bogtha Bogtha Bogtha
    8. Re:Can't trust your browser's address bar anymore. by pablo.cl · · Score: 1
      'c' and 'f' are Latin and English and French. 'é' is French and Spanish. 'ü' is German and Spanish. 'ß' is German. 'ñ' is Spanish.

      So Müller can be regarded as Spanish, or pingüino could be valid in German, but Eßpaña isn't valid anywhere.

  15. punicode? by Speare · · Score: 1

    Whatever happened to Punicode (Unicode in a special dns-characters-only encoding format)? There was some hoopla about the scheme, which would require browsers to show punicode-encoded URLs in the appropriate characters on the screen, but some naysayers said that it was a phisher's dream since many glyphs throughout Unicode looked alike. I figure this issue has nothing to do with Unicode per se, but with phishing vs certified sites in general, but I haven't heard a peep from the Punicode camp for over a year.

    --
    [ .sig file not found ]
    1. Re:punicode? by BeeRockxs · · Score: 1

      AFAIK it's used already for domain names to contain non-ascii characters.

  16. It's a little late, don't you think? by krell · · Score: 1, Funny

    Prince no longer goes by that strange symbol as his name anymore.

    --
    Where were you when the voynix came?
  17. Woot! by Anonymous Coward · · Score: 0

    Domain names in Tengwar, Yay!

  18. Threat to Google? by Anonymous Coward · · Score: 0

    So, how does Google work in an international Internet? If each content contributer is submitting in their native language, will I be able to search for terms anymore?
    I don't think English SHOULD be the default language, but there is certainly some advantages to one language for all content. Related to that, weren't all computers susposed to be using Japanese or something by this point? There was a prediction "back in the day" along those lines for a while, something about that character set being more efficient for machines to parse...

  19. URL goldmine. by emmagsachs · · Score: 4, Insightful
    Imagine the land rush that'll ensue if DNS will allow non-Latin characters. Trademark transliteration ? A heaven for domainsquatters and an upcoming surge of legal fees for trademark lawyers, if you ask me.

    Nice for localising, sure, but how usable will Japanese, Indian, or Arabic script URLs -- for example -- be for those who do not have access to the respective sets or keyboard layouts?

    1. Re:URL goldmine. by dodobh · · Score: 1

      http://en.wikipedia.org/wiki/List_of_Indian_langua ges_by_total_speakers

      With ~ 350 million English speakers, and a literacy rate of 65%, that's 400 million people who can't use the current form of the DNS. About as big a population as the US.

      Just because it isn't useful to you doesn't mean it isn't useful to anyone else.

      --
      I can throw myself at the ground, and miss.
    2. Re:URL goldmine. by dkleinsc · · Score: 1

      From the domain registrar's point of view, what's the problem?

      --
      I am officially gone from /. Long live http://www.soylentnews.com/
    3. Re:URL goldmine. by emmagsachs · · Score: 1

      By your own argument, even if DNS was revised to include the Indian languages, the illiterate 35% of the population would still be unable to use the internets, 'cause whether English or Hindi, they still can't read it.

      Of course, I forgot the manifold uses the illiterate have for the Web.

      This isn't second language acquisition, but a case of pressing symbols on a keyboard. I know non-English speaking -- let alone reading -- children that can manage such a feat.

      You were saying -- ?

    4. Re:URL goldmine. by DragonWriter · · Score: 1
      Nice for localising, sure, but how usable will Japanese, Indian, or Arabic script URLs -- for example -- be for those who do not have access to the respective sets or keyboard layouts?


      The only case they'd be less usable is if someone didn't have access to the appropriate input method and was getting the URLs in some non-electronic form where they couldn't, say, cut-and-paste into the browser address bar or wherever else they needed the URL.

      Frankly, I think this is pretty much a non-problem.

    5. Re:URL goldmine. by dodobh · · Score: 1

      While the people in India work to solve that little problem of illiteracy, why ignore the needs of the 400 million people who can actually use the Internet to do things, and possibly generate enough local demand for technology that jobs get exported back to the US?

      The problem isn't merely of pressing buttons on a keyboard, the problem is with interacting with a computer in the first place. And it isn't merely domains in a browser, what do you do about other things relying on DNS, including VoIP?

      --
      I can throw myself at the ground, and miss.
  20. compounding one mistake with another by tverbeek · · Score: 2, Insightful

    Of course it's late in coming.

    But that doesn't mean it should be done hastily and badly.

    --
    http://alternatives.rzero.com/
  21. why???? by Anonymous Coward · · Score: 0

    why would you want a domain name the biggest users (and hence customers) on the internet worldwide cannot type on their keyboard.

    As there is no current danger of the current DNS address space running out...
    sticking to ASCII seems sane to me but then again this is a political not a technical problem....

  22. Caprican and Tauron letter systems by krell · · Score: 1

    "One source of the pressure was Adama..."

    And he will not rest until the script of each of the 12 Colonies is properly represented with ICANN. I hear he's not too keen on Cyrillic, however.

    --
    Where were you when the voynix came?
  23. Re:Make 'em all speak english by TrappedByMyself · · Score: 1

    Many of them actually do speak English

    --

    Help me take back Slashdot. When did 'News for Nerds' become 'FUD and Conspiracy Theories for Extremist Nutjobs'?
  24. English, not latin languages by pubjames · · Score: 2, Insightful

    Given that some societies have used non-Latin characters for thousands of years, is this a bit late in coming?

    Let's be clear. The domain name system only uses English characters. There are lots of languages in Europe (Italian, Spanish, French...) which are closer to latin than English (which isn't really a latin language at all) which are not currently represented, because you can't use accents in domain names, or other letters such as the spanish Enye (n with a squiggle, actually a distinct letter). English speakers often think accents aren't important but they can completely change a word's meaning.

    1. Re:English, not latin languages by brusk · · Score: 3, Insightful

      True, but the English subset of the alphabet has another feature that matters in this regard: it's a lowest common denominator that all computers on the planet are capable of producing. I can type any letter easily on a computer in China, Israel, Jordan, Russia, Spain, India, etc. I can't necessarily input a given Chinese character, Arabic letter, or Cyrillic letter.

      Why does this matter? Well, one argument is that it doesn't, much: if I want to view a Chinese website I'm probably in China and can input Chinese characters on my computer. But what about a Chinese person visiting an English-speaking country and surfing at a public computer (e.g. in a web cafe)? If the computer isn't set up for input of Chinese, he/she won't be able to view certain sites if they can only be accessed by inputting a non-latin URI. Thus to serve all possible customers, the computer would need dozens of input systems installed. That simply isn't going to happen. The alternative of just inputting Unicode codes is unworkable.

      Hence it makes more sense to have a requirement that any non-Latin DNS registration ALSO be accompanied by a pure ASCII one, so that any computer will be able to access it. This also helps people who don't know a given language very well: if you don't know Chinese well, and are just learning it, you may find it hard to type in a web address with unfamiliar characters, even if your computer has Chinese input enabled. That shouldn't keep you from visiting a site.

      In fact, there are some Chinese systems that do this, by creating a registry of Chinese names for websites. But they involve kludgy workarounds like browser bars that are not universal and are otherwise evil.

      --
      .sig withheld by request
    2. Re:English, not latin languages by kamapuaa · · Score: 1
      . English speakers often think accents aren't important but they can completely change a word's meaning.

      Yes, I am an English speaker, and throughout the day I often stumble across the recurring idea that accents are of no particular use to determing the meaning of a word. I would go so far as to say that I often think accents just aren't important. I'm glad Slashdot has you around to set things straight.

      --
      Slashdot: providing anti-social weirdos a soapbox, since 1997.
    3. Re:English, not latin languages by syrinx · · Score: 1

      Latin *characters*, not the Latin language. The whole point of alphabets is that you can use them to write different languages. It's irrelevant that English isn't really a Romance/Latin language.

      Though, the later Latin alphabet only has 23 letters... if we're really sticking to that, you can't use U, J, or W. :P So they really do mean the English alphabet.

      --
      Quidquid latine dictum sit, altum sonatur.
    4. Re:English, not latin languages by pubjames · · Score: 1

      Well, I mentioned it because I have some experience with languages and have found it to be the case that some English speakers think they are not important. Sorry if that annoys you.

    5. Re:English, not latin languages by wolfsrudel · · Score: 1

      I guess it doesn't matter to you simply because English if the only language you're able to understand. The world stretches a bit farther than English-related stuff...accents do matter ;-) BTW, the main purpose of encoding *Latin* is simply that it's much easier to implement than multibyte encodings.

    6. Re:English, not latin languages by LoonyMike · · Score: 1

      In Brazil for instance, there are domain names with accented chars. I'm not sure if this is some sort of tweaked usage, though.
      Go to this page and search for the domain (www.)pãodeaçúcar.com.br - it works.
      If you try removing accents from only some of the letters, the query still works, so probably the browser or the DNS server converts each letter to an non-accented one before performing the query.

    7. Re:English, not latin languages by jargon82 · · Score: 1

      But of course! Think of the impact. How would you have www, without w?

    8. Re:English, not latin languages by faraway · · Score: 1

      Accents don't matter as much as you claim. In fact, one can easily deduce a word from context even when dropping most of the vowels from it. They do it in Arabic. Accents can change the meaning of a word, but are not entirely necessary.

    9. Re:English, not latin languages by pablo.cl · · Score: 1

      http://vvvvvv.blox.pl/html/, that's 6 "v"s.

    10. Re:English, not latin languages by Anonymous Coward · · Score: 0

      Really, I doubt very much you have ever once in your life heard somebody say that accents don't matter. -kamapuaa

    11. Re:English, not latin languages by Anonymous Coward · · Score: 0

      Why does this matter? It matters for the one and only reason which you mention: you. You can type, you can do, it does not matter whether chinese or arabic can do or not, but you.

      I bet you are an American, nobody else can be so completely arrogant of other languages and cultures.

      The chinese visiting cafe here is a fallacy: s/he would easily get to the page by google, should the owner of the server allow it. And if he were able to use the non-chinese input system.

      Last: "not going to happen" is just because people like you.

  25. Not a trivial job by turnipsatemybaby · · Score: 4, Insightful

    The internet was originally conceived, designed, and implemented in the USA at a time where hardware was at a premium, and corners were cut to conserve that limited resource. DNS was just one of the results of that era. However, it is the most visible because it is the front end means for people to find each other. That means there is now a very well established standard, used by people across the entire globe, that is very difficult to change.

    Changing all the DNS servers in the world to switch from ASCII to Unicode is NOT trivial. The fact that some societies have used non-latin characters for thousands of years is completely and utterly irrelevant. THEY didn't make the internet. They simply bolted themselves on to an existing infrastructure.

    I agree that progress needs to be made to accomodate non-latin characters, but to have people whining about "how they want it, and want it now"... That's just ridiculous. It's like waltzing into a house that was built 40 years ago and having a tantrum because the stairs are too steep and the house is too squished. Major structural renovations take time, effort, and careful planning. And there is nothing you can do to avoid that, short of implementing cheap stop-gap measures that are virtually guaranteed to cause even bigger unintended headaches later on.

    1. Re:Not a trivial job by Adam+Hazzlebank · · Score: 1
      Changing all the DNS servers in the world to switch from ASCII to Unicode is NOT trivial. The fact that some societies have used non-latin characters for thousands of years is completely and utterly irrelevant. THEY didn't make the internet. They simply bolted themselves on to an existing infrastructure.
      Yea, bloody towel heads coming on to our INTERNET. Stealing our jobs and our women! Let them make their own Interweb in sandnigger language. MAKES ME MAD!

      Seriously, you need to sort yourself out and adopt a slightly less racist attitude. It's not "your Internet", it's a standardised global network. These countries are asking for an technical problem which I imagine has significant economic impact for them to be addressed through a standards body. The standards body is coming back and saying "well that's going to be really hard". Doesn't mean they shouldn't push for this to be a priority, after all it's better than them adopting a proprietary solution.
    2. Re:Not a trivial job by Anonymous Coward · · Score: 0

      Who is the racist here?

    3. Re:Not a trivial job by Anonymous Coward · · Score: 0

      Racist yourself, you silly clot! And why is that always the first card played when non-Western attitudes are called into question? It's moronic!

      GP made a reasonable point -- and very cogently too. Those countries are NOT "asking for an technical problem" (so eloquently put, BTW); they're looking for a solution to a problem which is of their own creation. There's a perfectly workable mechanism in place right now. Dick with it at your own peril, but don't screw me up too while you're at it.

    4. Re:Not a trivial job by spiritu · · Score: 1
      It seems that Hazzlebank has missed a key point of the earlier post. His racist slurs don't help his point, but I'll put aside Adam's anti-Arab racism for the moment and focus instead on the technical and political merits of his and the earlier discussion.

      I think what the previous poster was attempting to say was something along the lines of, "We built this house. It is open to change. Please feel free to rewrite BIND to fit your particular society's needs and also feel free to peacefully and democratically convince the rest of the world to make the (huge, difficult) adjustments needed in order to accommodate your particular needs. Please note that given the inherent limitations imposed by the immense deployed base of software that depends on DNS, it may take a while, so be patient."

      I highly encourage those from non-English-speaking societies (and their allies within said societies) to adjust DNS (and everything that depends on it) to account for the particular writing systems that have arisen out of non-English-speaking societies. It seems that ICANN is considering imposing this change upon the 'net by fiat, however. Such an imposition would be abhorrent.

      One source of the pressure was Adama Samassekou, president of the African Academy of Languages in Mali, who said that the Anglo-centric internet left people isolated and marginalised.

      "I think the digital divide is not as important as the linguistic divide. And that's the one we should be bridging in order to guarantee the democratic governance of the internet," said Samassekou.

      Democratic governance of the internet? By whom? Is there some sort of requirement that, like Mali, countries involved in the governance of the internet themselves be democracies? Is there a press freedom requirement for countries involved in the governance? Is there a court to resolve disputes? Will this "Internet Government" pay for the writing of DNS and routing software that can handle all of the requirements imposed upon it by a legislative body?

      [satire]
      Further, why should we stop at DNS? Won't the code be written in obviously discriminatory languages that utilize poorly-obscured ENGLISH for their function names? Why must a Chinese programmer use "printf" when she should be able to choose to use something written in her own language? Clearly "print" was put into programming languages to keep down the non-English-speaking people in the world.
      [/satire]
    5. Re:Not a trivial job by Anonymous Coward · · Score: 0

      >Changing all the DNS servers in the world to switch from ASCII to Unicode is NOT trivial. The fact that some societies have used non-latin characters for thousands of years is completely and utterly irrelevant. THEY didn't make the internet. They simply bolted themselves on to an existing infrastructure.

      this is precisely the reason why IDNA has deen developed. With IDNA URLs contiaining unicode will work seamlessly with the existing hardware infrastructure. This debate is only for internationalizing TLDs.

    6. Re:Not a trivial job by turnipsatemybaby · · Score: 1

      I know this thread is cold at this point, but I wanted to reply.

      If you think my post is racist, you completely missed the point of my post. This is how the internet was born. It was concieved and built in the US decades ago, originally under DARPA. Then it slowly expanded. As any expensive computer technology, it was expanded in a way that maximized the infrastructure already available. That is how we've come to where we are today. The east DIDN'T make the internet. They came on much much later, when all this infrastructure was already established.

      The fact that you think this is all racist says more about yourself than about my comments. I recommend educating yourself before making baseless accusations on others.

  26. thousands of years? by sexyrexy · · Score: 3, Insightful

    Given that some societies have used non-Latin characters for thousands of years, is this a bit late in coming?

    Those societies did not build an entire economic and social infrastructure using all 50,000 of those characters in a few decades, though.

    --

    Rex is 09 F9 11 02 9D 74 E3 5B D8 41 56 C5 63 56 88 C0
  27. Basement by faqmaster · · Score: 1, Funny

    "What we're trying to do is change the bricks in the basement."
    It's the Internet; so it's more accurate to say we're changing the bricks in your parent's basement.

    --
    Are you...Are you some kind of genius?
    No, ma'am, I'm just a regular Slashdot reader.
  28. Re:Make 'em all speak english by Anonymous Coward · · Score: 0

    Why English? Why not, say, Spanish, or Mandarin?

  29. Better idea! by EvilRyry · · Score: 2, Funny

    How 'bout we all just speak English and forget about all those weird letters.

    (It was a joke... well sort of)

  30. Huh? by writermike · · Score: 4, Funny

    ICANN Under Pressure Over Non-Latin Characters

    You mean white people?

    --
    If Nalgene water bottles are outlawed, only outlaws will have Nalgene water bottles.
  31. Base-Ten BIGOTRY, I say!!! by mosel-saar-ruwer · · Score: 3, Funny


    Now if you'll excuse me, I need to finish reading all the new posts on 66.35.250.150.

    Base-Ten CHAUVINIST!!!

    What about societies that use Base 2 [binary], or Base 8 [octal], or Base 16 [hexadecimal]?

    Or entire societies, like the British empire, which use no base at all?

    12 inches in a foot. 3 feet in a yard. 1760 yards in a mile...

    60 seconds in a minute. 60 minutes in a hour. 24 hours in a day. 7 days in a week. 52 weeks in a year [give or take]...

    Or how about base 12?

    12 keys in a chromatic scale: A 440, then, logarithmically [give or take a little well-tempering]: A#, B, B# == C [kinda sorta], C#, D, D#, E, E# == F [kinda sorta], F#, G, G#, and finally A 880.

    Except that on the continent, things are often just a little sharper - say A 443/444/445 & A 886/888/890...

    And let's not even get into water freeezing & boiling at 32 & 212 versus 0 & 100...

    1. Re:Base-Ten BIGOTRY, I say!!! by Anonymous Coward · · Score: 0

      "Or entire societies, like the British empire, which use no base at all?"

      Technically speaking this is "base inconvenient".

      (Metric FTW!)

    2. Re:Base-Ten BIGOTRY, I say!!! by jargon82 · · Score: 1

      I buy that, except the last part. basing a temperature scale on water really has nothing to do with anything, so on that point, F is just as good as C.

    3. Re:Base-Ten BIGOTRY, I say!!! by amliebsch · · Score: 1

      I'll vote for base-12. Sure, it isn't any more convenient in binary calculations, but it's way better than base-10 for everyday use, being evenly divisible by 2, 3, 4,and 6 instead of just 2 and 5. Lots of people are already used to thinking in dozens and grosses. All it would take is a couple of extra numerals, say a square, stylized "X" for 10 and a backwards 3 (a stylized "E") for 11.

      --
      If you don't know where you are going, you will wind up somewhere else.
    4. Re:Base-Ten BIGOTRY, I say!!! by LunaticTippy · · Score: 1

      OK, let me see if I understand.

      C = based on water
      F = based on nothing

      I don't think F is just as good as C.

      --
      Man, you really need that seminar!
  32. Re:Make 'em all speak english by igb · · Score: 1, Troll
    Amusingly, when the issue is non-UTF8 character sets, or censorship, or anything else that upsets the non-Western countries, they start shouting threats like ``Turkey will start its own top-level domains'' or ``Iran will disconnect from the Internet''. Which I'm sure is terribly impressive in UN-type meetings where we're supposed to pretend that all countries' opinions matter, but in the real world is an entirely hollow threat.

    Were some random non-UTF8 country to make interworking with the rest of the Internet harder, it would be cutting its nose off to spite its face. For the G7 countries (yes, G7, not G8), the value of Internet connectivity to random minor countries is minimal. The value to those countries of Internet connectivity is large. Do US users care if Uzbekistan is on the Internet? No: it has zero impact on 99% of them, minimal impact on 0.9% of them, etc. Do people in Uzbekistan care about being able to access Google, Wikipedia, Amazon, CNN, the BBC? I rather think they do.

    No one likes pointing out to random minor countries that their presence on the Internet is far more in their interest than it is in anyone else's. But that doesn't make it any the less true. So, in general terms, the choice they're getting is ``largely anglophone, largely UTF-8, or nothing''.

    ian

  33. What kind of stupid idea by jlebrech · · Score: 1

    Who idea was to use cyrillic character where theres and exactly identicaly non-cyrillic character on the ascii set. They just decided to stick cyrillic characters in a totally seperated grid rather than adding the additional characters form the cyrillic.

  34. Use a simple eight dot three kludge by tempest69 · · Score: 2, Interesting
    Set up a private latin name prefix for the non-latin names i.e. NONLATINPREFIX and then a UUEncode of the non-latin name.. IE (arabic word for horse in arabic script)=AER5ER8EDG so you would have NONLATINPREFIX-AER5ER8EDG.com as a domain name, that would resolve correctly if someone typed in (arabic word for horse in arabic script).. 1. This allows for simple web-extention to serve non-latin countries

    2. Doesnt require any change to the DNS system. (other than some name policy changes)

    3. Allows links to be imbedded in normalweb-pages so that they can be cut and pasted by anyone with latin functionality. So a Japanese person could cut and paste the link to some arabic site that they dont have the font for.

    4. While this is a kludge it has some major advantages over rebuilding the DNS system.

    Storm

    1. Re:Use a simple eight dot three kludge by tempest69 · · Score: 1
      Oh there is a funky problem... letters that look identical in different languages. could allow for spoofing.. so if the link had a dual-language character-set it would need to multicolor the thing so that it would look pretty odd.. so that microsoft.com and mîcrosoft.com would be clearly distinguishable (reverse the î character so that it's white on black) but I still it's workable.

      Storm

    2. Re:Use a simple eight dot three kludge by Anonymous Coward · · Score: 0

      Horsesh*t! Something like that already exists and is called IDN and it uses punycode.

  35. DNS won't break by zdzichu · · Score: 2, Informative

    DNS won't break. In fact, it already works! The thing is called IDN and is supported by all modern web browsers (including IE). Try for yourself - http://www.kozowski.pl (I hope Slashcode won't caniballize letter "").

    So DNS and Web is OK. Any breakage I can think of may appear in email systems or other domain-based forms of communication.

    --
    :wq
    1. Re:DNS won't break by zdzichu · · Score: 1

      It ate the letter. So I'd add Slashdot to possible broken systems.

      --
      :wq
    2. Re:DNS won't break by BJH · · Score: 1

      Yes, it's irredeemably broken.

    3. Re:DNS won't break by dcam · · Score: 1

      IDN has implications for phishing.

      --
      meh
  36. Re:Make 'em all speak english by voice_of_all_reason · · Score: 1

    Because those languages are dirka, obviously

  37. Air Traffic Control by zomgitsnev · · Score: 1

    I don't see anyone complaining that air traffic control "should include non-english words", so why should the internet include non-latin characters? The system works, and I see no reason not to leave it as it is.

    1. Re:Air Traffic Control by Anonymous Coward · · Score: 0

      Good god! Would someone mod this up? The entire basis of international flight is English. It works. It works well.

      Another area where there are likely to be problems (hold the flames) is PKI. The Verisigns of the world will have to adjust their vetting process to properly handle unicode in domain names. Thawte (Verisign) offers a "lower"-priced certificate that is based solely on domain verification. If they cannot distinguish between two similar looking characters, we're fooked even more than we are now.

  38. Apple is a "unix" by Anonymous Coward · · Score: 0

    maybe you're stuck in os9, dunno...
    Apple just somehow made "their unix" do all kinds of fancy stuff

    m10

    (let's keep ourselves from the argument if macosx/darwin is to be classified as a *real* unix or not - it's a completely different discussion)

  39. Political policy again trumping practical sense? by jgercken · · Score: 1

    It's so frustrating how often people or groups with great political power but little technical insight frequently force changes without the capacity to truly weigh the risks involved. So very bad things continue to happen unnecessarily.

    Case in point: The US publicizing advanced engineering documentation on how to build a nuke because some shmuck, who didn't grasp the consequences, ordered it posted thinking that somehow it would help his political party.

    As technology becomes more ingrained in our lives, politicians will salivate even more over the being able to claim party to any major breakthroughs. There is more or less a separation of church and state in most countries. Can't we do the same with technology?

    --
    Never ascribe to malice what can be adequately attributed to ignorance. -Napoleon
  40. This will change the Internet into ... by salec · · Score: 1

    ... parallel multi-nets. I guess servers will have multiple domain names for same IP address, one for each culture they wish to address.

    No matter what, english-language net will continue to be *the* Internet, a global Forum, direct connection between common people from all parts of the world ( Hey there! :) ).

    All the other nets will have quite a marginal significance. Nations will try to boost them in order to keep their citizens indoctrinated with own traditional values, but things that do not fly by themselves usually have short age, lose appeal and fade away. Internet as we know it will take only a mild hit, so no worries.

    All this is needed for final globalization of internet - reaching people of the world with only as much as elementary literacy in their own mother's tongue. That is something that native english speakers take for granted - "Your grandma can use Internet". Well, most grandma's of the world still can't, or have difficulties with it.

  41. I smell a hack! by Anonymous Coward · · Score: 0

    "we have to make sure that if we change the system, the rest is all going to work.'"

    Pfttt. To quote Rocky Squirrel "that trick never works". Especially if they drop bind, and come up with something new. It's positively guaranteed that there will be a crack somewhere, waiting for a smart cracker to take advantage of it. History has well shown that the odds are extremely high there will be a back door vulnerability somewhere in the system.

    *rubs his hands in greedy anticipation*

    Oh yeah, here's a hint. Beware of accepting code from China (the allusion to the Trojan Horse is appropriate).

  42. An alternative by Anonymous Coward · · Score: 0

    Perhaps *they* might have spent some of those thousands of years inventing computers and internetworks? If *they* want non-latin characters, let *them* build *their* own root nameservers, *their* own implementation of DNS, and go for it. Who's stopping *them*?

  43. Why shouldn't this be "easy?" by kabocox · · Score: 1

    I thought this was what unicode was for. The only 3 scared characters that I wouldn't want messed with are the ":", "/", and "." How come we don't have a unicode DNS solution so countries could use the entir unicode address pool for domain names? I've read postings basically bashing the non-English world for not being invovled with the original tech so being left out. So that's a valid reason to discrimnate now? What used to get me excited about slashdot was the unquie solutions that you could find in the comments for real world problems. Used to be, on slashdot at the mere suggestion of a unicode DNS solution some one would either find one or write one. Now a days we get bitching and moaning regionalism about how either the US is behind the rest of the world due to our policies or that the US is better than rest of world because we speak English. Ug. Slashdot drives me batty sometimes.

    I say that non-English countries should just do it. If it breaks Standard US or European IE or FireFox by putting the domain name in the address bar, no loss to your country.

    1. Re:Why shouldn't this be "easy?" by pedestrian+crossing · · Score: 1
      I say that non-English countries should just do it. If it breaks Standard US or European IE or FireFox by putting the domain name in the address bar, no loss to your country.

      It would break a hell of a lot more than just web browsing and email.

      Essentially all internet-aware software would have to be updated to avoid breakage.

      That is a lot of software...

      --
      A house divided against itself cannot stand.
    2. Re:Why shouldn't this be "easy?" by EvilIdler · · Score: 1

      Modern slashdotters are mostly technology-obsessed nerds who don't actually
      produce any code of their own ;)

      Scandinavian countries are already starting to support 'æ', 'ä', 'å', 'ö' and 'ø' in domain names,
      although I'm not sure if they're going about it the correct way. There is a lot of demand for it,
      though, because not every company in the world is international, or has an English name..

      If a couple of hundred million people want to use their local language to represent their domain
      names, that's a good enough reason for the ameri-centric slashdotters to shut the fuck up >:)

  44. Re:Make 'em all speak english by Shaper_pmp · · Score: 0, Troll

    At the risk of sounding like a cultural chauvenist... because we invented the damn internet, and we speak English, and use the Latin-1 character set.

    If individual countries want to implement their own DNS-equivalents in their national character set them more power to'em, I say. However, they'll also have to deal with upgrading every DNS-capable application on every machine in the country, then find a solution to the massive problem of phishing they've just caused by introducing two identical-looking (but numerically different) characters... and then find a way to enable other nationalities to type and use those URLs without necessarily having the characters on their keyboards or character-sets on their machines.

    I honestly don't see a way around this.

    --
    Everything in moderation, including moderation itself
  45. Re:Yes and No - thinking long term by bbernard · · Score: 1

    Besides, think of how well prepared DNS will be to start supporting lookups in extra-terestrial languages when the time comes if we do this now! We'll be completely compatible with Martian, Klingon, Mimbari, and Vulcan networking systems the day we meet them! We should be able to view each other's pron almost immediately!

    --
    ----- Connection reset by beer
  46. Re:Make 'em all speak english by Anonymous Coward · · Score: 0

    The internet was developed in the US, and therefore US people designed the early protocols. The internet has been around for how long now, and these countries are just now really trying to push their character set. Couldn't you have thought about this before we had so many webpages and domain names? And with non-latin domains, aren't we setting ourselves up for something like www.@(öö)@.com? I wouldn't mind it in a hyperlink, but just try to type it.

  47. While we are at it ... by Anonymous Coward · · Score: 0

    Why URI in non-English speaking countries should start with "http://", "ftp://", ..., instead of the accronym of the protocol's translated name (which shoud be something like "ptht://", "ptf://" in French :) ) ?

  48. .cn by hey · · Score: 2, Interesting

    Does ICAN control .cn (China)? Or other national TLDs? Why don't they just start registering
    domain in their local language. Leave .com, .org, .mil (ie the USA TLDs) English.

    1. Re:.cn by DaveCar · · Score: 1


      All the searches for those domains will still start off from the root servers controlled by ICANN, so depending on policy/implementation the names could be refused as containing invalid characters. I hasten to add that I don't know if there is actually something that would cause them to be refused, just that the searches will still hit ICANN servers.

    2. Re:.cn by Anonymous Coward · · Score: 0

      .com and .org are not USA TLDs. USA TLDs are .us, .gov and .mil.

  49. What About HTML? by Anonymous Coward · · Score: 0

    God forbid these foreigners "view source" and realize that html is largely broken English. That would really put a bee in their bonnet.

  50. Pléåsé ñø by bugnuts · · Score: 2, Funny

    Tht ìs thê £äst thïñg wë ñèêd

    Dibs on ©óm

  51. What's this going to do for security .. by rs232 · · Score: 3, Interesting

    What's this going to do for security. Didn't we have phishing attacks receintly that consisted of unicode characters being inserted into e+bay.com for instance that didn't get displayed. the domain e+bay.com being different than ebay.com.

    "A domain name is a unique address that allows people to access a website, for example, smh.com.au"

    No,a domain name is a sequence of characters mapped to an IP address. It was designed so as you won't have to remember 66.35.250.150 instead of slashdot.org. This wasn't a problem while the original Internet consisted of just four computers. DNS was never designed to provide identity. There was also the case of a stock trader hacking a DNS server and redirecting traffic from a legitimate finantial site to his own where he had duplicated the real site only with bogus information.

    "He said that this could create problems where, for example, a character in Urdu looks identical to one in Arabic"

    It sure could. How about totally replacing DNS with a system of online identities.

    --
    davecb5620@gmail.com
  52. Um... why? by Colin+Smith · · Score: 2, Informative

    "Yes, countries that use non-English characters should be able to interact with the rest of the world using their natural language."

    Why... No really. You speak as if this is a good thing. Why should they be able to use their natural language rather than English? Why shouldn't they be restricted to a limited area of local language speaking people?

    The reason the Internet is useful is because everyone speaks TCP/IP. Incompatible protocols are to be actively discouraged because they balkanise the network. Language is exactly the same. The reason the Internet is useful is because everyone speaks English, the more divided it becomes the less useful it becomes.

    Languages are anachronisms, the only reason we have more than one is the physical distance between locations and difficulty travelling allowed them to evolve independently. Well that isn't the world we live in any more and the different languages actually make communication far more difficult now. They're no longer beneficial. So get rid of them, insist on a common language. The most popular happens to be English at the moment. I could live with Spanish, but for those of you about to suggest Chinese, read this before deciding: http://www.pinyin.info/readings/texts/moser.html

    We should be using this opportunity to actively get rid of languages.

    --
    Deleted
    1. Re:Um... why? by CRCulver · · Score: 4, Interesting

      Languages are anachronisms, the only reason we have more than one is the physical distance between locations and difficulty travelling allowed them to evolve independently.

      So why does every language have strata of slang and jargon that may well be incomprehensible to outsiders? In south-east England, a fairly small area, one has a wide range of speech depending on economic status and social circle. If one has a few people speaking a common language, it won't stay uniform for long, even if everyone's still in the same place.

      So get rid of them, insist on a common language.

      Sure, and why don't we just all wear the same clothes, just because different styles or colours can be taken too seriously (on gang turf, for example)? And let's all eat the same food, no need for various cuisines when flavourless mush can keep us alive.

      Languages make the world more interesting. I enjoy very much traveling about and seeing how the local communicate, the phonological inventory and morphological quirks they employ, the different judgements on eloquent speech they hold. If all this disappeared, it would be very dull.

      And your claim that languages are "too difficult" is a peculiar opinion of some in first world nations. The vast majority of human beings are multilingual, see e.g. Edwards, John. Multilingualism (London: Penguin, 1994). It should only take a person a couple of weeks to acheive a basic conversational level in a foreign language, which can easily be done before each time you set off on vacation. I've never had a problem learning enough of the language to talk with the locals about their culture and mine, and I think my language skills are actually fairly humdrum in comparison to a lot of people I've met.

      And if all national tongues disappear in favour of some world language imposed by fiat, what would happen to all the literature written in them? Poetry translates infamously poorly. People have spent millennia composing art in words, one of the skills that makes us the unique species we are. Are we to throw all of those great monuments away?

    2. Re:Um... why? by Colin+Smith · · Score: 1
      So why does every language have strata of slang and jargon that may well be incomprehensible to outsiders.


      I'm glaswegian. I'm perfectly capable of being completely incomprehensible to all non glaswegian english speakers while still speaking english. I'm also perfectly capable of being completely comprehensible to virtually all other english speakers. Whether there's slang or not, english speakers can make themselves understood to other english speakers should they choose to, the basic language is the same. Not true with different languages.

      Sure, and why don't we just all wear the same clothes, just because different styles or colours can be taken too seriously (on gang turf, for example)? And let's all eat the same food, no need for various cuisines when flavourless mush can keep us alive.


      Strawman, neither of the examples are communication protocols which benefit from the network effect. Language is.

      Languages make the world more interesting.


      Sorry, irrelevant, and wrong. People make the world more interesting. It's nice to be able to talk to them.

      And your claim that languages are "too difficult" is a peculiar opinion of some in first world nations.


      Nope. Spanish, Italian, German or other romanic or germanic language I could probably pick up as required. Chinese is apparently particularly difficult.

      And if all national tongues disappear in favour of some world language imposed by fiat, what would happen to all the literature written in them?


      It would be consigned to academia, where all dead languages go.

      --
      Deleted
    3. Re:Um... why? by CRCulver · · Score: 3, Interesting

      Strawman, neither of the examples are communication protocols which benefit from the network effect. Language is.

      Language may be employed in various ways. Not only to communicate, but also to obfuscate (as some Roma do with their use of Romani) or to explore new possibilities of form (conlangers, bits of Sandor Weores and James Joyce).

      People make the world more interesting. It's nice to be able to talk to them.

      People aren't solitary individuals, they belong to larger societies that shape them. Understanding his language is part of understanding a person.

      Nope. Spanish, Italian, German or other romanic or germanic language I could probably pick up as required. Chinese is apparently particularly difficult.

      Chinese's difficulty is mainly at the level of official orthography. I studied Chinese at Defense Language Institute while in the Navy, where we concentrated only on the spoken language and learnt but a few characters, and after the first two months I no longer felt any barriers. Granted, I occasionally had to ask a person to explain what they meant, but still in Chinese of course, and I employed many circumlocutions, but it's not hard at all to learn enough Chinese to talk to Chinese people about themselves and their culture.

      It would be consigned to academia, where all dead languages go.

      The Finno-Ugrian minorities of Russia, which are my chief object of study now, do not want their languages and literature "consigned to academia". They want their works preserved, they desparately seek more funding of publication (and an end to local government censorship), and they experience great pain over the monolingual policies of the Russian state--most of the Mari men of letters, for example, were murdered under Stalin. Are you to tell those suffering peoples to "just get over it"? One finds in Russia that the locals who did "get over" the loss of their language also have higher rates of suicide, alcoholism, and existential crisis, while those who are fighting to preserve their language and feel a connection to the past have a much more positive outlook.

    4. Re:Um... why? by pangloss · · Score: 1

      It should only take a person a couple of weeks to acheive a basic conversational level in a foreign language, which can easily be done before each time you set off on vacation.

      I actually agree with the general thrust of your post and until recently, even the bit of your post I quoted above. However, my most recent foreign language experience (visiting China), now leads to me strongly disagree (just with the bit I quoted above). A couple of months isn't adequate, let alone a couple of weeks. Most of the language students I met in Beijing, after 8 weeks of 20 hrs/week of classroom study had yet to achieve a "basic" conversational level. Of course, we could argue about what constitutes basic =) I was pretty happy to be able to order a beer and say thank-you, but I would call that essential, not basic ;)

      I met people who could speak two or three other (Western) languages, but after over a year of living/working in China, still couldn't explain over the telephone in Mandarin that they might be late for a meeting without assistance from a bilingual friend. I'm willing to bet there are other languages that impose a similar level of difficulty.

    5. Re:Um... why? by Colin+Smith · · Score: 1
      Language may be employed in various ways. Not only to communicate, but also to obfuscate (as some Roma do with their use of Romani) or to explore new possibilities of form (conlangers, bits of Sandor Weores and James Joyce).


      All of which can be done in english.

      People aren't solitary individuals, they belong to larger societies that shape them. Understanding his language is part of understanding a person.


      And if there was just one language, wouldn't it be much easier for everyone in the world to understand one another. Languages are barriers to communication, they don't aid understanding, they make it more difficult. The fewer languages there are in the world, the better off humanity will be.

      Chinese's difficulty is mainly at the level of official orthography.


      And yet the DNS uses written language which makes chinese less than ideal.

      The Finno-Ugrian minorities of Russia, which are my chief object of study now, do not want their languages and literature "consigned to academia".


      That'd be tough then. Neither nature nor economics are particularly sympathetic.

      One finds in Russia that the locals who did "get over" the loss of their language also have higher rates of suicide, alcoholism, and existential crisis, while those who are fighting to preserve their language and feel a connection to the past have a much more positive outlook.


      As I pointed out earlier, I'm glaswegian, my native language is long gone and I don't feel any loss. They will get over their nostalgia and will benefit hugely from the use of a common language.
      --
      Deleted
    6. Re:Um... why? by curunir · · Score: 1
      We should be using this opportunity to actively get rid of languages.
      What you don't seem to be groking is that language is an integral part of people's identities. The french are proud of their language...losing it would be like losing a part of themselves. Your link even mentions that Chinese people take the difficulty of their language as a badge of honor. Every language has some measure of this.

      Ignoring how foolish it would be to try to ensure that there is only one language, the natural candidate for "universal language" would be the one that was created specifically for that purpose, Esperanto. It was specifically designed to be easy to learn and pronounce for as many people as possible. And, more importantly, adopting it would mean that everyone would be forced to learn a new language. It would make far more sense to try to ensure that eveyrone speaks two languages, their native langauge and Esperanto. This would allow people to contextually use language depending on whether it was intended for consumption by everyone on earth (passport, street sign, technical manual, etc) or whether it was intended specifically for people who speak their language. Most sci-fi visions of the future where a common language is realized adopt this scenario of a single, common language that everyone speaks without eliminating the other languages entirely.

      However my personal feeling is that we should be actively using pre-school and early grade school (ages 3-7 or so) to teach only languages and reading to children. Children are so receptive to learning languages at that age, that it's very possible to teach 4+ languages during that time span. And when people speak that many languages natively, they can pick up languages far more easily as adults. I only wish I had been given this sort of education...I know it would have made it far more easy for me to get to the point where I am now (can speak and read 4 languages fluently and a number of others with varying degrees of ability).
      --
      "Don't blame me, I voted for Kodos!"
    7. Re:Um... why? by CRCulver · · Score: 1

      All of which can be done in english.

      No, to explore different forms means that the language is no longer English. To use the two authors I cited above, Joyce's Finnegans Wake is only nominally English, knowledge of something like 60 languages is necessary to fully understand Joyce's prose. The first poem of Sandor Weores' "Barbar dal" or the last stanza of "Nema zene" are in languages of his own making. Even if a person only has access to one language, he or she may develop other languages out of pleasure. It's part of being human. And this happens quite often, many countries have had national language reforms where they intentionally set their language apart from others, from Estonian (loss of much German stock) to Romanian (eradication of many Slavonic loans) to the languages of the former Yugoslavia, now trying each to be distinct. You're a tiny minority here if you think that peoples having their own lingo is nice.

      Languages are barriers to communication, they don't aid understanding, they make it more difficult.

      If many cultures exist, there is more to talk about. If one has a single universal language, that limits the amount of information that can be communicated. If I can talk about something as simple as the weather with someone who uses a system of communication exotically different than the one I grew up speaking, that's a much richer experience than speaking just my own.

      As I pointed out earlier, I'm glaswegian, my native language is long gone and I don't feel any loss.

      You weren't part of the generation of Scots that lost Scottish Gaelic, or the generation long before that that lost Pictish. Generations that face the loss of their language experience immense sadness, why wish that on them?

      They will get over their nostalgia and will benefit hugely from the use of a common language.

      And as I already pointed out, this is not true for the former speakers of indigenous languages of Russia nor, I daresay, for most places on Earth where the local language has been replaced.

      That'd be tough then. Neither nature nor economics are particularly sympathetic.

      Social Darwinism has been nearly universally recognized as a horrible philosophy for at least the past six decades. And many minority languages actually have economics on their side, look at the Basques (the region is a powerful banking capital with money to go around).

    8. Re:Um... why? by Colin+Smith · · Score: 1
      What you don't seem to be groking is that language is an integral part of people's identities.


      No I do understand. As a Scotsman I completely understand it, in fact I probably understand it better than you and I reject it completely. I'm far, far better off speaking English rather than Gaelic or Scots. I'm still Scottish despite the fact that I speak English and Americans are still American despite the fact that they speak English, Australians are still Australian despite the fact that they speak English, even the Welsh are still Welsh despite the fact that they speak English.

      Sorry but Esperanto is a politically correct joke, the numbers of speakers is infinitesimal and the network effect will ensure it remains that way. If anybody wanted to learn it they would have done by now. What will happen instead is that English will continue to spread, replacing all but a few other languages. And this is a good thing, the world will be a better place and we should encourage that spread, not attempt to hinder it.

      --
      Deleted
    9. Re:Um... why? by doshell · · Score: 1

      What I find most disturbing in your post is that you hint it would be a good thing to standardize on English (or any other widely spoken language, for that matter). In my opinion that would be a very bad idea because it would not only give an unfair advantage to those who already speak it natively, but also open up the door for a dangerous world-scale cultural dominance by the countries in which that language is spoken (something of that kind is already hapenning at a smaller scale at the moment with the "invasion" of American culture pretty much everywhere).

      The thing is you can't look at a human language as if it were purely a communication tool, like computer languages or internet protocols are. A human language carries a culture around with it, which is that of the countries it is spoken in (either presently or historically). That happens not only through literature but also through everyday spoken language (as an example consider the phrase "pardon my French" and how well it would bide with french people, were they to standardize on English).

      In order for your plan to function adequately you would first have to get rid of all cultural references in your language of choice -- which I reckon is quite an impossible task. Or you could "start from scratch" and use a constructed language like Esperanto (which, incidentally, would suit the role quite well since it is easy to learn and has very consistent rules). Even then you could argue there already exists some kind of an Esperanto culture you'd have to get rid of. In my view it would simply be safer not to go down that path at all.

      --
      Score: i, Imaginary
    10. Re:Um... why? by doshell · · Score: 1
      I'm still Scottish despite the fact that I speak English and Americans are still American despite the fact that they speak English, Australians are still Australian despite the fact that they speak English, even the Welsh are still Welsh despite the fact that they speak English.

      Except that English has been spoken in those parts of the world for centuries. Compare that with the attempt to implant the English language (through cultural items such as movies, music, etc) in countries where it has never been spoken natively, such as Japan, Germany, Spain or Brazil. What worries me is: will the cultures of those countries be preserved after a long exposure to the English/American culture, which the attempt to standardize the world on the English language will undoubtedly bring? (read my reply to your original post.)

      --
      Score: i, Imaginary
    11. Re:Um... why? by ObsessiveMathsFreak · · Score: 1

      And yet, we all use the same mathematics. Digits, structure and symbols.

      --
      May the Maths Be with you!
    12. Re:Um... why? by CRCulver · · Score: 0, Offtopic

      There are still languages that use non-Base 10 systems, and if the reports are right (and, to be honest, they probably aren't) Piraha speakers don't use higher numbers.

    13. Re:Um... why? by Colin+Smith · · Score: 1
      No, to explore different forms means that the language is no longer English.


      And Lewis Carroll made up nonsense words. Literary snobs would continue to explore from an english base.

      If many cultures exist, there is more to talk about. If one has a single universal language, that limits the amount of information that can be communicated.


      I'm sorry, but, bullshit. There are many cultures within the English language and communication between them is greatly facilitated by that one language. The world becomes a wider more exciting place.

      Generations that face the loss of their language experience immense sadness, why wish that on them?


      Because the alternative is isolation and decline. Despite your protestation, a single language is far better than the alternative.

      Social Darwinism has been nearly universally recognized as a horrible philosophy for at least the past six decades.


      And survival of the fittest is particularly harsh for the individual unable to adapt but guess what, that's the way the world works, even today in human society. You can either adapt and be happy or don't and die out. You can stick your head in the sand and try to pretend it isn't happening or you can embrace change.

      Are you saying the Basques don't speak French or Spanish, and most probably English as well? There are some Gaelic speakers in Scotland too, about 60,000 they all speak english and the number is in inevitable decline just like Basque.
      --
      Deleted
    14. Re:Um... why? by CRCulver · · Score: 0

      Ok, let's see. You consider popular literary works (Weores' poetry is known by heart by several generations of everyday Hungarians) as products of "snobbism". You seem to think Basque is on the decline when it's use has actually been growing impressively over the last quarter-century (and non-ethnic Basques in the area have started learning it in large numbers), and you think that the basest human cruelty should be tolerated, even encouraged, since "that's how it goes". Yet another Slashbot with no idea what he's talking about.

    15. Re:Um... why? by Colin+Smith · · Score: 1
      Except that English has been spoken in those parts of the world for centuries. Compare that with the attempt to implant the English language (through cultural items such as movies, music, etc) in countries where it has never been spoken natively, such as Japan, Germany, Spain or Brazil.


      It has to start somewhere. Business, movies and music are a good start.

      What worries me is: will the cultures of those countries be preserved after a long exposure to the English/American culture, which the attempt to standardize the world on the English language will undoubtedly bring? (read my reply to your original post.)


      You'd rather they were anachronisms preserved in amber for your amusement? You don't think the members of those cultures deserve to benefit from the wider world?

      --
      Deleted
    16. Re:Um... why? by Colin+Smith · · Score: 1
      What I find most disturbing in your post is that you hint it would be a good thing to standardize on English (or any other widely spoken language, for that matter).


      It absolutely would be a good thing.

      And as to the rest, culture is independant of language, certainly there are influences but the English are identifiably English, Americans are identifiably American with different attitudes, different politics, different history and a single language. Culture is dynamic, not static, it varies from generation to generation in the same location never mind between different locations. If everyone spoke english, American would be a minority culture, french idioms would enter the language and everyone would be richer for it.

      The sooner the better.
      --
      Deleted
    17. Re:Um... why? by doshell · · Score: 1
      You'd rather they were anachronisms preserved in amber for your amusement? You don't think the members of those cultures deserve to benefit from the wider world?

      I would rather preserve what you call "anachronisms" than have American culture fed down my throat to supplant my own. I have a right to preserve my culture and language as a righteous part of the legacy of mankind. So does everyone else in the world.

      What you don't seem to understand is that going down that path leads not to a wider world but a narrower one, culturally speaking. It is not so much a matter of opinion since it is happening right now, as you can see with your own eyes at least in Europe.

      Besides, if you are so convinced that language has little to do with culture, then certainly it isn't hard for you to agree that the ability to have contact with and benefit from other cultures does not require mastery of their respective languages?

      --
      Score: i, Imaginary
    18. Re:Um... why? by coaxial · · Score: 1

      So why does every language have strata of slang and jargon that may well be incomprehensible to outsiders? In south-east England, ...

      Well that's you're problem right there. :)

    19. Re:Um... why? by doshell · · Score: 1
      And survival of the fittest is particularly harsh for the individual unable to adapt but guess what, that's the way the world works, even today in human society. You can either adapt and be happy or don't and die out. You can stick your head in the sand and try to pretend it isn't happening or you can embrace change.

      Or you can realize that "change" is not imperative in this case, but rather a matter of choice. You're the only one here who is saying that converging to a single language is inevitable.

      --
      Score: i, Imaginary
    20. Re:Um... why? by Grishnakh · · Score: 1

      If many cultures exist, there is more to talk about. If one has a single universal language, that limits the amount of information that can be communicated. If I can talk about something as simple as the weather with someone who uses a system of communication exotically different than the one I grew up speaking, that's a much richer experience than speaking just my own.

      Huh??? If I try to talk about the weather with someone who uses an exotically different language than my own, then I simply can't communicate with them at all! How is this useful? Sorry, but not everyone is a linguist, or cares to be.

      If I want to talk to someone about anything besides their language, it's pretty much impossible if we don't both speak the same language. Why you're advocating that people not be able to communicate with each other is incomprehensible.

    21. Re:Um... why? by coaxial · · Score: 1

      Languages are anachronisms, the only reason we have more than one is the physical distance between locations and difficulty travelling allowed them to evolve independently. Well that isn't the world we live in any more and the different languages actually make communication far more difficult now. They're no longer beneficial. So get rid of them, insist on a common language. The most popular happens to be English at the moment. I could live with Spanish, but for those of you about to suggest Chinese, read this before deciding: http://www.pinyin.info/readings/texts/moser.html

      Languages are more than a means to convey information. They're identity. Why do you think Ireland, Wales, and Scotland are dusting off dead languages are nameing all their cities and trying to teach it school? (The irony that literally no one knows how to spell these words or even what words should be used, should not be lost.) It's a big ol' wankfest to try and to assert national identity to nations that either already exist, or ceased to exist more than 1000 years ago. The Jews did the same thing with Hebrew in the 19th century and then again in the 20th when they needed words like "telephone,"

      Yeah, I speak English. Yeah, I'm an American. And yes, trying to dictate a language through legislation and "academies" is a pointless attempt at nationalism. When you're own people don't even speak the language you've lost. Give it up.

      Hasta la vista.

    22. Re:Um... why? by CRCulver · · Score: 0, Flamebait

      Huh??? If I try to talk about the weather with someone who uses an exotically different language than my own, then I simply can't communicate with them at all! How is this useful? Sorry, but not everyone is a linguist, or cares to be.

      As I mentioned before, with a useful citation you're sure to find in your university library, most people on Earth live in multilingual communities. They grow up speaking the languages of their neighbours--or learn gradually later in life without all the bitching and moaning you show here--and so there is not necessarily a barrier to understanding. The poster to whom I was responding, however, asserts that even this stable multilingualism isn't tolerable, but rather we have to get rid of all languages but one.

    23. Re:Um... why? by An+ominous+Cow+art · · Score: 1

      The Esperanto language was created to be everyone's common language, so that people with different native tongues could interact equally, while not replacing regional languages or dialects. I'm surprised neither of you have mentioned this.

    24. Re:Um... why? by CRCulver · · Score: 0, Offtopic

      Oh, I've indeed got some impressions about Esperanto.

    25. Re:Um... why? by sydneyfong · · Score: 1

      The mod who gave you +1 is smoking crack.

      Your suggestion is fundamentally flawed on moral, cultural, and practical grounds.

      There have been a number of good replies from fellow slashdotters, so I'm not going to repeat all that.

      A few things that I'd want to add:

      Firstly, you must understand that there are some expressions which could be easily expressed in one language and extremely difficult in another, due to the grammar, available vocabulary, perceptions of aesthetic for the language and so on. If you don't believe me, try translating a page of complex mathematical symbols (a specialized niche language) into English, and tell me English is better for all applications.

      Using your TCP/IP analogy, you must be aware that above TCP/IP there are a multitude of different protocols for different applications. HTTP for the "WWW", FTP for lame file transfers, SMTP/IMAP/POP3 for email (three!), telnet/SSH for remote shell access (two!), and so on. Your suggestion that everybody ought to speak English is like the efforts to move everything onto port 80 -- yeah things become more "standardized", but having everything served in HTTP isn't necessarily a good thing.

      Secondly, on what grounds do you prohibit people from speaking whatever language they want? For one, it's against free speech. And why English? Some fellow slashdotters have suggested Esperanto. Why not that instead?

      Thirdly, as I am sure you have noticed, your proposal is absolutely impractical. Good luck on getting the 1.3 billion people in China to agree with you. Actually they are having trouble in getting those 1.3 billion people to agree on a single dialect (Mandarin), and people do speak their local dialect from time to time. EVEN if you have overcome that hurdle, language changes and evolves, and unless there is a strong link of communication between different local regions, the language will eventually evolve into different paths.

      --
      Don't quote me on this.
    26. Re:Um... why? by An+ominous+Cow+art · · Score: 1

      You make good and interesting points.

      (I don't actually speak it, I've just always like the idea behind it)

  53. There are plenty of sites already by Anonymous Coward · · Score: 0
  54. Horrible indeed by unity100 · · Score: 2, Interesting

    Im in a country that is based between europe and middle east, we have a few non-latin characters in the alphabet, still it creates problems when conferring domain names.

    no wonder the middle east (arabic) countries are especially wanting this, because the majority of the inexperienced internet users there will be more likely to easily use these domain names, hence the sites using those domains will be greater incentive for controlling what they see, because these domains will be under their control nationally.

    not only this, but we as it people will be very unwilling to change all our software to adapt with the new situation because of the horrible development/testing/implementation involved, and hence wont be accepting these domains as valid in our network traffic, which will create a second internet which is as described above, less free.

    this should not be allowed.

  55. Internet Layers by 0xABADC0DA · · Score: 2, Funny

    1. Physical
    2. DataLink
    ...
    6. Presentation
    7. Application
    8. Tubes
    9. Bricks
    10. Porn
    11. Google
    12. YouTube
    13. ??
    ...
    16. Profit

    It was hard enough remembering them all back when there were only 7.

  56. Oblig by Saikik · · Score: 1

    I do not know of these "patawans" of which you speak. I felt a great disturbance in the DNS as if thousands of URLs cried out in anguish, and then were silenced.

  57. you couldnt be more wrong by Srin+Tuar · · Score: 5, Informative

    much even when Windows solved the problem soooo long ago

    i18n on windows is far from "solved".
    I do admit that MS had a huge benefit when they started pushing unicode.
    (It takes a company with microsoft's level of clout to push around national governments )


    And the ASCII problem isn't just bad because it forces people to use inefficient encodings like UTF-8 (THREE bytes per character?)


    Perhaps you don't realize that UTF-8 is moving on to become the most dominant character encoding,
    and the legacy cruft such as UTF-16 (designed to deal with design flaws in windows) is being phased out.

    Even languages that would end up as mostly 3 byte characters tend to benefit from the savings on single byte
    characters for control and formatting markup.

    I'm not going to harp on about it, but a few basic web searches could enlighten you here.

    if(string[index] == '.' || string[index] == '?' || string[index] == '!') sentenceEnd = true;

    Code like that *works* in UTF-8, which is one of the things that makes it beatiful. (among many others)

    It allows you to deal with world characters sets when it matters, and allows you to ignore them when it does not.
    (for example, a lexical analyzer that specifies its tokens does not want to support punctuation from every language ever conceived)

    And if you think code like that doesnt exist in the windows world, you are sadly quite naive.
    In my experience internationalizing applications, its typically far easier to upate unix applications, which
    on occaision need nearly no changes at all, compared to the laborious grind and near total re-write often needed
    for ms-windows applications.

    1. Re:you couldnt be more wrong by HiThere · · Score: 1

      I'm not sure that Dr. Johnson would accept your definition of where this sentence ended.
      If I were to say "Hey!" would this sentence have ended?
      Perheaps(sp?) you need to rethink this a bit.

      --

      I think we've pushed this "anyone can grow up to be president" thing too far.
    2. Re:you couldnt be more wrong by curunir · · Score: 1

      Even languages that would end up as mostly 3 byte characters tend to benefit from the savings on single byte characters for control and formatting markup.

      To further your point, compression techniques like gzip make the difference between UTF-16 and UTF-8 even smaller since they recognize which characters or character sequences are being used and allow them to be encoded using less bytes. A 3-byte UTF-8 character and a 2-byte UTF-16 character (as part of a document of any reasonable size) are represented by gzip in basically the same number of bytes.

      But even this is a minor concern...with the size of hard drives and the bandwidth capabilities we have these days, the number of bytes used to represent text is pretty insignificant. Even in the worst case scenario of UTF-8 (IIRC, a 5-byte character), my broadband connection can still transfer 160,000 of those characters per second. Trading the backwards-compatible nature of UTF-8 for the few bytes saved by UTF-16 makes very little sense.

      --
      "Don't blame me, I voted for Kodos!"
    3. Re:you couldnt be more wrong by rabiddeity · · Score: 1
      if(string[index] == '.' || string[index] == '?' || string[index] == '!') sentenceEnd = true;

      Code like that *works* in UTF-8, which is one of the things that makes it beatiful. (among many others)

      It allows you to deal with world characters sets when it matters, and allows you to ignore them when it does not.

      It's not always true. What about code that scans for lowercase letters and capitalizes them? That will royally screw up any UTF-8 string that isn't based off a Latin alphabet, like CJK languages. And it even has the potential to screw up languages like Czech. Oh, you say, just use the builtin functions. Regardless it's something you really should keep in mind when coding. Don't assume everyone uses English, and don't assume that single character conversions will work. Quite often, they don't.

  58. Mod parent up by TrilateralRegression · · Score: 1

    Virtual keyboards, character maps, and the Optimus keyboard, just to name a few points...

  59. Arab, Persian, Asian nations need more security th by OldHawk777 · · Score: 1

    China, Persian, and Arab nations need more security through obscurity to prevent kiddy-script cracking with Latin/Russian/English fonts and control the internet in their sovereign totalitarian nations for god and corruption to freely grow in their two-cast corporatist-communist (haves and nots, just like the US and EU) cultures.

    Remember: An exploitable-labor force is never a slave-labor force, I mean like, for real, exploitable and slave are two totally different words ... they aren't even spelled the same.

    This should help China, Persian, Arab nations and Yahoo, Google, Microsoft ... to provide better tools for building "The Great Digital Divide Wall" for themselves, the Terrorist Defense Network (TDN), and other two-cast totalitarian corporatist-communism nations. God bless US one and all!

    Also, the collapse of the internet will be useful to developing the "New World Order" that everyone in charge really wants.

    --
    Unaccountable leaders are masters, and unrepresented people are slaves. How do US and EU fare?
  60. So, Run The New DNS in Parrallel by arthurpaliden · · Score: 1

    Run the new one in parallel, same port just different host, and let the user decide which one to use, which is what they do now. Eventually everyone will be using the new system.

  61. Should be insightful rather than funny by Colin+Smith · · Score: 1

    Because it'll be the easiest way to be sure you're hitting the correct server.

    --
    Deleted
  62. These Asians should simplify their writing systems by Anonymous Coward · · Score: 0

    If Europeans and other Westerners manage to write with a few dozen symbols why cant Asians do the same? It would be easier for everbody (including their children). Some Asian nations are already on the right track, the Turkish and the Vietnamese people already switched to Latin writing; why cant other Asian nations do the same? Anyway the Alphabet is an Asian invention (Phoenicians were Asians - Lebanon is in Asia). The switching nations wont adopt a Western writing system, but an Ancient Asian one! The sooner the better!

    Regarding switching, there are discussions about backward compatibility (pepole wont manage to read old books, etc). Sure the Vietnamese and the Turkish peoples solved this problem in some way; other nations should ask for advice from them.

  63. Re:Internet: Made in the US of A by leon.gandalf · · Score: 0

    The only thing inflamitory about that statement is how true it is...

  64. I'm Just a Stupid American by 8ball629 · · Score: 1

    But all this will lead to is more domain hijacking and phishing... these people that are complaining about not having non-latin characters in a domain have just run out of domains to hijack.

  65. Bad for phishing by AaronW · · Score: 2, Interesting

    Adding unicode to DNS names would make phishing much more difficult to detect unless all the browsers, email clients and other tools are modified to indicate that a URL may not be what the user thinks it is. It is bad enough as it is, and remember, most Internet users are not as savvy as those of us on Slashdot. I forsee a lot of security implications by adding this.

    --
    This post is encrypted twice with ROT-13. Documenting or attempting to crack this encryption is illegal.
  66. A few things need to happen first by davidwr · · Score: 1

    1) the infrastructure needs to support unicode-16 and -32 DNS names.
    2) collisions need to be found and identified, and name-registrations that collide with existing names not allowed.
    A collision is any letter that is easily confused with another, the very things scammers use to trick you. Even in the existing 37 characters, 1 and l collide in some fonts. Add uppercase, and 0 and O will collide. Unfortunately, any existing collisions will probably have to be grandfathered.
    3) some standard needs to exist for accents - are they a single code or two?

    Trivia: In the days of yore, ancient devices called "typewriters" did not have "!" or "1". You used "l" (lowercase l) for "1" and for "!" you used "'" (single-quote, apostrophe) then backspace then "."

    --
    Knowledge is how to play a game, intelligence is how to win, wisdom is knowing what game to play.
  67. Microsoft, apple no better.. by Anonymous Coward · · Score: 0

    From your post it appears that you think that *someone* has this solved.

    They don't.
    Microsoft, for examples, uses UTF-16. Long long ago UTF-16 and UCS-16 were the same, so you could encode all unicode characters in two bytes. Sure, it's inefficent for the bulk of text out there (which is in ascii) and totally incompatible with the bulk of systems out there (which use ascii).. but it avoided the problems with varible length characters. ... except it didn't last.

    You see, long before Microsoft managed to actually ship a real production ready product with their wonderful commitment to two-byte fixed length characters it was realized that 16 bits isn't enough to encode all the characters that people care about (in particular a lot of asian language characters were left out). As a result UTF-16 broke away from UCS-16. UCS-16 is the fixed lenghth representation which can't capture all the unicode characters, and UTF-16 became a variable length representation which is two bytes at a minimum and four bytes at maximum. So users of UTF-16 still have all the bugs from mishandling variable length encodings, but they just happen in less obvious ways.

    Of course, you could use UCS-32 ... but even microsoft can't abide by its inefficency.

  68. The only compromise left by CupBeEmpty · · Score: 1

    This is just getting out of hand. What we need to do is replace Latin characters in DNS with numbers. That way there will international unity through the language of math. Every DNS entry will be a number that points to current IP addresses. I don't know why this hasn't already been implemented. It completely solves "the problem."

  69. Building, Tubes, What's Next, Hamster Wheel? by fernandoh26 · · Score: 1, Funny
    "The internet is like a fifteen story building"
    Whoa whoa who, wait a minute ... I thought it was a series of tubes? Just when I was beginning to comprehend it! I believe the correct analogy is hamster wheel.
    --
    Chums up, let's do this!
  70. Re:Make 'em all speak english by Anonymous Coward · · Score: 0

    And we, dickwad, invented the computer -- so you are
    henceforth required to spell words like honour, colour
    and aluminium, correctly when using one.

    You are not at the risk of sounding like a cultural
    chauvenist, you are a cultural chauvinist.

  71. Wouldn't it be safer... by Adam+Hazzlebank · · Score: 1

    To build a standardised layer on top of DNS that translates the native alphabet in to whatever subset of ASCII DNS allows?

    That way none of the rest of the protocol stack would be disrupted and everybody would be able to enter all URLs on a standard (English) keyboard?

  72. Too troublesome... by MaWeiTao · · Score: 1

    While it would be neat to be able to have addresses using non-latin characters I think there are some fundamental problems with this.

    First of all, what happens to those unable to type non-latin characters. Windows and OS X both support such text entry, but how the hell would a non-Chinese speaker know how to type anything, assuming they know how to set it up or even choose the proper language? Automatically the vast majority of the World's population is for all intents and purposes blocked from Chinese sites. In most cases it may not matter, but the fact is that people will be indirectly denied access to some websites.

    Secondly, aren't there fundamental problems interpreting non-latin characters at the OS-level? As I'm sure most people here know, non-Latin characters are formed by a string of what essentially looks like nonsense characters. If even one of those is lost for whatever reason that character goes missing. How will the browser, let alone a server know that a string of characters is an actual character and not gibberish. And what if it happens to coincide with something in some other language? Then there are other problems like Traditional Chinese versus Simplified Chinese, versus Japanese. These languages all share characters, but they don't interact meaning you can't copy a character typed in Japanese and paste it as Traditional Chinese.

    Then there are all the forms of encoding which add to the problem. Forcing everyone to use UTF-8, for example, would cause huge problems in Taiwan because few people use it. And I think compared to some other forms of encoding it tends to have problems.

    So are all non-Latin languages included? What about languages like Mongolian which are written vertically? I guess they could use Cyrillic, but then if they're going to do that they might as well just stick with latin.

    The Chinese the writing system, and any logogram-based language, is not suited to computers. It's far too complex to be practical. At least no English-based operating system isn't. But I've yet to see anyone try to make an OS specifically designed for Chinese or Japanese. What for? They've adopted the Latin system fairly well.

    Not to get into wacky conspiracy theories, but I can't help but think that this is more of a political move to undermine Western control of the internet.

    I think a more practical solution would be to device a larger character set that can accommodate most major languages but mainly derived from the Latin character set, which nearly everyone already uses with no problem. Perhaps some day we'll have a universal writing system, but we're a long way off from seeing that implemented.

  73. Welcome to 2006, guys! by Schraegstrichpunkt · · Score: 1

    Geez! It's like nobody on Slashdot has ever heard of UTF-8 or Punycode.

    1. Re:Welcome to 2006, guys! by Alcari · · Score: 1

      Guess I'll see you soon on moc...www good luck.

    2. Re:Welcome to 2006, guys! by Alcari · · Score: 1

      ok, insert random arabic words in place of ... for proper snipe remark. it seems slashdot is just as bad as the ICANN

    3. Re:Welcome to 2006, guys! by Schraegstrichpunkt · · Score: 1

      +A50DtQOnA8Q +A8QDuQOcA74 +A7wD2yEu UTF-7

  74. Re:Make 'em all speak english by Anonymous Coward · · Score: 0

    > largely anglophone, largely UTF-8, or nothing

    You probably mean Latin-1 (or a subset thereof in the case of DNS) not UTF-8 which is a 1 to 4 byte character encoding capable of representing non-latin languages.

  75. Re:Internet: Made in the US of A by erikvcl · · Score: 1

    It's a good thing my preferences are set to give trolls +3... otherwise I would have missed your post. So true, so true, what you say.

  76. Latin characters, not English! by Anonymous Coward · · Score: 0

    Why are they called English letters? In 700-600 BC when people of Rome used them regularly, the inhabitants of British Islands were backward illiterates! For that matter if you call them English why dont name them Spanish, French, Czech, Croatian, or even Turkish! Most of Europe use them, AND THEY DID NOT GET THEM FROM THE ENGLISH!

    Did the Roman Empire leave the Latin characters in its will to the British Empire or what?. Maybe all nations using Latin charaters should pay a license fee to Britan and/or the US!

    1. Re:Latin characters, not English! by daverabbitz · · Score: 1

      Almost all other countries which use latin characters have accent marks which are not valid in DNS.

      To my knowledge english is the only language to use the latin character set without accents.

      --
      What could be better than a jet powered motorcycle? http://www.youtube.com/watch?v=u8l6GTHLSWE
  77. Punycode? by Anonymous Coward · · Score: 0

    I thought this was solved by Punycode, RFC 3492 http://www.ietf.org/rfc/rfc3492.txt, over three years ago. It is "Standards Track"; all major browsers support it; and it does not break the entire Internet.

  78. don't go to s1ashdot.org it's poison by davidwr · · Score: 1

    It's pop-up hell.

    One of the pop-ups says:

    callto://JOIN_THE_GNAA__2005_RECRUITMENT_DRIVE

    --
    Knowledge is how to play a game, intelligence is how to win, wisdom is knowing what game to play.
  79. We don't want any foreign temperatures here. by Anonymous Coward · · Score: 0

    Begone with those those foreign temperature scales, say I. We'll have no Celsius, Fahrenheit or Réaumur here. And Centigrade sounds suspiciously metric, and we'll have nothing to do with Bonapart's schemes.

    Stick to a proper British Scale, named after a Scottish Lord, no less, with water freezing at 273.15 and boiling at 373.15!

    1. Re:We don't want any foreign temperatures here. by AK+Marc · · Score: 1

      What about another Scott, William John Macquorn Rankine? Not a lord, so you have something against him? What do you have against us commoners? Screw you and your elitist measurements.

  80. Go get your own internet! by Bob-taro · · Score: 1
    I'm not being sarcastic. There's nothing to stop these countries from building their own networks that support unicode (or whatever character set) domain names. How many western users can display arabic, say, in the URL, much less read it or type it in? If there is really a lot of demand for this, that is what will probably happen.

    My first thought would be to use unicode, but then TFA pointed out a big problem: URLs that LOOK exactly the same but are not (as in, Oh, that's "ebay" in Latin Extended-A, not the Basic Latin "ebay"). Unicode is great for displaying things, but bad for uniquely identifying things. How many domains would you have to register?

    --
    Prov 9:8 Do not rebuke mockers or they will hate you; rebuke the wise and they will love you.
  81. Paul doesn't know what he's talking about by brunes69 · · Score: 1

    'The internet is like a fifteen story building, and with international domain names what we're trying to do is change the bricks in the basement.

    Who ever heard of building a fifteen story building out of tubes?!?!

  82. We already have a solution... by brownsteve · · Score: 1

    ...it's called Punycode. It's just a way of encoding Unicode into the 37 characters supported by normal DNS. Firefox, Opera, Safari, and IE7 give a transparent implementation, all with different protections against homograph attacks.

  83. A bit late? by 1729 · · Score: 1
    Given that some societies have used non-Latin characters for thousands of years, is this a bit late in coming?
    Yes. DNS should have supported non-Latin characters thousands of years ago.
  84. Easy to fix. by rumith · · Score: 1

    Just introduce a restriction according to which a valid URL can only contain symbols from one alphabet. I believe it's not too hard to determine http://www.unicode.org/charts/ which character set does a UTF-8 code belong to, and if the URL uses more than one.

    1. Re:Easy to fix. by Agelmar · · Score: 1

      That might work, except that the original 256 characters (all the roman characters, e.g. a-z) are duplicated all over the place in unicode, and so you can still wreak havok while staying within a single section of unicode.

      But why do you want to restrict people to a single section? That's a hack, not a good thing to do. What if I want to have www.hello.co.jp? I'll give you a hint - Japanese companies think it's a great idea to throw random English words everywhere, even if many people have no idea what the hell it means (including english speakers), and you often see english mixed right in the middle of hiragana and kanji. This is the way things are done, and you shouldn't restrict it just on a whim.

  85. more than 37 characters available by joenobody · · Score: 0

    Currently there are only 37 characters usable in DNS entries

    Wrong. The usable characters are 0-9, a-z, A-Z, period (.), underscore (_), and dash (-), so 65 characters are usable in DNS entries. I know the 37 number came from TFA, but it's still flat wrong.

    --

    1. Re:more than 37 characters available by Anonymous Coward · · Score: 0

      Hostname lookups are case-insensitive. While you can use both a-z and A-Z, they do not differentiate hosts.

    2. Re:more than 37 characters available by psmears · · Score: 1
      Wrong. The usable characters are 0-9, a-z, A-Z, period (.), underscore (_), and dash (-), so 65 characters are usable in DNS entries. I know the 37 number came from TFA, but it's still flat wrong.
      At best you're twisting the truth, and you know it :-). Domain names are case-insensitive, so 'A' and 'a' are, for these purposes, the same character. RFC1035 forbids the underscore character in domain names, and many registries follow that practice. The period (.) has a special meaning as a separator/terminator, and so doesn't really count either. So the 37 number is reasonable, even if the definition of what it's counting wasn't made 100% precise...
    3. Re:more than 37 characters available by brusk · · Score: 1

      Period? Aside from case insensitivity (you clod), a PERIOD can't be part of the domain name, excet as a separator.

      --
      .sig withheld by request
    4. Re:more than 37 characters available by mkcmkc · · Score: 1
      Hmm. 'a' and 'A' are not distinguished in hostnames, and '.' is a special character. '_' didn't used to be valid, even though a lot of people used it anyway. Maybe it's now RFC?

      By my count, that gives 37 or 38 unique characters.

      --
      "Not an actor, but he plays one on TV."
  86. Attitude problem by iamdrscience · · Score: 1
    "Plans to fast-track the introduction of non-English characters in website domain names could 'break the whole internet', warns ICANN chief executive Paul Twomey
    Now that's not the right attitude, he needs to stop being so pessimistic -- he works for ICANN, not ICANN't, right?
  87. The next step in anti-phishing tech? by khasim · · Score: 1

    Since we know it's going to happen, and we know how it will happen, all it will take is planning to defeat it before it is implemented.

    And it shouldn't be too difficult. Just compare the preferred unicode of the browser to the unicode of the URL and put a banner or something at the top saying that there may be a phishing problem when they don't match.

    Even better, have an option to auto-deny any javascript or java or anything else for sites that don't match the unicode of the browser.

    In fact, get the big players in the browser market together TODAY and get them to agree on the standard response that will be generated when the unicode doesn't match. That way everyone will only have to learn ONE warning for this attack.

    ICANN should tie that standard to the release of their standard. Until the browsers agree, there will not be any change in the domain names. That way the various countries can put the pressure on the browser people to get the standard out.

    1. Re:The next step in anti-phishing tech? by Agelmar · · Score: 1

      But how do you want to compare everything to make sure it matches? Is www.metro..jp the same thing is www.metero.tokyo.jp? It's not quite as simple as you make it sound. Even for (Tokyo), there are characters in the chinese codepage that are almost identical, and really are identical at the font / resolution most people use...

  88. Re:Yes and No - thinking long term by Lemmeoutada+Collecti · · Score: 1

    We are already compatible with Klingon porn, just do a search for knives, swords, batleth (no unicode on Slashdot working for me), etc.

    For Vulcan compatibility, open your favourite raw file viewer and set it to Binary mode.

    Martian porn is (apparently) covered by the Periodic Table of Elements, specifically the permutations of Illudium and Pu.

    Enjoy your new worlds of entertainment!

    --

    You can have it fast, accurate, or pretty. Pick any 2.
  89. Why is everyone... by Sinbios · · Score: 1
    ...trying to make it seem enforcing English-only URLs are forcing people who cannot understand English to learn a new language? All they have to do is learn to reproduce the URL with their keyboard; nobody really needs to understand the meaning of the URL to do that. When you think about it, is google an English word? Let's pretend it's a word in Martian, does that mean we need to learn Martian to get to www.google.com?

    An URL is just a representation, anyway. Sure, if its meaning actually has some relation to the actual content, it might serve someone who tries to find websites by entering www..com better, but most people I know use search engines, anyway. In fact, as a native Chinese writer, I find typing the occasional Chinese character more time consuming because I have to switch to the Chinese IME; I'd rather URLs are all English.

    --
    Anyone can "stand up for what they believe", but it takes a very brave individual to change what they believe. - Loundry
    1. Re:Why is everyone... by PitaBred · · Score: 1

      Can someone mod the parent up? It's not Amero-centrism, it's just pragmatism.

    2. Re:Why is everyone... by DerPflanz · · Score: 1

      I'd rather URLs are all English.

      Of course you mean: in the latin charset. We (dutch) use that same alphabet and we (dutch) have our own country namespace (.nl; having relatively the most registered names in a country namespace in the world) and we use our own language (dutch) for that.

      --
      -- The Internet is a too slow way of doing things, you'd never do without it.
    3. Re:Why is everyone... by Sinbios · · Score: 1
      No.

      1) What encoding do URLs use? ASCII. What does ASCII stand for? American Standard Code for Information Interchange. What do Americans use? English. ASCII was specifically created with English in mind, as it includes the 26 letters used in English, but none of the other fancy/accented characters from other popular Latin-derived languages, such as French or Spanish (where are the ç or ñ?).

      2) Dutch is not a Latin-derived language, but German-based. Also, doesn't Dutch have 27 letters?

      3) The classic Latin alphabet contained only 23 letters, not counting uppercase.

      And as PitaBred kindly pointed out, I'm not being Ameri/Anglo-centric; I was born and partially educated in China, and naturalized, as well as currently residing, in Canada. But you really can't deny that the DNS system was created with the English charset in mind.

      --
      Anyone can "stand up for what they believe", but it takes a very brave individual to change what they believe. - Loundry
    4. Re:Why is everyone... by DerPflanz · · Score: 1

      1. The 'American' in ASCII says something about where the standard was made, nothing about the information it conveys.
      2. No. It has 26 letters. German has 27; it includes ringel-s, which is being used less and less now. Dutch is a Germanic language, but has a lot of influences from Latin as well. Besides, English isn't a Latin language as well. The 'latin' in latin charset refers to the alphabet. The letter symbols I am using right now are called 'Latin characters' (as opposed to Greek, Hebrew, etc).

      The DNS system might be designed with the English language in mind, it most definately is not limited to that language.

      --
      -- The Internet is a too slow way of doing things, you'd never do without it.
  90. FUD ... just implement IDN everywhere by jhermans · · Score: 2, Informative

    see http://en.wikipedia.org/wiki/Internationalized_dom ain_name

    IDN is backwards compatible with existing DNS-servers, and has been in use for several years. Mozilla, Firefox, Safari and Opera support it. So does Internet Exploder 7.

  91. Wrong! I have a typewritter with ! by Anonymous Coward · · Score: 0

    I just checked, I found it in my cellar, and it has ! (it was produced in the thirties in Germany)

  92. Re:Changing a system ORRR by davidsyes · · Score: 1

    As inconvenient as this may be, create a response form from which the person chooses the desired or intended URL. For example, one seeks a site with no accent, but 3 variations exist. The look or search would bring them all back and prompt the user to select one. Helpfully, their email entry or search term will precede or follow the domain or URL as applicable (in a locale or language rules set).

    Right now, when e-mailing, people still incorrectly enter company names, even in English. So, what is their response? They fall back on a business card, a PDA entry, some scrap of paper, or even a search engine, or a phone call, or out of laziness, they give up. But, a mailing interface smart enough to seek the known registry of companies and non-commercial sites could bring back the options and let the user decide.

    Currently, any foot-dragging on the issue smacks of "pre-eminence" or "we had it first and WE'LL decide what and when." Obviously, that didn't hold back on Japan or China. Now, these other countries want technology to be used to resolve their exclusion in what is a language-set exclusive club. By keeping the web to a handful of languages, particularly in English, it almost could be seen as wanting to ensure that English is the default language for business and international communications. If accented languages enter the scene, then all sorts of unforseen permutations in business, communications, and economics might occur, much to the dismay of some (certain governments?).

    Personally, I think there is PLENTY of technology and brainpower to have solved this issue 5 years ago. There just was no pressure. Now, I don't by any means think ONE single person is holding this up. I think there are many forces and "interests" asking that this be deferred as long as possible. Normally, geeks and techies like challenges. This is a big-as-hell challenge if there EVER was one, yet it's being stonewalled, just like Asian languages are or have been. If Linux can support dozens of languages at the desktop (heck, Mandriva offers multitudes from which to select not only for install, but for desktop use), then why cannot ICANN and the registries follow suit? Oh, umm... yeh...

    --
    Previously: "Linux... Toward the Sunrise..." Now: "Linux... Toward the-- No, now, part of Every Sunrise"
  93. Hey ASSHOLE, who's the fucking TROLL? by Anonymous Coward · · Score: 0

    Hey ASSHOLE, who's the fucking TROLL? Why don't you pull your FUCKING panties out of your ASSCRACK? Or is that the only way you can get sexual gratification?

  94. Re:Misunderstanding Air Traffic Control by Anonymous Coward · · Score: 0

    Air traffic control is not something that every other luser engages in. It is not meant for real communications. There are approximately 300 phrases that you are allowed to use, and because Americans got there first, it's in English. It could be in Korean, and there would be no difference. If you ever fly an airplane, you will see that clear communication is paramount to your safety. The internet has many purposes, and it makes no sense to continue to use the same character set.

  95. Re:Make 'em all speak english by benoitg · · Score: 1

    Well, even restricting to the G7, 4 out of 7 have characters missing from the 37 allowable in DNS:
    Canada: à â ç é è ê ë î ï ô û ù ü ÿ missing
    France: à â ç é è ê ë î ï ô û ù ü ÿ missing
    Germany: ä, ö, ü and ß missing
    Italy: All letters covered (as far as I can tell)
    Japan: Most letters missing
    United Kingdom: All letters covered
    United States: All letters covered

    But restricting to the G7 is very convenient. How about the most popular languages by native speakers (taken from wikipedia)
    Mandarin: 672 million native (old statistic)
    English 425 million
    Spanish: 390 million
    Arabic: 272 million
    Indonesian: 222 million native
    Portugese: 210 million
    Bengali: 194 million
    Khariboli: 180 million
    Russian: 145 million
    Japanese: 130 million
    French: 120 million
    Persian: 101 million
    German: 100 million native

    The hilarious thing is that English is the only language out of these 13 to have no issues at all with only having the 37 dns characters.

  96. charset by tobyvoss · · Score: 1

    man! could we, as a single species, possibly agree on a single charset?
    redundancy is such a drag when paired with über-specificity (talk about two bazillion words for snow, or sand, or pr0n).

    1. Re:charset by joto · · Score: 1
      man! could we, as a single species, possibly agree on a single charset?

      No. (And neither can we agree on a single language, a single style of haircut, or a single preferred flavour of ice-cream)

      redundancy is such a drag when paired with über-specificity (talk about two bazillion words for snow, or sand, or pr0n).
      • snow: slush, sleet, powder, ice, frost, rime, (snow)drift, (snow)storm, (snow)flake, etc...
      • sand: dust, earth, soil, (dust)storm, (sand)dune, etc...
      • porn: erotica, adult entertainment, scat, rimming, ass-to-mouth, etc...

      If you can't be bothered to be as specific as that, I guess you can't be bothered to specify whether you want potato or rice for dinner either, as they both count as vegetables.

  97. Re:Um... why? THAT... by davidsyes · · Score: 1

    ... was very, very nicely put. It, to me, just underscores that any foot-dragging on the accented and Chinese/Asian character sets adoption would be an unacceptable denial of additional color and nuance to the ways of accessing the Internet.

    --
    Previously: "Linux... Toward the Sunrise..." Now: "Linux... Toward the-- No, now, part of Every Sunrise"
  98. Security & Compatibility Problems. by ezh · · Score: 1

    1. Since lots of characters in different languages are look-alike, this would create lots of security problems with character substitutions. There was a demonstration a year ago or so, with the registration of paypal.com and obtaining a legal ssl certificate for it, where a where non-latin. when user cannot distinguish between the two, how can he or she trust any site on the internet?

    2. Creating domains in different languages is also bad idea for collaboration. It will create unnecessary internet segments. Essentially, now people of the world use latin characters for accessing websites and sending emails. How in a world an English speaker suppose to type chinese or arabic characters of a domain to send an email if he or she doesn't know the alphabet or doesn't have a keyboard support installed? No way... so all these domains should really be a supplement to normal latin domains if you want to collaborate with the rest of the world...

  99. some do, some don't by davidwr · · Score: 1

    Some only had uppercase even. I guess those did have a "1".

    --
    Knowledge is how to play a game, intelligence is how to win, wisdom is knowing what game to play.
  100. What are you willing to learn? by Anonymous Coward · · Score: 0

    I've read peoples arguments about needing change, as it's a barrier to people who don't use Latin characters as the standard for their language.
    However, we're looking at multiple thousands of characters potentially being allowed.

    Personally, I think it's not much to ask the non Latin based to learn 37 new characters, rather than asking the whole planet to learn thousands. Not to mention the fact that massive amounts of information will likely become unavailiable to large numbers of people.

    With the current system, even if I end up at a site with foreign language, I can at least babel it, and get a general idea of what's going on. A change would likely prevent me from getting there in the first place.

  101. Re:Make 'em all speak english by igb · · Score: 1
    The German beta-symbol can be, and is, replaced with ss. The umlauts can be, and are, replaced with a following e. Canada and France definitely, Germany I think also have the property that accents aren't used in runs of capitals, signwriting or other non-continuous text. So in fact, of the G7 countries, only Japan has a significant problem.

    ian

  102. The GNS System? by Kadin2048 · · Score: 5, Interesting

    Kind of an interesting point. Maybe we should just let Google run the DNS system, and just replace it with a giant search engine. If we make actually typing in a web address hard enough, then that's what we're effectively doing anyway: people will just start typing everything (including the domain name of sites they want to go to) into the Google Search box at the top of their browser window, instead of the actual address bar.

    Actually, DNS arguably is a giant search engine, which simply works on a 1:1 relationship and uses a distributed database (you input one piece of information, and it gives you some corresponding piece of information back). Replacing it with a 'fuzzier' search engine that would give you back a number of results, ranked by relevance, isn't that huge a leap.

    --
    "Ladies and gentlemen, my killbot features Lotus Notes and a machine gun. It is the finest available."
  103. Yes, why? by Infonaut · · Score: 1

    ... insist on a common language.

    Why has this not been modded up as +5 Absurd yet?

    I think we should also standardize all programming languages. Let's just use C++ and be done with it.

    I also find comedies to be useless. All fiction should be tragedy.

    Currently there are more Chinese than anyone else, so we should just wipe out all non-Chinese. It would make things so much more efficient.

    --
    Read the EFF's Fair Use FAQ
    1. Re:Yes, why? by Colin+Smith · · Score: 1
      I think we should also standardize all programming languages. Let's just use C++ and be done with it.


      Computer languages are instructions to a computer, they aren't a human communication medium and so serve a different purpose. It's a strawman argument as is your comedy one.

      The purpose of human language is communicate with other human beings and the more languages there are the more difficult that is. Let me make it clear. Languages make communication more difficult. Not easier. They are not beneficial. They are not good things unless you think additional barriers to communication are beneficial or additional ways of dividing us from them is beneficial.

      I'd happily use Chinese if it was the only one, I don't really care which it is. However there are more english speakers than chinese speakers and written english is orders of magnitude simpler than written chinese.
      --
      Deleted
    2. Re:Yes, why? by Cro+Magnon · · Score: 1
      I'd happily use Chinese if it was the only one, I don't really care which it is. However there are more english speakers than chinese speakers and written english is orders of magnitude simpler than written chinese.


      Do you have a source for that? Everything I've read says that Mandarin has far more speakers than English. I've also read that spoken Chinese isn't any harder than any other language, though the lack of common vocabulary makes it a challenge for English-speakers.

      I do agree with you about written Chinese. Keeping all those pictures straight would be MUCH harder than writing any language with an alphabet.
      --
      Slow down, cowboy! It has been 4 hours since you last posted. You must wait another few hours.
  104. Back in my day by Anonymous Coward · · Score: 0

    Consider yourself lucky they allow anything other than ones and zeros!

  105. Re:Make 'em all speak english by Colin+Smith · · Score: 1
    The hilarious thing is that English is the only language out of these 13 to have no issues at all with only having the 37 dns characters.


    Um, that'd be because it's a meritocracy rather than a democracy. The johnny come lately others couldn't be bothered inventing the system in the first place.

    --
    Deleted
  106. Re:Anything's possible. by gd23ka · · Score: 1

    I didn't find anything in the WP article. I can only guess, but are you saying that
    they used Office 97 to create false documents?

  107. Latin??? Not even that! by dcs · · Score: 1

    The present character set, actually, does _not_ cover Latin languages! In fact, it only covers English. I don't know of a single other language that doesn't have either accents or characters not present in English.

    You should rewrite the topic to say "ICANN Under Pressure to Include Other Languages Besides English", but that might give the impression that what we actually have doesn't really cover over half of the Internet, right?

    --
    (8-DCS)
  108. Re:Changing a system ORRR by AK+Marc · · Score: 1

    Obviously, that didn't hold back on Japan or China.

    And how did you come to that conclusion? Do you know what the most popular email/blogging sites are in China? Many of them are numbers. Try something like www.163.com. This is easier than having some romanticized Chinese (and no, Pinyin, the official romanticized version, is not allowed either, as it requires accents, so it would be some bastardized Pinyin or other such setup). Are they still doing business? Sure. Are they inconvenienced at all (held back)? Yes. It didn't stop them from getting online, but it is restricting their ability to use the Internet as freely as we do.

  109. Never touch a running system. by aix+tom · · Score: 1

    I would leave DNS for IPv4 as it is. Build a new DNS for IPv6 from the ground up, with all things implemented new and no backward compatibility. Keep both of them separate, and phase the old DNS system out when IPv4 is phased out.

  110. Re:Misunderstanding Air Traffic Control by PitaBred · · Score: 1

    the Internet is meant to enable communication. Communication requires common standards. When you start including uncommon things (Unicode), then you end up breaking the communication.

  111. Why not? by khasim · · Score: 1
    But how do you want to compare everything to make sure it matches? Is www.metro..jp the same thing is www.metero.tokyo.jp?

    It's the URL.

    If the unicode of the URL does not match the browser's default unicode, then throw up the standard warning.

    I don't care about "almost identical". We have that already with "l1" and "O0". I'm talking unicode.

    If the unicode of the URL does not match the default unicode of the browser (because people should be most familiar with the language they browse in) then throw up the standard warning, disable java and javascript and activex and anything else until the user approves them.

    It is that simple.
    1. Re:Why not? by Agelmar · · Score: 2, Informative

      What do you mean by "if the unicode of the URL does not match the default unicode of the browser"? The point of unicode is that it is uniform - there's only one. It is broken up into sections, and perhaps that's what you meant to say, but even that won't work.

      Let's take Japanese as an example, and I will give you two reasons why it won't work.

      Perhaps if you assume I am Japanese, you will assume that my "default unicode section" is the section containing the Japanese characters. So this works fine if I go to URLs that use hiragana / katakana / kanji, but what if I go to www.google.com? Or www.washingtonpost.com? Or www.citibank.com? (Yes, there are Citi offices in Japan). Are you going to throw up a phishing warning simply because I'm browsing an international site? Because if you do that, you're going to make people so used to seeing those warnings that they will just ignore them and/or turn them off.

      Even if your method did work, however, this would still be easy to get around. The original 256 characters are repeated many times, and it just so happens that in the full-width forms (in the CJK sections) they are repeated again. I.e. I can use the letters a-z while still staying within the Japanese section of Unicode, and although these letters are the same visually, they are a different character in the Unicode charset, so you could easily have www.google.com and www.google.com registered entirely in the first 256 characters of Unicode or entirely in the full-width form section of Unicode, and there would be no discrepancy whatsoever.

      The problem is a lot more complicated than you make it out to be.

  112. But *IS* there a decent way to enter Chinese chara by HiThere · · Score: 1

    But *IS* there a decent way to enter Chinese characters? Last I heard people in China (& Japan) used Roman letters because as clumsy as the were, they were easier to enter than the Hiragana. (I think I got that right. Of course, even if I did that would be Japanese only, and it doesn't include Kanji.)

    With most languages the problem is different. European languages all have a small and well defined set of characters..so small that they could ALL have been handled by a one-byte code. Arabic languages are more difficult, as they have multiple different context sensitive forms for certain letters. (And that's the extent of my Arabic.) Hebrew is more amenable to computerized representation...but it still means that you need to go beyond the one-byte limit (if you want to include the other languages).

    OTOH! Sanskrit, Javanese, Minoan, Burmese, etc. 32 bits isn't quite enough to handle everyone. So now you're using 32 bits for each character transmitted, unless you use a culturally biased form. You've just quadrupled the overhead on the DNS.

    Yes, it's possible. I don't really think it's a good idea, though. If you were just proposing the UTF-8 subset then this argument wouldn't apply, but it would then STILL be culturally biased, and it would have all sorts of "false twin" URL mappings. And various other problems (see the other posts).

    --

    I think we've pushed this "anyone can grow up to be president" thing too far.
  113. Resist! by smcdow · · Score: 1

    One character == one byte, dammit!

    Don't let them change that.

    --
    In the course of every project, it will become necessary to shoot the scientists and begin production.
  114. Well said! by Anonymous Coward · · Score: 0

    It's exactly the point I wanted to make -- but you got there first.

  115. Finland has used it for more than a year by grimJester · · Score: 1

    here.

  116. Why ICANN? by Anonymous Coward · · Score: 0

    Why does ICANN need to do this? The task is supposed to go to the national domain registries, which control country-level domains.

  117. Re:Late in coming? Æ Ø Å in domain by HiThere · · Score: 1

    That's not at all what people have been saying. I'll admit I'm not sure why not, but that's not at all what I would expect, either. None of those are part of ASCII-7 (though they're in lots of the 8-bit expansions).

    --

    I think we've pushed this "anyone can grow up to be president" thing too far.
  118. A bit of a whine? by Anonymous Coward · · Score: 0

    Nobody complains that Java and C keywords are all English, do they?

  119. Arabic on the Internet by DavidApi · · Score: 1

    With all respect to the Arabic speakers out there - but isn't Arabic already supported in DNS? For example I can type in this Arabic address already: http://17.112.152.32/

    Works perfectly fine.

    But seriously, if the world can adopt Arabic numerals as a standard for counting things, perhaps it should consider the humble English language for DNS domains. Even the Asian countries use Arabic numerals.

    But I *do* wish everyone would use good British/International English, not that poorly spelt American cuckoo. :-) (e.g. colour, neighbour, favour etc).

  120. Re:Changing a system ORRR by davidsyes · · Score: 1

    Whupps.. Yep, you now remind me that I have seen numeric sites. I sit corrected. I guess I'd gotten too much of Sina and Baidu...

    But, when I use Google to search for a Chinese Character and get pages back, they have English URLs, true.

    http://www.google.com/search?q=%E5%8A%A9&ie=UTF-8& oe=UTF-8

    http://www.dongailbo.co.kr/docs/magazine/weekly/20 05/01/03/200501030500036/200501030500036_2.html

    http://www.91985.com/
    http://www.chosun.com/

    and I realize now that I'd forgotten that english is STILL in thees sites' names.

    But, honestly, I could swear that in Mozilla I'd seen Asian or Korean fonts in the URL/location bar only a few weeks ago. I'd been playing around with explicitly searching for information using Korean fonts in the URL at:

    http://bemil.chosun.com/

    I'll have to reinstall Mozilla and recreate my steps.

    --
    Previously: "Linux... Toward the Sunrise..." Now: "Linux... Toward the-- No, now, part of Every Sunrise"
  121. Re:But *IS* there a decent way to enter Chinese by AK+Marc · · Score: 1

    But *IS* there a decent way to enter Chinese characters? Last I heard people in China (& Japan) used Roman letters because as clumsy as the were, they were easier to enter than the Hiragana.

    Have you ever watched someone type Chinese? They enter in the romantic letters, but then select the Chinese character from the list of ones that match the possibilities. There is no one-to-one match, so what is typed in letters does not indicate the character, it is the person that picks from a list that is narrowed by what is typed in.

    and it would have all sorts of "false twin" URL mappings.

    If I were Emperor of the Universe, I would map all the visually similar characters to a single character, and map all incoming requests to the same single character so that the English A and [whatever other alphabet with the same look but different code for A] would all be mapped to the single code. No one would be able to have multiple codes for the same character image because they'd be "cleaned" in DNS and web browsers (cleaned in web browsers for convenience, cleaned in DNS for "security" by restricting ambiguity). That doesn't seem hard or time consuming. One person in a week and a couple lines of code and we're done. And I only came up with that while constructing this response, if you gave me another 10 minutes, I'm sure I could come up with 10 more ways to prevent the same problems others are complaining about. It always kills me how people presume that if they can't think of a fix to a problem that there must not be a fix to a problem.

  122. Re:Make 'em all speak english by benoitg · · Score: 1

    As a native french speaker, I can tell you that accents ARE most definitely used in signs and capitals. Even in acronyms now (pretty much always have been in Canada, relatively recent in France)

  123. Let's redefine the byte while we're at it ! by wtarreau · · Score: 1

    I'm really fed up with those stupidities.

    The DNS addresses alphabet was defined as 37 latin chars. And people complain that to be able
    to enter this system, they want to extend it. That's as shameful as to define that there
    are not enough IPv4 addresses for everybody, so let's extend the alphabet of IPv4 addresses to
    1000 values for each byte, so that we will be able to have addresses from 000.000.000.000 to
    999.999.999.999. Ah, it doesn't fit a byte ? let's extend the byte to 10 bits! After all, it's
    already what has been done with this stupid UTF8 which breaks every mailer, including
    those handling it natively !

    There's a system which works as a whole and which relies on sensible technical specifications.
    Changing some of them can have huge impacts on everything. Look at all the junk mail you all
    receive with unreadable chars. We really don't need to get unreadable addresses.

    And yes, I do already use accents and some characters that don't enter the ASCII map in my
    every day life, but when I use the net, I conform to established standards. What will happen
    IMHO is that people using such new an unsupported features won't have an easy access to the
    rest of the net, so they will finally avoid it. On the other hand, spammers and phishers will
    intensively abuse the new mechanisms. I'm sure we'll see new security options in all common
    tools to allow or block usage of such domains...

    That's becoming very very sad.

  124. Speculators Buying Up IDN Domains by miller60 · · Score: 1

    There's already an active market in Internationalized Domain names (IDN) with many domain speculators buying up names for investment and/or parking purposes. Sites focused on this niche include IDN Forums, the IDN Blog and the IDN Domain Blog. A lot of companies and web sites will go to register their trademarked terms in these multilingual domain names and find that they're long gone, and getting them back could mean international litigation. I think it hasn't even occurred to many corporations to register IDN domains.

  125. Unicode worries by dovgr · · Score: 1

    Using RTL (right-to-left) characters in DNS names is a bad idea. In a Unicode context, once you start using RTL characters in an environment that is not exclusively RTL you need to do bidirectional reordering converting from logical order to visual display order. This causes neutral characters like the dot character to jump to the left side of the word or the right side of the word depending on what character logically proceeds it. Sometimes the only way of forcing the display to look as you intended is to insert various zero width characters, like Right-Left-Mark, Right Left Mark etc. Another problem occurring e.g. in joined languages like indic languages and arabic languages is that you sometimes need to solve ambiguities by using other zero width characters like the ZWJ (Zero width joiner) and the ZWNJ (zero width non joiner). Is the proposal suggesting to allow these zero-width characters in DNS names as well? I certainly hope not... Though English is not my native language, I certainly hope that the current DNS subset will remain.

  126. It's a keyboard problem too by caseih · · Score: 1

    There is at least one reason, why non-latin letters in DNS names are a bad idea: International keyboards always support at least two character sets: latin (english) and whatever native character set. Thus even if the user is localized in Cyrillic, if I publish my DNS name using latin characters, I know that he and other people all over the world can reach it. If, on the other hand, I was a businessman in China, I could possibly create a nice domain name that was entirely in chinese, even if it was well-known name that was recognized across the world. Now suddenly only people who happen to a) read chinese and b) have chinese character support turned on in their OSes (input support, not display) can access my site. Whereas I could simply transliterate my name into latin characters and reach everyone, without requiring special input methods or skills on the part of the end user.

    Certainly I don't want to have add big-5 support to my Linux install and all the various input methods just to visit a site of guy in China that wants to sell me radio-controlled electronics for my hobby (I do want to buy from him though).

    1. Re:It's a keyboard problem too by joto · · Score: 1

      Look. If he wants his domain name in chinese, he is going to write his fucking website in chinese. And if he is selling something, he won't fucking care about one fucking american customer who believes he is the center of the world, and can't be bothered to learn chinese first.

      Ask yourself: How would an american company react to someone sending in an order for some cheap electronic toy, if the order was written in chinese?

      If you absolutely want to buy electronic toys from a chinese company, serving chinese customers, selling chinese goods, shipping to chinese addresses, and accepting payment in chinese yuan, then get yourself a chinese buddy to order for you, and ask him to send it to you.

      And if you are still in doubt about this, I can find plenty of american shopping websites, who simply refuse to ship orders to Europe (even if the europeans ordering stuff from them probably knows american-english better then their average customers). And even for a small country like Norway (where I'm from), you'll find plenty of shopping websites that refuse to ship outside of Norway too.

    2. Re:It's a keyboard problem too by caseih · · Score: 1

      Sorry but you're completely missing the point. If a country wants to have its own domain names in its own character set, internal that country and not globally available, then that is fine, since no one outside that country (non-speakers anyway) will be able to reach the site anyway (because they cannot type it). But to lobby the entire DNS system to change in a way that will fundamentally break the interconnected nature of the internet is silly. I also never said that English should be the language. I said that the Latin character set should be the default, for obvious reasons. Your comment about an American company reacting to an order in Chinese is somewhat of a red herring. If an American company truly wanted to do business with the Chinese (and if Asian characters were the basis of the default technology) then they would do well to find someone who could read it.

      The point is that the fundamental, underlying technology (the computer and keyboard itself) is based on the Latin character set. Hence it forms the lowest comment denominator of the basic, core technologies that drive computing and the internet, across the entire world. Thus to change DNS on a global scale just isn't practical or even desirable.

      So stop thinking that I'm just saying we all should be like Americans and speak English. This seems to be a knee-jerk reaction by many in the world to assume this kind of thing. For every close-minded American I know, there seems to be at least one equally close-minded non-American.

      As for speaking English well, I would not have thought your English was any different from any other profane, vulgar American, had you not said you were from Norway.

  127. Re:Make 'em all speak english by dovgr · · Score: 1

    Ehhmm... The whole of unicode can be encoded using UTF8. You mean ASCII probably. Please see wikipedia.

  128. No, it's not late by Anonymous Coward · · Score: 0

    Last time I checked, the US created it. We didn't need non-english domains, so implying that it's late is abit stupid - it was never part of the deal. I'm not so sure it's a good idea in any case - it's been trucking on along just fine without it and the thought of adding a whole bunch of different domains where some dumbass is simply adding an accent mark to a letter - this is BEGGING to making phishing even worse (it makes me wonder how much organized crime is behind the push on this)

  129. Re:But *IS* there a decent way to enter Chinese ch by Jamie+Lokier · · Score: 1

    I've watched people enter Chinese texts on cellphones in China, and it's amazing to watch. Whatever entry method they're using seems remarkably efficient.

  130. Re:Make 'em all speak english by Hognoxious · · Score: 1
    we invented the damn internet, and we speak English, and use the Latin-1 character set.
    ... and if Latin-1 was good enough for Chaucer, Shakespeare & Jesus Christ, it's good enough for you!
    --
    Confucius say, "Find worm in apple - bad. Find half a worm - worse."
  131. Crack-smokers? by clacke · · Score: 1

    This is nonsense.

    The technical solution already exists today, it already works in both Firefox and IE and some other applications, and it is already used in i.e. the .nu and .se domains. It's up to each TLD to decide what profile of domain names they accept, and no modifications of the DNS protocol is needed.

    The ICANN has nothing to do with this. Move along.

  132. Re:Make 'em all speak english by Hognoxious · · Score: 1
    Canada: à â ç é è ê ë î ï ô û ù ü ÿ missing
    France: à â ç é è ê ë î ï ô û ù ü ÿ missing
    Hmmm, what a coincidence.

    Italy: All letters covered (as far as I can tell)
    There are accented versions of e and o.

    The hilarious thing is that English is the only language out of these 13 to have no issues at all with only having the 37 dns characters.
    Another coincidence, go figure.
    --
    Confucius say, "Find worm in apple - bad. Find half a worm - worse."
  133. What about the current IDN system? by donaldGuy · · Score: 1

    Seriously, there is already a system for simulating the presence of non-latin characters in domains... its called the Internationalized domain name system

    names are basically translated from Unicode to an ASCII encoding and prefixed with "xn--"

    anyway .. if you want more info the Wikipedia Article is pretty good.

    Why isn't this system adequate?

  134. I think the whole idea is a mistake by msobkow · · Score: 4, Insightful

    Instead of changing the fundamental DNS which is a programmer's and administrator's tool, not an advertising medium. It is founded, like programming languages, on a fundamental 7-bit ASCII character set, and is not intended to be used for NLS text.

    A far better solution is some form of VDNS that translates NLS text names into the proper domain name at the system level. That also allows the same domain to have multiple language translations to reflect localized product and service names.

    We seriously need to kick the general political community in the arse. They keep trying to impose technical decisions, and it fails as miserably as any corporate PHB's uninformed decisions. ASK the techies to propose solutions instead of shoving ill-conceived ideas down our throats.

    For example -- once you mandate multibyte domains, you implicitly mandate multibyte URL components. Goodbye direct mapping of names to the directories, file systems, and servers.

    Bad idea. Very bad idea.

    --
    I do not fail; I succeed at finding out what does not work.
    1. Re:I think the whole idea is a mistake by Anonymous Coward · · Score: 0
      For example -- once you mandate multibyte domains, you implicitly mandate multibyte URL components. Goodbye direct mapping of names to the directories, file systems, and servers.

      But this already exists - and works. IE has sent the HTTP GET request in UTF-8 for several versions now, and it works fine. I can link to Japanese wikipedia as such: http://ja.wikipedia.org/wiki/JAPANESECHARACTERSHER E (Slashdot does not allow entering Unicode characters though), and everything works like a charm. (The UTF-8 encoded string can further be url-encoded for compatibility, if necessary.)

      However I still don't advocate non-ASCII domain names. It's just asking for trouble.

  135. No kidding by Petrushka · · Score: 1
    Given that some societies have used non-Latin characters for thousands of years, is this a bit late in coming?

    No kidding. However, how come you're making a comment like that when Slashdot still refuses to allow non-Roman characters in posts, and insists on using charset=iso-8859-1? ... Pot, meet kettle.

    1. Re:No kidding by n6kuy · · Score: 1

      Yes indeed. A little bit late.
      If iso-8859-1 was good enough to encode the KJV Bible that the Apostle Paul used, it's good enough for me.
      Any other charsets are obviously tools of Cthulhu, and are meant to deceive the very elect!

      --
      If you disagree with me on social issues, then it's pretty clear that you are a narrow-minded bigot.
  136. fuck you world by minus_273 · · Score: 1

    Seriously, this is one of those times i really want to say fuck you to the world. Considering the DNS is based in the US, Latin it is. Make yout own DNS that no one uses if you want another. Seriously having the same character in maultiple chracter sets visually indistuingishable from eachother is the best thing that will happen for phishers in ages. How do you know yourbank.com is actually yourbank.com in latin and not in some other characterset. Its not just for latin either. Imagine Japanese Kanji and Chinese in simplified then in traditional encodings. Nice! you can have nihongo(kanji) .com written in 3 different encodings and can send you to three differnt sites. How do you know you are going to nihongo.com and not ribenyu.com.

    Jeez this is the most retarded thing i have heard of since the UN threat to split the net a few months back.

    --
    The war with islam is a war on the beast
    The war on terror is a war for peace
  137. Even IDN doesn't support English by argent · · Score: 1

    If the Domain Name System supported English it would allow my name to be a domain name, but not even IDN allows me to put a space in a domain name.

    What, "da Silva" isn't an English name?

    If not, then what is? English is a mongrel language, every name in it comes from a conqueror or settler. Should we go back to the Angles and Saxons, or perhaps to a good celtic name like Cúchulainn?

    Whoops.

  138. Don't need no stinking reading for PRON!!! by tinkerghost · · Score: 1

    Of course goatse is much to easy to sucker people into if they can't read the site. I would recommend only 1/2 points for people who can't read the site.

  139. Ignorants by porneL · · Score: 1

    Imagine Internet had Hawaiian origin - you'd be wanting to break the DNS to use your "odd" characters like "c" and "d". Why would you want to "fix" Hawaiian alphabet if it's not broken for Hawaiians!?

  140. Re:Make 'em all speak english by Anonymous Coward · · Score: 0

    English native speakers = 5% of Humanity and going down. The dictature of English has lasted long enough.

  141. Your strawman... by Infonaut · · Score: 1

    ... is this:

    The purpose of human language is communicate with other human beings

    You are operating under the assumption that human language fulfills no other function beyond functional communication. There is plenty of evidence that language defines culture. Speakers of different languages do often see the world slightly differently. If you want everyone to think the same way, one language is a great idea. Personally, I prefer a more heterogeneous world, despite the friction languages create.

    Computer languages are instructions to a computer, they aren't a human communication medium

    That's an interesting interpretation of computer languages, given that they are created by humans and used by humans to create software. Choice of language informs what the human programmer can create, and has a strong effect on broader human culture. As Lessig argues, code is speech.

    You seem to be looking at language as if it merely is a conduit for information. If we were all computers, programmed merely to pass information between each other, that would be the case. For humans, I think it serves many other purposes, and the profusion of languages is good for humanity. Homogeneousness, while it seems like a cure for our maladies, doesn't necessarily help us. Americans speak English; that hasn't stopped violence, disagreement, or other forms of conflict. I would also argue that American culture is so strong in part because it is continuously enriched by influences from other cultures and their languages.

    --
    Read the EFF's Fair Use FAQ
  142. A Strange Comment by andersh · · Score: 1
    Why should they be able to use their natural language rather than English?
    You do realize that most of the world does NOT communicate in English but their native languages? What is the point to denying us the right to use non-Latin characters in domain names when the content is already in this format? You will not stop Germans, French or Russians from writing their websites in their languages because their domains names are in "English".

    The reason the Internet is useful is because everyone speaks English, the more divided it becomes the less useful it becomes.
    But we dont! Not everyone speaks or writes in English on the web! What kind of argument is that?! The Internet is useful to the local group exactly because it is available in their langauge - and not anything else! Why would a government agency publish in a foreign language? Why would a commercial website list their service in a language the customer does not understand?

    Perhaps I am not seeing your point here, I am quite tired and sleepy, but your comment is really strange.
    -non-latin north european (æøåòóöôéáäâ)
  143. In case you'd like to learn how... by l0b0 · · Score: 1

    In Windows, you can setup Alt-Shift-Number to change to a different keyboard. I've used three different ones (Dvorak, US English, and Norwegian) for three years, no problem.

    In Linux, you can even use the otherwise useless Caps Lock to rotate layouts.

    The Optimus keyboard should be another stepping-stone in making non-English layouts a whole lot more mainstream.

  144. That exists currently!!! by Mr+44 · · Score: 1
    A far better solution is some form of VDNS that translates NLS text names into the proper domain name at the system level.


    Something similar to what you are describing exists, and is called IDN ( http://en.wikipedia.org/wiki/Internationalized_dom ain_name ).

    It exists currently and is supported in all major browsers. I would like to hear more about why IDN doesn't work for international users, and why native 16-bit DNS is needed.
    1. Re:That exists currently!!! by msobkow · · Score: 1

      Another implementation is called a "Search Engine".

      It's how the vast majority of people find things. Go ahead and find the Saskatchewan Abilities Counsel website without a search engine, for example. What would that URL be?

      The push for 16-bit DNS has nothing to do with whether people from other nations can view a URL because the subset of characters valid for domain names are glyphs in every multibyte character set, using codes compatible with ASCII/LATIN1.

      The domain names and IP addressing issues are infrastructure technology, and government wind-emitters and spin-doctors don't belong in the meetings or plans to address those issues. They're not qualified.

      --
      I do not fail; I succeed at finding out what does not work.
  145. Re:Make 'em all speak english by Shaper_pmp · · Score: 1

    To be clear, I would have no problem with a Unicode DNS system if I could see a simple way around the identical glyphs/different letters problem that didn't involve making phishing almost impossible to spot or avoid.

    Alas:
    1. The very nature of the problem seems intractable (how to make two things which look identical look different, without making either one look different)
    2. Computers are already almost entirely latin-1-based - on the command-line/at a low level even if not always in the GUI layer
    3. Thanks in part to this state of affairs most languages already have widely-known rules or guidelines for rendering words in non-Latin alphabets into the Latin-1 character set - ask any foreign person you know if they have much problem communicating with their countrymen using only normal (western) qwerty keyboards - all of my foreign friends seem adept enough at it even they've never highlighted it as a problem.

    This wasn't intended as a racist "fuck 'em for not inventing the computer" rant, but as a genuine hands-thrown-in-the-air regrettably-there's-no-viable-solution-I-can-see and is-this-really-a-problem statement of opinion.

    And incidentally, I already spell honour, colour and aluminium "correctly" (and nice little bit of cultural chauvenism there too). I also buy beer by the pint and understand the meaning of the word "irony". I meant "we" as in "the English-speaking (or even Roman-alphabet-using) western world", not as in "American".

    Now who feels stupid, eh?

    --
    Everything in moderation, including moderation itself
  146. My reality check just bounced by DragonHawk · · Score: 1
    "So get rid of them, insist on a common language."

    What's the weather like on the planet you're on? :-)

    Seriously, while I agree with what you say in principle, it is never going to happen in the real world.
    --

    dragonhawk@iname.microsoft.com
    I do not like Microsoft. Remove them from my email address.
  147. Re:Anything's possible. by krell · · Score: 0, Flamebait

    What does this have to do with Palestine?

    --
    Where were you when the voynix came?
  148. Not everyone wants to talk to you by DragonHawk · · Score: 1
    "More bluntly, of what use is the parts of the Internet I can't even type the domain name for?"

    I'm guessing as much use as the parts of the Internet using languages you don't speak.

    It's a network of networks. The computers can all speak to each other. However, not everyone using those computers is interested in speaking to everyone else. There are people in Japan who just want to email their friends and family (also in Japan), in Japanese. They have as little interest in speaking English with you as you do in speaking Japanese with them.

    Keep in mind that the Internet was not designed, and is not being used, to serve you in particular. Or anyone else in particular. As long as the computers can reach each other, the Internet is doing its job. What we use it for is up to us.

    There are billions of people on this planet who do not speak English. They are not going away, no matter how much it inconveniences you. I bluntly suggest you get adjust your world-view to include the whole world.
    --

    dragonhawk@iname.microsoft.com
    I do not like Microsoft. Remove them from my email address.
  149. Stupid comment by ScrewMaster · · Score: 1

    Given that some societies have used non-Latin characters for thousands of years, is this a bit late in coming?

    I would put it thus: "Given that the Internet has been around for only a tiny fraction of those thousands of years, this is actually happening dangerously fast."

    {sigh} Just another example of politics overriding engineering reality.

    --
    The higher the technology, the sharper that two-edged sword.
  150. Bah! by Tinman_au · · Score: 1

    37 characters ought to be enough for anybody!! /joke off

  151. Extended DNS Aliasing is all you need by Slur · · Score: 1

    Every domain would be required to have a name according to the current limited scheme, as it is now. And then on top of that you add a capability to create domain aliases that use extended character sets. This is the only thing that makes sense. One should never be forced to type in a bunch of cryptic Kanji to reach a domain. It should be broadly accessible to people using older platforms and DNS standards.

    The extension could eventually be wrapped up into DNS, but it would be best to develop it as a separate module for the time being, to absolutely ensure nothing breaks.

    --
    -- thinkyhead software and media
  152. I wish they'd hurry up and fix this... by Monkier · · Score: 1

    ...because I've got money waiting for me at ámäzón.com & çïtîßäñk.com that I can't get thru to!

  153. Latin language itself does not use accesnts by Anonymous Coward · · Score: 0

    You forgot that Latin language itself, which is WAY OLDER THAN ENGLISH and which was for 1700 years the official language for science, does not use any accents or diacritics. I has even fewer letters than English, only 23.

    Only two or three hundreds of years ago big scientists and mathematicians like Euler or Gauss wrote their papers in Latin. Latin in not a dead language because it is still the official language of the catholic church; even though the tridentine mass is not the standard mass anymore the official Vatican documents are all issued in Latin.

  154. Not so much of an issue... by BlueAdept · · Score: 1

    Just for the guy above who must be having a bad day with his "UNIX is the problem" quotes...

    http://www.fsck.co.uk/symbol5.gif

    --
    Who is Seg Fault, and what is he doing with Kernel Space?
  155. Internet != Web, and other IDN technical issues by billstewart · · Score: 2, Interesting
    The Internet is not just the web - you might remember that there are other applications such as email, ftp, ssh, telnet, ping, traceroute, and some people use programs other than browsers to access these things.


    The reason ICANN wants to do lots of testing (after having dragged their feet for years before getting started) is that IDNs fundamentally change how DNS works, and it's really important not to break too much when you do that (not that ICANN traditionally worried about that.) It's *not* simple, and you don't want to get it wrong.

    DNS translates a set of strings of nominally-ascii characters into numbers, or translates numbers into a set of strings of characters, or translates some sets of strings into other sets of strings, depending on which query you run, and uses specific data formats to represent those strings and numbers. There are restrictions on what characters can be in the strings, some for reasons that we could easily declare to be obsolete (7-bit, uppercase-to-lowercase translation), some for reasons that are harder to change (printable characters only, please), and some which are really hard (dots are used as delimiters, and nulls terminate character strings in some popular computer languages. So you can't just plug in arbitrary Unicode two-byte characters instead of pairs of ASCII bytes and skip the case-munging, because some of the bytes will have values that can't be handled, though most of the 8-bit-character alphabets can be used transparently if you don't mind people using incorrect character sets on occasion. 8-bit character sets simply aren't enough - you can handle most Western languages in ISO-8859-1, and UTF-8 is closer but apparently not quite a cigar (too bad - it would have been my preference.)

    The main IDN strategies replace this by adding one more translation layer - character-string-set IDN names are translated into ugly-but-recognizable Punycode strings, which get used with standard DNS character-string-set to number translations in the forward direction, and in the reverse direction, anything that arrived as a Punycode xn-uglystuff string usually gets fed to a Punycode-to-Unicode translator by a user interface.

    Some things can be fixed by recompiling (or relinking, or re-DLLing) all of your programs with a DNS resolver library that guesses whether to convert strings or not - forward DNS knows to punycode non-ascii characters and not to re-punycode xn--uglystuff, though reverse DNS doesn't necessarily know whether to convert it to Unicode 16 or UTF-8 or just pass it on directly, and if you've typed in a domain name using something other than 7-bit lowercase+digits ASCII, it knows to punycode it, and obviously any domain registry supporting punycode ought to allow anybody who registers a name that doesn't need punycode to have both the straight and punycode names. But it's still ugly.

    --

    Bill Stewart
    New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks
    1. Re:Internet != Web, and other IDN technical issues by Mr+44 · · Score: 1

      True, the web is not the internet. But the issue here fundamentally _is_ a user-interface level issue. There's no reason that mail clients, etc couldn't support IDN by doing the punycode translation for you.

      The point is, we have a system that addresses the problem currently, without breaking anything.

    2. Re:Internet != Web, and other IDN technical issues by jc42 · · Score: 1

      The reason ICANN wants to do lots of testing (after having dragged their feet for years before getting started) is that IDNs fundamentally change how DNS works, and it's really important not to break too much when you do that

      Well, I've found that almost all of my software works just fine with UTF-8, without any patches at all. The explanation is simple: Most software that does anything at all with text does so by looking at specific special chars; all others are just carried along as "unrecognized" text that is only compared with other text strings. Usually the special chars are all 7-bit ASCII, so any char with the 8th bit set is "unrecognized". It turns out that usually this is equivalent to treating them as "letters", in the sense that all unrecognized strings of letters are just unprocessed text. The only problem is with code that uses the 8th bit for some special purpose, and even mediocre programmers know not to do that.

      I did an instructive experiment a while back, in which I had some Chinese text in a bunch of HTML files, and I defined a CSS class whose name was the obvious Chinese character. (For the curious, it's ""="zhong1", which I've found that slashdot garbles somehow so it probably won't appear correctly on your screen.) I used this class to increase the font size of Chinese text to 160%. I was curious how many browsers would accept a CSS class with a Chinese name. I was unable to find a single browser that didn't accept it handle it correctly.

      I was only slightly impressed by this. All it really meant was that the browsers didn't care what the "letters" in CSS class names are. They just took whatever was there that wasn't a special char in CSS, and used it as the name. So the 8-bit chars making up that Chinese char were just three "letters" to the browsers' parsers, with nothing special about them, and it all worked without even a warning message.

      As far as I can tell, if you use UTF-8, there's no real reason any software except the low-level library code that draws characters on a screen or paper should ever have any problem with names in any language. If a program has a problem, it means that it's doing something clever (read: dumb) with the 8th bit.

      So really, what problems would be caused for DNS if ICANN were to decree that the encoding is UTF-8, and all chars from 0x80 0xFF are to be treated as "letters"? I don't know of anything that DNS does that should be bothered by this. If there's a problem, is there a coherent explanation of it somewhere? Does some obscure DNS rule require using bit 8 for some purpose? They couldn't have been that dumb, could they?

      --
      Those who do study history are doomed to stand helplessly by while everyone else repeats it.
    3. Re:Internet != Web, and other IDN technical issues by Anonymous Coward · · Score: 0

      DNS is case-insensitive, isn't it? With ASCII, getting a lower-case representation of a letter is a simple flip of a bit. This gets a bit more complicated with non-ASCII characters (which Slashdot doesn't want to display).

    4. Re:Internet != Web, and other IDN technical issues by billstewart · · Score: 1
      If your definition of "without breaking anything" is "changing the underlying data conversion semantics in ways that can sometimes be implemented by changing a library but sometimes require A Simple Matter of Programming to modify the application, at a cost of relinking or recompiling every piece of software that uses DNS and then doing regression tests on it", then yes, I suppose it addresses the problem :-)


      But the issue is not just user interfaces, unless you're including "make humans type punycode" as part of your solution - you're changing the semantics from a 2-step process to a 3-step process, and it's not always obvious whether the correct semantics include doing the middle step or not, especially in the reverse direction. There are some default decisions that you can implement in libraries that maintain the current programming interface that can usually do the right thing, but sometimes you may need to change the API.


      If we could wait to solve this problem until we were converting everything to IPv6, it would simplify things, because IPv6 usually requires changing all your programs and libraries anyway, but unfortunately it's not that simple.

      --

      Bill Stewart
      New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks
  156. Sounds like Hellschreiber by Kadin2048 · · Score: 1
    Perhaps even whole idea of encoding alphabets is a relict (and biased to phonetic alphabets, as well)? Today computers have enough power to operate on pictures as UI, so why don't we switch to shape-based data processing?
    There is a data-transmission system which works like you describe, used for transmitting text over radio and other high-noise links. It's called "Hellschreiber." Rather than encoding letters as numbers, with a different number for each possible letter in the alphabet, it represents characters as a series of small (7 pixel tall) bitmaps, and then transmits a black/white value for each pixel.

    It requires more data than ASCII, since you'd probably need several 7-pixel tall columns to communicate a single letter, but its advantage in radio communications is that it is "fuzzy" -- a one-bit error will just mar the letter, but probably won't make it illegible, or silently swap it for another letter. It lets the human eye and brain do the error correction, rather than trying to do it in filters.

    So anyway, good idea; so good, you've been beaten to it by 86 years. :)
    --
    "Ladies and gentlemen, my killbot features Lotus Notes and a machine gun. It is the finest available."
    1. Re:Sounds like Hellschreiber by salec · · Score: 1

      My idea would be even closer to worthless then it is if it really was novel :) . It relies on OCR, pattern recognition, pattern classification and other things that are with us for quite some time. It is just a matter of will to evolve to put all of these to better use, to add more intelligence to IT. Hellschreiber is neat, but when you think about it, human reader is essential part of it. That is probably the reason why (dumber) teletype device was UI device of choice for computers.

  157. DNS is *precisely* for NLS text by billstewart · · Score: 2, Insightful
    The problem is that it was designed for natural language text in the US back when some computers could deal with the new fancy feature of lower-case letters and others couldn't, and when humans tended to get confused about that sort of thing even though they all spoke English, and some computers could deal with 8-bit bytes and punctuation while others were very limited. I don't know if the IBM 48-character character sets were still around, but 64-character was still widespread, and EBCDIC was certainly still plagueing many of us in the early 1980s. It's a tool for users, not the programmers and admins who support them - but it's a tool for users of _computers_, so it still has technical constraints.


    It's been obvious since the Europeans got DNS for their ftp and email that there was a problem, even before they invented the web, and even aside from myopic silliness like having .GOV be a US TLD and fortunate accidental decisions about having .COM be viewed as global instead of US-only. Techies have been working on the internationalized-character-sets-for-computers problem for a while. ICANN's finally starting to pay some attention to the IDN issues, but they're not fundamentally a technology organization, they're a trademark protection organization and their approach to non-US domain names was an attempt at World Domination designed to get the CCTLDs to follow their trademark-protection rules, not to worry about fundamental technologies like making DNS work outside the US.


    DNS has a couple of restrictions that may have made sense in 1985, long before Unicode was invented. Some of them are easy to fix, especially since most DNS servers in the world use versions of one of three or four server programs, but there's a lot more resolver software out there that deliberately casefolds (though you could fix most of that in two or three generations of Microsoft releases, if you knew what you wanted it to do), and you can fix some of it administratively, by having the people who register UPPERCASE-EXAMPLE.COM also register uppercase-example.com and maybe Uppercase-example.com and do a few similar things for munged Unicode.

    --

    Bill Stewart
    New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks
    1. Re:DNS is *precisely* for NLS text by msobkow · · Score: 1

      Incorrect.

      The specific characters set selected for DNS names is a carefully selected subset that overlaps ALL character sets created by that point in time, including packed 6-bit schemes and such used by barcodes.

      In order to service the widest range of computer systems and devices possible, the character set had to be pruned to include only characters that ALL those systems understood.

      If it were intended to be "readable", even in English, there would have been provisions for decorative characters, case sensitivity, spacing, and the full ASCII7/LATIN1 character set.

      People outside the networking infrastructures also don't realize that domain names and IP addresses comparisons and switching are implemented in hardware by the backbone hardware. This is not just a software change, any more than IPv6 is.

      --
      I do not fail; I succeed at finding out what does not work.
    2. Re:DNS is *precisely* for NLS text by billstewart · · Score: 1
      As you say, the character set is limited to the subsets that were widely supportable on machines in common use in the early 80s - but it's still exactly designed to be human-readable (abbreviations of) natural-language text, just as the machine charcter sets were. If they weren't trying to do that, numerical IP addresses were just fine.

      And while people in the networking infrastructure business know that IP addresses used for switching are implemented in high-performance ASICs and other tuned hardware environments (which does not, alas, perform very well yet for IPv6 on most platforms), I'm rather puzzled about your assertion about domain name comparisons being implemented in hardware. It's mostly BIND and djbdns and a couple of other packages, and while there are "DNS appliances", they're either vanilla PC hardware 1U servers with the appliance-maker's logo on the front, or else they're variants of Unix-on-a-Stick general-purpose computers (possibly with ARM or MIPS instead of x86.) I haven't seen anybody doing a DNS server based on ASICs or FPGAs, and I don't see how it would be a particular win, either for performance (because DNS needs big memory caches, which aren't a good hardware match) or for reliability (because you need really trustable software to manaage the databases, so you'd be using a Unix or possibly WinNT backend even if you had some kind of accelerator widget front-end.

      --

      Bill Stewart
      New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks
  158. 3Com fixed that problem by billstewart · · Score: 1

    The starts-with-a-letter restriction is gone, mostly because of 3Com, but I think there may still be restrictions against starting with a dash.

    --

    Bill Stewart
    New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks
    1. Re:3Com fixed that problem by mibus · · Score: 1

      OK, I didn't know that. I'll have to re-read the RFCs :)

      You still can't have solely numeric domains though?

    2. Re:3Com fixed that problem by billstewart · · Score: 1
      Ok, I haven't read those RFCs in a while either :-)


      There aren't any all-numeric top-level domains, but there are a number of second-level domains in .COM like 163.com, most of which are used by Chinese spammers (though presumably the names also have some meaning when handled as Chinese characters?)


      There's also the whole inverse-DNS space, with names like 1.200.9.192.in-addr.arpa.

      --

      Bill Stewart
      New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks
  159. Have you tried pinging them? Or emailing? by billstewart · · Score: 1

    Sure, you can have browser hackery that knows to display xn--ugly-punycode-string-ewtr.cn as the Han characters for the web site. But if you're having trouble reaching the site, does your ping or traceroute program have the builtin IE hack? Or if you want to email them, and you're using an email program that's not part of your browser, can you type in their name and email them? Or can you cut&paste from your browser?

    --

    Bill Stewart
    New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks
  160. Re:Make 'em all speak english by pablo.cl · · Score: 1
    Italy: All letters covered (as far as I can tell)

    There are accented versions of e and o.

    There's also "à" TuttoCittà, "ì" Così fan tutte and "ù" PiùChePuoi.
  161. Re:But *IS* there a decent way to enter Chinese by jc42 · · Score: 1

    It always kills me how people presume that if they can't think of a fix to a problem that there must not be a fix to a problem.

    It's not so much that people can't think of solutions, but rather to so often people's idea of their own writing system is so entrenched that they refuse to solve problems.

    You can see this even in the limited English writing system. The computer field has been plagued from the start with the O/0 and I/l/1 problems, and there isn't the slightest chance that any solution will ever be accepted. We also keep using fonts that confuse "d" with "cl" (as in clear vs dear) and "m" with "rn" (as in modem vs modern), even when we realize that the font causes a problem.

    If we can't straighten out the ongoing screwups caused by these charset problems in English, how can we preach to the rest of the world about how easy their problems are?

    Of course, the Unicode gang made the east Asian scripts even worse in this regard than even their writing system. Many characters have 2 or 3 different codes, and I think I saw one with the same glyph for 4 codes (but I could just be dreaming this ;-). At least ASCII doesn't have 3 different codes for the letter 'b', for example.

    --
    Those who do study history are doomed to stand helplessly by while everyone else repeats it.
  162. Re:But *IS* there a decent way to enter Chinese ch by HiThere · · Score: 1

    OK, I'm no expert, and my information is 2-3 decades old...however...

    I have this strange suspicion that the stuff they enter into cell-phones is the rough equivalent to "texting" (or whatever that set of peculiar abbreviations is called).

    --

    I think we've pushed this "anyone can grow up to be president" thing too far.
  163. Problem solved long ago. Unicode already in use by Anonymous Coward · · Score: 0

    using the Punycode encoding, which recent versions of all major browsers support.

    If you want to register an internationalised domain name, just convert it to punycode, register the resulting domain as per usual, and you're set to go.

    The major browsers even deal with IDN homograph attacks, by making sure that a url containing characters from different languages will be displayed in raw punycode rather than the internationalised string.

    What's more, strings containing things like combining diacriticals and different accented marks need to normalised using the Nameprep algorithm to reduce a string that could be encoded in several different (but visually equal) ways into a single representation.

    This system has been in place for a couple of years already, and works fine. No need to go breaking anything.

    wikipedia provides a whole host of examples which your browser (if it's recent) should automatically convert to punycode when it makes the request to the page you're after, all the while, displaying the nice internationalised name for you to see.

    So, move along people, nothing to see here...

  164. hello! punycode, anyone? by Anonymous Coward · · Score: 0

    punycode - e.g. http://en.wikipedia.org/wiki/Punycode - is here, and it already works. It's a gigantic tempest in a teapot. (Never mind that figuring out which normalized form of Unicode to use is bad enough as it is.)

  165. WTF by Anonymous Coward · · Score: 0

    Lamest idea ever. How the hell am I supposed to type a non-latin domain in my browser? Do I rely on Google for everything, or do I keep a link to Character Map on my quick launch bar?

    50,000 characters makes for a huge keyboard.

  166. Re:Have you tried pinging them? Or emailing? by jamar0303 · · Score: 1

    That's why I don't use it- it only works for the web browser.

    --
    OSx86 FTW
  167. ascii FTW by sentientbrendan · · Score: 1

    It's high time everyone else learned English anyway.

    Seriously though, having lots of character sets is a stupid idea. Suddenly, you can only even type in a website's name if you have the right language pack/keyboard.

    English is the dominant international language right now, and it is only sensible that there be a standard English, or at least latin character set, interface to everything. It's one thing to provide support for other languages, but moving too far away from the standard will only increase the segmentation of the internet and decrease the information available to any one person.

    Latin domain names insure that there's a standard and simple way for anyone running any OS with any language pack to type in any domain name, if only to run it through babelfish. Most major foreign languages (japanese, chinese) have systems for writing in the latin character set anyways, which are certainly sufficient for the short strings used in *domain names*.

  168. Re:Anything's possible. by gd23ka · · Score: 1

    I just thought it was an intriguing allegation, I have no idea whether this has any bearing
    on Palestine.

  169. display vs. transmission vs. storage by bagwill · · Score: 1

    I didn't see another comment that made the distinction, so:

    Don't confuse display, transmission, and storage encoding. It was a convenience when 7-bit ASCII characters could be used for all three (seemingly). But it was never really true. The encoding on disk was never really the same as the transmission encoding or the display encoding. When you display an ASCII 'A', you don't see '0100 0001'. End users want to see and type in a pretty URL. If it's transmitted or stored as an unreadable hexadecimal hash, who cares? It's like complaining that the magnetic domains on the hard disk that represent the letter 'o' don't form a circle.

    If I wanted a logo domain name, I could register an encoding of the SVG as my domain name, then write and contribute the code for firefox to decode and display my logo domain name. I wouldn't need to lobby ICANN to allow SVG or bitmapped domain names (although I might have to write an RFC for the IETF :-) )

    Actually, since DNS names and IP addresses aren't trustworthy on a massive global scale anyway, we should be using public keys to identify hosts, websites, and other Internet entities. But that's a topic for another discussion.

  170. Re:Anything's possible. by krell · · Score: 0, Flamebait

    It was just another jab at Dan Rather for his "all the evidence for the story is faked, but I tell ya, it's TRUE!!!!" . Not to mention his insistence that the faked Word 97 documents were real for two weeks after they were proven fake. Not the best way to end his long and quite distinguished career.

    --
    Where were you when the voynix came?
  171. Re:Make 'em all speak english by Hognoxious · · Score: 1

    Been a while since I read any Italian, apart from signs & menus, seems I'm getting rusty. Still, as long as I can tell the difference between a casinò and a casino...

    --
    Confucius say, "Find worm in apple - bad. Find half a worm - worse."
  172. Re:But *IS* there a decent way to enter Chinese ch by HeroreV · · Score: 1
    Last I heard people in China (& Japan) used Roman letters because as clumsy as the were, they were easier to enter than the Hiragana.
    That's ridiculous. The biggest internet forum in the world is 2channel (eng nav). The text there consists almost entirely of hiragana, katakana, and kanji. About the only place you see Latin characters at 2channel is in URLs.

    The Chinese and Japanese sections of Wikipedia are likewise nearly absent of Latin characters.
  173. Heck no! by twebb72 · · Score: 1

    America rules the internet and domain names. Why do you think there are only latin chars? GO TEAM!

  174. Re:Anything's possible. by Evilest+Doer · · Score: 2, Insightful

    It's amazing, krell. How do you type so much with Rush Limbaugh's cock rammed down your throat?

    --
    I feel like death on a soda cracker.
  175. it's kind of funny.... by Anonymous Coward · · Score: 0

    ....the fake story was aired on CBS News, and not Rush Limbaugh. What does Limbaugh have to do with it???? Dan Rather torpedoed himself, unless you are one of those that think Limbaugh planted the fake story to trap Dan Rather.

  176. DNS case-insensitivity vs. IDNs by billstewart · · Score: 1

    More precisely, DNS is supposed to be case-insensitive and case-fold requests when appropriate. For IDN purposes, the important issue is that one 8-bit byte may get transformed to a different 8-bit byte, which is fine for 7-bit ASCII characters and usually wrong for bytes that represent half of a 2-byte Unicode character. The fact that the transformation can also be implemented by a bitmask is an implementation detail that's not really in the DNS standards, but it does mean that there are bytes with values 128-255 (such as ISO-LATIN-1 bytes or halves of 2-byte Unicode) that might be undamaged by a lookup table implementation but would be damaged by a bitmask implementation.

    --

    Bill Stewart
    New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks