ICANN Approves Non-Latin ccTLDs
Several readers including alphadogg tipped the news that ICANN has approved non-Latin ccTLDs at its meeting in Seoul. "Starting in mid-November, countries and territories will be able to apply to show domain names in their native language, a major technical tweak to the Internet designed to increase language accessibility. On Friday, the Internet's addressing authority approved a Fast-Track Process for applying for an IDN (Internationalized Domain Name) and will begin accepting applications on Nov. 16. The move comes after years of technical testing and policy development... Currently, domain names can only be displayed using the Latin alphabet letters A-Z, the digits 0-9 and the hyphen, but in future countries will be able to display country-code Top Level Domains (cc TLDs) in their native language. ... 'The usability of IDNs may be limited, as not all application software is capable of working with IDNs,' ICANN said in a 59-page proposal (PDF) dated Sept. 30 that describes the [application] process." Reader dhermann adds, "Great, now even less chance I can identify NSFW links before they are blocked by my work's big brother app and my boss is notified... again."
micrösöft.cöm?
Far too much software makes the assumption that TLDs only contain [a-z0-9-], so if you want to go changing that there needs to be a damn good reason, there is not. There are ~1369 2 letter TLDS to be shared between ~200 soverin states and 49284 3 letter generic ones to be split between uses (.xxx .nws .org .edu, etc), there doesn't seam to be any good reason to expand that and make lots of software more complex.
IranAir Flight 655 never forget!
This will only make phishing attacks easier unless there are SERIOUS checks on domain name registrations. There are letters in the Cyrillic alphabet that have different character codes than their look-alike letters in the Latin alphabet. I'm sure there are other collisions as well. I'm sure they accounted for this in the proposal, but the problem always lies in the implementation. From a security standpoint, this is a VERY bad idea without proper regulation of domain name registrations, and so far it has been demonstrated that we cannot manage them properly even with only the Latin alphabet. From a cultural and usability standpoint, this is a good thing. It will be easier for someone whose native language uses a non-Latin alphabet to recognize the supposed purpose of a web site by its domain name if some of those domain names can be in their native language. A hypothetical native Tamil speaker who speaks no English will be able to recognize the purpose of a site with an appropriate domain name in Tamil, for example
I wonder what impact this will have on the ever decreasing amount of IPv4 addresses available.
This will have absolutely no effect on IPv4/IPv6. This is a DNS change to allow additional characters in domain names.
The domain names get translated to ip addresses by DNS servers.
I doubt that individuals & companies said, "No! We refuse to go on the internet until we can have TLDs with non-Latin characters."
Yay!!! The door is open for an even harder to detect phishing scheme! Imagine the emails linking to http://slashd/öt.org/something...
.es for example)...
I'm all for internationalization, but perhaps limit it to internationalized domain extensions (.jp or
If a man isn't willing to take some risk for his opinions, either his opinions are no good or he's no good
So build your own damn internet.
There are letters in the Cyrillic alphabet that have different character codes than their look-alike letters in the Latin alphabet. I'm sure there are other collisions as well. I'm sure they accounted for this in the proposal, but the problem always lies in the implementation
This is a decision made by ICANN. We've known for some time that they will willingly approve really tremendously bad ideas, if enough money is presented to them. They recently moved on a motion to start selling gTLDs, after all.
From a security standpoint, this is a VERY bad idea without proper regulation of domain name registrations, and so far it has been demonstrated that we cannot manage them properly even with only the Latin alphabet
Security is not of any concern for ICANN. Never has been, never will be. As long as they keep making money they're happy; security, spam, phishing, etc, be damned.
Damn_registrars has no butt-hole. Damn_registrars has no use for a butt-hole.
Now those countries, organizations and businesses that wish to become inaccessible to most of the world (except the native speakers of their own language) can finally do so as easily as possible. Create their own little Internet reservations and stay there :)
As long as my software (such as Firefox) obligingly converts these IDN urls into the dash-hex notation making them obviously unreadable, I am ok with that.
Disclaimer: I am a native of non-English speaking country. I am sure a few of my countrymen will use this feature based on misplaced patriotism. I am also sure that vast majority will ignore it just like they ignore potential to use non-latin domain names that exists right now.
A lot of the debate here seems to be about English-speaking countries vs. the rest of the world, but English isn't the only language that uses the Latin. Also, the unavailability of non-Latin scripts hasn't hampered the flourishing of home-grown websites in India and China named in their many local languages - what makes the ICANN think this is even necessary?
Karma fed to this user will be promptly burnt. Be warned; be wary.
Actually, UTF-8 can and is being used in DNS - as long as you stick to basic Latin characters, that is. Also it is Unicode - as I posted earlier, Unicode is a blanket for UTF-8, UTF-16 and UTF-32 which makes it ambiguous.
UTF-8 bits 0-7 is ASCII as long as bit 8 isn't set, so to fully support it you'd need to still exclude bits below 7 that are not valid html characters and include support for multiple bytes and bit 8. The reason existing DNS servers won't work with it is because bit 8 indicates multibyte and the second byte may carry an invalid character from the 0-7 bits and the first byte may have a language encoding for the second byte (indicated by bit 8). For instance character 43 is + and that is invalid in a URL. If character 1 had bit 8 set and indicated the language as French in the language encoding (which I believe is done in the first 7 bits and can in some cases be extended to the second or even third byte, but its been a while since I read the spec - I do know there is an encoding that does this and I'm pretty sure it is UTF-8), the second byte 43 would (probably - I'm not going to look it up) mean something entirely different and be perfectly valid.
There are a lot of websites where the words don't matter.
Uh, yeah, because the keyboard you're using is a clear indicator of which language(s) you understand.
Just because the characters don't show up in the edited text doesn't mean that they won't be handled in anchor tags or Slashdot's URL tag.
Well, Slashdot mangles them anyway. The URL should end in .com.
Slashdot's web interface is quite embarrassing in this respect. Having a non-Unicode-capable page in 2009 is like having one that is optimized for Netscape 0.9, no matter what amount of JavaScript and Web 2.0 bling they put in there.
If international URLs will finally force Slashdot to implement a triviality such as string parsing, so much the better.
As a state gets corrupt, its laws multiply; the most corrupt states have the most numerous laws. (Tacitus, Annales 3:27)
"the majority of the internet users won't either."
Sorry, but that sounds like typical American ethnocentricity. The MAJORITY of internet users actually are people who don't natively speak English. Chinese speakers, Russian speakers, European people, many of whom use cyrilic alphabets, Arabs, South Americans, Indians, and others that I'm surely missing.
How can you possibly speak for "the majority of internet users", when people who speak English as their native language constitute a pretty small percentage of the world's people? I could google, but I'm almost willing to bet that more people on this earth grow up speaking Chinese, than people who grow up speaking English as their first language.
If a guy is more comfortable using his own language, I'm all for him doing so.
"Windows is like the faint smell of piss in a subway: it's there, and there's nothing you can do about it." - Charlie Br
I don't normally browse websites written in a language I can't understand.
1. The link text in the example I provided was in English.
2. I am not aware of any requirement that only one language may be used on a given website. If there is such a requirement, please inform my contacts on Facebook of this, because they post messages there in about 15 different languages using at least 4 different writing systems. (And I've posted there myself in 4 languages, including English.)
I still see an ignorant american that thinks the whole world should read and write english for people like dhermann.
1. See above.
2. So you are saying that you can read my mind? Perhaps this ability of yours needs some fine-tuning, since I never made any such assertion.
3. It's true that I still carry a US passport, but I've not lived there in many years.
NSFW has nothing to do with supporting more internationalization and it's all a cop out.
Nobody is "copping out", and if you seriously think I am opposed to internationalisation, you're barking up the wrong tree.
Nevertheless, dhermann is voicing what I believe is a legitimate concern, even more so for less sophisticated and experienced Internet users.
The answer to such concerns is, of course, education. Many people are not even aware that services like Google Translate are available.
In the meantime, I suggest you remove the chip from your shoulder. Not all Americans are alike, you know.
Il n'y a pas de Planet B.