Asia Next Frontier in Blogging
Lullabye_Muse writes "Japan Today tells us that there are 3 million people blogging in Japan and over 16 million people visiting a blog at least once a month in the country. It also mentions that over the next two years the market for blogs will expand over 40%." Meanwhile, in regards to Chinese blogging, wayfaring stranger writes "A new Wired News article talks about Hong Konger Edwyn Chan's new www.blogkumedia.com Chinese blog network, which aims to make blogging a mainstream reality for the Chinese internet." From the article: "Blogs haven't caught on in China, so even when Chan can hire bloggers, it's hard to market them to consumers, attract advertisers and raise venture capital. The investors he has met don't use blogs as sources of information, so they generally have no clue of what a blog is. 'All they know is that it's something hot which they hope to be able to cash out hopefully in less than a year,' Chan said."
Can't those characters be represented using UTF-8?
Yes they can as in UTF-16 too.
However, Java doesn't use UTF-16 encoding but wide characters (a.k.a. "wide char", or wchar). Wide char is just an extension of the regular one byte "char" to a two bytes value. So it can't store values bigger than 65535.
With UTF-xx, several bytes/int/long/whatever can be combined to create bigger values. Not so with wide characters.
So yes, UTF-xx can encode nearly anything (up to 8 bytes or something) but Java can't.