Slashdot Mirror


Unicode 6.1 Released

An anonymous reader writes "The latest version of the Unicode standard (v. 6.1.0) was officially released January 31. The latest version includes 732 new characters, including seven brand new scripts. It also adds support for distinguishing emoji-style and text-style symbols and emoticons with variation selectors, updates to the line-breaking algorithm to more accurately reflect Japanese and Hebrew texts, and updates other algorithms and technical notes to reflect new characters and newly documented text behaviors."

15 of 170 comments (clear)

  1. Favourite unicode character by Cocodude · · Score: 3, Interesting

    has got to be the Love Hotel.

    Does anyone know why this is even there?

  2. Why Slashdot won't adopt it by tepples · · Score: 5, Informative

    Before anyone chimes in complaining that Slashdot doesn't even support an old version of Unicode, this is for several reasons. For one thing, there was once a fad of posting pornographic ASCII art on Slashdot, so it appears Slashdot disallows any character that would be more useful for glyph art than for English text. For another, there was once a fad of using bidirectionality override control characters for turning text backwards, which would break the layout and allow spoofing a comment's moderation score.

    1. Re:Why Slashdot won't adopt it by BetterThanCaesar · · Score: 4, Insightful

      Raise your hand if you couldn't code a parser that detects those characters and takes appropriate action, such as popping bidi characters.

      I'd love to be able to write IPA when discussing pronunciation, or actually write out words in other languages, ohm character for discussing electronics, pound and yen signs for currency ... Hey, even a bigger whitelist than what we have now would be great!

      --
      "Stop failing the Turing test!" -- Dilbert
  3. Re:Stick to ASCII by cc1984_ · · Score: 5, Funny

    Yeah but can you write a pile of poo in ASCII?

    http://www.fileformat.info/info/unicode/char/1f4a9/index.htm

  4. Re:Zomg by piripiri · · Score: 3, Insightful

    Yes, lolcats are a standard now.

  5. emoticons? by pz · · Score: 3, Insightful

    Seriously, emoticons? Who ever thought it a good idea to include those in a standard? Should we have an encoding for hearts as dots over lower case i as well? And little horseys, too? And y with a big tail that wraps around to the front of the word?

    --

    Put my fist through my alarm clock with its ding-dong death inside my ear. - The Blackjacks.
    1. Re:emoticons? by snowgirl · · Score: 3, Informative

      And little horseys, too?

      U+1F40E ... no, seriously...

      --
      WARNING! This girl exceeds the MAXIMUM SAFE standards established by the FDA for BRATTINESS
    2. Re:emoticons? by Hentes · · Score: 3, Funny

      The next thing will be teenagers building bigger emoticons out of emoticon characters. Then they will have to be included in the standard as well, and so on...

  6. Re:Zomg by fuzzyfuzzyfungus · · Score: 4, Funny

    I believe you mean to say that lolcats are in ur standardz, occupyin ur code-points; but not necessarily prescribing ur particular choice of glyph...

  7. Re:Checking for the release of a new version by Canazza · · Score: 5, Funny

    £ is Shift+3, what are you on about?

    --
    It pays to be obvious, especially if you have a reputation for being subtle.
  8. Re:The next version of the standard by StuartHankins · · Score: 4, Funny

    ...filling pages with sexually explicit ASCII art, such as Goatse, male masturbation, and birds perched on a penis...

    Yeah, the way they are going they might actually *have* these characters in the set now...

  9. Tetris, Chess, Baseball, and gang symbols by tepples · · Score: 4, Informative

    all the Tetris pieces

    The polyominoes up to five squares can be composed from U+2580 (upper half block), U+2584 (lower half block), and 2588 (full block) characters. Unicode tends not to introduce precomposed ligatures except when needed for round-tripping with pre-Unicode encodings.

    glyphs of game pieces of all well known games

    A lot of well-known pre-1923 tabletop games' game pieces already exist in Unicode. Chess is U+2654 through U+265F, and Checkers is U+26C0 through U+26C3. A lot of game pieces are simple enough in form that the Geometric Shapes (U+25A0 through U+25FF) represent them just fine. For example, Othello is U+25CB and U+25CF, as is Connect Four. Even the enemy in Fast Eddie for Atari 2600 is in Miscellaneous Technical (U+237E) as is home plate in Baseball (U+2302).

    heck, instead of just the suit symbols why not 52 glyphs for a standard deck of cards

    Those can already be composed from a Basic Latin letter or number and a suit symbol. Unicode tends not to introduce precomposed ligatures except when needed for round-tripping with pre-Unicode encodings.

    throw the Major Arcana tarot cards in there too

    I don't know about Tarot, but all twelve signs of the zodiac are in Miscellaneous Symbols, even the "69" looking sign of Cancer (U+264B).

    gang symbols

    The symbol of "Folk Nation" gangs is similar to that of Judaism: a Star of David (U+2721). The symbol of "People Nation" gangs is similar to that of Islam: a 5-point star and crescent (U+262A).

  10. Re:Stick to ASCII by Pieroxy · · Score: 3, Informative

    ASCII is just 128 characters.

  11. Re:Stick to ASCII by metamatic · · Score: 3, Funny

    This is Slashdot, I'm sure you can find any number of examples of people who've written a pile of poo in ASCII.

    --
    GCHQ Quantum Insert installed. If only our tongues were made of glass, how much more careful we would be when we speak
  12. Re:Obligatory XKCD by marcosdumay · · Score: 4, Insightful

    You know that this is the exact situation that Unicode AVOIDED, doesn't you?

    Now we have one standard with 3 different representation. Those replaced literaly thousands of standards. Yep, sometimes doing that new standard works.