Slashdot Mirror


AI May Have Finally Decoded the Mysterious 'Voynich Manuscript' (gizmodo.com)

An anonymous reader quotes a report from Gizmodo: Since its discovery over a hundred years ago, the 240-page Voynich manuscript, filled with seemingly coded language and inscrutable illustrations, of has confounded linguists and cryptographers. Using artificial intelligence, Canadian researchers have taken a huge step forward in unraveling the document's hidden meaning. Named after Wilfrid Voynich, the Polish book dealer who procured the manuscript in 1912, the document is written in an unknown script that encodes an unknown language -- a double-whammy of unknowns that has, until this point, been impossible to interpret. The Voynich manuscript contains hundreds of fragile pages, some missing, with hand-written text going from left to right. Most pages are adorned with illustrations of diagrams, including plants, nude figures, and astronomical symbols. But as for the meaning of the text -- nothing. No clue. For Greg Kondrak, an expert in natural language processing at the University of Alberta, this seemed a perfect task for artificial intelligence. With the help of his grad student Bradley Hauer, the computer scientists have taken a big step in cracking the code, discovering that the text is written in what appears to be the Hebrew language, and with letters arranged in a fixed pattern. To be fair, the researchers still don't know the meaning of the Voynich manuscript, but the stage is now set for other experts to join the investigation. The researchers used an AI to study "the text of the 'Universal Declaration of Human Rights' as it was written in 380 different languages, looking for patterns," reports Gizmodo. Following this training, the AI analyzed the Voynich gibberish, concluding with a high rate of certainty that the text was written in encoded Hebrew."

The researchers then entertained a hypothesis that the script was created with alphagrams, words in which text has been replaced by an alphabetically ordered anagram. "Armed with the knowledge that text was originally coded from Hebrew, the researchers devised an algorithm that could take these anagrams and create real Hebrew words." Finally, "the researchers deciphered the opening phrase of the manuscript" and ran it through Google Translate to convert it into passable English: "She made recommendations to the priest, man of the house and me and people." The study appears in Transactions of the Association of Computational Linguistics .

24 of 203 comments (clear)

  1. "Finally Decoded" by Anonymous Coward · · Score: 5, Insightful

    STOP using this phrase in each bi-weekly story about this book only to say at the bottom of each article it "isn't really decoded".

    It's "decoded" when the text is readable.

    1. Re:"Finally Decoded" by MrKaos · · Score: 2

      Finally Decoded using this one weird trick.

      --
      My ism, it's full of beliefs.
  2. Re:Indian ... not hebrew by beep54 · · Score: 3, Insightful

    Citations sorely needed...

  3. Lorem Ipsum by houghi · · Score: 5, Interesting

    What if they let loose the same AI on the Lorem Ipsum text that we know to be meaningless. Would it come to a similar conclusion? We humans want to see patters where there are none.

    --
    Don't fight for your country, if your country does not fight for you.
    1. Re:Lorem Ipsum by 91degrees · · Score: 2

      It's not completely meaninglessness though. I mean it's gobbledygook, but gobbledygook with Latin sentence structure and vocabulary.

    2. Re:Lorem Ipsum by Dwedit · · Score: 5, Informative

      The lorem ipsum text actually means something though... (some words were removed)

      Nor again is there anyone who loves or pursues or desires to obtain pain of itself, because it is pain, but occasionally circumstances occur in which toil and pain can procure him some great pleasure. To take a trivial example, which of us ever undertakes laborious physical exercise, except to obtain some advantage from it? But who has any right to find fault with a man who chooses to enjoy a pleasure that has no annoying consequences, or one who avoids a pain that produces no resultant pleasure?

      On the other hand, we denounce with righteous indignation and dislike men who are so beguiled and demoralized by the charms of pleasure of the moment, so blinded by desire, that they cannot foresee the pain and trouble that are bound to ensue; and equal blame belongs to those who fail in their duty through weakness of will, which is the same as saying through shrinking from toil and pain.

    3. Re:Lorem Ipsum by Anonymous Coward · · Score: 2, Informative

      Um, Lorem Ipsum isn't meaningless, it's Latin text copied from Cicero. We already know what it means. There goes your entire post.

    4. Re:Lorem Ipsum by BoogieChile · · Score: 2

      The Lorem Ipsum text, though, is based on something that Cicero wrote, but is definitely not coherent latin.

      where Cicero wrote;

      "Dolorem ipsum quia dolor sit amet, consectetur, adipisci velit, sed quia non numquam eius modi tempora incidunt ut labore et dolore magnam aliquam"

      Which is Latin, the Lorem Ipsum runs;

      "Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua"

      Which has some Latin words in it, but is mostly not.

  4. Lololololol by bluegutang · · Score: 5, Insightful

    Failing to find any Hebrew scholars who could help validate their findings, the researchers eventually resorted to using Google Translate,

    (Source)

    This "research" is a joke.

    1. Re: Lololololol by Anonymous Coward · · Score: 5, Funny

      To be fair, pasting something into google counts as research for millenials.

    2. Re:Lololololol by Luthair · · Score: 2

      Its Gizmodo, what do you expect from a tech blog?

    3. Re:Lololololol by DRJlaw · · Score: 2

      This "research" is a joke.

      I disagree. How do you recruit a classical Hebrew scholar to validate your hypothesis and assist with additional work? Not i the Yellow Pages. You publish your intermediate results and hope that it tickles a suitable person's interest such that they join in the effort.

      You may as will declare Linus' work a joke. It's not as if Linux 0.12 was useful for much. It took a boatload of domain experts to bring it up to the capabilities that made people find it useful.

    4. Re:Lololololol by quantaman · · Score: 4, Insightful

      Failing to find any Hebrew scholars who could help validate their findings, the researchers eventually resorted to using Google Translate,

      (Source)

      This "research" is a joke.

      Why? Because the Hebrew scholars didn't want to participate?

      Google Translate botches modern languages. The fact that running their results through Google Translate gave them meaningful output suggests they have real data.

      --
      I stole this Sig
    5. Re:Lololololol by bluegutang · · Score: 3, Insightful

      But they didn't get meaningful output. They got "She made recommendations to the priest, man of the house and me and people". This makes little sense as the first line of a book on herbology. This is AFTER "making a couple of spelling corrections" (how many is a couple?) and AFTER "de-anagraming" every single word (i.e. arbitrary picking one of the thousands of permutations of letters in the word). Not to mention that Hebrew is written without vowels, so any string of several characters is as likely as not to be a word.

      When I was in high school I used a script to find dictionary anagrams of my name and my friends' name. A few of the anagrams looked pretty cool. Did they have any deeper meaning? Of course not. This is basically the same methodology.

  5. Voynich Manuscript is obviously an elaborate prank by pezpunk · · Score: 4, Insightful

    you would think over time people would become less gullible, not more.

    and sure, if you train an AI long and hard enough, it will probably be able to tickle out something that looks like meaning from that nonsense. just like if you train an AI to see dogs, it can identify weird dogs in literally any image.

    https://www.washingtonpost.com...

    --
    i could live a little longer in this prison
  6. this last one got debunked by Idimmu+Xul · · Score: 3, Informative

    https://arstechnica.com/scienc...

    its the puzzle that keeps on giving!

    --
    The problem with slashdot is that most of its users were bullied and stuffed into lockers as kids!
  7. One Line by NicknameUnavailable · · Score: 4, Insightful

    Is proof this is a fake. They ran their algorithm, got something almost sensible for the first sentence, and the rest was total gibberish but they needed to publish.

  8. XKCD uncovered its meaning long ago by Opportunist · · Score: 5, Insightful

    https://xkcd.com/593/

    It is obvious when you think about it...

    --
    We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
  9. Overlooking the Obvious by CodeHog · · Score: 3, Funny

    "Drink your Ovaltine" - a crummy commercial.

    --
    Fat, drunk, and stupid is no way to go through life, son.
  10. Re:Summary of Text by networkBoy · · Score: 2

    actually a *very* possible hypothesis.

    --
    whois gawk date unzip strip find touch finger mount join nice man top fsck grep eject more yes exit umount sleep dump
  11. Re:Amazing AI by smooth+wombat · · Score: 2

    AI can only know what humans know. If humans consider something impossible then so does the AI.

    All AI is doing is getting to answers faster than humans can.

    --
    We will bankrupt ourselves in the vain search for absolute security. -- Dwight D. Eisenhower
  12. Re:Religious Kook Job? by Darinbob · · Score: 2

    A lot of scientific knowledge, especially medical, were secretive at some time. Knowledge was protected, guilds were formed to protect the secrets, and so forth. So texts would be written to be obscure, intentionally.

  13. Re:Indian ... not hebrew by Marxist+Hacker+42 · · Score: 2

    I think you were thinking of this: https://www.newscientist.com/article/dn24987-mexican-plants-could-break-code-on-gibberish-manuscript/, which is about the drawings in the manuscript, NOT the words. I suggest you ask your doctor about age related dementia.

    --
    SJW: a person who perceives an injustice, and while correcting it, commits a greater injustice.
  14. Re:Indian ... not hebrew by Pseudonym · · Score: 2

    I don't know if you knew this, but patent nonsense is also easily found via Google and other search engines. This is especially true of the Voynich manuscript. This is one reason why it is considered courteous for a person making an argument to give some pointers as to which information they believe supports that argument.

    This is a random Internet comment section, so nobody expects a comprehensive literature review. But, you know, something.

    --
    sub f{($f)=@_;print"$f(q{$f});";}f(q{sub f{($f)=@_;print"$f(q{$f});";}f});