Making The Case That Voynich Is A Hoax

My 2 cents by SYFer · 2003-12-30 19:24 · Score: 3, Funny

01001001011000110110100000100000011001000110010101 10111001101011011001010010000001110011011001010110 10010110111000100000011101100110010101110010011011 01011101010111010001101100011010010110001101101000 00100000011001010110100101101110011001010110111000 10000001010100011011110111000001100110001000000110 00010110111000100000010100110110001101101000011001 0101101001110111110110010100101110

--
"...all the labours of the ages, all the devotion, all the inspiration, all the noonday brightness..." yada yada

Re:My 2 cents by Anonymous Coward · 2003-12-30 19:41 · Score: 5, Informative

Translation from binary:
Ich denke sein vermutlich einen

Translation from German from binary:
I probably think its one
Re:My 2 cents by tempfile · 2003-12-30 22:21 · Score: 2, Funny

Actually, it means
"I thinking be probably a pot of slice".

The Salamander Papers by Saeed+al-Sahaf · 2003-12-30 19:28 · Score: 5, Interesting

Somebody is laughing a lot.. Remember way back the Salamander Papers?

--
"Who are in control, they are not in control of anything - they don't even control themselves!" - Glen Beck

Re:The Salamander Papers by Timex · 2003-12-31 01:17 · Score: 2, Interesting

I'd be surprised to find many that even KNOW that the Salamander Papers are related to the Church of Jesus Christ of Latter Day Saints...

(I know this 'cause I was a member, once.)

--
When politicians are involved, everyone loses.

Ershlap? by paul248 · 2003-12-30 19:30 · Score: 4, Funny

Abdook artelly oppetrom uplocty?! Astenboorsley... af arcoolodople!

Bli, Fal.

Re:Ershlap? by decipher_saint · 2003-12-30 19:40 · Score: 5, Funny

Are you my boss, 'cause you sound like him...

Am I fired yet?

--
crazy dynamite monkey

Been there, done that by User+956 · 2003-12-30 19:31 · Score: 4, Funny

Gordon Rugg has demonstrated that it's possible to generate a text like the Voynich manuscript -- containing language-like regularities, despite being potentially meaningless

That's funny. I thought Darl McBride had already proven that with all those open letters he's written.

Mod me down, hippies!

--
The theory of relativity doesn't work right in Arkansas.

Re:There have been many meaningless books... by Zebbers · 2003-12-30 19:31 · Score: 2

RTFA

Its an unknown language or an unseen encryption scheme. This new theory is that it's neither, just cleverly crafted to appear that way. An interesting read...

Library of Babel by Mrs.+Grundy · 2003-12-30 19:31 · Score: 5, Interesting

This reminds me of a passage from Jorge Luis Borges' Library of Babel. In fact a lot reminds me of that story these days.

Five hundred years ago, the chief of an upper hexagon (2) came upon a book as confusing as the others, but which had nearly two pages of homogeneous lines. He showed his find to a wandering decoder who told him the lines were written in Portuguese; others said they were Yiddish. Within a century, the language was established: a Samoyedic Lithuanian dialect of Guarani, with classical Arabian inflections. The content was also deciphered: some notions of combinative analysis, illustrated with examples of variations with unlimited repetition.

Re:Library of Babel by marnanel · 2003-12-31 04:55 · Score: 2, Informative

The full Borges story is here. Like much of his work, it's a good read.

--
GROGGS: alive and well and living in

Missing the fact.... by Zibi · 2003-12-30 19:34 · Score: 4, Interesting

I think this report is missing the fact that if someone really wanted to make a hoax book, they could simply translate any other book (even the bible) into a made up language. If it's an obscure book the likliness that anyone would every figure it out is slim.

--
-Zibi

Re:Missing the fact.... by Anonymous Coward · 2003-12-30 20:08 · Score: 5, Insightful

actually very few people could write on any known topic (such as a topic for which we have a contemporaneous book in a known language) in a consistent but made-up language without being easily decipherable. We couoldn't figure out ancient egyptian because we had no idea what topic they were even talking about.... ALL it took to figure out ancient egyptian was being told (in ancient Greek, which we knew) what topic a couple of sentences of egyptian were talking about...we had no idea, having almost NO idea what various examples of the writing could POSSIBLY have stood for.
Re:Missing the fact.... by 1u3hr · 2003-12-30 20:54 · Score: 5, Insightful

if someone really wanted to make a hoax book, they could simply translate any other book (even the bible) into a made up language.
Making up a language, that isn't just a scrambled version of an existing one, is very, very hard. It takes someone like Tolkien (a professor of Old English who could translate Norse on the fly) to do that convincingly, and I doubt that anyone in the period could have done it in a way that would still defy detection.
Re:Missing the fact.... by Zibi · 2003-12-30 21:22 · Score: 2, Informative

Wow I've never actually made a comment on slashdot and had so many replies. To be entirley honest I don't know much about the document in question. When I scanned through, it struck me that they are looking into complex ways of proving it to be a hoax when it could be something more simple. I do understand the complexities of creating a language, and I didn't really mean to make up a completely new language with new gramar etc., I was more refering to creating your own alphabet. Create your own symbols. If you wanted to make it complex you could add your own rules and extra characters such as what other languages use (i.e., a character for the "th" sound, and another for the different pronounciations of the letter "a" and so forth). It would be very time consuming to translate something like that, but if you have nothing to do for a decade or so and are driven to try and confuse people for many years then I'm sure it could be done (although I personally doubt it). Anyway, just a brain splurge.

--
-Zibi

Beale Papers by Dan+East · 2003-12-30 19:39 · Score: 4, Interesting

Sounds a bit like the Beale Papers.

Dan East

--
Better known as 318230.

Re:Beale Papers by Lispy · 2003-12-30 23:47 · Score: 2, Funny

Wow, now I know whats my quest in 2004....The Treasure is mine...;-)

Ridiculous by SargeZT · 2003-12-30 19:40 · Score: 4, Interesting

I'm sorry, but calling the Voynich Manuscript a hoax is unfeasible. Sure, could it have in theory been a hoax? Yes, but there is no point to this. The "hoaxer" creates this in 3+ months, with very accurate drawings, and probably hangs on to it till he dies, so that it can be sold to a king 100 years later and eventually make it to america? Then again, maybe Nostradamus wrote it.

--
And why did you staple the trout to the RAM?

Re:Ridiculous by Seth+Morabito · 2003-12-30 20:00 · Score: 5, Interesting

The point of a hoax, in my opinion, would most likely have been financial gain.

There is no clear evidence pointing to an exact date that the manuscript was written, and the only firm circumstantial evidence we have to go on is Marcus Marci's letter to Anasthasius Kirchir, which mentions that the manuscript was sold to King Rudolph for 600 ducats. That is a heck of a lot of money. It seems perfectly reasonable to me that someone manufactured the manuscript to extract 600 ducats from the emperor.

This assumes a lot. It assumes that the letter is genuine, and it assumes that the facts mentioned in the letter are true, and it assumes that Rudolph was the first buyer, so it is by no means a sure thing. But a lot of us who lean (gingerly) toward the hoax theory stand by Occam's Razor, which points to a hoax being at least a feasable, and probably even likely solution. Rugg's analysis is just more circumstantial evidence, not proof, but every little bit weights the scale more.
Re:Ridiculous by shaitand · 2003-12-30 20:43 · Score: 4, Insightful

No actually "evidence" THIS broad lends no weight whatsoever. I saw this wholeheartedly as someone who has never even heard of the particular manuscript in question.

Here is what I know, partly assuming what you've said is accurate. Nobody knows when the manuscript was produced, the only evidence that indicates it's existance at a particular point may be suspect (although this is the case with much of the dates we've fixed for events in history and even the basis for several things we believe happened to the degree we call and teach them as facts). Yet this discovery claims at the time the manuscript was produced it was possible to produce fake meaningless gibberish that appears to have meaning.

Am I the only one who finds a problem with that in itself? How can you claim something was possible at the creation date when you don't know the creation date?

Next, giving that magically the date looked into did happen to coincide with the creation date that nobody knows. How exactly does a process being theoretically possible at a date get considered as evidence that is what was done in a particular instance?

Example, my house catches fire. Firefighters are unable to determine the source. The insurance company denies my claim on the grounds that the technology existed to rub two sticks together to generate heat and produce fire.

I wouldn't even call that circumstantial evidence. That isn't EVIDENCE at all. Hell if there were two sticks in the lawn right under the tree, then it would become the most ridiculous circumstantial evidence that should obviously be tossed aside. But it would be the sticks that are the evidence there, not the fact that it's possible to create fire by rubbing two sticks together and the technology existed at the time. However there isn't even that much here.
Re:Ridiculous by shaitand · 2003-12-30 20:46 · Score: 2, Insightful

"He has classic symptoms of numerous mental disorders"

Without intention of implying Nostradamus was or was not one of them. The same can be said of pretty much EVERY truely great mind in human history.

Those who believe being "normal" which is equivelent to "average" is a GOOD thing aren't likely to ever join their ranks ;)
Re:Ridiculous by Charles+E.+Hardwidge · 2003-12-31 02:24 · Score: 3, Insightful

The point of a hoax, in my opinion, would most likely have been financial gain.

The manuscript was produced in a time when alchemy was the only science in town. Knowledge of herbal cures would've been a goldmine during that period, and studies to discover how to turn base metals into gold were the arms race of its day. Given that alchemist commonly encrypted their notes, this manuscript would've made a tempting purchase.

One overlooked thought is the amount of effort that went into encryption and decryption at the time. It's possible the manuscript was designed to intrigue the political masters who would then throw all of their decryption resources at the manuscript, at the expense of apparantly more mundane, though more important, documents and cryptographic research being ignored.

Fake, maybe. Fake what? That's another question.

The pattern of nonsense by the+end+of+britain · 2003-12-30 19:42 · Score: 4, Interesting

The technique really is interesting. We have techniques that can identify patterns that are meaningful (all of cryptology, most of number theory, graph theory) but this application is neat because it is an effort to prove--rigorously--that a given set of data is just total noise.

--
"Oh, the tragedy of math gone wrong. I can't even talk about it." -Wil Wheaton http://www.wilwheaton.net

so obvious by segment · 2003-12-30 19:42 · Score: 4, Funny

Gordon Rugg has used the techniques of Elizabethan espionage to recreate the Voynich manuscript, which has stumped code-breakers and linguists for nearly a century

Had Mr Rugg just used rot13 he would've cracked the code long ago. Want Crypto?

--
MoFscker

It's one thing to say something is a hoax... by WIAKywbfatw · 2003-12-30 19:43 · Score: 3, Insightful

...but it's another thing to prove it.

Anyone can say anything is a hoax but it takes scientific evidence - actual empirical data - to prove such a claim.

For example, people once believed that the Earth was flat (some people still do) but the circumnavigation of the globe by explorers such as Magellan, lunar exclipses, etc provide evidence to the contrary.

Saying that just because something could be a hoax then it is a hoax is just plain stupid. Like Fermat's Last Theorem, it may be many years before Voynich is proved to be geniune or accurate, but the absence of proof of the former doesn't provide proof of the latter. Remember, even though TLF has been proved, we still don't have the "simple proof" that Fermat himself discovered.

Saying that the manuscript is more likely to be a hoax than not just because computer scientists have theorised that it could have been faked in the 16th century is like a 25th century scholar saying that the Wright Brothers flight, the atomic bomb and the Apollo missions are more likely to be hoaxes than not just because they could have been faked with 20th century technology.

--

"Accept that some days you are the pigeon, and some days you are the statue." - David Brent, Wernham Hogg

Re:It's one thing to say something is a hoax... by cpeikert · 2003-12-30 20:00 · Score: 4, Insightful

Remember, even though TLF has been proved, we still don't have the "simple proof" that Fermat himself discovered.

That's because he almost certainly didn't discover one.

Fermat was known for making some pretty bone-headed mistakes. Also, in his future writings he posed challenges to prove FLT for the case of n=3 or n=4, but never for general n>2. If he had found a truly elegant proof of the general case, and believed it was true, why not pose the general challenge?
Re:It's one thing to say something is a hoax... by jdbo · 2003-12-30 20:16 · Score: 2, Insightful

I think what you're trying to get at is the inherent difficulty of "proving a negative". It's always much easier to prove an affirmative (which inherently contradicts the position to be disproved). Hence, sailing around the world to proves that it's round, and therefore not flat.

However, the article offers speculation, not claims of proof/disproof.

I don't see anything unreasonable in the claim that the manuscript might be a hoax; reasonable observers will note that this is not actual proof.

In the meantime, speculation (within the realm of reason) that something could be hoax may suggest to these/other researchers paths of approaching the manuscript which eventually lead to its proof/disproof.

This is how the scentific method is supposed to work -exploration of multiple paths moving progressing (hopefully) towards deeper insight.
Re:It's one thing to say something is a hoax... by sakusha · 2003-12-30 20:26 · Score: 2, Interesting

RTFArticle. It is pretty clear that if the text can be produced by the algorithmic chart as described, it is meaningless gibberish.

You remind me of Stanislav Lem's classic book "Memoirs Found in a Bathtub." It's about a society that revolves around codebreaking. Lem makes huge plot points about short texts that are ambiguously decodable into dozens of other possible texts. They are never sure if the message really IS a code, or whether one of the decoded versions contains further codewords. But everyone is absolutely convinced that everything is encoded, nothing is what it seems.

And such is true of almost anything, leading to mental masturbation like The Bible Code. People WANT to believe it's real, but it's all a hoax.
Re:It's one thing to say something is a hoax... by Bagheera · 2003-12-30 20:28 · Score: 2, Interesting

Actually, the burden of proof would be on those who claim there is some meaning in it. Reading the article, and references to the manuscript, the "It's a hoax" proposition now has a plausible explanation as to how a hoax could be perpetrated. While not conclusive evidence.

Anyone can say anything is a hoax but it takes scientific evidence - actual empirical data - to prove such a claim.

Anyone can claim anything, but the more outrageous the claim the more evidence they need to support it. Someone could claim the book was the work of Aliens. That claim would take more conclusive evidence than "It was part of a clever scam." While this doesn't prove the hoax theory, it gives it more plausibility than simple supposition.

Honestly, do you think it's more likely to be an authentic encoded manuscript of alchemy? Occam's Razor favors the hoax. To challenge your analogy, it's much more like a 25th century scholar looking back and saying Roswell was a hoax than Kitty Hawk was.

--
Never attribute to malice what can as easily be the result of incompetence...
Re:It's one thing to say something is a hoax... by KjetilK · 2003-12-30 22:00 · Score: 4, Informative

Anyone can say anything is a hoax but it takes scientific evidence - actual empirical data - to prove such a claim.

No. It is the proponents of the idea that the book is genuine's job to prove that it is indeed that. One doesn't need to prove that something is a hoax if it is, Occam's Razor does that job. What explanation is contains the fewest ubstantiated assumptions: That something was written a language nobody knows, containing valuable information nobody has any idea about, or that it was produced using a simple encryption technique to fool somebody to pay loads of shiny ducats?

For example, people once believed that the Earth was flat (some people still do) but the circumnavigation of the globe by explorers such as Magellan, lunar exclipses, etc provide evidence to the contrary.

I find it amazing that some people still hold this myth as true! What kind of history education have you had!?!
Look, no scientist have never claimed the earth was flat. For one thing, in every other culture than the western, it has never been claimed otherwise ("they even knew the earth was spherical"), but some has got the weird notion that Columbus had to argue that the earth wasn't flat.
He didn't. The moron had the wrong numbers, and would have gotten killed if America didn't happen to be there.
Allready the pupils of Thales claimed their master knew the earth was round. Erastostenes, measured the circumference of the earth with an error of 3%! The true circumference of the earth was known to the greeks in antiquity! Plato and his pupil Aristotle himself knew many arguments for the spherical shape of the earth, and why is this important? Because though some Christian scholars around 300 AD didn't like the idea of a spherical earth, St. Augustin adopted much of Plato's philosophy and made it an important part of christianity in the same century, and they adopted the ideas of a spherical earth as well. Through Augustin, every leading authority accepted the idea of a spherical earth.
Eventually, Erastostenes numbers was also accepted , but Columbus didn't like them, because it meant that going the other way to India was infeasible. So, he used some other numbers, and he used Marco Polo's exaggerated estimates of the distance he had travelled, and so he made it quite feasible. But it wasn't, he was wrong.
Columbus thought the distance to Asia was 4000 km, his contemporary scientists 16000 km, the real distance is 23000 km, while Columbus eventually travelled 6500 km.
So, why is this important? Because people who hold this belief often have many other misunderstandings about science. Indeed, you can't prove that the book is a hoax, but for that reason, the burden of the proof rests with the proponents of the idea that it is genuine. Who, of course, might cling to the idea that it is, long after the world has moved on to greener pastures. That's how it usually works anyway.

--
Employee of Inrupt, Project Release Manager and Community Manager for Solid
Re:It's one thing to say something is a hoax... by notfancy · 2003-12-31 04:14 · Score: 3, Informative

Anyone can say anything is a hoax but it takes scientific evidence - actual empirical data - to prove such a claim.

No. It is the proponents of the idea that the book is genuine's job to prove that it is indeed that. One doesn't need to prove that something is a hoax if it is, Occam's Razor does that job. What explanation is contains the fewest ubstantiated assumptions: That something was written a language nobody knows, containing valuable information nobody has any idea about, or that it was produced using a simple encryption technique to fool somebody to pay loads of shiny ducats?

No, to you. Occam's Razor is a heuristic for selecting hypotheses to test. It doesn't relieve you of the burden of proof just because your burden is heavier. You definitely do need to prove that "X is false", if that is the hypothesis you selected based on whatever heuristics you choose.

Voynich is patently written in an unknown code (i.e., language): that's not an assumption, it's a given for both hypotheses. The first hypothesis (you used the non-synonim "unsubstantiated assumption") is that Voynich has high information content in the algorithmic sense. The second hypothesis is that Voynich has low information content, again in the algorithmic sense. Considerations of value, motive, etcetera are irrelevant to this analysis although they might be of heuristic value for selecting hypotheses, but not for application of Occam's Razor (which is another heuristic).

To sum it up, you still have the burden of proof, and you can't use heuristics for selecting heuristics.
Re:It's one thing to say something is a hoax... by poot_rootbeer · 2003-12-31 04:16 · Score: 2, Funny

If he had found a truly elegant proof of the general case, and believed it was true, why not pose the general challenge?

hmm... because the margins were too small to pose it in?

Looks like things haven't changed in 500 years! by a-aiyar · 2003-12-30 19:45 · Score: 2, Funny

Lets see - it turns out that the Voynich manuscript is likely a bunch of drivel that pictures of naked women. Looks like we haven't come that far since it was written, as this Filipino edition of FHM would suggest!

Google found me this by ElDuque · 2003-12-30 19:46 · Score: 5, Informative

In case you're wondering what it looks like

http://www.voynich.nu/

Cryptonomicom has this by puzzled · 2003-12-30 19:53 · Score: 3, Interesting

There is a portion of Cryptonomicom by Neal Stephenson where a real book of coded intercepts is replaced by random number strings encrypted with a fairly simple scheme.

Does anyone know if this book is a seed for Stephenson's story? He draws an awful lot of information from the history of computing for his stories.

--
I am very easy to get along with, but I don't have time to waste being nice to people who are being stupid. -Theo

Re:Cryptonomicom has this by Jonathan · 2003-12-30 23:51 · Score: 2, Interesting

Well, a more likely inspiration for the "Cryptonomicon" manuscript mentioned in Cryptonomicon and Quicksilver is the Steganographia of Trithemius. In the late 1990's the book was briefly in the news because a well known cryptographer, Jim Reeds, found and deciphered a hidden message from it.

Anyone else get the feeling... by carambola5 · 2003-12-30 19:54 · Score: 3, Funny

Does anyone else get the feeling that these people are just saying "It's too hard. We give up" ?

--
IWARS.
People, in general, disappoint me. Politicians even more so.

A Hoax? To What End? by WombatControl · 2003-12-30 20:03 · Score: 4, Informative

I've studied the Voynich manuscript before, and the possibility of a hoax seems just as unlikely as many of the theories that have been floating about. Yes, the language of the Voynich manuscript could be an elaborate hoax, but Rugg's analysis only proves what is already widely known.

The problem of creating such an elaborate hoax is that even Rugg's theory doesn't explain all the features of the Voynich manuscript. Furthermore, it seems unlikely that a sixteenth-century forger would go to the trouble of creating something that would have all the qualities of a real language and would include techniques that would deliberately resemble an actual document when viewed with analytical techniques that wouldn't be developed later. Occam's Razor makes it seem more likely that there some kind of language operating in the manuscript than a random system of patterns. Then again, there's no real way of knowing.

There are some images of the text of the Voynich Manuscript available here. Analysis of the text and the illustrations support the theory that the manuscript has defined sections on astrology, herbal medicine, and other subjects. There have been some serious and some rediculous theories about the manuscript from the intriguing notion that the Voynich text is mathematically similar to East Asian languages like Chinese or Vietnamese, or that the Voynich manuscript is written in an ancient form of Ukrainian. (I've read the supposed translation of it from the Ukrainian, and it hardly makes sense given that the manuscript's illustations don't match the text of the supposed translation.)

In the meantime, this site offers more information on modern translation efforts including a font for the Voynich script. (Which would make a lovely way of annoying co-workers by switching their default system font to Voynich text...)

Author's Page by mlc · 2003-12-30 20:19 · Score: 4, Informative

Prof. Rugg has a website about his methods and results, which may be of interest.

From the article... by aztektum · 2003-12-30 20:27 · Score: 2, Funny

To prove that the manuscript is a hoax, one would need to produce entire sections using this technique, says Pelling. Tweaking the grilles and tables should make this possible, reckons Rugg.

It's called a Xerox machine man.

--
:: aztek ::
No sig for you!!

Here is a better one. by AltGrendel · 2003-12-30 20:31 · Score: 3, Informative

More info is at http://www.voynichinfo.com/

It has a slow load due to java applets though.

--
The simple truth is that interstellar distances will not fit into the human imagination

- Douglas Adams

Missing a (cryptographic) clue ... by Professor+D · 2003-12-30 20:36 · Score: 5, Insightful

But, a volume of self consistent language (even a made up one) of over a hundred pages of text with accompanying pictures should fall to statistical and linguistic analysis.

Champolion cracked the Rosetta stone with much much less.

The 'true' examples of lost written languages/cyphers (do a google search) are mysteries because there exist few examples of brief length usually bereft of context (of grammar, history, linguistic evolution etc.).

The sheer volume of the Voynich manuscript, plus its origin in relatively modern Europe is what makes it so interesting to amateur cryptographers.

The Nature Paper is too brief to know how good Rugg's analysis is (and the Cryptologia site has been slashdotted), but if it holds up it is an interesting result, even if it is a conclusion that many "very smart cryptographers"(TM) have suspected for a long time

Re: Missing a (cryptographic) clue ... by Black+Parrot · 2003-12-30 22:55 · Score: 5, Insightful

> But, a volume of self consistent language (even a made up one) of over a hundred pages of text with accompanying pictures should fall to statistical and linguistic analysis.

I doubt it. How many possible mappings are there between strings of characters and meanings? And even with plausible interpretations of the pictures (e.g., a herbarium), the number of things that might be said in that context is for all purposes unbounded:
xyz =?= "this soothes the throbbing toe"
xyz =?= "this is very poisonous"
xyz =?= "this grows only in Ys"
xyz =?= "I learned this from my grandmother" ...
Surely it will never be deciphered if it is in an unknown language.

> Champolion cracked the Rosetta stone with much much less.

Actually, he had the benefit of a parallel text.

In the absence of a parallel text, this will only be decyphered the way Linear B was: after a rigorous analysis of the patterns in the text, and a much tighter context (essentially lists of <picture,name,number> tuples), it was noticed that some very obvious translations ("man" and "woman", or such) fit the inflectional pattern of a language historically spoken in the region where the texts were found, and that simple mapping could be extended to other obvious <picture,name> pairs without introducing inconsistencies.

I suppose it's possible that something similar could be done with the manuscript, but IMO only if there are some clearly labeled images that give tight enough a context to guess the specific word being used. And then some luck, because somebody has to recognize some language-specific patterns (such as the Greek masculine/feminine inflectional suffixes). And of course, more luck in what language it happens to be: Linear B might never have been deciphered if Greek didn't use gender-based patterns in its noun declensions.

If it happens to be written in some unknown language, IMO it will never be deciphered.

--
Sheesh, evil *and* a jerk. -- Jade
Re: Missing a (cryptographic) clue ... by Carewolf · 2003-12-31 00:25 · Score: 2, Interesting

No you underestimate the inherent limits of a structured language. The reasons you list are the reasons it might not be deciphered if it was a cryptographic language. If it is a natural language it would still fail.

Imagine attacking common words and phrases. If you read an english text, you would quickly notice words like "the" "a" "and", and it was a letter stuff like "you" and "me" Once you have a large set of common words and phrases you look at how they are placed and structured, and start making qualified guesses to their relationship.

Basically out cryptographica today, is so advanced that it now only can break most common encryptions, but it can infact break the differences between most langauges if guided by human sense.
Re: Missing a (cryptographic) clue ... by TygerFish · 2003-12-31 02:11 · Score: 2, Insightful

Basically out cryptographica today, is so advanced that it now only can break most common encryptions, but it can infact break the differences between most langauges if guided by human sense.

I think that this is an interesting curiosity but possibly a sad one for our age. It's hard to find people with heavy skills in dead languages nowadays.

On a more discouraging note, once you throw encryption into the picture and add it to an unknown(?) inflected language, you see that the problem will require the assembly of rare intellectual resources to even adequately define the problem. Talking about human sense is one thing, finding humans capable of applying the sense could well be problematic.

I've studied several modern languages and I don't want to even *think* about what's being discussed here. Good luck!

--
To mail me, remove the 'mailno' from my email addy.
"Yeah. It smells, too..."
Re: Missing a (cryptographic) clue ... by Black+Parrot · 2003-12-31 10:55 · Score: 2, Informative

> I can see how this could be done as a really big simultaneous equation where the coefficients are dummy values for verb/noun etc with a parallel set of equation based on the grammar rules (combination), then variance analysis to eliminate the typos. Pretty elementary.

If you think it's pretty elementary you should write it up and publish it, since doing so would make your name an instant household word in fields ranging from philology to computer science, and probably also harvest you a fine crop of honorary PhDs and cushy job offers.

The problem of inducing grammars from examples has been intensely studied, and about all we know about it is that it's hard. (For example, we have a theorem showing that it's impossible to learn an arbitrary Context Free Grammar from any finite number of example strings.) The way children learn their language's grammar is so baffling that intelligent people have seriously supposed that you're born with a grammar processor in your brain that already knows how to process any possible natural language grammar and a support module that helps you determine which grammar to use by inducing a small set of switch settings from the examples you hear.

For that matter, after several decades of research we are just now getting to the point where computers can reliably parse natural language sentences even when we already know the language, its grammar, and have a lookup dictionary for all the words in the language. Automatically determining the parse of a sentence where both the grammar and vocabulary are completely unknown is a phenomenally more difficult problem; I suspect it's impossible even in principle.

As to your suggestion, I'm curious how you're going to solve a system of simultaneous equations when the data you are working with doesn't actually express any equalities. (Stop for a minute and think how you eliminate variables in a system of simultaneous equations.) For that matter, I'm not even sure what your representations are supposed to be.

It almost sounds like you're wanting to try all possible combinations for the part of speech for each word, but the combinatorial explosion would eat your lunch (n^m solutions, for n parts of speech and m words, even assuming no words can play multiple roles). Perhaps worse, even if you could enumerate all the possibilities you wouldn't be able to tell which one was correct. Since you don't know the grammar in advance, and since natural language grammars can be remarkably different from each other, you simply wouldn't have any way of knowing which part-of-speech mappings resulted in grammatical sentences and which didn't.

And if by some chance you did guess the correct grammar, you still wouldn't have a clue what the words meant.

> then variance analysis to eliminate the typos

If that's possible for unknown languages, it should be easily applicable to known languages. Are you suggesting a methodology for automated proofreading, that would catch typos in manuscripts? Could it be embedded in the slashcode, to automagically correct the typos in our posts? This technology alone would make you rich, even without all the other stuff you would need for interpreting unknown languages.

--
Sheesh, evil *and* a jerk. -- Jade

repeats by 1u3hr · 2003-12-30 20:48 · Score: 5, Insightful

The Nature story says:

The text contains some features that are not seen in any language. The most common words are often repeated two or three times, for example - the equivalent of English using 'and and and' - giving weight to the hoax theory.

Indonesian pluralises words by duplicating them (anak = child, anak anak = children). And many languages, including English ("he was really, really stupid") intensify by repetition, so this point is not at all conclusive.

Re:repeats by PurpleFloyd · 2003-12-30 21:30 · Score: 3, Interesting

Of course, that particular point isn't much, cryptographically. Ever since frequency analysis came into use, historical cryptographers used "nulls" in their codes - random meaningless characters which would hopefully cause trouble to frequency analysts. It may be that the manuscript's code contains keywords that the decoder should ignore (all repitions of a word, for instance), or instruct the decoder to perform a certain action (say, 3 repititions means to skip the next three words).
On the other hand, this certainly could be a hoax. After all, the author was familiar with cryptographic methods and was paid an enormous amount of money for the manuscript. The real truth could certainly be either hoax or reality - there simply aren't enough facts available to decide right now, despite the huge amount of work put into the manuscript by many talented amateur cryptographers.

--

That's it. I'm no longer part of Team Sanity.
Re:repeats by 1u3hr · 2003-12-30 22:17 · Score: 2, Insightful

Neither "anak" nor "really" fit that.
Those were random examples. In Indonesian, EVERY noun is doubled to pluralise. So this is very common feature indeed. In English, no, we don't duplicate so much.
As far as the main article goes, though, I'd vote for it being a hoax.

Interesting problem. by Black+Parrot · 2003-12-30 22:22 · Score: 4, Interesting

Those who read the article can take note of an interesting challenge: though Rugg has shown that it is possible to generate a high quality hoax using a Cardan grille, proving it to be a hoax may require producing a character grid that will actually generate large portions of the text. My question is, could that be done with a genetic algorithm, and are any Slashdotters up to the task?

Also, a few comments about formal analysis. Notice that if you took some arbitrary text, typeset it in a fixed-width font to force the characters into columns, and then skimmed it with a grille in order to generate a new text, you would automatically preserve such basic statistics as character frequency, including spaces and also punctuation if you used them in your grid. (Depending on how you applied the grille, you could actually be generating a simple permutation of the original text.) However, you would disrupt all the within-word correlations.

For example, in compound words derived from Latin there is a familiar pattern where ad C* ==> aCC* (where C is some arbitrary consonant), but that pattern would be completely obscured if the characters were read off a diagonal grille as shown in the photograph. You would still get the increased frequency for C, but not the common aCC pattern.

More subtly, there are some well known universals of syllable structure in natural languages, but those would be scrambled just as the aCC would be. You would have the right proportions of consonants and vowels, but not a realistic distribution within words.

Likewise, prefixes and suffixes would be scrambled. If it is a hoax generated by a Cardan grille, it should not have prefix/suffix patterns that occur commonly in many languages. (Ditto for suffixal inflections.) In fact, the letters appearing at the beginnings and ends of words should be a random sampling from the frequency distribution of letters in the whole text; this may be the easiest metric to check.

Also, by using spaces as characters in your grid you'd get the right proportion of spaces, and therefore the right average word length, but you would obscure any patterns in word length. Someone has already linked to studies of the word lengths in the manuscripts, but those assumed that the distribution of Latin word lengths word lengths would be preserved. However, only the average would be preserved. I suspect the distribution would be converted to a gaussian. Anyone got time for the experiment? (Notice that you may generate extra spaces with the grille, depending on how you use it. For example, what do you do when your grille starts running off the bottom of the page in your source text? Or, if your grille has 10 windows, do you transcribe to the first space and then move the grille, or do you transcribe everything in the grille and insert a "virtual" space for position 11? It looks to me like you might be able to generate the document's actual "word" lengths from Latin, given only some very basic assumptions.)

--
Sheesh, evil *and* a jerk. -- Jade

Re: Interesting problem. by Black+Parrot · 2003-12-30 22:31 · Score: 3, Interesting

> In fact, the letters appearing at the beginnings and ends of words should be a random sampling from the frequency distribution of letters in the whole text; this may be the easiest metric to check.

Actually, the distribution of initial letters might be preserved, or at least mostly preserved. If the source text is written so that lines always begin with a new word, and the grille is always aligned with the start of a line, then what you read out of the grille will preserve the frequencies of word-initial letters. But if you read more than one "word" out of the grille before moving it, you will get a mixture of the true word-initial distribution plus the distribution of all the letters in the document. And if you don't always align the grille to the start of a line, all bets are off.

Off hand, I don't see any way that the distribution of word-final letters would be preserved. The first thing I would do to detect a hoax is compare that distribution to the distribution of all the letters in the document. If they are the same, then I would suspect the use of a grille or some other randomizer.

--
Sheesh, evil *and* a jerk. -- Jade

Repeats? by plumby · 2003-12-30 22:32 · Score: 2, Interesting

contains some features that are not seen in any language. The most common words are often repeated two or three times, for example - the equivalent of English using 'and and and'

What about Chines? From the little that I've learned, they often repeat a word for emphasis - e.g., Xie Xie meaning thank you.

Can you say "Kolmogorov complexity"? by dido · 2003-12-30 22:50 · Score: 5, Interesting

One definition of randomness, and one that seems quite reasonable is that a string is "random" if it cannot be compressed to smaller than it is, i.e. listing its characters itself is the most compact possible description. Formally, a string is random if there exists no algorithm generating the string whose description on some universal Turing machine is smaller than the string itself (this is the definition used in the field of Kolmogorov complexity). A string of a billion digits making up Pi, for example, is not random by this definition, as one can easily write a short program, whose length would certainly be less than one billion characters, whose output is the digits of Pi. Think of it this way: the most general form of pattern matching device that we know of is a Turing machine, and if the best device you can construct to match that pattern is as complex or more complex than the pattern itself, then well, you have total randomness. Unfortunately, rigorously proving that a particular string is random by this very strong definition is extremely difficult, as you run into undecidability everywhere you turn.

This is the sort of stuff that real theoretical computer science is made of. For a very good overview of the theory of Kolmogorov Complexity and algorithmic information theory, Gregory Chaitin's home page is a good starting point

To go back to the Voynich manuscript, if there is some sort of regularity that can be discerned from it, then perhaps a context-free or context-sensitive (or something in between) language may be found to characterize it. Once you have such a syntactic characterization, perhaps it might be possible to divine the semantics from context. The shape of the grammar that results may well prove whether the Manuscript is in fact a real language, a fabrication, an elaborate cipher, or just total gibberish.

--
Qu'on me donne six lignes écrites de la main du plus honnête homme, j'y trouverai de quoi le faire pendre.

Re:Can you say "Kolmogorov complexity"? by kasperd · 2003-12-30 23:38 · Score: 2, Interesting

That definition of randomness does make sense. Unfortunately it is undecidable, so you can never prove something is random according to the definition. You can prove something is not random, if you can find a program generating it. But if you cannot find such a program, you don't know if it is because it doesn't exist, or if you just didn't look on the right one.

As for finding a language given the string, it isn't hard to find a regular language containing the string, the hard part is to find the right language. It is trivial to define a regular language that contains all strings. But in this case it probably isn't the right one.

--

Do you care about the security of your wireless mouse?

Re:More information by actiondan · 2003-12-31 00:33 · Score: 3, Informative

oops. extraneous space in the link. Here's one you just need to click

Burden on proof ... by aepervius · 2003-12-31 00:34 · Score: 3, Insightful

If you and other purport it having a meaning then you should come forward with it. Unless you have compelling evidence, thern what you are presenting is no more no less than wishfull thinking or belief.

Indeed right now it isn't prooved at all that this manuscript has any meaning (encrypted or not) and a researcher prooved that you can reproduce most of the feature of the manuscript by using an encryption technic born a few year earlier. Furthermore the person selling it to the first known possessor was a forger. Yes not all feature are repdroduced. But this is a step forward.

The burden of proof is with you and "Then again, there's no real way of knowing." isn't an answer. At least none a scientific and a person interresed into knowing moer hold for enough. And, yes "Voynich manuscript. Furthermore, it seems unlikely that a sixteenth-century forger would go to the trouble of creating something that would have all the qualities of a real language and would include techniques that would deliberately resemble" Well I have news for you. 3.5 Kilogram gold (a prince wealth for the time) make it more likely than you wish to hold it.

You might have included a lot of link making people see your post as informative, but frankly it isn't especially your dubious use of Occam's Razor (The explanation needing the LESS number of new entity is the most probable). Sorry but to purport that the manuscript hold meaning is having one unknown new entity (from where that language come ?) more than purporting that using the clever trick aforementionned (available at that time) which hold no unknown new entity.

My final point is, Occam's razor only say you what is the most likely explanation. NOT WHAT IS THE CORRECT ONE.

--
C. Sagan : A demon haunted world:
http://www.amazon.com/gp/product/0345409469/
visit randi.org

Codebreak this! by crazyhorse44 · 2003-12-31 00:41 · Score: 2, Funny

Gue sepak biji lu!

--
. SLASHDOT: Home of the vicious nerd.

It's ancient and indecipherable! by buckeyeguy · 2003-12-31 01:22 · Score: 2, Interesting

Therefore it must be important! Eh, no. (See the Urantia Book for one example of why some old nonsense is better left aside.)

Years ago I had a coworker who would blather on about the Urantia book and its 'answers'... but then he was an old stoner too.

--
I'd have a personalized plate on my car, but "toxic bachelor" won't fit into 7 letters.

Arlet and the rec.puzzles archive by Mikey-San · 2003-12-31 01:26 · Score: 2, Interesting

Here's a great little bit of information regarding Voynich:

http://rec-puzzles.org/new/sol.pl/cryptology/Voy ni ch

Mmm, strangeness.

--
Mikey-San
Karma: +Eleventy billion (mostly affected by watching Celebrity Jeopardy)

Your analogy is incomplete by DeadVulcan · 2003-12-31 01:26 · Score: 3, Informative

Example, my house catches fire. Firefighters are unable to determine the source. The insurance company denies my claim on the grounds that the technology existed to rub two sticks together to generate heat and produce fire.

Of course, this is ridiculous. But there have been many who claimed that producing a hoax as convincing as the Voynich papers was virtually impossible. Rugg has shown that, at the earliest known date of "discovery," it was possible, and perhaps well worth doing for the price it fetched.

So, your analogy is incomplete. The insurance company's argument would have some relevance if you had previously been claiming that it was technologically impossible for you to light the fire. They just produced a counter-argument.

Coming back to the Voynich manuscript, it just means that the possibility of a hoax cannot be ruled out because of the effort required to produce it. Turns out it's not as hard as people thought.

--
Accountability on the heads of the powerful.
Power in the hands of the accountable.

Re:Your analogy is incomplete by Matthew+Austern · 2003-12-31 07:44 · Score: 4, Informative

There's no direct evidence that the document is forged. There's also no direct evidence that it's genuine, or even what "genuine" would mean. There are stories vaguely associating it with various interesting people, such as John Dee and Roger Bacon, but they're all pretty vague.

People have been studying this document for the better part of a century, because it's fascinating, enigmatic, and beautiful. (You can find some pictures of it at www.voynichinfo.com) We know a bit more than we did about what kinds of hypotheses are plausible and what kinds are not. For example: we can be pretty sure that it is not written in any natural language. We can also be pretty sure that it isn't just a simple substitution cipher. Finally, we can be pretty sure that it isn't a 20th century forgery: it has been given a rough date, it really does look like a manuscript from the 15th or 16th century, and it probably was once owned by Rudolf II. The Roger Bacon rumors are almost certainly false, because the manuscript doesn't appear to be that old. The John Dee rumors may be true.

At present the two most plausible guesses are that it is a real 15th or 16th century treatise on an occult subject, written in a code that has yet to be broken, or that it's a good imitation of an encoded occult text. If the latter, it was probably written specifically for the purpose of fooling Rudolf. It is known that he was fascinated by the occult (there's even an opera where that's a crucial plot point), and it is known that many of the astrologers and alchemists he patronized were quacks and that many of the texts he bought were forgeries.
What's interesting about this research isn't that it's a new argument against the possibility that the manuscript is genuine, but that it's a good counterargument. Until now, many people argued that the manuscript wasn't likely to be a forgery because the text followed a certain statistical property of natural languages (Zipf's law) that weren't known until the 20th century. Thus, the argument goes, it's unlikely to be a 16th century fake because a 16th century forger, inventing a fake code or a fake language, wouldn't have known to match this statistical distribution.

The reason this work is interesting is that it shows that this argument is invalid: there is a plausible method that a 16th century forger might have used that might have produced such a document. This doesn't show that it really is a 16th century forgery, it only shows that there's one fewer argument against that possibility than we once believed.

In the end, of course, we're unlikely to ever have decisive evidence that the manuscript is fake. Either someone will come up with a believable decryption (several people claim to have done it already; none of their claims have stood up), or people will keep trying and failing. The longer scholars bang their heads against the wall trying to get a translation, the less likely people will think it is that there really is one. Messy, but that's the way the world works. Sometimes you don't get to learn for sure whose guess is right.

Bible Code? by gillbates · 2003-12-31 02:18 · Score: 3, Interesting

I do believe that there are "codes" in the Bible, but the reason is different than what the fanatics describe. My belief is that the Bible codes exist for only one reason: to ensure accuracy. Consider the following:

The cat in the hat caught a rat and that was the end of that.

Notice the rhyming. Now translated into spanish (courtesy babelfish):

El gato en el sombrero cogio una rata y ese era el final de eso.

Now translated back into english:

The cat in the hat took a rat and that one was the end of that.

Okay, so notice in the original that the rhyming words appeared in positions 1, 4, 7, 9, and 14 (zero based). In the retranslation, the rhyming words appear in positions 1, 4, 7, 9 and 15. This disparity alone is enough to determine that the retranslation is not accurate.

Supposing that one writes in such a manner that there is a definitive pattern to their sentences and word choices, it is easy to determine the accuracy of a text after having gone through many translations. For a book such as the Bible, this was of paramount importance. I believe the original purpose of the "Bible codes" was to ensure that the meaning of scripture was not lost as it was passed from one generation to the next.

Consider for example, the poem. If a poem is incorrectly copied, it no longer rhymes, or the meter is disrupted. This simple mechanism not only ensures easy memorization, but provides a security against unintended alteration. In much the same manner, the "Bible codes" have provided scholars a way of discerning the accuracy of a copy of scripture. In fact, some of scripture is indeed poetic, further reinforcing the confidence in the original scriptures.

I find it somewhat interesting that lossless copying was available long before digital electronics were invented.

--
The society for a thought-free internet welcomes you.

Re:Bible Code? by Ami+Ganguli · 2003-12-31 02:52 · Score: 3, Insightful

An interesting conjecture, but you'd have to provide some sort of evidence to back it up. The "famous" bible codes are clearly nonsense - you can tweak the algorithm to extract just about anything from any text (see here for an example). Do you have some alternative code that stands up better to scrutiny?

Also, at the time the books in the bible were written, accurate transcription wasn't considered nearly as important as it is today. The stories were part of an oral tradition anyway, and would have evolved in the telling before ever being committed to paper. Early scribes were aware of this and would not have thought twice about "correcting" parts of the story that didn't, to them, seem to be right.

--
It is tempting, if the only tool you have is a hammer, to treat everything as if it were a nail. - Abraham Maslow
Re:Bible Code? by Samrobb · 2003-12-31 08:16 · Score: 2, Interesting
Also, at the time the books in the bible were written, accurate transcription wasn't considered nearly as important as it is today.

Sorry - you're wrong, particularly in terms of the writings that make up the Old Testament. The requirements for copying these texts were pretty stringent. Requirements 4, 6, and 7 are particularly interesting:
The Talmud lists the following rules for copying the Old Testament:
1. The parchment had to be made from the skin of a clean animal, prepared by a Jew only, and was to be fastened by strings from clean animals.
2. Each column must have no less than forty-eight or more than sixty lines.
3. The ink must be of no other colour than black, and had to be prepared according to a special recipe.
4. No word nor letter could be written from memory; the scribe must have an authentic copy before him, and he had to read and pronounce aloud each word before writing it.
5. He had to reverently wipe his pen each time before writing the Word of God, and had to wash his whole body before writing the sacred name Jehovah.
6. One mistake on a sheet condemned the sheet; if three mistakes were found on any page, the entire manuscript was condemned.
7. Every word and every letter was counted, and if a letter were omitted, an extra letter inserted, or if one letter touched another, the manuscript was condemned and destroyed.
Can't recall the reference at the moment, but I have come across mention that over the course of nearly a thousand years of transcription, there is a staggering lack of transcription errors in the Hebrew texts.
--
"Great men are not always wise: neither do the aged understand judgement." Job 32:9

Wikipedia by headqtrs · 2003-12-31 03:03 · Score: 2, Informative

Use wikipedia for some background information here

The Voynich manuscript by t0ny · 2003-12-31 03:11 · Score: 4, Funny

Have they tried casting "Read Magic" on it?

--

Manipulate the moderator system! Mod someone as "overrated" today.

You can't permenantly do that by porkchop_d_clown · 2003-12-31 04:12 · Score: 2, Funny

you see, new suckers are being born every minute.

--
Clear, Dark Skies

Research project in progress... by oneiron · 2003-12-31 04:14 · Score: 2, Interesting

There is a serious research project in progress which trying to get to the bottom of this mystery. If you can look past the occasional conspiracy-theorist-kook, there are actually quite a few thoughtful and intelligent folks participating. Here is the discussion thread for the project:

Voynich Manuscript Research Project @ AboveTopSecret.com

Note: Some of the other research projects are pretty interesting, also. In particular, the Yellowstone Super-Caldera Research Project.

Repetitions in the Text by wintermute1974 · 2003-12-31 05:35 · Score: 3, Funny

Its text contains features found in no known language: for instance, its commonest words may be repeated two or three times in succession.
Source: http://www.keele.ac.uk/depts/cs/staff/g.rugg/voyni ch/index.html

It is very, very, very unlikely that common words would be repeated again and again and again unless someone really, really, really wanted to.

More information about Voynich by Elonka · 2003-12-31 06:31 · Score: 4, Informative

On my own list of Famous Unsolved Codes, the Voynich Manuscript is right up there at #2, just under the Beale Ciphers (which also have some pretty compelling arguments that they're a hoax).

Some other good links for Voynich information:

An excellent viewer which lets you quickly see thumbnails of all of the pages at once.
A good overview page
The Voynich Mailing List - a site maintained by Jim Gillogly (famous for cracking the first few parts of Kryptos).

Elonka :)

Was the Cryptonomicom based on the Necronomicon ? by mbone · 2003-12-31 07:01 · Score: 2, Interesting

The "Cryptonomicom" has an obvious liguisitic similarity to the "Necronomicon" of H.P. Lovecraft. Colin Wilson later wrote sci-fi / horror stories that included Lovecraft and which stated that the Voynich Manuscript was actually one copy of the Necronomicon.

I have no idea if Stephanson knew this, but given the similarity of names, I would suspect so.

More details can be found here .

I've cracked it! by RealRav · 2003-12-31 08:12 · Score: 2, Funny

I've cracked it and will gladly give over the translation for 600 ducats.

Dreams are better as dreams than reality.

Surprised this didn't come up by GMFTatsujin · 2003-12-31 12:05 · Score: 2, Interesting

The Solution of the Voynich Manuscript by Leo Levitov was published by the Aegean Press in 1987. Links to Amazon.com are left as an exercise to the Slashdot readership.

Levitov provides methodology for extracting the linguistic model that the book encodes. Many examples and translations are provided, and there is plenty of work for the reader to do if he wants to prove the system to himself.

Levitov proposes that his solution reveals a manual of heretical text regarding the ease and assistance of the mortally ill into death -- euthenasia, basically. To my knowledge, his work has not been discredited, only ignored.

For the definitive hoax-type artificial reality book, check out the amazing Codex Seraphinianus.

Slashdot Mirror

Making The Case That Voynich Is A Hoax

73 of 382 comments (clear)