Slashdot Mirror


ICFP Contest Releases Codex

howie writes "The ICFP Programming Contest begins on Friday and organizers have released the first part of the contest, the "codex". Contest organizers haven't (yet) given any details, but there's been lots of speculation on the discussion list -- the file contains some suggestive strings like "i love bees" and "MKULTRA". The organizers promise more information by noon (ET) on Friday."

38 comments

  1. My Speculation is They're All Blogs by eldavojohn · · Score: 3, Interesting
    Ok ok, so I've only looked at this thing for five minutes in Textpad but there are some very interesting clips of readable text in the file. Early on they say:
    ignoti et quasi occulti
    Which I believe is a latin phrase for something like "unknown and partly hidden." It's also the motto for Societas Eruditorum in Neal Stephenson's Cryptonomicon book.

    Next:
    Evan Chan was murdered
    It's a reference to a game centered around the film A.I. and refers to a society called the Cloudmakers.

    Next:
    evalso dark the con of man
    The phrase "so dark the con of man" is actually a religious blog.

    A funny thing about the first three phrases I picked out is that they are all blogs online if you google them. Is this a coincidence or are all of the phrases here blogs? They seem to refer to direct blog entries, perhaps the html or text from these blog entries can be used as a hash encryption for the "junk" binary loaded in between each of the entries.

    Again, this is just speculation. MKULTRA refers to the CIA's mind control program in response to rumors of the Soviet, Chinese and North Korean programs of the smae nature ... not sure how that would tie in.

    Perhaps the purpose is to develop a blog scanning program that will accurately identify blogs and retrieve information and try to figure out a way to crack this document? It's very large so I am guessing automation would be necessary as opposed to human googling by hand.
    --
    My work here is dung.
    1. Re:My Speculation is They're All Blogs by Anonymous Coward · · Score: 1, Informative

      "So dark the con of man" is an anagram used in the Da Vinci Code, which in itself has a codex as a main plot device.

    2. Re:My Speculation is They're All Blogs by Anonymous Coward · · Score: 1, Interesting

      There's also an image in there. To view it:
      - Open the file in your favourite text editor (that doesn't mess up binaries), like emacs
      - Search for GIF and delete everything that appears before it
      - Save with a .gif extension

      Voila! instant mysterious picture!

    3. Re:My Speculation is They're All Blogs by Anonymous Coward · · Score: 0

      The .umz file also contains an embedded gif image

    4. Re:My Speculation is They're All Blogs by Amouth · · Score: 1

      Intresting.. one has to wonder what it all means.. or we can just wait till tomarow

      (by the way AC is right there is an image.)

      --
      '...if only "Jumping to a Conclusion" was an event in the Olympics.'
    5. Re:My Speculation is They're All Blogs by AKAImBatman · · Score: 1

      BTW, "i love bees" is a reference to another online reality game of the same name. It's like they took all the technology-related pop culture and spread references to it throughout the file.

    6. Re:My Speculation is They're All Blogs by Maru+Dubshinki · · Score: 1

      And of course, "eval" (which precedes "so dark the con of man") is our old friend and command from various functional and not-so-functional languages.

      --
      Enquiring minds want to know!
    7. Re:My Speculation is They're All Blogs by tropo3050 · · Score: 1
      Your post piqued my curiosity and I decided to investigate further. I ran an analysis looking for groups of alphanumerics and/or spaces and/or underscores and discovered a number of "interesting" lines. One I found interesting was
      welldonedaed si luap
      which, when backwards is
      paul is deadenodllew
      . You can get a longer list at http://rapidshare.de/files/26473525/codex.tar.bz2. html. Here are the highlights:
      inuz, 1lambda, 5Evan Chan was murdered, 5raimundus, abracadabra, abulafiabad wolf, anbaric, area51, CGwei, dlbizu, dustsolomon, evalso dark the con of man, fnord, frogz, HatwLQ, Hitmmw, i love bees, Idahc, ignoti et quasi occulti, K_Iea, Kid0u, lambda, lfPaw, LnusH, Lovne, lullus, nehmy, novus ordo seclorum, ntpmt, oceaa, PCtak, PDewn, PDmax, plbndetibh u ou rvcofalt ea, pviqh, pyshxtc, Q0JIyfPHOq, rakoczi, rkude, roswell, societas_eruditorum, surmount, telluric, templar, tsarogy, tycon mismatch, welldonedaed si luap, yosod, zbofHa, zhn_rs
    8. Re:My Speculation is They're All Blogs by Anonymous Coward · · Score: 0

      Not to be a karma whore, but there is an image there. You can see it at http://imageigloo.com/images/5647codex2.gif. Wonder what CBV or GBV means?

    9. Re:My Speculation is They're All Blogs by tropo3050 · · Score: 1

      My bad, I didn't read at a low enough threshold. Ignore this post.

  2. dasddasa by Anonymous Coward · · Score: 0

    first post se ma se poes

  3. programming with brain by 192939495969798999 · · Score: 2, Funny

    So if I am able to crack this from looking at it, they'll have to declare my brain the "programmers tool of choice?" Or would it go to notepad?

    --
    stuff |
  4. I can't wait by Anonymous Coward · · Score: 2, Funny

    come on, QBASIC don't fail me now!

    FP?

  5. Wow. by Anonymous Coward · · Score: 0

    How tantalizing.

  6. Interesting things I've found by Anonymous Coward · · Score: 1, Interesting

    using "strings"... I've only found interesting strings at the beginning and end of the file.

    Beginning of file:

    ignoti et quasi occulti
    welldonedaed si luap
    5Evan Chan was murdered
    fnord
    tsarogy
    apply
    MKULTRA
    abracadabra
    evalso dark the con of man
    __VFD__
    h__CBV__
    dustsolomon
    GIF89a@
    |%equo
    rakoczi
    novus ordo seclorum
    telluric
    o,5:>n
    plbndetibh u ou rvcofalt ea
    templar
    tycon mismatch
    apply
    5raimundus
    societas_eruditorum

    End of file:

    1lambda
    anbaric
    Tga'G
    abulafiabad wolf
    lambda
    roswell
    area51
    i love bees
    __42__
    lullus
    surmount
    currents

    1. Re:Interesting things I've found by AKAImBatman · · Score: 1

      Interestingly, there seems to be a theme of ficticious conspiracy theory running through the file. References to DaVinci Code, Backward Record Messages, CRYPTONOMICON, aliens, "The Beast" reality game, Hitchhiker's Guide to the Galaxy, and Doctor Who all appear in that output.

    2. Re:Interesting things I've found by LincolnQ · · Score: 1

      Interesting patterns in a hex editor:

      After "abracadabra" and "eval", it counts from 0-256 in big-endian 32-bit words.

      Right before "abracadabra", there's 4096 bytes of weird looking patterned data. I opened it as "raw" 64x64 indexed in Gimp and got the same mysterious "GBV"/"CBV" image as can be extracted from the GIF a little later in the file!

    3. Re:Interesting things I've found by Sweeman · · Score: 1

      The big endian words represent all the possible ASCII characters.

    4. Re:Interesting things I've found by SQLGuru · · Score: 2, Interesting

      Dude, check this out.....

      After MKULTRA and before abracadabra......using Notepad (the best programming tool ever?)

      Basically, take that chunk of text and put carriage returns after every 64th character. A pattern that matches the GIF image appears using text. [I tried including it, but of course filters killed that idea]

      BTW, the text looks more like a GBV than a CBV.

      Layne

    5. Re:Interesting things I've found by Fahrenheit+450 · · Score: 1

      No, I'm pretty sure it's a CBV (though a shout out to Guided By Voices would be nice). If you look at the very end of the file there is the sequence "roswell area51 CBV i love bees 42 __42__ lullus surmount Äí currents".

      --
      -30-
    6. Re:Interesting things I've found by mypalmike · · Score: 4, Funny

      It turns out "plbndetibh u ou rvcofalt ea" is an anagram of "unfavorable botched tulip". I think I'm onto something... ;)

      --
      There are 0x40000000 types of people: those who understand 32-bit IEEE 754 floating point, and those who don't.
    7. Re:Interesting things I've found by SQLGuru · · Score: 1

      Looking at it more, you are right, it's CBV. It's the decorations on the letters that confuse things.

      So, my thoughts:
      - the file contains many dividing tokens (i.e. MKULTRA and abracadabra)
      - each block between the tokens represents the same image in some format (GIF, BMP, Text, etc.)
      - the goal will be to decode as many different versions of the same image as possible
      - I'm sure there is some link between the dividing tokens and the encoding method (this is purely guessing)

      (If I'm right, then the portion of the file that looks like this: evalso dark the con of man __VFD__ *õh__CBV__ dustsolomon includes the __CBV__ because it is part of the encoding)

      Approach:
      - figure out all of the different tokens and what they imply
      - find the binary file format headers for various file formats
      - decode, decode, decode

      Layne

    8. Re:Interesting things I've found by illuminatedwax · · Score: 1

      The problem with this is that it has very very very little to do with programming. 95% of the work lies in the first step.

      More likely, the codex implies that contestants will be asked to do some kind of ciphering/deciphering of languages and conspiracy theory.

      --
      Did you ever notice that *nix doesn't even cover Linux?
    9. Re:Interesting things I've found by SEWilco · · Score: 1
      It turns out "plbndetibh u ou rvcofalt ea" is an anagram of "unfavorable botched tulip". I think I'm onto something... ;)
      OK, you grok the file. You win the contest.
    10. Re:Interesting things I've found by Anonymous Coward · · Score: 0

      Actually, it's "Hubbard copulative teflon", which tells you that it's a scientologist plot.

    11. Re:Interesting things I've found by SQLGuru · · Score: 1

      The Cult's scholarly publications are of particular interest to us. Because the Cult's journals were circulated on sandstone tablets, editors imposed very strict length limitations. Consequently, authors aggressively compressed their articles. A typical publication would have the following form:

      PUZZL.TSK=100@1001|14370747643c6d2db0a40ecb4b0bb65
      Should you encounter any such publications, we humbly request that you submit them to us via our web site. Our server will track all submitted publications, ensuring that every participant is given appropriate credit for advancing our understanding of the codex. Publications are of varying value; some will represent a greater contribution than others, and we will take this into account when assigning credit.


      Looks like my premise wasn't that far off......

      Layne
  7. Is Doctor Who scheduled to make an appearance? by AKAImBatman · · Score: 2, Funny

    If you look at the end of the file, it seems that Rose managed to plant "Bad Wolf" in there. So I wonder when Rose and the Doctor are going to show up and stop the alien monsters from crashing the contest? :P

  8. Oblig Beatles ref??? by Anonymous Coward · · Score: 0

    Did anyone notice the "paul is dead" backwards in the codex?

  9. Image Processing Perhaps? by FreshMeat-BWG · · Score: 1

    Perhaps image processing? Starting at offset 0x1a74: GIF89a Starting at offset 0x0620: Looks like some sort of bitmap?

    1. Re:Image Processing Perhaps? by prizog · · Score: 1

      Yep, it's a bitmap of the same image as the GIF.

    2. Re:Image Processing Perhaps? by mypalmike · · Score: 1

      xxd -s 0x628 -c 64 -l 4096 codex.umz

      --
      There are 0x40000000 types of people: those who understand 32-bit IEEE 754 floating point, and those who don't.
    3. Re:Image Processing Perhaps? by uab21 · · Score: 1

      Hmm - bitmap appears to include the letters CBV ???

    4. Re:Image Processing Perhaps? by Fahrenheit+450 · · Score: 1

      Call by value? Are they hinting that Haskell will not be the language of discriminating hacker this year?

      --
      -30-
  10. Damn by Fahrenheit+450 · · Score: 1

    Damn. Too bad I'm going to be traveling all weekend. This looks like a fun contest this year.
    I might even noodle around after the fact and just not enter it in the contest.

    --
    -30-
  11. Here's the gif. by sleepingsquirrel · · Score: 2, Informative

    Here's the gif.

  12. Codex book ? by dolmen.fr · · Score: 1

    Have a look to this blog post dated 2005-08-08: it tells about ICFP, codex and Slashdot.
    The WebArchive does have a recent archive of the site to check is this is was recently created.

    This indicate there may be a relation with the Codex book reviewed last year on Slashdot by the post author himself. The alternate reality game is just starting...

    1. Re:Codex book ? by dolmen.fr · · Score: 1

      s/does have a recent archive/does not have a recent archive/

  13. Rules released by uab21 · · Score: 1
    Here is the text:

    Dear Colleague:

    In 1967, during excavation for the construction of a new shopping center in Monroeville, Pennsylvania, workers uncovered a vault containing a cache of ancient scrolls. Most were severely damaged, but those that could be recovered confirmed the existence of a secret society long suspected to have been active in the region around the year 200 BC.

    Based on a translation of these documents, we now know that the society, the Cult of the Bound Variable, was devoted to the careful study of computation, over two millennia before the invention of the digital computer.

    While the Monroeville scrolls make reference to computing machines made of sandstone, most researchers believed this to be a poetic metaphor and that the "computers" were in fact the initiates themselves, carrying out the unimaginably tedious steps of their computations with reed pens on parchment. A few have conjectured a city-sized machine powered by falling sand, but no physical evidence of such a device has been discovered.

    Among the documents found intact in the Monroeville collection was a lengthy codex, written in no known language and inscribed with superhuman precision. It is believed to be the masterwork of the Cult's scholarship, and as such it carries immense potential to advance our understanding of historyand possibly of computing as well. Unfortunately, the codex eluded interpretation, and over the decades, study of the Monroeville scrolls has slipped into obscurity. Since 1978, the codex has been stored in the basement of the Carnegie Museum of Natural History.

    Two weeks ago, during a visit to the excavation site for a new computer science building at CMU, workers discovered a set of inscribed tablets that proved to be the Rosetta Stone for interpreting the Monroeville codex. The tablets precisely specify the Cult's computing device, known to initiates as the "Universal Machine." Although there is still no evidence that the cult succeeded in constructing their machine, it is a reasonably simple task to emulate it on modern hardware.

    We can now say with certainty that the codex is in fact a program, intended for execution on the Universal Machine. Our initial exploration of the codex suggests that the Cult's ideas about programming were very sophisticated, if somewhat peculiar to the modern eye. One cannot help but wonder what the Cult might have achieved had they had access to modern electronics and type theory.

    I have enlisted the help of the CMU Principles of Programming group in creating a venue for study of the codex. We invite you to participate in this investigation. The codex and a translation of the Universal Machine (UM) specification are available for download from our web site. We encourage you to implement the UM and begin your own exploration of the codex.

    The Cult's scholarly publications are of particular interest to us. Because the Cult's journals were circulated on sandstone tablets, editors imposed very strict length limitations. Consequently, authors aggressively compressed their articles. A typical publication would have the following form:

    PUZZL.TSK=100@1001|14370747643c6d2db0a40ecb4b0bb65

    Should you encounter any such publications, we humbly request that you submit them to us via our web site. Our server will track all submitted publications, ensuring that every participant is given appropriate credit for advancing our understanding of the codex. Publications are of varying value; some will represent a greater contribution than others, and we will take this into account when assigning credit.

    On a personal note, being inspired by the scholarship of the Cult, I have decided to dedicate the remainder of my days to a solitary study of computation and programming languages. However, before embarking on my monastic transformation, I wish to see that the world is well on its way to uncovering the secrets of the Codex.

    Therefore, I ask that you submit as many publications as you can by