Slashdot Mirror


GZipping Life Forms: Deflate Reveals Bare-Bones

An anonymous reader writes "To distinguish images derived from living vs. non-living sources, USC and NASA JPL researchers report today using the standard gzip compression utility. As a measure of overall pattern complexity, they find that the inherent pixel content of biologically generated fossils produces higher image compression ratios [more data redundancy], compared to their non-biological counterparts. The more the file shrinks, the more likely it is that a living process was involved. A test is live online here. This extends the simple, but powerful, uses of gzip to biogenic fossil detectors, in addition to spam cop filters, DNA sequence comparisons, digital camera image crunchers, etc. In nine months, the two Mars rovers will send back the first microscopic-scale images of Mars rocks, which should be amenable to some of these same techniques: thus gzipping is apparently pretty zippy."

243 comments

  1. Makes sense... by Anonymous Coward · · Score: 4, Insightful

    Lifeforms seem to be built on patterns afterall. Patterns are easily compressible.

    1. Re:Makes sense... by spot35 · · Score: 1

      Perhaps this should be another entry to this poll then?

    2. Re:Makes sense... by jolyonr · · Score: 5, Interesting

      Unfortunately it's not that simple, inorganic systems can have as much visual complexity as organic things. For example.. um.. (looks out of window here in Toronto).. a snowflake! Fractal complexity, such as that seen in the branches of a tree, is frequently mirrored in the inorganic world - the snowflake is one example, another less well known example are manganese dendrites, they look just like fossil plants, but are totally inorganic such as these [Victoria Museum]. The patterns of frost on a frozen windscreen are another example. I can't see how a computer program can distinguish whether such complex patterns are signs of life or not. Still, if it helps NASA get more funding, then who am I to argue! Jolyon

      --


      Please read my Canon EOS tech blog at http://www.everyothershot.com
    3. Re:Makes sense... by Ted_Green · · Score: 2, Insightful

      Of course, so do a lot of crystalized structures. Lots of things are built on patterns.

      Anyways as far as this technique is concered this (organic images being more compressable) only holds true for organicly created stromatolite structures vs. chemcialy created stromatolite-like structures.

      They've only done 20 images or so, I'd like to know the comparitive compression ratios.

    4. Re:Makes sense... by Kadagan+AU · · Score: 1

      I was disappointed... I thought you linked to a poll about "What's your favorite use for gzip?". Maybe we should have one of them!

      ~Jon~

      --
      This space for rent, inquire within.
    5. Re:Makes sense... by Angus+Prune · · Score: 1

      I think the point is that it is hard to see the difference. It is the tiny variations or elements of pattern that affect the compression. Even if something looks organic there are elements that are more random than organic matter, they are indistinguishable by the naked eye which is why a computer is used. This isn't a fail safe method but a way of checking how likely it is to be organic.

    6. Re:Makes sense... by morgajel · · Score: 1

      WHAT?! no!

      SNOWFLAKES AREN'T ALIVE?!?!

      *sniff* say it ain't so...

      --
      Looking for Book Reviews? Check out Literary Escapism.
    7. Re:Makes sense... by DShard · · Score: 1

      Actually, I believe that the fact that pics of organic material is less complex. Organisms have more order and therfore are more regular. The point is that non-organic material is of higher entropy, therefore more "complex" or fractal. Humans have bilateral sysmetry, and virii and bacteria would be bilateral or other types.

      To achieve this more ordered state energy is used to stave off entropy. Directed use of energy to produce high order is a hallmark of life.

    8. Re:Makes sense... by DavidBrown · · Score: 1

      It's never that simple - but this method appears to be an easy way to separate the wheat from the chaff. It's not a life-detector, it's more of an indicator that life may be present, pointing out higher-potential samples for further review. False positives are OK - they'll be discovered when a person examines the sample. As long as this weeds out some of the non-live samples, it's of benefit.

      --
      144l. ph34r my 133t l3g4l 5k1lz!
    9. Re:Makes sense... by shpoffo · · Score: 1

      "WHAT?! no! SNOWFLAKES AREN'T ALIVE?!?! *sniff* say it ain't so..."

      Actually, both this post and it's parent are a bit ridiculous. Water is the most basic component to life. It's what is searched for on other world's as we look for life elsewhere in the universe.

      Why would anyone classify the most basic component of life as 'inorganic' (not alive) ??

      -shpoffo

    10. Re:Makes sense... by Anonymous Coward · · Score: 0

      Any reason why they didn't use bzip2?? (No, I didn't read the article... Farscape's coming on in 6 minutes so bite me.)

    11. Re:Makes sense... by morgajel · · Score: 1

      the same can be said for "carbon" based life forms.

      but last time I checked, a diamond was still not alive.
      in other words
      flour != cake

      --
      Looking for Book Reviews? Check out Literary Escapism.
  2. Cool by fudgefactor7 · · Score: 1

    Bad pun at the end of the original post not withstanding, this is pretty cool stuff. Wonder why nobody thought of using comression in this manner before? This has all sorts of potential uses.

    1. Re:Cool by tijnbraun · · Score: 3, Interesting

      A similiar technique has been used by italian mathematicians to differentiate pages from various authors by using zip. A nature article can be found here. After a request from a dutch newspaper they were able to identify one author (Marek van der Jagt, which made his first debut) to be the same as an already well-known author (Arnon Grunberg).

  3. I compress.. by mr.+methane · · Score: 4, Funny

    ... therefore I am.

    I'm not sure I should be flattered that the best way to tell a picture of me from a picture of a rock is that I have more redundant image data. :-)

    1. Re:I compress.. by DShard · · Score: 5, Funny

      That actually should flatter you. You have less entropy so you are of a higher order than the rock. You can brag to all your non-rock friends that those stupid rocks have high entropy.

    2. Re:I compress.. by JordanH · · Score: 1

      I shrink, therefore I am.

    3. Re: I compress.. by Black+Parrot · · Score: 1


      > You can brag to all your non-rock friends that those stupid rocks have high entropy.

      Mama always said Rock Music was just noise.

      --
      Sheesh, evil *and* a jerk. -- Jade
    4. Re:I compress.. by 4of12 · · Score: 2, Interesting

      Not only are you, but are uniquely Mr Methane, because each individual author has unique and identifying characteristics that can be measured using - guess what - compression algorithms.

      Given enough samples, individual authors can be identified and graphs of language relationships, too.

      I think it's interesting because it raises the bar on preserving anonymity if you publish widely.

      Add some entropy to your life; write drunk.

      --
      "Provided by the management for your protection."
    5. Re:I compress.. by TKinias · · Score: 1

      scripsit mr. methane:

      I compress therefore I am.

      Comprimar ergo sum?

      --
      In principio creauit Linus Linucem.
    6. Re:I compress.. by Anonymous Coward · · Score: 0

      That actually should flatter you.

      Actually, shouldn't compression flatten you?

    7. Re:I compress.. by OneEyedApe · · Score: 1

      I would think that LSD or a similar psychotropic substance would be better for the introduction of chaos.

      --
      Life sucks, but death doesn't put out at all....
      --Thomas J. Kopp
    8. Re:I compress.. by 4of12 · · Score: 1

      Acid will generate too much chaos.

      I can see it now...

      Step 1: Pick up pen.

      Step 2: Marvel at the wonder of the universe.

      Step 3: Huh? What?

      --
      "Provided by the management for your protection."
    9. Re:I compress.. by darkonc · · Score: 1
      I'm not sure I should be flattered that the best way to tell a picture of me from a picture of a rock is that I have more redundant image data. :-)

      Look at it this way: If your (potential) lover told you: "Your face looks like a random clump of flesh.", would you feel more or less flattered?

      Although a forest may seem random, if you take a look, you'll see a lot of repetetive patters. Thousands of leaves, all alike. Symetry to the leaves. All the trees of the same species look much the same (that's part of how we define them as species), and the various branches of a single tree tend to have the same basic pattern repeated. All of that repetition makes life easier on gzip.

      If you want your face to be hard to compress, you can make your skin randomly mottled, set your eyes at random angles (and shapes). remove the circular symetry of your eyes and beat your nose in with a small hammer.
      If you then blow your lips (and teeth) off with a ballpeen hammer so that they have a really random shape, you should have a pleasantly non-compressable picture. (just don't ask me for a date).

      --
      Sometimes boldness is in fashion. Sometimes only the brave will be bold.
  4. A-ha! by grub · · Score: 4, Funny


    So when we compress the ultimate, super-duper intelligent life form we get a two byte file containing "42"

    --
    Trolling is a art,
    1. Re:A-ha! by spot35 · · Score: 1

      Ironically, when compressing a file containing just '42' (using winzip, sorry) the resultant file is 112 bytes compared to the 'uncompressed' 2 bytes

    2. Re:A-ha! by C+A+S+S+I+E+L · · Score: 1

      Well, indeed; in terms of applying image compression, the highest form of life is indeed the super-intelligent shade of the colour blue, just as Adams predicted.

    3. Re:A-ha! by cymen · · Score: 1

      With gzip 1.2.4 the file containing "42" (3 bytes) results in a compressed size of 25 bytes.

      So did you make a pkzip compatible file or a gzip file? I tried Winzip 8.1 SR-1 (5266) here on a Win2k system and the file with "42" (4 bytes, made with edit.com) became a 104 byte zip file.

    4. Re:A-ha! by spot35 · · Score: 1

      Don't know. I just thought that trying to compress the ultimate answer to life the universe and everything causing the size of the file to become larger than the original was quite ironic. ;)

    5. Re:A-ha! by fredrikj · · Score: 1

      Excellent! Now that we've made that discovery, all we need to figure out is how to decompress it.

      Hmm, thoughts anyone?

    6. Re:A-ha! by Anonymous Coward · · Score: 0
      For a single file, named "1" and containing the string "42" (ASCII, 2 bytes, no CRLF)
      1.zip - 102 bytes
      1.zip.zip - 210 bytes
      1.zip.zip.zip - 326 bytes
      1.zip.zip.zip.zip - 450 bytes
      1.zip.zip.zip.zip.zip - 582 bytes
      The winzip header etc. is therefore 100 bytes. The difference in later iterations are explained by the increase in the length of the filename, which is an extra 4 bytes (Actually this is on Windows 2000, so I'm thinking it is actually 8 bytes because of the Unicode filenames) for every iteration.

      Completely useless. I have no idea why I bothered.
    7. Re:A-ha! by Anonymous Coward · · Score: 0

      I just thought that trying to compress the ultimate answer to life the universe and everything causing the size of the file to become larger than the original was quite ironic.

      Incontrovertible proof that we are still in the inflationary stage...

    8. Re:A-ha! by Anonymous Coward · · Score: 0

      Test it: what happens when you gunzip '42'? :)

    9. Re:A-ha! by Wraithlyn · · Score: 1

      It will be... tricky.

      --
      "Mind, as manifested by the capacity to make choices, is to some extent present in every electron." -Freeman Dyson
    10. Re:A-ha! by holt · · Score: 1
      [localhost:~] bholt% gzip -d 42.gz

      gzip: 42.gz: not in gzip format
      (42.gz is simply the file with only "42" in it.)
  5. I'd assume by Omkar · · Score: 2, Interesting

    that this has something to due with patterns and image continuity. If so (enlighten me!), then it would be a decent filtering tool, but reliability would be a major problem. Geological (or whatever) patterns could fool the algorithm. Finally, the most compressible image consists of monochrome - is it alive?

    (Mods: the last line was a joke, intended to point out a particularly simple example of a problem - not a troll)

    1. Re:I'd assume by Anonymous Coward · · Score: 0

      The question isn't 'is it alive' but has it been created/influenced by life. I would submit that any monochrome image would have to be created by a life form, either by selection or creation.

      For instance, a black piece of plastic. Created by humans. It all comes down to entropy. Life is one of the few forces that can cause a localized decrease in entropy, and therefore a non-random pattern. Non-random patterns compress. Random ones do not. (or not well)

    2. Re:I'd assume by xpl_the_myst · · Score: 1

      The question that they have to answer is not that of reliability, imho. What they have to say is why non-living chemical substrates do not develop in an ordered way. In fact, it is pretty logical that non-living stuff develop in simple patterns because they are ultimately the product of certain simple reactions at heart. Something like these fractal stuff - simple reactions composed to form a large structure should be pretty patterned and easily compressible. Life forms, on the other hand are formed by centuries of evolution (ok these are pretty basic life forms but still centuries of evolution must've been involved) and you would naturally expect some deviation from pattern.

      Just because someone puts a 'This is pretty intuitive' post at the top shouldnt mean this issue isn't debated. It is most definitely not logical / intuitive.

      --
      This sig is empty.
  6. horsefeathers. by Anonymous Coward · · Score: 1, Interesting

    It is true that many pictures of life forms compress to better or worse than than in-antimate objects. Just beause a picture of something compresses similarly to a life form doesn't mean it is a life form. This is simply coincidence.

    1. Re:horsefeathers. by Lussarn · · Score: 1

      What the article is trying to say is that intelligent lifeforms likes patterns. Look at a car. It's one color, and should compress fairly well. Look at a tree. It's many colors and shouldn't compress that well.

      I don't think a picture of a human compresses better than a tree though.

    2. Re:horsefeathers. by bsharitt · · Score: 1

      Well compressing things is at least a start. Perhaps with another technique or two a coputer could be able to almost be certain whether or not something is alive.

    3. Re:horsefeathers. by etcpasswd · · Score: 1

      The article talks about one specific case (stromatolites) where it is normally difficult to figure out if the rock formation is biotic or abiotic. This program is used only in cases where the images look alike. But you're right that one cannot generalize this idea to all kinds of images.

  7. So, when's the transporter going to work? by Anonymous Coward · · Score: 0

    Just zip me up and email me to Scotty. Too bad I'll probably have to do it naked to save a few bytes.

  8. Excellent... by Anonymous Coward · · Score: 5, Funny

    No more sniffing when i'm checking items in the refrigerator - is it 'alive' ? gzip is the answer!

    1. Re:Excellent... by alchemist68 · · Score: 1

      No more sniffing when i'm checking items in the refrigerator - is it 'alive' ? gzip is the answer!

      Now, we all know that the more familiar name of the 'refridgerator' among us geeks is the 'Polish Genesis Machine', which was shown in great detail in "Star Trek: The Search For Spock". I don't know about you, but my Polish Genesis Machine works rather well, ALL THE TIME. While it may not turn a dead planet into a thriving world replete with vegetation and animals, it sure does a great job on turning dead food into colorful sedentary lifeforms. And I don't need gzip to determine if 'it's alive', I can monitor the growth of my precious lifeforms on a daily basis with a simple ruler or meter stick.

  9. uhhh.. huh? by SamBeckett · · Score: 2, Interesting

    Doesn't gzip only look for patterns in one dimension? Assuming they are using these for pictures, they are missing the boat on at least one more area of complexity!

  10. gzip gates by greenalbatros · · Score: 2, Funny

    then we will find out if he truly is the borg!

    --
    this sig steers like a cow. and i can prove it
  11. Be Humble by hugesmile · · Score: 4, Funny

    OK, so if I have this right: Life is less random, and more predictible (more compressable)than non-life.

    So that tells me that life contains less data then non-life.

    Perhaps sophisticated life (human life?) contains even less data than non-sophisticated life. So the smarter we get, the more predictable we get, and the less data we contain.

    Perhaps we will someday get smart enough to be totally compressed to one bit. In the time I thought about this concept, I think my gzip file got even more compressed. Hmm....

    1. Re:Be Humble by Anonymous Coward · · Score: 0

      Yes, in the future, we shall all be hydrogren atoms!

    2. Re:Be Humble by javatips · · Score: 5, Insightful

      > So that tells me that life contains less data then non-life.

      No, it means that life contain less noise than non-life.

    3. Re:Be Humble by jdavidb · · Score: 1

      Thank you. You just completely cleared this up for me. I was sitting, puzzling, thinking, "Life contains huge amounts of information, and should therefore not compress as well as non-life." Then I realized information is less entropy, so it actually will compress better. (What a relief. I was almost on the verge of taking this personally. ;)

      A useful analogy: an ASCII file, containing information, compresses better than a file of truly random binary data. The truly random (think cryptographic) data has no useful information and has an even probability distribution of bytes, so there is no possible compression. When you add information you start altering the probability distribution of those bytes, and make compression possible. (Or think letter frequency analysis for breaking cyphers.)

    4. Re:Be Humble by Pflipp · · Score: 1

      > No, it means that life contain less noise than non-life.

      Apparently you don't have children.

      --
      "We can confirm that Debian does *not* ship the version with the trojan horse. Our version predates it." [CA-2002-28]
    5. Re:Be Humble by ThePhin · · Score: 1

      Borrowing from Sci Fi Lit, Larry Niven's Protectors were a lot more intelligent than us 'breeders', but between their biological imperatives, and their extreme ability to extrapolate chains of events from current data, they were said to have little free will.

      So at least to another Protector with the same data, they would be very predictable indeed. Just a thought experiment, but it makes sense to me...

    6. Re:Be Humble by p3d0 · · Score: 2

      No, you still have it wrong. Information is entropy. More information is more entropy. However, imagine the amount of information in a JPEG of your face, compared with a JPEG of bits from /dev/random. The latter will have more information and thus more entropy. That shouldn't give you an inferiority complex. :-)

      --
      Patrick Doyle
      I mod down every jackass who puts his moderation policy in his sig. Oh, wait a sec....
    7. Re:Be Humble by NonSequor · · Score: 1

      I think this point needs a little clarification. If you were to ask me to try to draw a picture of a person's face without my knowing anything about the person I can at least get reasonably close. My picture will have lips, eyes, and a nose and in all likelihood so will the person's actual face. But if you ask me to predict what string of bytes will be given by a random number generator (assuming I know nothing about how it works which might give me some insight), I cannot come up with any guess which will be any more accurate than any other guess. Finding out that a person has a nose is not particularly informative but finding out that the first byte of the string is 183 is. The easier it is to predict what something will be like, the less information it will contain.

      --
      My only political goal is to see to it that no political party achieves its goals.
    8. Re:Be Humble by p3d0 · · Score: 1

      Yeah, you're right. In hindsight, my post was pretty confusing as it was. :-)

      --
      Patrick Doyle
      I mod down every jackass who puts his moderation policy in his sig. Oh, wait a sec....
    9. Re:Be Humble by MegaFur · · Score: 1

      I'm not sure I agree with your definition of `information'.

      I think, for something to count as infomration, it has to be meaningful in some way. This means /dev/random doesn't count as information--it's just arbitrary numbers. No true patterns.

      --
      Furry cows moo and decompress.
    10. Re:Be Humble by p3d0 · · Score: 1

      Well, I disagree, but gine. You can define "information" and "meaningful" and "patterns" however you want, but your definition has no bearing on data compression, which is what we were discussing.

      --
      Patrick Doyle
      I mod down every jackass who puts his moderation policy in his sig. Oh, wait a sec....
  12. My question by AEC216 · · Score: 1

    So does this go to show that life is anti-enthrophy?

    --
    May I please have my frontal lobotomy if I bring back the ashtrays?
  13. I told you so! by twoslice · · Score: 2, Funny

    The Magic School Bus is true!

    --

    From excellent karma to terible karma with a single +5 funny post...
  14. I was wondering... by mingthemerciless · · Score: 2, Funny

    ... if it could find life forms in my doom wad's?

    1. Re:I was wondering... by Anonymous Coward · · Score: 0


      Sure, why not? I created a life with my wad...

  15. The same image... by Anonymous Coward · · Score: 0

    ....as image 1 and image 2 seems to have different complexity........???

    1. Re:The same image... by maxwell+demon · · Score: 2, Informative
      Hmmm.... really as "image1" and "image2", and not as "img1" and "image_2_with_an_incredibly_long_file_name"?

      BTW, if you want to be file name independent, you can use
      cat file | gzip -c9 | wc -c
      This way, gzip doesn't see the file name, and therefore doesn't include it into the .gz file.
      --
      The Tao of math: The numbers you can count are not the real numbers.
  16. Does this work for anyone? by iainl · · Score: 1

    I tried it with two jpgs off my little home site, and the poor thing died with a div by 0 error.

    Either I'm doing something wrong with my jpg compression, or this is slightly flakey - a successful pair of test pics would be most helpful.

    --
    "I Know You Are But What Am I?"
    1. Re:Does this work for anyone? by iainl · · Score: 1

      OK, I answered my own question - despite being a web-based app, it wants to be passed files off my local harddrive, not ones off a website. Odd.

      --
      "I Know You Are But What Am I?"
    2. Re:Does this work for anyone? by Anonymous Coward · · Score: 0

      Try not to use naked pictures of yourself next time.

  17. bzip2? by maxwell+demon · · Score: 2, Interesting

    Has anyone checked if bzip2 is better or worse in detecting biological products?

    After all, they have quite different compression characteristics (on one hand, compression of a megabyte of zeroes is much better in bzip2, OTOH adding the same file on top of itself and then compressing gives much less additional compressed size with gzip than with bzip2 - tested with /usr/src/linux/kernel/sys.c, 24957 bytes uncompressed).

    --
    The Tao of math: The numbers you can count are not the real numbers.
    1. Re:bzip2? by Kiwi · · Score: 1

      I'm noticed that bzip2 compresses better (and sometimes, much better) for most tarballs of software and other data. However, in the case of a list of prime numbers, gzip actually compresses better than bzip2.

      - Sam

      --

      The secret to enjoying Slashdot is to realize that it should not be taken too seriously.

    2. Re:bzip2? by meowsqueak · · Score: 1

      The model part of bzip2 uses an implementation of the Burroughs-Wheeler transform. The BWT involves sorting a block of data based on 'context' (for text, this is usually the first few letters or so of a word or sentence) a special reversible way. This results in a block of data with similar contexts grouped together, thereby better fitting a predictive model and therefore compressing better than the original when passed through a probability-based encoder. Duplicating the file contents is unlikely to 'assist' compression because the block size restricts the model's 'view' (and the BWT isn't a perfect sort either).

      The bzip2 program incorporates this transform with a Huffman encoder.

      Gzip on the other hand uses a form of LZ78 compression that looks for previous matches and back-references them. So, duplicating the contents of a file as you suggest would suit gzip quite well, provided that the 'context buffer' length incorporates most, if not all of the first half of the file. The 'level of compression' parameter for the gzip algorithm (gzip -#) simply changes the size of the look-back buffer (i.e. the context length). Making it larger increases processing time because of the increased amount of context comparisons involved, but it usually results in better compression because matches are more likely to be found.

  18. I'm jealous... by PKFC · · Score: 1

    ...that I am not a unix user.

    Where is .sit? I thought you people became mac users! :P

    1. Re:I'm jealous... by javatips · · Score: 1

      There is no place for dark lord here!

    2. Re:I'm jealous... by Anonymous Coward · · Score: 0

      Oddly enough, Apple are doing some kind of evil compression thing themselves in new machines with OS X.

      Someone told me to try this, and I did, and it's TRUE! Try it yourself!
      Forwarded message:
      1) Double click on the hard disk icon on your desktop
      2) Double click on the Applications folder
      3) Double click on the utilities folder
      4) Double click on the Terminal.app application

      Then try typing - there's an actual unix computer in there!

      I tried it myself - I not only have access to things like gzip and gnutar, but whole range of unix things.

      Is this why Macs are so expensive now? Is it because they have a second computer INSIDE the mac? And how do they fit it all in my iBook?

      Plz advise.

    3. Re:I'm jealous... by usotsuki · · Score: 1

      Duh, MacOS X is FreeBSD-based, you nimnul *g*

      -uso.

      --
      Dreams, dreams, don't doubt dreams, dreaming children's dreaming dreams. Sailor Moon SS
  19. The fractal geometry of nature? by RNG · · Score: 4, Interesting

    Although I'm certainly no compression expert, I think this makes sense. Many (most?) natural systems have fractal structures on some level so it only makes sense for them to compress better (ie: have more self-similar features) than systems which don't have this feature.

    Then again, what do I know? Maybe something more immersed in this field can tell us whether there's a seed of truth to my ramblings ...

    Greetings
    --> R

    1. Re:The fractal geometry of nature? by battjt · · Score: 1

      Could you give some examples of fractal structures in a human?

      GZip doesn't do fractal compression. It will compress repeating patterns though. (My two arms will be compressed because they are similar, not because that look like little humans.)

      I don't think there are many fractal structures in nature. Rocks are different than sand. Humans are different than cells. A field is different than grass, which is different than cells, which is different than molecules.

      Joe

      --
      Joe Batt Solid Design
    2. Re:The fractal geometry of nature? by jeff_bond · · Score: 2, Insightful
      Could you give some examples of fractal structures in a human?

      For starters, how about the branching structure of the airways in your lungs?

      Jeff

      --
      stty erase ^H
    3. Re:The fractal geometry of nature? by Dr.+Manhattan · · Score: 1
      Could you give some examples of fractal structures in a human?
      For starters, how about the branching structure of the airways in your lungs?

      Or the branching of blood vessels. Or bone microstructure. Or nerve cells.

      --
      PHEM - party like it's 1997-2003!
    4. Re:The fractal geometry of nature? by Xzzy · · Score: 1

      Or just look down at your body? :P

      torso breaks into limbs. limbs break into digits.. looks like a crude fractal to me.

  20. Genome project... by Anonymous Coward · · Score: 0


    It would be nice if someone would play with gzip and the various genomes available.

    That would probably have more relevance (being in a single dimension) than images, IMHO.

  21. Thought this would be somewhat obvious... by ignoramus · · Score: 2, Insightful

    Every one of us is incredibly redundant, and I don't just mean in our posts on slashdot!

    Simply consider that you can have a reasonably good duplicate of yourself, with only the DNA contained in a single cell!

    You may need most of your parts to be functional but, information-wise, it all comes down to 1 germ cell (say, a spermatozoid) and the aparatus needed to move it into proximity of another compatible germ cell ;)

    1. Re:Thought this would be somewhat obvious... by peerogue · · Score: 1

      But the DNA is not visible in the pictures of you, is it? However, we're mostly symetrical, which is a form of redundancy visible from the outside.

      By the way, the spermatozoids are not issued from a mytosis but from a meyosis, and therefore do not contain the full DNA, but only half of it (half the chromosoms, in fact).

    2. Re:Thought this would be somewhat obvious... by ignoramus · · Score: 1

      But the DNA is not visible in the pictures of you, is it?

      I am certain you will agree that phenotype has something to do with genotype. Let's say, for instance, that your skin cells - although there are very many - are pretty much all the same (since they are coded in a small part of that DNA). This will cause different areas of your skin to be similar (same color, etc.) and easier to compress than some random static.

      The fact your body is a redundant interpretation of your DNA does impact the compressibility of your image.

      On the other hand, I very much doubt that gzip will be able to note such large scale symmetries/patterns as "he has two arms and two legs"

      By the way, the spermatozoids are not issued from a mytosis but from a meyosis, and therefore do not contain the full DNA, but only half of it.

      Yeah, I'm actually aware of that. And although the mechanisms related to dominant-recessive genes and sexual characteristics are influenced by these two pairs, the fact is that you get 22 autosomes from your parents (11 from each) and these are basically duplicates (the 2 extra sex chromos are a little different). Doubling those 11 from the sperm won't result in you but will result in a functional human being nonetheless.

      All you need is 1 germ cell to produce a perfectly normal (although statistically anomalous) female (XX sex chromos).

    3. Re:Thought this would be somewhat obvious... by AugustMoon · · Score: 2, Interesting

      Your DNA is only sufficient to create another state machine with the same rules you had at birth.

      It will not re-create your complexity because our dna-state machines are designed to create brains which are 'genetically-memoryless', capable of self modification, and have incredible data collection and storage capacity.

      Think of your DNA as the graphics engine for Quake. It is relatively small (space-wise) compared to the textures and levels. Add different data, and you have still have a first-person game, but a completely different one.

    4. Re:Thought this would be somewhat obvious... by ignoramus · · Score: 1

      Your DNA is only sufficient to create another state machine with the same rules you had at birth.

      Agreed. Of course, a clone is the like a twin - nothing more. It isn't the entire you (an exact replica of everything up to the superimposed quantum wave functions or whatever) but to a camera it's pretty close, and that's what we're discussing.

    5. Re:Thought this would be somewhat obvious... by AugustMoon · · Score: 1

      Ah, yes to a camera, but consider while DNA is physically small, it contains around 3,000,000,000 base pairs (in a human).

      Conservatively assuming only 1 bit per base pair, that is still 375 MB. That's a pretty big picture. ;)

    6. Re:Thought this would be somewhat obvious... by ignoramus · · Score: 1

      Two points here need clarification.

      DNA [...] contains around 3,000,000,000 base pairs

      Ok, that looks like a big number but a very gross estimate of the number of atoms in my body would be : ((myweight in grams)/14) * 6.022 x 10E23 (I chose 14 because it's somewhere between the atomic mass of carbon and water, but it's only the order of magnitude I'm interested in)

      The point here is that I've got on the order of 10E27 atoms being "organized" by (possibly only a portion of) those base pairs. So I say again, we have lots of redundancy - this would seem obvious when you consider that a decent clone could be made with the DNA from a single cell.

      The second point involves the comment that 375 MB. That's a pretty big picture. Yeah, I agree, but until we use holography a little better you only take pictures of the outside of my body and not even the entire outside at that (only a single side at a time). So you can certainly shave of a few megs from stuff like heart ventricles, bone marrow and alveoli.

    7. Re:Thought this would be somewhat obvious... by meowsqueak · · Score: 1

      It's finding that one tiny cell that's the hard part

    8. Re:Thought this would be somewhat obvious... by OneEyedApe · · Score: 1

      Holography? Or Hyperdimensional photography? A four dimensional eye would see both the inside and outside of a three dimensional object at once.

      --
      Life sucks, but death doesn't put out at all....
      --Thomas J. Kopp
    9. Re:Thought this would be somewhat obvious... by ignoramus · · Score: 1

      Actually, this is just theory of course, some have proposed the Holographic principle which basically says that everything within a volume can be described by whats happening on the bounding surface and that there is at most 1 degree of freedom/Plank area of that surface.

      If this was correct, then a complete "image" of the surface area would describe everything that happens within.

  22. Fractal Compression by Mr+Pippin · · Score: 1

    Hmmm, since life forms are generaly based on geometric patterns, I would think fractal compression would be even conclusive in terms of detecting life.

  23. Not GnuZip any more by Anonymous Coward · · Score: 0

    Now gzip stands for GenomeZip.

  24. Application by Anonymous Coward · · Score: 0

    So an RFC1437 bodypart could fit on a floppy?

  25. Some companies are using model based mathmatical t by Anonymous Coward · · Score: 1, Interesting

    Companies like Image Metrics use a mathematical translation into n-dimensional space similar to a compression algorithm to perform some interesting kinds of image recognition and processing. Examples are medical diagnosis, facial recognition, crystal growth monitoring and the like.

    http://www.image-metrics.com/pages/technology.as p

  26. this might have a few glitches by jj_johny · · Score: 4, Funny
    When I compressed the transcript of the Osbornes, it got increadibily high compression but I don't think they are intelligent life forms. Or maybe I am really wrong.

    This post can't be compressed.

    1. Re:this might have a few glitches by nounderscores · · Score: 1

      When I (compressed|1) the transcript of the Osbornes, it got increadibily high #1 but I don't think they are intelligent life forms. Or maybe I am really wrong.

      This post can't be #1.

    2. Re:this might have a few glitches by entrager · · Score: 1

      gzip got it from 199 bytes to 170...

    3. Re:this might have a few glitches by entrager · · Score: 1

      Correction... when I told gzip to remove original filename and timestamp data I got it from 199 bytes to 161.

    4. Re:this might have a few glitches by PickaBooga · · Score: 1



      I compressed all of Slashdot's front page articles, and got 50% compression just from the duplicates alone!

  27. The Mars fossil IS made by life; my wife is not. by Saint+Aardvark · · Score: 5, Funny
    In a true first for extraterrestrial biotic research, I decided to compare two pictures:

    at the comparison page attached to the article that lets you run the same test on images that the researchers tried. In a startling discovery that is sure to earn me a Nobel Prize for Physics, Chemistry, Biology and Marital Relations, I was told the following:

    "Answer: Image 1 [the Mars image](1.43702451394759 % compression) has a higher complexity measure than image 2[the image of my wife] (0.773501341151519 % compression), and thus image 1 is more probably biogenic."

    Not only does this prove that there was once life on Mars, but it also proves that my wife is some sort of robot. Further research will be undertaken pending receipt of my prize money.

  28. The.. by saqmaster · · Score: 2, Funny

    .. thought of being gzipped is quite disturbing.

    Mad Scientist: "Fire up the GZip Continueum Transfunctioner!"
    Operator: "Okay, Boss"

    *Bizzzttt*

    --
    "Never let the truth get in the way of a good story..."
  29. am i reading this right? by Anonymous Coward · · Score: 0

    i'm dropping pictures in from stileprojects webcam site versus pictures of various cars. i consistently get results showing that the anime is more biogenic than a car.

  30. new pr0n detectors by neverpsyked · · Score: 1

    Great, I think I just figured out a new method for pr0n detection. Unless we're talking anime, of course.

    --
    What if this weren't a hypothetical question?
    1. Re:new pr0n detectors by wkos · · Score: 1

      Talking about pr0n, this program thinks cars are sexier than nude women...

      tried it with a pic of a posing pr0nmodel & a TVR & Porsche 911.

      The latter two were the most organic ;-)

    2. Re:new pr0n detectors by Angus+Prune · · Score: 1

      pr0n stars are mostly silicon anyway.

  31. In other news... by MoeMoe · · Score: 1

    The creators of WinZip filed suit stating that they have a better assembled compression utility and will use it not only to distinguish between living and non-living, but make the living incased in a tiny plastic cell on a keychain that kids can take with them, feed, and keep healthy.

    --
    Business \Busi"ness\, n.;
    A scam in which all people involved perceive as beneficial...
  32. Re:The Mars fossil IS made by life; my wife is not by Anonymous Coward · · Score: 0

    Hubba Hubba!

  33. The Live test that doesn't work... by Omicron32 · · Score: 0

    Using that live test, I gave it one image of my face, and another of some rocks.

    Apparently, the rocks are "more probably biogenic" than I am. Bastards.

  34. Information vs. Meaning by 16977 · · Score: 2, Interesting

    One of the posters brings up an interesting point. Although meaningful data has more information than pure noise, it also has less than a blank signal. When you download pictures, regardless of the "meaning" they have to you, their compression can vary a considerable amount. And you've probably heard the statistic that the english language is 50 percent redundant. That figure may vary a bit too, but the point is that english's meaning to us is independent of its information content. And the probability that an image of a life form with more information will also have more "meaning" is probably just as uncertain.

  35. Kolmogorov Complexity by MarkWatson · · Score: 4, Interesting
    This seems like a "sort of" restatement of Kolmogorov Complexity.

    Roughly, Kolmogorov Complexity is a measure of randomness - the measure is how long a computer program needs to be to reproduce data (pardon an oversimplification).

    -Mark

  36. Re:The Mars fossil IS made by life; my wife is not by Anonymous Coward · · Score: 5, Funny


    The problem here is that your wife is wearing clothes. Clothes are man made.

    If you send me a picture of your unclothed wife, I'll be happy to, uhm, test this theory.

  37. Filtering Images by CommieBozo · · Score: 2
    While slightly different, this reminds me of the way I filtered a bunch of images from a video camera. I was taking many frames per second of a thunderstorm and I wanted to find which frames out of thousands contained lightning strikes.

    It was pretty simple... Images over a certain size contained lightning, the others were mostly black, therefore smaller. Once I filtered it that way, manually filtering out the better images was easy.

  38. Doesn't seem to work... by Kadagan+AU · · Score: 1

    Ok, I might very well be confused, and not using the tool right, but I plopped two of the only pictures I could find into this thing; an areal shot of a bunch of houses, and a picture of a really good looking woman. The thing told me that the houses were much more likely biological. Strange. Unless it can see implants?

    ~Jon~

    --
    This space for rent, inquire within.
    1. Re:Doesn't seem to work... by Anonymous Coward · · Score: 0

      Its a matter of scale. Comparing a naked woman and a rock the size of a naked woman may work, comparing an aerial shot of houses and an aerial shot of desert may work. I guess this thing is designed to be ran on images of cross-sections of rocks.

  39. Operating Principle? Kolmogorov Complexity by fygment · · Score: 3, Informative

    Read about it in _the_ book (http://www.cwi.nl/~paulv/kolmogorov.html) or check out the web site here (http://www.hutter1.de/kolmo.htm). For a more succint idea of the approach, these articles by one of the gurus on the topic (http://www.cs.ucsb.edu/~mli/focs.ps and http://www.cwi.nl/~paulv/papers/ecml97.ps).

    --
    "Consensus" in science is _always_ a political construct.
  40. Can we use this to defeat spam? by tetrode · · Score: 1

    Couldn't we use a similar system to see if an e-mail is spam or not?

    Mark

    1. Re:Can we use this to defeat spam? by spot35 · · Score: 1
      Did you not read the summary of the report -
      • This extends the simple, but powerful, uses of gzip to biogenic fossil detectors, in addition to spam cop filters...
    2. Re:Can we use this to defeat spam? by Alcohol+Fueled · · Score: 1

      Hi spot35, you must be new here. This is Slashdot, where nobody reads anything, but posts comments anyways. Please fasten your seatbelt and enjoy your stay here. :)

      --
      Ah am not a crook! (\(-__-)/)
    3. Re:Can we use this to defeat spam? by spot35 · · Score: 1

      Yes it is a giraffe! I hope that answers your question.

  41. Re:The Mars fossil IS made by life; my wife is not by (startx) · · Score: 3, Interesting

    ahh, but the picture of your wife contains a lot of inanimate objects. I'm sure if you cropped the picture down to just her (or reasonably close) she would fare better in this comparison.

  42. I am not by Karpe · · Score: 4, Funny

    I compress to binary 0, therefore I am not.. :(

    1. Re:I am not by mr.+methane · · Score: 1

      No, no, you misunderstand. If you compress all the way down to zero, you must be incredibly intelligent, because you are more compressible than I am!

  43. Biological clocks in unicorns... by dpbsmith · · Score: 4, Interesting

    zip is a fine thing, but it's not a pattern-recognition program!

    This is the loopiest thing I've heard of since Rosenblatt reported that his Perceptrons could distinguish between music composed by Bach and music composed in imitation of Bach.

    Good heavens, any picture that's slightly out of focus will now be declared to be evidence of "biological processes."

    I'm guessing that the researchers are not as nutty as they sound and that they've done more than is being reported, but still...

    Reminds me of the researchers in the sixties who were publishing analyses of data that supposedly showed "biological clocks." It turned out that they were using smoothing algorithms that, basically, were filters that had a 24-hour peak in the frequency domain--so their analysis was creating the patterns they claimed to be detecting. A debunking article was published in Science in which another research used data from a random number table (the "unicorn" data) and showed that the same analysis techniques showed that the unicorn had a biological clock.

    1. Re:Biological clocks in unicorns... by archeopterix · · Score: 2, Insightful
      Similar thoughts here. From the article:
      So how does one separate the wheat from the chaff, the true stromatolites from the fakes?
      One method is to examine the suspect rock with a microscope, looking for visual evidence of microorganisms. But as researchers who study ancient terrestrial rocks- and one notorious Martian meteorite - have discovered, it isn't all that easy to tell, just by looking at shapes, whether or not a microscopic blob in a rock was once alive.
      So, what do they verify the gzip method against? Their guesses about the image origins? Does not look great from the methodology standpoint, eh?
    2. Re:Biological clocks in unicorns... by nounderscores · · Score: 1

      IANAXB (xenobiologist) but you could try looking at terrestrial bacteria, freshly killed, which leave behind calcium deposits.

      gzip that and see if you get a positive.

      then look at porous rock where the little circles are from air bubbles or somesuch and see if you get negative results.

    3. Re:Biological clocks in unicorns... by neybar · · Score: 1

      I think that this is a pretty cool idea, but obviously has some flaws. Out of curiousity I compared an ultrasound of my son, and a random wallpaper that I had. The wallpaper turns out to have a better change of being biogenic then my son has. Of course that my just confirms my suspisions that my wife is having an alien or something... neybar

  44. lameness filter? by Speare · · Score: 1

    Isn't that conclusion the opposite of CmdrTaco's use of compression to weed out "lame" postings? More noise is apparently more valuable discussion, while less noise is somehow considered likely spam? How many good postings have you seen with a line "this has been added to get past the lameness filter"?

    --
    [ .sig file not found ]
  45. GZip solves the Fly! by paulhar · · Score: 1

    This must have been the solution in second Fly film. The researchers kept using lame when they should have used gzip!

  46. and language detection. by Anonymous Coward · · Score: 0

    gzip seems to be good for every sort of pattern detection. There was an article, but I forget where I read it, a couple of months ago, on how gzip was used to detect the language used in a few written words. I know it's OT, but could somebody who remembers please answer me with a link ?

    1. Re:and language detection. by spot35 · · Score: 3, Informative

      Could this be what you're after?

  47. Re:The Mars fossil IS made by life; my wife is not by paulhar · · Score: 1

    Erm, it's a photograph. It's all inanimate. Just like my ex-wife, though she was inanimate in real life too...

  48. gzip - the swiss army knife utility by kinnell · · Score: 5, Funny

    I myself have successfully used gzip for factoring large prime numbers, sorting the men from the boys, unblocking the kitchen sink and cracking safes. I'm currently trying to locate Osama Bin Laden by compressing Al Jazeera footage, but all I come up with are reports of Elvis sightings.

    --
    If I seem short sighted, it is because I stand on the shoulders of midgets
    1. Re:gzip - the swiss army knife utility by Jon+Abbott · · Score: 1

      So is that why I keep seeing the answer to the JFK conspiracy in my gzipped files?

  49. I get the feeling... by Snork+Asaurus · · Score: 1

    ...that this item was posted a day early.

    --
    Sigs are bad for your health.
    1. Re:I get the feeling... by mindriot · · Score: 1

      Hmm yes, I thought so for a second too...

      And thinking about that, I'd like to say: Editors, please do not overact tomorrow! Last year was funny for the first two 4-1 stories, but after that it just got annoying. Please limit yourself to maybe two or three April Fool stories, but make 'em good instead...

      Thank you.

  50. Slightly Dodgy by jolyonr · · Score: 5, Interesting

    This whole thing is slightly dodgy, and I begin to wonder whether it was released a day early by mistake.

    The big problem is the use of JPEG source images. Unless you've stuck it up to the maximum size on quality, then the jpeg artifacting (which is in effect repeating blocks of image data after transitions) will probably mask any hidden level of complexity in the images - the human brain is a much better tool at pattern recognition than most computer algorithms (especially those algorithms not designed for the task!).

    Throw high-resolution bitmap files at it, and I'd be more persuaded that there is a genuine effect. Until then, I suspect it's more of a happy coincidence that the files they've thrown at it give results they are excited about.

    Jolyon

    --


    Please read my Canon EOS tech blog at http://www.everyothershot.com
    1. Re:Slightly Dodgy by kris_lang · · Score: 2, Interesting
      I've seen similar errors made by vision science (note that I did not say "image processing") researchers trying to analyze natural scene statistics and come up with interesting patterns. They created "basis functions" and did principal component analysis on sets of images and came up with a basis set that looks curiously like the base images of the DCT (discrete cosine transform), the underlying calculations of the JPEG image format. This is to be expected when you start with a set of images that are JPEG compressed.

      This was actually published in a (barely) peer-reviewed journal, Vision Research. I didn't say "image processing" above because a lot of these vision scientists seem to be psycologists doing visual psychophysics without having a strong background in math, or optics, or (it seems at time) the fundamentals of science.

      The other thing to take into consideration is that gzip is "pseudolinear". It does not take into account the 2-dimensional correlations that exist in image data. Even fax compression takes advantage of it. (and yes, I do realize that gzip can account for runs from previous regions regardless of length or location, but I am trying to point out that there is a specific 2-dimensional set of correlations extant in 2-d image data).

      In these cases being cited that use GZIP, the major function of GZIP seems to be as an indicator of the presence or absence of high-frequency components in the signal stream. Lots of irregular high frequency -> Low compressibility, very little irregular high frequency --> High compressibility factors.

  51. Re:why no bzip2 ? by bill_mcgonigle · · Score: 5, Interesting

    doesnt bzip2 outperforms gzip ?

    gzip might be preferable because it works more locally. It only keeps track of the last n bytes of data and does substitutions based on patterns seen in those n bytes.

    bzip2 uses a markov predictor and the chain length is typically much longer than gzip uses, so the compression is less local. That's great if you're going for compression but for this work, it might be misleading.

    That said, gzip doesn't know about image formats, so I wonder if these guys are getting some false positives on scanline wraps and other non-image data.

    --
    My God, it's Full of Source!
    OUTSIDE_IP=$(dig +short my.ip @outsideip.net)
  52. Was this the April issue? by Anonymous Coward · · Score: 0

    April Fools isn't until tomorrow.
    Did someone perhaps get caught a day early?

  53. Re:The Mars fossil IS made by life; my wife is not by gatesh8r · · Score: 2, Funny

    Wait wait wait you have a wife? Dude, this is Slashdot; are you sure you're not a misdirected user???

    --
    Karma whorin' since 1999
  54. 42 by snarkh · · Score: 2, Insightful


    42 is one byte.

    1. Re:42 by etcpasswd · · Score: 1

      Nope. Two bytes in ASCII.

    2. Re:42 by hazem · · Score: 1

      Well, using binary, you only need 7 bits to encode the integer value 42. That fits into one byte, with a bit to spare.

    3. Re:42 by Wraithlyn · · Score: 1

      6 bits, actually: 101010.

      *double take*

      WOW! Lookit that regular pattern! Who'd have guessed? It really IS the ultimate answer. Everything makes sense now.

      --
      "Mind, as manifested by the capacity to make choices, is to some extent present in every electron." -Freeman Dyson
    4. Re:42 by MegaFur · · Score: 1

      depends on how you choose to store it.

      --
      Furry cows moo and decompress.
  55. I know what you all need to do by Anonymous Coward · · Score: 0

    BZZZIIIPPP2 it, haha.

    This isn't funny.

  56. Simphile Seems to do something similar by spot35 · · Score: 1

    But not quite. It detects patterns but it does use gzip in a similar manner.

    Simphile uses the gzip program to detect patterns in two files. Used to determine things from whether two sonnets where written by shakespeare or whether certain sounds files came from the same source.

    1. Re:Simphile Seems to do something similar by Anonymous Coward · · Score: 0

      You should be modded up! Simphile is exactly the methodology the researchers have been using.

    2. Re:Simphile Seems to do something similar by 42forty-two42 · · Score: 1

      Wouldn't it be better to hack mp3 or ogg? These filter out noise (to an extent) and are designed for detectnig similarities *in audio files*.

  57. gzip == measure of information content by firecode · · Score: 2, Informative

    This is not surprising at all really. Gzip and other compression utilities can be used to get upper bound for real/nonredundant information content.



    <p>I'm not sure if above is public knowledge, but I have used it as a one additional feature for certain pattern recognition tasks for a while.</p>
  58. Compression to measure semantic content by KingRamsis · · Score: 3, Interesting

    It was an interesting coffee break discussion with one of my professors, we were arguing if there is neat way to estimate the semantic content of a neural network after training it, I recall suggesting to compress the value of the weights of all layers and the less compressible the more this neural network is trained.

    1. Re:Compression to measure semantic content by Anonymous Coward · · Score: 0

      I'm curious as to the conclusion of your discussion. I have had the same idea but unfortunately noone to talk to.

      Mike

      bourassa-m@rmc.ca

    2. Re:Compression to measure semantic content by snarkh · · Score: 1

      That does not seem to make much sense - all weights in the neural networks big or small will compress roughly to the same amount. Surely you are not implying that if the network is trained, the weights will have some sort of magical easily compressible patter.

      You could have some sort of Bayesian framework with a prior on your weights to try to see when your network is trained, but the the prior should probably not come from gzip.

    3. Re:Compression to measure semantic content by KingRamsis · · Score: 1

      Read my post carefully...
      the less compressible the more this neural network is trained
      let me give you an example, a neural network is trained to re-produce three distinct tones, the input is time in milliseconds, before training the weights are more compressable, after training the weights are less compressable because there is more "useful" content...
      i hope i made it clear...

  59. Re:The Mars fossil IS made by life; my wife is not by Anonymous Coward · · Score: 0
    I think I speak for all of /. when I say
    • Your wife is hot.
    • Can I have her number?
    Godspeed!
  60. mmm... by Anonymous Coward · · Score: 0

    now imagine a bewoulf cluster of these....

  61. bzzt by Anonymous Coward · · Score: 0

    '42', EOF

    1. Re:bzzt by Anonymous Coward · · Score: 0

      No, the original poster is quite right. 42 is a single integer that can be expressed in one byte. "42" is an ASCII string which must be expressed in a minimum of two bytes. Anyway, who uses an EOF marker these days? (Actually you can get away with only 7 bits for 42 hex, and just 6 bits for 42 decimal.)

  62. All else being equal.... literally. by Captain_Stupendous · · Score: 1

    In order for this to work, the tester would need to eliminate a whole spectrum of other variables that would affect the outcome of the test. Image format (JPEG compresses less than BMP), image size, JPEG "resolution" (pixels per inch), color depth, etc. Of course, I assume they have some way of standardizing their input images, but it's unlikely to become an automated process....

    --


    I am alone, yet I also surf the universal backwash of undifferentiated Being, which is LOVE.
  63. Pattern Recognition by cyber_rigger · · Score: 2, Interesting


    I envision a whole array of compression algorithms.

    Each algorithm could be fine tuned for a paticular type of pattern.

    Is that an elephant or a giraffe?
    Does it compress better with the elephant algorithm or the giraffe algorithm?

  64. GZipping and the Slashdot Spam Filter by Demiah · · Score: 1

    The Slashdot spam filter was the first thing that came to mind on reading this. Are we seeing excuses? ;)

    --
    Have fun. Or failing that, be miserable with style.
  65. Re:The Mars fossil IS made by life; my wife is not by mpweasel · · Score: 1


    Reread the story:

    ...they find that the inherent pixel content of biologically generated fossils produces higher image compression ratios...

    Now, there are several questions you should ask yourself:
    (1) Are you calling your wife a fossil?
    (2) Does she read Slashdot?
    (3) If she could mod, would she mark it +1, Funny, or -1, Won't_get_some_for_a_while?

  66. DNA is the life-form compression medium of choice by ojQj · · Score: 1
    Hmm... I would find the result that DNA is more compressable than random data to be a possibly more interesting result. After all, all (as of yet discovered, earth-based) life is based on some kind of DNA sequence which could be considered to be the compression of that life-form.

    This is of course ignoring the effect of the environment which almost certainly reduces the symmetry (and thus the redundancy) produced by the genetics of a life-form.

    But then again, maybe DNA would also have a high degree of repetition, at least in sexually reproducing life-forms, since DNA also has to be sensably combinable.

  67. Re:The Mars fossil IS made by life; my wife is not by paulhar · · Score: 1

    And the wife wasn't man made? Wow!

  68. good times by Anonymous Coward · · Score: 0

    out of the tar and into the gzip, eh?

  69. Not particularly ironic by delphi125 · · Score: 2, Funny

    It is a well-known fact that any compression algorithm will cause some files to increase in size when 'packed'. If this were not the case, then '42' would be the compressed version of some other file, say 'Wdugiu*6x9', which in turn would be the compressed version of DNA's DNA, which in turn might be the compressed version of the answer to life, the universe, and everything. Furthermore, everybodies DNA would compress down to the same file '42' (since we all contain the answer within ourselves, presumable mice would compress down to something else), which would mean we were all clones, which means that I am the Pope and you are CowboyNeal (and vice versa). QED.

  70. Re:The Mars fossil IS made by life; my wife is not by Anonymous Coward · · Score: 0
    I just looked over your site, and I now reply to myself:
    • Nevermind.
    • She kinda looks like Dick Nixon.
    Retreat!
  71. Seperate the chaff by Anonymous+Struct · · Score: 2, Interesting

    I doubt this is very accurate for marking photos as hits or misses directly. This kind of thing may be useful more for detecting the lack of life rather than the presence of it. If compression rates are low, maybe you don't have to look at this photo so much. If they're high, maybe you want to examine it more closely. If you're dealing with truck loads of data and you're looking for a needle in a haystack, a mechanism for ruling out uninteresting data is invaluable.

    That having been said, it sounds good in theory that 'organisms are highly patterned and therefore compress better', but then why would you use gzip? Why not take that theory and build something a little more adept at locating particular types of patterns you're interested in, or ruling out the ones you know are going to create false positives?

    So, THAT having been said, I'm forced to wonder if somebody forgot that March has 31 days. Lord knows I can never keep track.

  72. hidden markov models by nounderscores · · Score: 2, Interesting

    Interesting. For genome analysis Hidden Markov Models have been used in a lot of software.

    Maybe if you could have an image recognition system do the Hard Machine Vision probelm of generating a schematic of the picture, and then fed the "leg bone is connected to the hip bone" kinda data into a HMM you could work out which fossils are ancient Cambrian crustations and which ones are Trogdor the Burninator.

  73. Sprint... by Anonymous Coward · · Score: 0

    Can you hear me now?

  74. Compressed transmission by Newtonian_p · · Score: 1
    But what if the aliens are compressing all their communications? An already compressed signal wouldn't be re-compressible.

    Actually, it would suprise me if an advanced civilization would send all their signals uncompressed and waste radio bandwidth.

    --

    There are 2 kinds of people in this world: Those who write in decimal and those who don't

  75. viruses? by Mentally_Overclocked · · Score: 2, Interesting

    I wonder if viruses (sorry - didn't RTFA) would compress like living life forms or if they would be more similar to nonliving.

    Just a thought.

    --

    Mathematician, n.:
    Someone who believes imaginary things appear right before your i's.
    1. Re:viruses? by lovebyte · · Score: 1

      I wonder if viruses (sorry - didn't RTFA) would compress like living life forms or if they would be more similar to nonliving.

      If you mean living viruses as oppose to computer viruses, they are living life forms. That's it. I don't understand your question.

      --

      I'll do it for cheesy poofs.

  76. Irony? No... by Anonymous Coward · · Score: 1, Informative

    A pkzip file (aka Winzip default) is not equivalent to a gzipped file, but more analogous to a gzipped tar archive! Pkzip stores all that wonderful file information - full path, permissions, owner, and so on with the compressed data. Gzip by contrast only compresses, and doesn't store archival information. Gzip leaves the archival information in the filesystem. If you tar.gz'd the file; the filesize of the .tgz would be similar to the pkzip.

    The difference between your filesize and his is likely the difference in the lengths of the pathnames to the respective text files and not a difference in the size of the compressed data. Remember pkzip files store the full pathname in the file uncompressed; gzip doesn't store the filename at all.

  77. New Meaning by raiyu · · Score: 1

    This gives new meaning to the phrase:

    "Honey do I look fat in this?" Put on Gzip glasses. "Of course not dear."

  78. Pretty sloppy, you mean... by TheSHAD0W · · Score: 2, Insightful

    There are other techniques for measuring the level of chaos in a set of data, and they'd probably yield more consistent results than running the data through an algorithm meant for an entirely different purpose.

  79. Bzip2? Bah , new fangled rubbish! by Viol8 · · Score: 3, Funny

    What about compress? Or even good old "compact". Ah I remember the days when we had 20% compression
    and were glad of it and some of the old timers could have been confused with non living processes
    even without the help of gzip anyway!

  80. That's NOT what was said... by Anonymous Coward · · Score: 0

    What was said was that lifeforms tend to organize their environment, reducing entropy and the reduced entropy yields more compressible images. This is a very logical progression since all compression is entropy-based in inverse proportion. The greater the entropy (disorganization), the less compressible a file is.

    To say that compressibility implies life is not the case. To say that life implies compressibility is the case, and can certainly reduce the the number of images through which one must cull. Better to look at the interesting image files that might contain life signs, than looking at the less compressible areas that definitely do not.

    Short form of above: I agree.

  81. not a pun by Anonymous Coward · · Score: 0

    The comment is implying that gzip removes most of the surplus information. So it's quite a good zip.

  82. Highly ordered structures by Anonymous Coward · · Score: 0

    compress well, and only structures which contain a great deal of order. So it's probably interesting, even if it's not life.

  83. this can also detect PHB's by IDigUNIX · · Score: 4, Funny
    As alternative to this hypothesis consider:
    feed a business technology proposal through gzip
    • A very high compression ratio indicates that the proposal was likely to be written by consultants. As supported by the fact that they usually re-use the same buzz phrases over and over.
    • A moderate compression ratio indicates that the proposal was written by engineers. Typically they use large words, and unique phrases that are already compressed. I.E. SNMP, J2EE, WWW, and so on.
    • A zero to negative compression ratio indicates that the proposal was likely to be written by a PHB, and hence void of all indications of intelligent life. As evidenced by most PHB's having a hard time using buzz phrases and keywords in context, so they won't recycle enough words to form a good compression dictionary.
    1. Re:this can also detect PHB's by 42forty-two42 · · Score: 1

      Good theory, but flawed. All ASCII text can be compressed to 7/8ths of its original size of smaller (remove the high-order bit). Note that the engineers might use unicode ;P

  84. goatse.jpg and tubgirl.jpg? ;) by caluml · · Score: 1
    Come on, admit it, who's tried to upload the GoatSe man, and TubGirl? ;) Are they organic? Just, I'd say.

    But seriously, I wonder what weird pics people have uploaded :)

  85. Fractal = better compression? by dave_mcmillen · · Score: 1

    I don't understand . . . Does an image having a fractal structure really compress better than one without? I can see that it might compress really well if you could detect the underlying algorithm: "Hey, that's region X of the Mandelbrot set", so its Kolmogorov complexity would be pretty low. But does gzip really detect this? As an image, that bit of the Mandelbrot set might be pretty hard to compress.

    I just find it strange that I keep reading comments nodding at the assumption that being fractalish means easy compression . . .

    1. Re:Fractal = better compression? by battjt · · Score: 1

      Right. I agree with you.

      1. GZip doesn't do fractal compression.

      2. There just aren't that many visibly fractal structures in life. There are some structures that are obviously fractals life (as mentioned in some other posts), but that is also true in rocks.

      Joe

      --
      Joe Batt Solid Design
    2. Re:Fractal = better compression? by NonSequor · · Score: 1

      My information theory text book does in fact say that the Kolmogorov complexity of the Mandelbrot set is "nearly zero." However the Mandelbrot set doesn't seem like the ideal case for something like gzip to me either. I think most of the people here are thinking of fractals such as the Koch snowflake in which the repetition is more immediately apparent.

      --
      My only political goal is to see to it that no political party achieves its goals.
  86. Re:The Mars fossil IS made by life; my wife is not by gurps_npc · · Score: 1

    In addition to the artificial object problem mentioned above, there is also the far more likely problem of resolution and size. A 12' by 12' picture using 600 dpi resolution is FAR more complex than a 4' by 4' using 200 dpi.

    --
    excitingthingstodo.blogspot.com
  87. Hmmmm... kermit lives? by Anonymous Coward · · Score: 0

    I tried out the link and uploaded a pic of kermit, and another of a tree frog. Guess who's biogenic?

  88. zip as language identifier by jmacgill · · Score: 1

    This is not the first time zip (or other compression algorithms) have been used as an identification tool.

    You can do a form of crude language identification using it too, its a bit long winded and you can do far better with other metrics but...

    1) Create a load of zip files wich contain large samples of language - one zip file per language

    2) Add the file to be identified to each zip file in turn and see which one grows in size by the smallest amount.

    Because compresion is based on patterns, the new file will compress the most in the archive which contains the most similar text.

    So all we need to do to spot life on other planets is to fill zip files with images from good scifi films (mars attacks seems appropriate) and away we go :)

    --
    Spell checker (c) creative spelling inc. (aka my dyslexic brain)
  89. Ferengi by Latent+Heat · · Score: 1

    What are you, a Ferengi? "UUUman FEEmales wear clothes!"

  90. Re:The Mars fossil IS made by life; my wife is not by fudgefactor7 · · Score: 1

    Dude, you got a fembot!

  91. Since You Ask... by duck_prime · · Score: 1
    Wonder why nobody thought of using comression in this manner before? This has all sorts of potential uses.
    Actually, there is a precedent for using compression on organics.

    The linked article points out some problems with this approach.
  92. From the "Who Gives A Rat's Ass" Department by Anonymous Coward · · Score: 0

    Really, why is this news?

  93. Wow by JediTrainer · · Score: 1

    Sounds like someone at NASA got a little carried away with their new toy

    --

    You can accomplish anything you set your mind to. The impossible just takes a little longer.
  94. What happens if they use.. by balbord · · Score: 1

    this?!?!?

    --
    "If I have been able to see so far, It is because I went out and bought a damn binoculars" - Ze da Esquina
    1. Re:What happens if they use.. by 42forty-two42 · · Score: 1

      That seems to me to be a joke project. It may even be against sourceforge's TOS. Also, could someone tell me what the lzip jpeg image is? I'm afraid to look :P

  95. Does SETI@home use this approach? by Cy+Guy · · Score: 1

    It would seem that the same approach could be used to distinguish potential intelligent radio signals from those of random or astronomical origin. Though perhaps you would want a pattern to be present resulting in a more compressible file? I think it would depend whether the signal that is picked up is a deliberate simple pattern meant to be a "hello, are you out there?" broadcast by an E.T, or if it is normal communications between E.T.'s not realizing (or not concerned) that they are being overheard.

  96. Chicken Before the Egg by LPetrazickis · · Score: 1

    Because of the mutation factor inherent in meiosis, her genes are not fully determined by other humans. Ergo, she can make humans but humans did not make here. QED.:)

    --
    Is this a sigs-optional kind of place? 'Cause I am totally down with that if you know what I mean.
  97. Beam Me Up by Anonymous Coward · · Score: 0

    Oh great... wait until the cost-cutting management gets ahold of this. I can just hear it, "We can save money and data storage space if you'll just stop using full-data compression and use a lossy compression algorithm instead! The savings will be huge, and the end-users won't know the difference."

    No wonder Dr. McCoy didn't want to use the transporters.

  98. Makes sense by pclminion · · Score: 1
    In the most general form possible: life decreases physical entropy, which leads to ordered images, which means a decrease in informational entropy. Therefore, life processes produce images with less information than nonbiologically produced images. Less information takes less space, so the biological images should compress better. This is all in the abstract; in reality your results will depend on what algorithms you use.

    Someone pointed out that using JPEGs as source images is tainting the results. That's definitely true, but the basic concept here is valid. Anyone who's studied both physical entropy and information theory can understand that the two are highly related. I'd like to see them do this study again with the *original* images -- I suspect the result will still come out. I'm not sure gzip is entirely the most appropriate algorithm to use here, but it could work.

    Very cool...

  99. Pr0n Model? Ha! by LPetrazickis · · Score: 1

    tried it with a pic of a posing pr0nmodel & a TVR & Porsche 911.

    Try it with a pic of an amateur.:)

    --
    Is this a sigs-optional kind of place? 'Cause I am totally down with that if you know what I mean.
  100. GZIP: Divining Rod for Life by Anonymous Coward · · Score: 0

    Hey, maybe GZIP will help Stallmann find his home planet!

    :-)
    { just kidding, Richard--you da man. }

  101. Re:The Mars fossil IS made by life; my wife is not by Anonymous Coward · · Score: 0

    I believe the first step is to test pictures of yourself against her, so that you can rule out who's really the cyborg.

  102. Did something like this years ago by rasper99 · · Score: 2, Informative
    I used a technique like this to do a web cam way back in 1997 before web cams were an easy thing to do. I was supporting Silicon Graphics workstations at the time. One of the models came with a digital camera. The cameras did not have automatic exposure.

    Using CGI as the user hit the web page it took pictures at different shutter speeds. Working up from the slowest shutter speed the first JPG over 20K bytes was the right exposure and was shown on the page.

  103. Hmm.... by Bvardi · · Score: 1

    So I guess if you squeeze hard enough you literally CAN get blood from a stone!

    (And I deserve "most obscure joke" points for that one ;)

  104. Gzip doesn't preserve well... by Anonvmous+Coward · · Score: 2, Funny

    "This extends the simple, but powerful, uses of gzip to biogenic fossil detectors..."

    The problem with gzip is that doesn't preserve data very well. Now tar, it preserves fossil data quite well.

  105. Let's try this out.... by skookum · · Score: 1
    Well, let's see who is more likely to be biogenic, CmdrTaco or CowboyNeal...

    And the jury says:
    Image 2 (0.21048962526417 % compression) has a higher complexity measure than image 1 (0.121237058862429 % compression), and thus image 2 is more probably biogenic.
    There you have it, CowboyNeal is more biogenic than Taco.

  106. Re:The Mars fossil IS made by life; my wife is not by Anonymous Coward · · Score: 0

    Cheers. Lucky for you, I am man-made and I can be produced on demand. You didn't think Saint Aardvark actually wooed me in a conventional fashion? Nay, he had me crafted from bits left over from his home made beowulf cluster.
    - The posting-from Sunday-apparently Wife

  107. One more use for zip-files in ... by makkverk · · Score: 1

    ..."a fascinating new paper on the use of data compression algorithms for allowing a machine to quickly determe aspects of a document like language and authorship." For those less mathematically inclined see report from the Economist.com.

  108. Here you go by Gothmolly · · Score: 1
    --
    I want to delete my account but Slashdot doesn't allow it.
  109. Featurelessness by K-Man · · Score: 1

    In some ways this technique is meant to defeat systematic biases like the ones you mention. Compression tools make few assumptions about the data they process, so they serve as a check against more tailored filters which may introduce artifacts, or be defeated in some way. This problem may occur because they look for pre-selected "features" in the data rather than looking at the distribution of the data as a whole.

    gzip isn't perfect, but it will find repetitive byte sequences of any kind, regardless of the type of data. It's more of a sanity check than a knowledge extraction method.

    --
    ---- "If we have to go on with these damned quantum jumps, then I'm sorry that I ever got involved" - Erwin Schrodinger
  110. This method... by cr0sh · · Score: 1
    ...and what seemed like an exhausting list of others was covered in lengthy detail in Stephen Wolfram's "A New Kind Of Science" (IIRC, in the section on perception and analysis - but referred to in myriad ways throughout the volume).

    Now, I know I will either be flamed or derided for bringing up the mention of this text. I don't claim to be an expert on it (in fact, the scope and breadth of the reading convinced me that one time through is no where near enough - I will probably re-read it several more times in my life). I also know that his work both extrapolates and builds upon previous work - he mentions this repeatedly throughout the book.

    If you are at all interested in this sort of thing (and let me tell you this, his book covers much more than just using compression algorithms to determine patterns created by biological processes), you owe it to yourself to read, in full, the book at least once.

    --
    Reason is the Path to God - Anon
  111. uh? by h2k1 · · Score: 1

    well, i don't know if this is a result of iteration or merelly a fact of nature, but i tried the "biogenic image detector" and when comparing the same image, the file 1 was allways more compressed than the second, despiste being the same... Can anyone explain me this?

  112. GZIP IS THE ANSWER by Anonymous Coward · · Score: 0

    Wesley Willis writes:

    I told you about the time
    that life was differentiated with GZip.
    I was very greatful to hear about that stuff.
    Take it to me. It'll be the best test to the life.
    Tell it all to Jean-loup Gailly. Tell it to Mark Adler with all your mite. I'm saying this again to let you know:

    GZIP IS THE ANSWER
    GZIP IS THE ANSWER
    GZIP IS THE ANSWER
    GZIP IS THE ANSWER!
    Tell it like it is, Jack

    It was so alike to be compressed.
    I told it the same way.
    I have to compress it the same way so it is to be alive.
    Make it great for the grant money.
    It should be for the grant money.
    I'm saying this again to let you know.

    GZIP IS THE ANSWER
    GZIP IS THE ANSWER
    GZIP IS THE ANSWER
    GZIP IS THE ANSWER!
    Tell it like it is, Jack

    Everywhere you go, GZip is helping you.
    GZip that file. It'll sound the trumpet for you like it'll sound the trumpet for me.
    GZip it well for you and I'll gunzip it well for mell.

    I'm saying this again to let you know.

    GZIP IS THE ANSWER
    GZIP IS THE ANSWER
    GZIP IS THE ANSWER
    GZIP IS THE ANSWER!

    I'm letting you know that GZip is the answer.

    Rock on Jean-loup and roll on Mark.

  113. Just remember to ftp gzip files in binary mode by iamacat · · Score: 1

    Otherwise, you'll just get a worthless puddle of protoplasm when you uncompress people on the other end of the teleport. Also, don't compress humans and insects together, altough you might get a better ratio that way.

  114. They used TIFFs not JPEGs by Gogo+Dodo · · Score: 1

    You didn't read the article carefully enough. The seventh paragraph of the article clearly states they used TIFF images, not JPEG.

  115. Re:The Mars fossil IS made by life; my wife is not by laejoh · · Score: 0

    tried it on goatse.sx yet? he has no clothes, so...

  116. Winzip a better soultion? by Anonymous Coward · · Score: 0

    Surely winzip would be a better solution after all everyone knows that there are more windows users then any other platform.

    This would give a much better chance of finding the existance of life through shear numbers. ;^>

    Please tell me this story is an early April Fool!

  117. check it out... by Anonymous Coward · · Score: 0

    I uploaded one image of war casualties (incinerated bodies from 1991 gulf war) and an image of a friend of mine and. The result:

    Answer: Image 2 (24.1070556640625 % compression) has a higher complexity measure than image 1 (0.861397377968705 % compression), and thus image 2 is more probably biogenic.

    I guess it's doing what it's supposed to...

  118. Windows XP is alive! by laard · · Score: 2, Funny

    Did a test run with some default images in windows xp. Windows XP's "Purple Flower.jpg" is apparently more "alive" than Windows Xp's "Tulips.jpg" but "Windows XP.jpg" is more alive than both of them!

    --
    --- If we knew half the things we shouldn't we'd stop wishing we knew it all
  119. GNU -- the ``g'' is gzip... by mi · · Score: 1

    What's so special about (g)zip? Would not any good archiver (or, rather, any good archiving algorithm) do?

    Rhetorical questions, of course -- what good is an article if it does not mention GNU and/or Linux...

    --
    In Soviet Washington the swamp drains you.
  120. huh ? by Anonymous Coward · · Score: 0

    "reigons of similar, predictable data while preserving aspects of dissimilar, unpredictable detail.. " the article posted doesn't (if you did any reading) claim to do anything other than that, and that there might be good correlations to that, and life forms.

    what does what you do with gzip have anything to do with the article ?

  121. Mathematics for this by SparafucileMan · · Score: 0
    You can find the mathematics for this whole deal in an outstanding book, Introduction to Kolmogorov Complexity and its Applications.

    The basic idea is that the probability of highly compressible data resulting from a algorithmic process is much higher than one that would result from a random process.

  122. Re:The Mars fossil IS made by life; my wife is not by Anonymous Coward · · Score: 0
  123. Moderators... by mivok · · Score: 1

    will somebody please moderate that fossil -1 redundant?

  124. Information theory by pclminion · · Score: 1
    Information theory says that random noise is the most "concentrated" form of information possible. Roughly, information == entropy (actually, entropy times number of symbols).

    Random data may not be meaningful but they are full of information by definition.

    Consider the sequence 123123123123. The sequence is highly ordered, and therefore is probably meaningful (at least to someone, somewhere), but it contains very little information. In contrast the random sequence 196390244187 is highly disordered, totally meaningless, yet contains more information than the 123123123123 sequence.

    The technical definition of "information" is counterintuitive, not as simple as "noise vs. signal."

  125. Redundancy == Survival by Sayjack · · Score: 1

    Long ago engineers learned that nature evolves life to adapt to it's environment. We often look to nature for inspiration.

    What drives nature? Survival for one. Survival often depends on having a backup so it's no surprise that nature tends to adapt redundant systems.

    I have 2 hands in which I can hold a spear or club. I have 2 eyes to identify my potential predators. I have 2 ears to hear them coming. I have two...well you get the picture.

    All of this is externally visible. As such, the a jpeg or gif image is likely to capture some quantifiable amount of this externally visible redundancy.

    GZip which is being used here to measure visual redundancy in an ingenious manner. It's not entirely surprising that it's working.

    --

    -- Good judgement comes with experience. -- Experience comes with bad judgement.

  126. histogram by Anonymous Coward · · Score: 0

    This same information could be achieved by doing a frequency histogram on the data.

  127. Seems to be just a complicated way by RDPIII · · Score: 1

    to measure entropy or redundancy. Why not do that directly? A program to measure 8bit entropy is not more than a few dozen lines of C, or one could simply "apg-get install ent".

    --
    Marklar: marklar
  128. fuckin' hilarious by Anonymous Coward · · Score: 0

    too bad not many will get the joke :-)

  129. Yes, but ... by guybarr · · Score: 1

    Add some entropy to your life; write drunk

    But would writing drunk increase or decrease entropy overall ?

    --
    Working for necessity's mother.
  130. Re:Uhhhhh... No.....Does It Detect Red Herring Too by Raiford · · Score: 1
    There is a class of wavelet transforms that allow for data compression in the form of time-series or images. The amount of compressiblity here is also directly proportional to the amount of determinism in the data. An interesting place to look is here.

    --
    "player 4 hit player 1 with 0 stroms"
  131. Remember people by jasonditz · · Score: 1

    Don't Gzip the living, that's a felony.

  132. Another way to stop spam by Bilb0 · · Score: 1

    With some users complaining about receiving up to 1,000 unsolicited e-mails a day there is no shortage of innovative solutions to stop the spam. Blacklists of known spammers and keyword filtering have been tried with mixed results. Another promising approach, pioneered by companies like MailFrontier, MailBlocks and others is called challenge-response. It works as follows. When a customer receives a new message from an unknown correspondent, the system will intercept the message and automatically return to the sender a form to fill out. The typical form contains graphics images, simple pictures, geometric shapes, colored check-boxes or other objects that are easily recognizable by humans but hard for computers to make sense of. Once a human being views those images and types the response into the form - demonstrating that she is a person and not an automated mass-mailing machine - the system will forward the e-mail to the intended recipient. DyedBlond, a secretive Silicon Valley artificial intelligence startup, is rumored to have been working on an advanced version of a challenge-response spam blocker. Whereas existing challenge-response spam blockers discriminate between mass-mailing machines and humans, DyedBlond discriminates between intelligent and not so intelligent humans. "Counting daisies and bunnies is too simple," says Alex Brodenschmuck, a renowned AI expert, "Sooner or later the machines would learn how to do it. You need more sophisticated tests for human intelligence. An ability to maintain a conversation has always been considered a test for true intelligence (so-called Turing test)." But DyedBlond goes beyond small talk. Want to send an e-mail to a physics professor? Be ready to take an integral or solve a differential equation. Sending your resume to Wall Street? Prepare to price an exotic option. Not only does DyedBlond solution eliminate spam, it prioritizes mail by sorting the correspondents by their intelligence. "If the guy cannot solve Schroedinger equation for hydrogen atom you probably don't want to hear from him," says Tiev Resle, a Cornell physics professor. "I am looking forward to the day when American Physical Society makes DyedBlond mandatory filter for all e-mail sent to its members. This will kill spam once and for all, and improve students' performance."

  133. A few questions. by shadowpuppy · · Score: 1

    1. This seems like it might be a special case. The samples in question are layerd and thus have a greater chance of being compressable. I wonder how well a tree would do.

    2. As far as I can tell they are not pictures of life but rather the effects of life. The results may come out differently when the pictures were of actuall living things.

    3. If for some reason an image gets compressed by a lossy compression scheme then the data has to be thrown out.