Slashdot Mirror


Using gzip As A Spam Filter

captainclever writes "Kuro5hin have an interesting article on detecting spam using gzip." Here's a sample: "Loosely speaking, the LZ (Zip) and the related gzip compression algorithms look for repeated strings within a text, and replace each repeat with a reference to the first occurrence. The compression ratio achieved therefore measures how many repeated fragments, words or phrases occur in the text."

3 of 268 comments (clear)

  1. But... but... but... by Anonymous Coward · · Score: -1, Troll

    Ostrich!

  2. Thanks Timothy by Talez · · Score: -1, Troll

    K5 has been having troubles with speed over the past couple of weeks. I'm sure this will make it much better.

    It's good to know that you're using your power for good and not for evil. Oh wait, last time I checked blindly flooding a community run site into oblivion by sending 250,000 people our way is evil.

    Thank you very much Timothy.

  3. Lamer filter ph3ars me! by Anonymous Coward · · Score: -1, Troll

    It is official; HP confirms: Algerbraic is dying!

    One more crippling bombshell hit the already beleaguered Algerbraic community when HP confirmed that Algerbraic calculator usage has dropped yet again, now down to less than a fraction of 1 percent of all professionals. Coming on the heels of a recent hpcalc.org survey which plainly states that algerbraic notation has lost more market share, this news serves to reinforce what we've known all along. Algerbraic is collapsing in complete disarray, as fittingly exemplified by failing dead last [hpcalc.org] in the recent HPcalc.org speed trials.

    You don't need to be a Kreskin [amdest.com]to predict alberbraic's future. The hand writing is on the wall: Algerbraic faces a bleak future. In fact there won't be any future at all for algerbraic because it is dying. Things are looking very bad for algerbraic. As many of us are already aware, it continues to lose market share. Red ink flows like a river of blood.

    TI's algerbraic calculator development team is the most endangered of them all, having lost 93% of its core engineers. The sudden and unpleasant departures of long time algerbraic's developers Casio and Sharp only serve to underscore the point more clearly. There can no longer be any doubt: Algerbraic is dying.

    Let's keep to the facts and look at the numbers.

    RPN supporter Jean-Yves Avenard states that there are 70000 propfessional users of calculators. How many users of algerbraic are there? Let's see. The number of RPN versus algerbraic posts on comp.sys.hp48 is roughly in ratio of 500 to 1. Therefore there are about 70000/500 = 14 algerbraic users. Sharp DAL (Direct Algerbraic logic) posts on Usenet are about half of the volume of plain algerbraic posts. Therefore there are about 7 users of DAL. A recent article put DAL at about 50 percent of the algerbraic market. This is consistent with the number of DAL Usenet posts.

    Due to the troubles of mismatched brackers, excessive keystrokes and so on, algerbraic went out of favor with TI and was taken over by Casio who sell another troubled calculator. Now Casio is also dead, its corpse turned over to cheap chinese calculator manufactures.

    All major surveys show that alg has steadily declined in market share. Algerbraic is very sick and its long term survival prospects are very dim. If Algerbraic is to survive at all it will be among vintage calcululator collectors. Algerbraic continues to decay. Nothing short of a miracle could save it at this point in time. For all practical purposes, Algerbraic is dead.

    Fact: Algerbraic is dying

    If the lamer filter can't stop these posts, what can it do?!