Slashdot Mirror


User: Glog

Glog's activity in the archive.

Stories
0
Comments
260
First seen
Last seen
Profile
(view on slashdot.org)

Comments · 260

  1. Umm yah on Rewrites Considered Harmful? · · Score: 1

    Win 95 sure did work flawlessly. It was a bummer when 98 came out. My BSOD went through the roof from 5/day to 30/day.

  2. Re:Pricing on A Look Inside Virginia Tech's New Super Computer · · Score: 1

    The Japanese Earth Simulator went live in May 2002 - see http://www.es.jamstec.go.jp/esc/eng/ES/birth.html.

  3. try xCache on Reverse/Server-Side Proxy Caching for Windows? · · Score: 3, Informative

    Try xCache - I've used it before and it's quite good: http://www.xcache.com/home/default.asp

    Many Fortune 500 companies use it.

  4. put alarm clock out of hand's reach on Alarm Clocks for Heavy Sleepers? · · Score: 1

    I have a similar problem - when I go to bed late I tend to just smack the alarm clock in the morning and continue sleeping. I don't even remember a thing.

    A very quick and dirty solution for me has been to put the alarm clock across the room so that I have to physically get up to turn it off. That forces my body to waken up so that I won't go right back to bed. HTH

  5. Soo that means on Marriage May Tame Genius · · Score: 1

    ... most Slashdot readers are safe. Smart forever, mooohaha ...

  6. Re:Buddy, you don't know poor! on Sun Sued Over H1-B Workers · · Score: 1

    Get your facts straight - H1-B workers work in THIS country and PAY TAXES JUST LIKE YOU DO. And yes there is a lower limit on how much they can be paid - it's called the prevailing wage. And yes companies are supposed to make sure that an American citizen cannot fill the position.

  7. Re:Standard US pattern on What Fruits Will Reduced R&D Bear For The U.S.? · · Score: 1

    For your info the leading Chinese manufacturer Legend is one of the biggest in the world. And it's only going to be bigger. Check out the population of China vs that of the US sometimes.

  8. Text of the full article on Using gzip As A Spam Filter · · Score: -1, Redundant

    Originally posted on kuro5hin.org

    By KWillets
    Sun Jan 26th, 2003 at 07:03:35 AM EST

    While many people see gzip as a compression tool, it also makes a credible spam filter. Here's how.

    I was reading through a bioinformatics book the other day, and was reminded of a useful shortcut for comparing a text against various corpora. A number of researchers have simply fed DNA sequence data into the popular Ziv-Lempel compression algorithm, to see how much redundancy it contains.
    Loosely speaking, the LZ (Zip) and the related gzip compression algorithms look for repeated strings within a text, and replace each repeat with a reference to the first occurrence. The compression ratio achieved therefore measures how many repeated fragments, words or phrases occur in the text.

    A related technique allows us to measure how much a given, "test" text has in common with a corpus of possibly similar documents. If we concatenate the corpus and the test text, and gzip them together, the test text will get a better compression ratio if it has more fragments, words, or phrases in common with the corpus, and a worse ratio if it is dissimilar. Since the LZ algorithm scans the entire input for repetitions, it tends to map pieces of the test text to previous occurrences in the corpus, thereby achieving a high "appended compression ratio" if the test text is similar to what it's appended to.

    In this case, we wish to compare an incoming email message against two possible corpora: spam and non-spam (ham). If we maintain archives of both, we can compare the appended compression ratios relative to each, to judge how similar a new message is to spam or ham.

    As a simple test, I downloaded some sample spam and ham from the Spamassassin archive. I removed headers from the messages (to focus on message text only), and created spam and ham "training sets" 1-2 megabytes in size. I then tested spam and ham messages not in the training sets for for their compressed sizes when appended.

    Compression was measured as follows:

    $ cat spam.txt new-message-body.txt |gzip - |wc -c
    $ cat ham.txt new-message-body.txt |gzip - |wc -c

    The file sizes output were compared to the compressed sizes of spam.txt and ham.txt without new-message-body.txt appended, to see how many bytes were consumed by the new-message-body.

    The results for "ham" messages were the most dramatic. The average compressed size of a ham message appended to spam was 38% higher than when appended to other ham. For spam messages, the same comparison yielded a compressed size 6% smaller when appended to spam vs. ham, so in both cases, compressing a message with others of its kind yielded a smaller file, on average.

    Individual results were also quite clear: while some spam messages compressed slightly better when mixed with ham, ham messages still maintained a margin of 15% or more between the most spamlike ham, and the most hamlike spam. I would put the threshold somewhere around 110%; if a message's size when gzipped with spam is less than 110% of its size when compressed with ham, it's probably spam.

    In conclusion, gzip is a fairly blunt instrument for spam detection, but the effectiveness of its relatively blind repetition-finding is worth noting. The current fad among spam filters is word-counting, with various statistical heuristics applied to the results. Algorithms like LZ and gzip go beyond word matching, finding entire phrases and paragraphs of repetition, but do not attempt to measure their statistical significance. More sophisticated approaches, which combine phrase matching with statistical analysis, may be more effective.

  9. Re:Monopoly in action. on Microsoft to Buy Rational and/or Borland? · · Score: 1

    Dork - I use Kylix in a live site. And no I am not going to give you the URL.

  10. Imagine ... on Tornado in a Can · · Score: 1

    ... a Beowulf cluster of those. :) Just hadda do it.

  11. Re:the disturbing part of all this is the source on Abrupt Climatic Change Coming Soon? · · Score: 1

    There have been a number of people recently in academia and prestigiuous labs that have fabricated evidence and people have believed them for years until they were discovered. Now I am not saying that this guy did ... Also, what if YOU are the poor and infirm, huh?

  12. Re:$1 million on Million-Dollar Donation To Fight Abusive Copyrights · · Score: 1

    I disagree - others have posted a number of reasons why a million bucks actually mean something. And then there is the precedent that this sets. Once a movement gains momentum other donations are sure to follow.

  13. Re:Why this matters, and why it's mostly good on Million-Dollar Donation To Fight Abusive Copyrights · · Score: 1

    1. Guilt
    2. Altruism
    3. Pure idealism

  14. Why the hell do you give ideas to the RIAA? [n/t] on Can Poisoning Peer to Peer Networks Work? · · Score: 1

    Aagh!

  15. Re:Most normals don't want broadband on Why You Don't Have a Broadband Connection · · Score: 1

    Your arguments don't hold water. If broadband was so commonplace that pc manufacturers started selling computers with nic-cards rather than modems the "normal" user would be more than happy to use all of that excess badnwidth they "don't need or want". And for all of that to happen the prices need to fall down. And they won't until telecoms are in the driving seat.

  16. gone postal on NASA Plan to Read Brainwaves at Airports · · Score: 1

    Isn't this going to be discrimination against mental and heart patients?

  17. lack of content on Digital Restrictions Management for P2P Systems · · Score: 1
    DRM or not, the fact of life is that paid "content" providers will not take off until they can actually offer something better than the mainstream pimpled-teen crapola that is out there. Yah I have checked Pressplay and they have precisely 2% of the over 100 bands that I like to listen to. Yes, I am willing to pay to download MP3's but I want unlimited download for a set monthly fee and no, I don't want the MP3's to expire after 3 months so I can't play them any more, thank you very much.

    Must be tough to be a RIAA/MPAA/Label Exec - loosen those ties, dudes, they're restricting the oxygen supply to your brains!

    I am even willing to pay the full price of CDs if I knew the money was going in the hands of the artist and not in some phat exec's pockets. Cut out the middle-man, yeah - cry me a river because technology is making you obsolete.

  18. huh? on RIAA Smacked by DoS · · Score: 1

    Since when is CDNow a peer-to-peer network?

  19. huh, bandwidth caused them to go bankrupt? on Internet Giants Prepare for WorldCom 'Storm' · · Score: 1

    For your info WorldCom did not go bankrupt because their customers demanded more bandwidth. This is the stupidest economic statement I have heard in a while. They went bankrupt because of sloppy management, corrupted accounting practices, and greed. Can't wait to see which one is next?!

  20. Boooooring on PDA and Subnotebook Killer? · · Score: 1

    Bulky, grotesque piece of overhyped fad electronics. Not to mention the atrocious flash animation.

  21. You are spending $30B? on NASA Panel Says ISS Cuts Hurt Science · · Score: 1

    You must be frickin rich, big daddy to be spending $30B of *your* tax money.

  22. Re:Depends. on Video Games Found To Decrease Brain Activity · · Score: 1

    yeah and

    4) Buy a dictionary and a grammar. Learn to spell.

  23. i beg to differ on Winning the E.T. Lottery · · Score: 1

    >>Are you ready for the answer? ... 50%

    50% NOT - because probability is based on the facts - how a certain entity has behaved in the past and not on how *you* think it should behave.

  24. Re:How about nothing. Here's why... on Subversive Gifts for New College Students? · · Score: 1

    What do you work as to be able to afford hi-fi in college? Gigollo? Looks like mommy and daddy send a check after all ...

  25. Jonathan Swift predicted Phobos and Deimos on Science Fiction into Science Fact? · · Score: 1

    It is widely acknowledged that Jonathan Swift predicted the existance of the two Mars satellites: Phobos and Deimos. Although the two moons were not discovered until 1877, Jonathan Swift had written in 1720 (in Gulliver's Travels, chapter 3) that the inhabitants of Laputa had made important astronomical observations of 10 000 fixed stars and of the two satellites of Mars, one orbiting with a period 10 hours and the other with a period of 21 hours. This is pretty spooky for a prediction!

    Also, Sir Arthur Clarke is considered the inventor of the geostationary satellites.