Slashdot Mirror


Using AI for Spam Filtering (w/ Source Code)

jarhead4067 writes "Article snippet: "Up until recently, most researchers in the fight against spam have failed to classify it as an artificial living organism, hindering the development of effective tools and techniques to kill it. While this classification may sound strange, consider the following..." A novel approach to filtering spam, and hey, there's free source included."

15 of 197 comments (clear)

  1. I'll Read the Article... by UberOogie · · Score: 4, Funny

    ... after we get an AI to counter the Slashdot effect.

    --
    "Enough of this wretched, whining monkey life." -- Marcus Aurelius, _Meditations_, Book 9, 37
  2. Artificial living organism by Anonymous Coward · · Score: 4, Funny

    I won't believe spam is a living organism till I see Marty Stouffer do a special, complete with comedy 'boing' noises and 'aint that cute' music as we watch a mother Spam care for her young.

    1. Re:Artificial living organism by ThisIsFred · · Score: 2, Funny

      Those aren't my type of nature specials. I'd rather see a spam run down by a cheetah as it tries to escape through my router.

      --
      Fred

      "A fool and his freedom are soon parted"
      -RMS
  3. I can't let you read this Dave. by Anonymous Coward · · Score: 0, Funny

    How about a nice game of chess?

    1. Re:I can't let you read this Dave. by mog007 · · Score: 4, Funny

      Worse than being killed by the AI.. what if the AI decides to not filter spam anymore?

      "I'm sorry Dave, but your wife thinks you SHOULD try this V@GR!A substance."

      or

      "This Nigerian seems very nice, and if it pays off you can get me more delicious RAM."

  4. The great and powerful Oz has spoken! by carpe_noctem · · Score: 3, Funny

    And the AI says....

    The page cannot be displayed
    There are too many people accessing the Web site at this time.
    Please try the following:
    Click the Refresh button, or try again later.
    Open the www.generation5.org home page, and then look for links to the information you want.
    HTTP 403.9 - Access Forbidden: Too many users are connected
    Internet Information Services

    --
    "Quoting famous computer scientists out of context is the root of all evil (or at least most of it) in programming." - K
  5. Animal Rights Activists by toetagger1 · · Score: 5, Funny

    "living organism ... and techniques to kill it"

    Next thing we know, we will have Animal Rights Activists in Washington, D.C. protesting our "spam traps"

    --
    who | grep -i blond | date cd ~; unzip; touch; strip; finger; mount; gasp; yes; uptime; umount; sleep
  6. Who would have thought by mst76 · · Score: 2, Funny

    > most researchers in the fight against spam have failed to classify it as an artificial living organism

    Who would have thought Skynet has its origins in spam?

  7. Re:already slashdotted :( by Anonymous Coward · · Score: 1, Funny

    wooooooosh.

  8. Really? by Nestafo · · Score: 2, Funny

    Your web server can also be classified as an artificial living organism. But I ain't so sure about that living part anymore...

  9. Don't bother reading this article... by Monkelectric · · Score: 4, Funny
    It is *terrible*. Briefly: the author invented a rule based method for classifying email, and then added a few paramaters so he could call it a "learning algorithm". As if adjusting the ratio of links to words will allow you to detect spam, then he seems to throw in a Neural Network for no reason.

    I think about the only good thing I can say about this article is, at least he's not out killing puppies.

    --

    Religion is a gateway psychosis. -- Dave Foley

  10. Re:Bayesian filtering by Anonymous Coward · · Score: 1, Funny

    I wouldn't. It's straightforward, dumb and effective. AI would be the opposite.

  11. Re:The Article by ArbitraryConstant · · Score: 4, Funny

    This article advocates a

    ( ) technical ( ) legislative ( ) market-based ( ) vigilante

    approach to fighting spam. Your idea will not work. Here is why it won't work. (One or more of the following may apply to your particular idea, and it may have other flaws which used to vary from state to state before a bad federal law was passed.)

    ( ) Spammers can easily use it to harvest email addresses
    (x) Mailing lists and other legitimate email uses would be affected
    ( ) No one will be able to find the guy or collect the money
    (x) It is defenseless against brute force attacks
    ( ) It will stop spam for two weeks and then we'll be stuck with it
    ( ) Users of email will not put up with it
    ( ) Microsoft will not put up with it
    ( ) The police will not put up with it
    (x) Requires too much cooperation from spammers
    ( ) Requires immediate total cooperation from everybody at once
    (x) Many email users cannot afford to lose business or alienate potential employers
    ( ) Spammers don't care about invalid addresses in their lists
    ( ) Anyone could anonymously destroy anyone else's career or business

    Specifically, your plan fails to account for

    ( ) Laws expressly prohibiting it
    ( ) Lack of centrally controlling authority for email
    ( ) Open relays in foreign countries
    ( ) Ease of searching tiny alphanumeric address space of all email addresses
    ( ) Asshats
    ( ) Jurisdictional problems
    ( ) Unpopularity of weird new taxes
    ( ) Public reluctance to accept weird new forms of money
    ( ) Huge existing software investment in SMTP
    ( ) Susceptibility of protocols other than SMTP to attack
    ( ) Willingness of users to install OS patches received by email
    (x) Armies of worm riddled broadband-connected Windows boxes
    (x) Eternal arms race involved in all filtering approaches
    ( ) Extreme profitability of spam
    ( ) Joe jobs and/or identity theft
    ( ) Technically illiterate politicians
    ( ) Extreme stupidity on the part of people who do business with spammers
    ( ) Dishonesty on the part of spammers themselves
    (x) Bandwidth costs that are unaffected by client filtering
    ( ) Outlook

    and the following philosophical objections may also apply:

    (x) Ideas similar to yours are easy to come up with, yet none have ever been shown practical
    ( ) Any scheme based on opt-out is unacceptable
    ( ) SMTP headers should not be the subject of legislation
    ( ) Blacklists suck
    ( ) Whitelists suck
    ( ) We should be able to talk about Viagra without being censored
    ( ) Countermeasures should not involve wire fraud or credit card fraud
    ( ) Countermeasures should not involve sabotage of public networks
    (x) Countermeasures must work if phased in gradually
    ( ) Sending email should be free
    (x) Why should we have to trust you and your servers?
    ( ) Incompatiblity with open source or open source licenses
    ( ) Feel-good measures do nothing to solve the problem
    ( ) Temporary/one-time email addresses are cumbersome
    ( ) I don't want the government reading my email
    ( ) Killing them that way is not slow and painful enough

    Furthermore, this is what I think about you:

    (x) Sorry dude, but I don't think it would work.
    ( ) This is a stupid idea, and you're a stupid person for suggesting it.

    --
    I rarely criticize things I don't care about.
  12. Re:Not new, not genetic, not A.I. -- it's Bayesian by Anonymous Coward · · Score: 1, Funny
    Cmdr Taco, Slashdot was a brilliant idea of yours, and I love your site -- but that's because I have reasonably high expectations for it. (Italics mine)

    Really? Why?

  13. Who posts this crud? Who submits it? by jaghatarjankare · · Score: 2, Funny

    NOTE: The sample code for this application is in C#. C# was chosen over C++ so beginners could better see the structures of the process, and C# was chosen over Java because of the inherent performance advantages of .NET.

    What morons. what total losers.