SpamAssassin Gets a Promotion
darthcamaro writes "The folks at internetnews.com are reporting that the Spam Assassin project has been promoted to a full top level Apache Software Foundation project..the project has been in incubation for a while and it's finally made it through...the article also reveals that Apache is now using Spam Assassin themselves: 'I think spam filtering is now a critical part of the network infrastructure and Spam Assassin is a leader in the area,' said Daniel Quinlan, chairman of the Apache Spam Assassin Project Management Committee."
I didn't see that one coming. I have been using SA for about three years, I think... well, since whenever I heard about it anyway :)
A well configured installation of SA got me employee of the month way back when. Sadly, UCE/UBE is/has ruined the Internet. Finally.
"To deny our own impulses, is to deny the very thing that makes us human." - Mouse
Perhaps Slashdot editors might want to take an extra 20 seconds to check the spelling of the URLs they put in their stories.
spamassassin.org, not spamassasin.org
Brielle
This is great news! I have been running SpamAssassin on my box for quite a while, just to filter my own mail. I recently installed it on my mother's Windows 98 box to filter her mail when she checks it with Outlook Express, and she hasn't complained about Spam since. With a bit of tweaking, its been catching 95% with no false positives. Hopefully the SpamAssassin project will keep on getting better :)
For those looking for the official spam assasin site here it is
The link in the text goes to some search page
"Alcohol, cause of, and solution to, all of life's problems" -Homer Simpson
Anyone know when Spamassassin 3.0 is going to be released? Some spammers seem to have outsmarted 2.63. I'm really excited to see what changes they have made to up the ante in teh war on spam
"Avoid employing unlucky people - throw half of the pile of CVs in the bin without reading them." -- David Brent
"I think spam filtering is now a critical part of the network infrastructure"
After using SpamAssassin for quite a while, it just wasn't cutting it - 75%-80% accuracy is still a lot of spam to go through and delete. I added DSpam to my mail server and my spam catching rate is now better than 99%.
DSpam also came with much better directions for integrating with Exim than did SpamAssassin. As fond as I was of SpamAssassin, they have some catching up to do.
No, Thursday's out. How about never - is never good for you?
There was no clear way to integrate dspam with my postfix gateway as with spamassassin. Lots of instruction on setting it up with a local mail handler but not so much for a relay host.
Maybe there's a way to do it but I couldn't take the time to figure out a good way to get it done.
- It's not the Macs I hate. It's Digg users. -
If only it truly assassinated spamers.
"It's the height of ridiculousness to say for those 9 lines you get hundreds of millions."
See, i'm not interested in Assassinating Spam. Now if there was a SpammerAssassin, then I'd be all over using that.
What do you do with mail SA has flagged?
I like SA, and find it is very good for identifying around 95% of my incoming spam. However, I also have around 0.1% false positive rate, which means at some point I have to look through all the filtered spam messages and make sure none of them were legit.
I need a better tool for handling mail SA has identified as spam, either server-side or client-side. I'd like to delete anything with a score > 15, simply store anything with a score > 5, and send an auto-reply for scores between 5 and 10 indicating that the message was marked as spam and I'll probably never look at it.
A good set of procmail and formail rules will accomplish this, but my hosting company has a weird procmail setup and I'd prefer something easier to implement.
Any ideas?
as soon as it learns to speak french
s as sin.devel/26614
http://article.gmane.org/gmane.mail.spam.spamas
Everyone on Slashdot always seems to be complaining about spam. I don't see what the big deal is. I enjoy receiving e-mail from people and companies I don't know. Each morning when I run my e-mail program, it starts downloading, and the unexpected e-mail is a pleasant surprise that brightens my day. Well, a few hundred pleasant surprises that is, and they brighten my day in the same way that stepping in a pile of dogshit brightens my day. A few hundred times. So what the fuck? Why are all you whiny bitches on Slashdot always complaining about spam? Don't waste your time writing or deploying spam blockers. Enjoy life. And relax. Assholes.
Shame on Google.
Novell NetMail even supports SpamAssassin now.
http://netmail.sourceforge.net/
I don't employ Spamassassin or any other spam blocker. As a result, I now have a penis that will make her scream, hot lesbian schoolgirls lusting after my every move, a wide range of generic drugs, 2 PhDs and a completely clean credit record
A step up from living in your parent's basement and whacking off to an inflatable doll, right?
I'd stay and chat, but I have to get back to a Nigerian man about a bank transfer
call this guy
I can't speak for auto-replies, but you can do the sorting part client-side. The key is that spamassassin adds a line like "X-Spam-Level: *****" where the number of *'s is the score of the email. Almost any email client can filter mail to different folders based on headers. The unary representation of the spam score ensures that even a primitive filter can work.
For example, one popular client is Microsoft Outlook, and there are several web pages in google (such as this one) that explain how to reroute mail to specific folders depending on the spamassassin score.
they'll get it when they post the story again.
Ben
Work Safe Porn
I intended that to mean "understand," but really, I don't get any. Just have a regular (Well, recently upgraded to ad-free) mail.com address, and I get exactly 0 spam emails. Don't even know specifically of any spam-blocking software.
I hate grammar Nazi's.
YOU
not even DSpam will block these things
i keep getting messages from some company called CERT that say to upgrade windows stuff
i don't even have windows
nothing blocks these BASTARDS
If you integrate it with your mailer, you can reject the mail during the SMTP session rather than generating a separate bounce email, which would have the problems you mentioned (going to a forged from: address). As an added bonus, when you reject it during the SMTP session, you'll get taken off a lot of spam lists, since your address will look like it had delivery problems. And you still get the advantage of bounces, that legitimate mail that got rejected will end up with a bounce back to the sender informing them of it.
10 PRINT CHR$(205.5+RND(1)); : GOTO 10
Spam Assassin, while a very clever program, is as misdirected as the "Canned Spam" legislation. It has no effect on the real economics of spam: who pays for it.
Somebody is paying for the spamming, and we know exactly who it is. The URL of that organization is prominently displayed in every item of spamail. It is the advertiser.
The advertiser is right there out in the open, easy to locate. If they're not, the spam isn't doing its job, and wouldn't have been sent. And easy to locate means easy to go after, easy to sue, to fine, DoS or whatever.
Dinging the advertisers, and dinging them hard, will instantly put the spammers out of business.
Spamming can be eliminated without blocking, white lists, or anti-spoofing RFC's. Just go to where it's pointing.
To draw an [ugly, graphic] picture: a dog comes and poops on sidewalk in front of my house, and I step in it. Yelling at the dog is going to be only moderately successful, building a poop filter is difficult, messy, and leaky (as Spam Assassin demonstrates) . Following the dog's leash and fining the owner is what works.
The owner doesn't bring the dog back since s/he doesn't want to pay another fine.
No owner, no dog, no spam.
Get the owner.
Kill the spam.
Does this mean they finally have to sort out that god awfull tip of a web site then?
Sounds Good...
Webmaster of Infoweb
3.0.0pre1 was made available last week.
i ld/3.0.0_change_summary
It will apparently take another month or so to finalize the weighting of the rules.
I've put 3.0.0pre1 on a production system that filters ~350k messages per day. With some tweaking of the RBL, bayes, and AWL rules, it is much (~10%) more efficient at tagging spam than 2.63, which I'm running on a parallel server that also sees ~350k messages/day (load balancing is your friend).
More info: http://www.au.spamassassin.org/full/3.0.x/dist/bu
I'm with you 99 %.
Filtering spam generates way too many false positives. Challenge/Response schemes are IMHO much more effective. TMDA and similar programs can be configured with whitelists for your regular mail partners, auto-whitelists for everyone who confirms their e-mail identity, and, if necessary, with blacklists too.
cpghost at Cordula's Web.
Not perfect, then again, spam prevention methods never are.
What I do
The Internet's nature is peer to peer - 20050301_cs_profs.pdf
The people I trust with my "hidden" personal email address are also people I trust to run a patch MTA/MUA. Of course, they also have my personal phone number, if they need a different way of contacting me, when email doesn't work.
If I don't trust them to run a patched MTA/MUA, then they'll get one of my "special" ie. sacrificial subdomain email addresses, rather than my actual personal one.
A key point, which I didn't realise when I wrote the original text at the URL, is that nothing is permanent, including my physical address, my phone number or even my name. Once you accept that all these things have different levels of temporalness, you can then, in the specific case of spam, start creating and using email addresses that have differing levels of life expectency. My (current) personal email address isn't ever going to be permanent, as I probably won't have nosense.org as a domain for the rest of my life.
Once you get over the idea of a permanent, rest-of-life email address, you can then start to be a bit more creative with spam mitigation techiques.
The Internet's nature is peer to peer - 20050301_cs_profs.pdf
not all bulk mail is spam. spam assassin gives 2.4 points if it finds anything that looks like a unique identifier for X-Sender, and another 1.4 points for anything that looks like a tracking image or tracked link.
that plus the points for any non-safe html colors or any html at all, SA effectively tags ANY bulk mail as spam!
For an end user to setup on their client (as a "junk mail" folder) thats great.. I like to have bulk mail seperated from my personal mail, but for an ISP to throw it away before it even gets to the intended recipient is fucking rediculous and should be illegal.
The only email an ISP should be allowed to discard are the ones with attached viruses or some known email worm. The only reason your customers are happy with you throwing away their email is because you don't fucking tell them.
bite my glorious golden ass.
Well, I still use SpamAssassin, some 30 spams a day, no false positive, and 1 false negative in a month or so. Why is it so successful? I have spent a lot of time training it's Bayesian filter on both ham and spam. That's the key to success for any spam filter nowadays..
Many people use spamassassin on unix boxes, or if they have Exchange they use SA on a unix gateway between the net and the Exchange system.
But if you are a smaller shop and don't have the resources for that, then you can run sa right on Exchange.
Here is a write up on how to do it (that particular write up is for Exchange 2003 and SA 3.0, but it will work for SA 2.x as well, and for Exchange 2000 - or any combination thereof - but it won't work on Exchange 5.5 that I know of).
There are some odd things afoot now, in the Villa Straylight.
SpamAssassin has been a life saver. A few months back my server started taking a spam pounding. My personal spam alone went from just a couple a day, to at least 15 a day (I know thats not much for some of you, but it is for me, I've always been smart about where I give my email address. Unfortunately, I have to give an address that shows on domain whois databases and thats mostly where they get me). So I installed MailScanner, SpamAssassin, and ClamAV. I have it set to tag email only, but that works fine, I just filter spam to a spam folder in OE and occasionally take a quick glance to see if it caught any non-spam emails before I empty it. So far it has only caught one email I wanted to received, and it looked so much like one big ad that I understood why.
See, I've realized that spammers get my address because of my own behavior. I changed my address and started protecting my it responsibly, and have not received spam in two years. Count 'em. Two years. Ya'll are whores playing fast and loose with your personal information and then crying when you get herpes.
think of some monster inside a giant larval sac when they said incubating?
I'm no *nix expert, but I finally figured out how to do DSpam 2.10 on a Postfix/Amavisd-new gateway. Email me com dot akghetto at gdl (letters not backwards, just the words) and I might be able to help.
Over about a year, my spam levels had been gradually reducing from 250/day (for a single email address!) down to six or seven, as I had been bouncing spam diligently - I know it's a pain for faked senders, but it seemed to effectively discourage the spammers better than just dropping it.
Recently I renewed my domain name at Network Solutions. I made the mistake of not paying the extra $9/year to keep the record private. Within a couple of days my spam levels went to about 50-100 per day. I'm a cheapskate, and I didn't feel like I should have to spend money for this, but now I wish I had.
Is there a good spam filter I can install on my personal computer? I guess it has to involve setting up a mail server on my machine just for my incoming mail, but I'd be ready to look into doing that if it would help.
My ISP is using SpamAssasin, but I have no influence over the setting they use, and it lets far too much through.
I assume a server solution is better than a filtering client. Besides, it would make it possible to filter for both computers in the household.
I'm on Mac OS X.
It can also be run with godforsaken GroupWise with the addition of a piece of software known as "Guinevere" and a lot of elbow grease; trust me, I've had the pleasure of doing it.
As it seems now obligatory to mention anti-spam systems whenever a /. story mentions spam, I thought I'd add the following:
Please have a look at Spam Cannibal
It's an interesting concept that if correctly deployed (big "if") by even a relatively few admins around the world, could really make a difference to the amount of spam on the net. It can also protect hosts against DoS attacks of various kinds.
Don't get me wrong, I'm not astroturfing this (much...). It has flaws - there are those who think blacklisting is a bad idea, and I can see their point of view on that - but I just think Spam Cannibal needs more visibility as an approach.
"And the meaning of words; when they cease to function; when will it start worrying you?"
Nice finally. Makes me want to run web mail now.
Joe Jobs
I love C++
How do you configure SpamAssassin for a mail relay of about 200 users?
I have SA running on a Linux server (sendmail) as a mail relay receiving mail from the Internet, and forwards the mail to multiple Exchange servers of about 100 people each. The internal users don't have accounts on the sendmail server.
What's the best way to implement SA in this situation? A common Baysian database doesn't work very well since each user has different definition of what "spam" is, and loading everyone's Inbox into the database isn't a very good option.
Try turning on the Bayes filter and be amazed :-) Spamassassin + SARE Rulesets + a well trained Bayes filter = 99+% effectiveness for me.
I blacklist *@*.* and it's quite effective at getting rid of spam. I have a 100% spam filter. It also filters my e-bills as well as a added feature.
My one email account uses SA, but does not auto delete.
Evolution just files those with high rankings into another folder.
I don't think auto deleting emails is a good idea.
Yahoo does a similar thing, throws suspect spam into a bulk folder. Quite often I find stuff (website registrations) in there.
There are 3 main costs to spam
Server - CPU/Storage
Bandwidth
User time to read/sort it
User time is expensive.
Bandwidth and servers are relatively cheap
Email filtering saves the most valuable resource, this helps limit the damage of it. Nobody is saying this is the ideal case, but it is an effective tool.
I was so impressed ever since I heard about SA.