Comparison of Bayesian POP3 Spam Filters

Re:great by Eric+Ass+Raymond · 2003-08-10 20:05 · Score: 3, Insightful

Spam is not advertising.

It's harrasment.

Nitpick... by 1029 · 2003-08-10 20:07 · Score: 2, Insightful

I just sure as hell hope he meant "latest, best hope", because anyone who thinks bayesian is the LAST best hope doesn't understand CS technology at all. And such a person sure as all hell shouldn't be given an audience on /.

--
- I love animals. I try to eat at least one a day.

Re:Nitpick... by pmhudepo · 2003-08-10 20:11 · Score: 1

You may consider the fact that for non-native English speakers the difference between things like 'latest' and 'last' is not intuitively clear. Give Kristian, apparently from Norway, a break!
Re:Nitpick... by RatFink100 · 2003-08-10 20:40 · Score: 2, Informative

It's a Babylon 5 reference.
Re:Nitpick... by Anonymous Coward · 2003-08-10 21:27 · Score: 1, Informative

What the hell is wrong with using "last" in that context? What did you do last week? Whatever it was, you sure as hell didn't make it to this week, given your narrow definition. Summary: the adjective "last" is perfectly acceptable as "most recent", see a dictionary. "such a person sure as all hell shouldn't be given an audience on /." ... stfu. (... and "CS technology"?? wtf.)
Re:Nitpick... by Anonymous Coward · 2003-08-10 22:03 · Score: 0

No, it's a WW2 reference.
Re:Nitpick... by spongman · 2003-08-10 22:22 · Score: 4, Interesting

Actually SpamBayes isn't bayesian at all. It uses a chi^2-based algorithm which was shown in (the extensive spambayes team's) tests to be superior to regular bayesian filtering.
Re:Nitpick... by AndroidCat · 2003-08-10 22:40 · Score: 3, Informative

The "last, best hope" was used by Lincoln in the American civil war, "We shall nobly save, or meanly lose, the last best hope of earth."
It's quite possible that it goes back further to a version of the Bible or Shakespeare. (Always the two to bet on when finding the source of a phrase in one fell swoop.)

--
One line blog. I hear that they're called Twitters now.
Re:Nitpick... by spongman · 2003-08-10 23:01 · Score: 4, Informative

Here's a bit from the excellent SpamBayes background page:
A remarkable property of chi-combining is that people have generally been sympathetic to its "Unsure" ratings: people usually agree that messages classed Unsure really are hard to categorize. For example, commercial HTML email from a company you do business with is quite likely to score as Unsure the first time the system sees such a message from a particular company. Spam and commercial email both use the language and devices of advertising heavily, so it's hard to tell them apart. Training quickly teaches the system all sorts of things about the commercial email you want, though, ranging from which company sent it and how they addressed you, to the kinds of products and services it's offering.
Re:Nitpick... by cosmo7 · 2003-08-11 00:33 · Score: 1

What! all my pretty chickens and their dam?
At one fell swoop?
(Macbeth, act IV, scene 3)
Re:Nitpick... by tootlemonde · 2003-08-11 01:54 · Score: 1

Fellow-citizens, we cannot escape history. We of this Congress and this administration, will be remembered in spite of ourselves. No personal significance, or insignificance, can spare one or another of us. The fiery trial through which we pass, will light us down, in honor or dishonor, to the latest generation. We say we are for the Union. The world will not forget that we say this. We know how to save the Union. The world knows we do know how to save it. We -- even we here -- hold the power, and bear the responsibility. In giving freedom to the slave, we assure freedom to the free -- honorable alike in what we give, and what we preserve. We shall nobly save, or meanly lose, the last best hope of earth. Other means may succeed; this could not fail. The way is plain, peaceful, generous, just -- a way which, if followed, the world will forever applaud, and God must forever bless."
--Lincoln, State of the Union address, 1862
Re:Nitpick... by 21mhz · 2003-08-11 03:04 · Score: 1

It uses a chi^2-based algorithm

The developers of Bogofilter have gone this route too. It's the filter of my choice: written in C (fassst), mail parser coded in flex (fassst as well), stores the word database in a Berkeley DB (you guessed it... fassst again).

--
My exception safety is -fno-exceptions.
Re:Nitpick... by tim_one · 2003-08-11 12:37 · Score: 2, Informative

The way spambayes estimates the probability that a msg is spam given that it contains a specific word is thoroughly Bayesian, as described on Gary Robinson's web page, and in his March "Linux Journal" article.

The way spambayes combines probabilities ("chi-squared combining") is indeed not Bayesian at all. The probability combining scheme Paul Graham suggested isn't correctly Bayesian either, unless you assume the universe consists of equal numbers of ham and spam messages (so that the prior probability of spam is 0.5).
Re:Nitpick... by SlugLord · 2003-08-11 16:32 · Score: 1

I'm confused...

Does not a bayesian approach simply mean that it maximizes the posterior probability that an item belongs to a class given prior probabilities? Couldn't a chi-squared test be bayesian?

Admittedly, it's clearly not the "classical" bayesian method, but it seems to me a chi-squared test just takes into account the possibility of multiple dependent variables. Maybe I'm missing something...
Re:Nitpick... by Anonymous Coward · 2003-08-13 05:23 · Score: 0

He was paraphrasing the B5 intro " Our last, best hope for..."

Re:great by mirko · 2003-08-10 20:09 · Score: 3, Insightful

I think spam is overhyped : it is not convenient to get some but with properly adjusted filters, very few of these will land elsewhere than in you trash can.

Personally, I get around 100 of these a day, but only 3 get in my inbox instead of one of my specific mail directories, this is not *that* disturbing.

I just wish these spams were better targetted : getting some penis-enlargement, ultra-fast-diet, university-diploma or cheap-herbal alternative to viagra is somehow repetitive and boring.

--
Trolling using another account since 2005.

Bayesian filters are useful, but... by fr0z · 2003-08-10 20:10 · Score: 5, Funny

I still believe that we should have a hunting season for spammers, just like we do for ducks...

--
Never underestimate the predictability of human stupidity...

Re:Bayesian filters are useful, but... by frovingslosh · 2003-08-10 20:16 · Score: 4, Insightful

I still believe that we should have a hunting season for spammers, just like we do for ducks...
No, it should be longer, if not all year long.

--
I'm an American. I love this country and the freedoms that we used to have.
Re:Bayesian filters are useful, but... by dtfinch · 2003-08-10 20:28 · Score: 5, Insightful

You know, computer crimes are considered terrorism under the USA PATRIOT Act. Until that silly law gets repealed, lets hunt down those terrorists for their, umm, denial of service attacks against innocent email users, bandwidth theft, failure to provide real opt-out links, sending email advertisements with fake return addresses, presenting obscene material to minors, etc...
Re:Bayesian filters are useful, but... by ctr2sprt · 2003-08-10 20:29 · Score: 4, Funny

Spammer: Duck season!
You: Spammer season!
Spammer: Duck season!
You: Duck season!
Spammer: Spammer season! Fire!
*bang*
Re:Bayesian filters are useful, but... by frovingslosh · 2003-08-10 20:51 · Score: 4, Funny

I like your way of thinking. It's much like my approach of defending myself with deadly force when I'm attacked with the deadly weapon of second hand smoke.

--
I'm an American. I love this country and the freedoms that we used to have.
Re:Bayesian filters are useful, but... by AndroidCat · 2003-08-10 22:53 · Score: 2, Interesting

Spammers love to use open proxies to hide, and are now engaged not only in scans to find then, but also in campaigns to create them. Trojans and worms like SoBig. While each offense is small, it's on a scale large enough to have them behind bars for quite a while.

--
One line blog. I hear that they're called Twitters now.
Re:Bayesian filters are useful, but... by PhxBlue · 2003-08-11 00:13 · Score: 2, Insightful

You know, computer crimes are considered terrorism under the USA PATRIOT Act. Until that silly law gets repealed, lets hunt down those terrorists for their, umm, denial of service ...

An immoral law is no less immoral just because you can find a practical use for it. If you don't like the PATRIOT Act, don't support it, period.

--
!#@%*)anks for hanging up the phone, dear.
Re:Bayesian filters are useful, but... by sahala · 2003-08-11 02:51 · Score: 1

I like your way of thinking. It's much like my approach of defending myself with deadly force when I'm attacked with the deadly weapon of second hand smoke.
Apologies for being offtopic...
Right, because second-hand smoke CAUSES cancer/death. All the hype that ended up bashing smokers was based on a seemingly flawed study. Well, not flawed, but researchers at the time were apparently quite eager to conclude that second-hand smoke == death.
http://bmj.com/cgi/content/abstract/326/7398/1057
Also, just as a tip, if you are going to use deadly force on a smoker, don't do it on their way out for a smoke break. At that point their nicotine craving is so strong that they'll do anything, including kicking your deadly-force-emitting ass to get their long-awaited drag on their Camel Light.
Ok back on topic. I use SpamBayes and it works pretty well.

You just don't get it by frovingslosh · 2003-08-10 20:12 · Score: 5, Insightful

None of these spam filters will have any effect on spam at all if they are just installed on the systems of people who hate spam and would never buy from a spammer anyway. Hell, they might even have the opposite effect; I will never buy something if I get spam for it. But if I personally filter my spam and don't even see subject lines, I might end up buying the product without knowing they also marketed it by spam.

Spam is effective because it reaches millions of people who are not installing these filters on their systems. Until ISP's start applying these filters to all spam by default, then the spam filters will have no effect at all, exactly the same number of marks will be reached and respond no matter if the people who know better than to respond to spam go ahead and filter their e-mail or not!

--
I'm an American. I love this country and the freedoms that we used to have.

Re:You just don't get it by joonasl · 2003-08-10 20:17 · Score: 1, Funny

Hell, they might even have the opposite effect; I will never buy something if I get spam for it. But if I personally filter my spam and don't even see subject lines, I might end up buying the product without knowing they also marketed it by spam.
Just stay of herbal Viagra and penis enlargement pills, man! :)

--
"There is a terrorist behind every bush"
Re:You just don't get it by Plug · 2003-08-10 20:20 · Score: 5, Insightful

Realistically, I don't give a damn how much spam _you_ get, I care that _I_ don't get any.

You cannot automatically filter spam. Bayesian filtering works because it works on your own personal items only, and you have a method of manually removing false positives. There is nothing worse than the possibility that an ISP will filter out a real email in their spam system. That simple fact makes server side spam filtering impossible for most situations. You can filter spam into /dev/null (unacceptable), you can filter into a spam box (How many POP users would that rule out, who only have one POP box?), or you can keep it bundled in email with a flag, and expect people to update their clients, in which case you have the exact scenario you have now - the client has to do something themselves.

Until Hotmail et al starts offering bayesian filtering with a separate 'spam' mailbox, consider server side filtering worthless.

I am smart and don't get any spam. A lot of people I see in my line of work, aren't. These people are going to get something like Outclass (an Outlook plugin for POPfile), and then they are going to see the problem go away, and they're not going to lose any email in the process.

I'd rather use SpamBayes, but the Outlook plugin has an annoying bug that renders autocompleting addresses in Outlook useless.
Re:You just don't get it by gfody · 2003-08-10 20:20 · Score: 1

bleh

--

bite my glorious golden ass.
Re:You just don't get it by hankwang · 2003-08-10 20:28 · Score: 2, Insightful

>None of these spam filters will have any effect on spam at all if they are just installed on the systems of people who hate spam and would never buy from a spammer anyway.
Still, there are plenty of people who hate spam but don't know how to handle it. At our department, quite a few people receive over 30 spams per day and hate it, but no one has installed a spam filter better than the subject/sender filter built-in in their (Windows) mail clients. One has stopped reading e-mail from his university account and asks us to send mail to a private address, because he isn't allowed to change his email address.
I mentioned Bayesian filters, but it turns out that not every computer user enjoys downloading and trying out five different programs to see whether they filter effectively and work together with their existing mail software. On top of that is the fear of false positives. (I am one of the few Linux users and on top of that I don't receive so much spam that I should worry, so I can't advise them.)

--
Avantslash: low-bandwidth mobile slashdot.
Re: You just don't get it by FoeNyx · 2003-08-10 20:42 · Score: 1
I might end up buying the product without knowing they also marketed it by spam.

Will you buy penis (or breast) enlargement pills ? (not approved by the FDA). About 'losing weight methods' it was already a plague before the spam era. The less people are warned about such 'crappy rubish stuff', the less they will buy them. Advertising installs the need.

Spam is effective because it reaches millions of people who are not installing these filters on their systems.

It was the same situation against virii in the first ages of Computer Science.

Now more and more 'lambda' users are aware that a computer needs some tools to counter the drawbacks of internet :
- Antivirus,
- Firewall,
- Antispam
It's just a question of time, because the mentalities are evolving slowly.
Re:You just don't get it by Anonynmous+Cow · 2003-08-10 20:46 · Score: 4, Interesting

Speaking of filtering for others... I don't - but I do run my own little mail server.

Even after implementing all the postfix uce rules and adding in the RBL's - and using spamassassin... I still saw some spam slipping in...

So I hacked together a tiny little perl script that monitors my mail log... after any IP address gets more than 3 "554" messages (generated by the RBL's) the source IP gets a lovely little teergrube.

I waste their resources and prevent them from trying to deliver any other shit that might get through spamassassin...

Script can be found at here but is only good for postfix/linux/iptables peoples.

--
e3 :: blogging the wireless freenet
Re:You just don't get it by cheekyboy · 2003-08-10 21:06 · Score: 1

Yes, but DO i really want to see adds for amsterdam porno web cams 120 times a month?

enough is enough

--
Liberty freedom are no1, not dicks in suits.
Re:You just don't get it by cheekyboy · 2003-08-10 21:08 · Score: 1

How many people you know that email you 12 gifs/jpegs in one message with LARGE red text. ????

We need some rules on if you have > x images and other details besides text grep matching.

--
Liberty freedom are no1, not dicks in suits.
Re:You just don't get it by Plug · 2003-08-10 21:12 · Score: 5, Funny

How many people you know that email you 12 gifs/jpegs in one message with LARGE red text. ????

Lots of them. They're called 'girls' and Slashdot should encourage communication with them wherever possible.
Re:You just don't get it by Anonymous Coward · 2003-08-10 21:38 · Score: 3, Funny

Well actually got the Viagra and the penis enlargement pills... they work perfectly.

The problem is there were no instructions on how to find a partner.
Re:You just don't get it by NathanBFH · 2003-08-10 22:08 · Score: 1

"Until Hotmail et al starts offering bayesian filtering with a separate 'spam' mailbox, consider server side filtering worthless."

Actually, Yahoo offers something very similiar to bayesian filtering (or it may even BE bayesian filtering). All messages marked spam get thrown into a junk mailbox. Yahoo even gives you the option to report any spam that still gets through. It works wonders. I get about 20 spam(s?) a day, Yahoo catches 18-19 of them.
Re:You just don't get it by drix · 2003-08-10 22:23 · Score: 1, Interesting

You just don't get the whole concept of Bayesian spam filtering. It works on a personal basis; don't forget that, statistically speaking, one man's spam is another man's legitimate personal e-mail. For example, if you send and receive a disproportionately large amount of messages containing cock jokes and talking about tits and sex (which, being a 20-year-old male, I can tell you is about 80% of my friends), under a "typical" or system-wide Bayesian filter that might be installed by some ISP, you're almost certainly going to lose a lot of messages that weren't spam. Which is the worst-case scenario for a spam filter. What's worse, the ISP would have to employ some sort of "spam czar" to monitor (people's private) incoming e-mail and make judgement calls as to what is and is not spam. That's a call I want to make, not one that I want made for me.

The best way to eliminate spam, to me, is a two-part system whereby the ISP (via procmail, etc.) eliminates all mail that is definitely spam, and then passes along anything questionable to the user. Bayesian filter should be implemented in the client, which, thankfully, is becoming more and more common. ISPs should think about bundling clients that already support Bayesian sampling, enabling it by default, explaining in very clear terms how to use it, etc., but that's about all they can do.

--

I think there is a world market for maybe five personal web logs.
Re:You just don't get it by Anonymous Coward · 2003-08-10 22:26 · Score: 0

Don't tempt me to post a goatse link...
Re:You just don't get it by MuParadigm · 2003-08-10 22:31 · Score: 3, Funny

I do not know how many times I have to tell people this.

They do not work. They just make your hand smaller.
Re:You just don't get it by AndroidCat · 2003-08-10 22:58 · Score: 1

Didn't you get the email for the hog-sweat pheromone to attract females?
Okay, no problem. We'll send you a few hundred copies of it to make sure you get it.

--
One line blog. I hear that they're called Twitters now.
Re:You just don't get it by CrazyWingman · 2003-08-10 23:35 · Score: 1

There is nothing worse than the possibility that an ISP will filter out a real email in their spam system.

Absolutely! I'm working at a company in a foreign city this summer. For the first week I was here, I was unable to check my home e-mail (due to the company's firewall). So, left with only my corporate account, I attempted to write my family/girlfriend from that. The entire week, I got nothing back from my girlfriend. Luckily, being the tech-conscious person I am, I naturaly assumed that it was the coporate mail server blocking everything from Hotmail. I was later proven correct, but for the week before I regained access to my home e-mail I assumed that I was in the dog house for something. ;P
Re:You just don't get it by schnozzy · 2003-08-11 00:07 · Score: 1

An interesting alternative that was mentioned on slashdot a while back was graylisting. Having implemented a modified version of it on several mail servers, my experience has been massively positive. In a company of about 70 people, graylisting has allowed through maybe 10 pieces of spam in nearly a month (where before 70% of all e-mail traffic was spam) and the only false positives are from large mailing list clusters like groups.yahoo and the like, which are easily whitelisted. While this sort of maintenance on the server side may seem cumbersome to some administrators, it's definitely worth it, costs the spammer storage/time, and wastes only the bandwidth/storage of the token on the receiving side.

Also, unlike many spam filters (Paul Graham's new active method), it doesn't need to be used by everybody to work it's magic.
Re:You just don't get it by topham · 2003-08-11 01:25 · Score: 1

It's never an assumption you in the dog house.
It's a given.
Re:You just don't get it by SCHecklerX · 2003-08-11 01:43 · Score: 1

You cannot automatically filter spam. Bayesian filtering works because it works on your own personal items only, and you have a method of manually removing false positives. There is nothing worse than the possibility that an ISP will filter out a real email in their spam system. That simple fact makes server side spam filtering impossible for most situations.
Not impossible at all. Just don't DELETE the stuff at the server, and flag it in the subject line as spam. Then the END USER can EASILY filter it as he/she pleases. Or you could make a web interface for them to filter it before they even have to pop for it.
Re:You just don't get it by letxa2000 · 2003-08-11 01:46 · Score: 1

Bayesian filter should be implemented in the client, which, thankfully, is becoming more and more common.
Nah, all spam filters (including Bayesian) should be implemented at the *server* before the user has to download spam... but even though the filters are on the server they should be configurable and applicable to each individual user. Then you get the benefit of user-specific spam filtering but it happens at the server where it belongs.
Re:You just don't get it by SheepHead · 2003-08-11 01:58 · Score: 1

Thanks for the pointer to Outclass, that program seems to be exactly what I was looking for. I need something that will help classify e-mail for my users, but none of them are skilled enough to be trusted to even install a program (unless it's Hotbar... sigh.)
So, I've searched and searched on my own but Outclass seems to be pretty much perfect... I'm testing it out now. I thank you, kind sir or madam.

--
7d9e63e9501751ff4bf9307989d5623d *SheepHead
Re:You just don't get it by sean.peters · 2003-08-11 02:50 · Score: 1

There is nothing worse than the possibility that an ISP will filter out a real email in their spam system.

Yes there is. What's worse is when your box is so overwhelmed by spam that you miss legitimate mails that HAVEN'T been filtered. Now you've missed the potentially important mail AND wasted a lot of time.

Until Hotmail et al starts offering bayesian filtering with a separate 'spam' mailbox, consider server side filtering worthless.

You do realize that Hotmail has been doing this for over a year, right?

Sean
Re:You just don't get it by psycht · 2003-08-11 03:27 · Score: 1

Either way.. You get results!

--
thelikesofwhich.com
Re:You just don't get it by frovingslosh · 2003-08-11 06:11 · Score: 1

You just don't get the whole concept of Bayesian spam filtering.
No, I completely understand that. While I said that Bayesian filtering is completely ineffective when installed on your system rather than on the ISP system, I didn't advocate installing it at the ISP, I don't think an ISP based filter will be very effective, and I think there are better ways to fix the problem by moving away from the current version of SMTP and by passing laws that let the spammers face very high fines (or worse).
You clearly miss my point that when you filter only your mail you still let the spam reach the people who will buy the product. But you also seem to think the filter has any effect on you at all. Yet most filter advocates claim they don't want an ISP filtering their mail because they fear false positives, and say they send all of their filtered mail to a special spam inbox so they can checj for any false positives. These people still end up examining every piece of spam they get in just as much detail as they would if they were killing it in their regular e-mail, the only real change to the system is that they put that work off until later, and in doing so they also may delay receiving anything that was a false positive until later also. There is n net gain at all with such filtering.

--
I'm an American. I love this country and the freedoms that we used to have.
Re: You just don't get it by jo42 · 2003-08-11 09:28 · Score: 1

You forgot:
4) Antispyware
- Uncle Bill
Re:You just don't get it by icebike · 2003-08-12 19:59 · Score: 1

There is nothing worse than the possibility that an ISP will filter out a real email in their spam system.
Yes there IS something worse than that, its getting all the spam!
Most people are willing to accept an occasional lost mail to be free of spam. We have no idea how many mails are lost in the system anyway (without filtering). Its far from perfect.
And the ligitimate mail lost or unseen in all the spam in the inbox is just as lost as the one filtered by your ISP.
(About here is where some cluck chirps up and says "what about when you are notified you won the lottery and the email is filtered?". I'll take my chances.

--
Sig Battery depleted. Reverting to safe mode.

Other filters by dtfinch · 2003-08-10 20:13 · Score: 4, Informative

I would have liked to see how my favorite bayesian spam filter, K9, would have faired in your comparison, but it failed to meet your first requirement of being cross platform. It's freeware written in C, is about a 60kb-100kb download, depending on if you get it with the self installer, is easy to use, and has a very small memory footprint. Before today it had sorted my email with over 99.8% accuracy, excluding the first couple days of training, and after only a couple weeks of use, though now it's down to 99.7%.

I have used PopFile in the past on both Windows and Linux, but found K9 to be better suited for environments where Windows is an option. It's very easy to use, having a windowed interface, and it seemed to learn much faster than PopFile did.

I haven't used SpamBayes. I'll have to give it a shot.

Spamprobe by 1029 · 2003-08-10 20:14 · Score: 5, Informative

The article didn't mention SpamProbe. It is what I use, and it has worked quite well for the past month or so that I've been using it. Perhaps this is just because the author didn't test this spam filter yet, but I like it quite a lot with my current mutt/procmail setup. Take this for what it's worth.

--
- I love animals. I try to eat at least one a day.

Re:Spamprobe by opk · 2003-08-10 20:29 · Score: 3, Interesting

I'll second this. Have been using spamprobe since December. It took longer than a month before it was fully trained. These days it's very good. And the best thing (except once when someone quoted the full body of a spam when complaining about spams on a mailing list): It has never given me a false positive.
Re:Spamprobe by HermanAB · 2003-08-11 02:42 · Score: 3, Insightful

Yes, SpamProbe is the best one I tested and I tested most of them. The reason being that it not only counts single words, but also word pairs. It is about 99.5% accurate for me and never gives false positives. My wife uses it in her law office, where I run it on the server - one database for everybody. It works like a charm and doesn't get tripped up with matrimonial fighting mail, which can resemble sleaze mail in many respects...

--
Oh well, what the hell...

Re:great by Anonymous Coward · 2003-08-10 20:15 · Score: 0

Somebody needs to move to the UK for a while...

Only useful to a point by KU_Fletch · 2003-08-10 20:16 · Score: 4, Interesting

I love spam protection programs. I've been using them for years, but have to switch every couple of months because of the friggen spammers. The people that make the spamming software don't just sit around cackling about how evil they are. They reverse engineer every anti-spam protection out there in an attempt to get around it. While this seems like a good idea (and I will be playing around with these two programs for a while), it's unfortunately only good up to the point when spammers figure a way around it.

I wish the government would somehow make the practice illegal, but I doubt they'll ever get anything to stick. The far better option at this point is to have a class action suit of server owners (who provide mail accounts) against developers of spamming software and spammers. I've gotten enough warnings from my university to know that bandwidth costs money. By sending millions of spams a year into any one e-mail server, that can account for a serious chunk of bandwidth used at significant cost to the provider. It won't stop spam all together, but it will bankrupt anybody that has been doing it.

--
It's not stupid. It's advanced.

Re:Only useful to a point by Cato · 2003-08-10 20:21 · Score: 1

I use SpamAssassin (which includes Bayesian filtering, though I don't use it) and it works fine - no need to switch away since it's very flexible and lets me write my own rules for new types of spam, or just tinker with the scores.

http://spamassassin.org/
Re:Only useful to a point by Hieronymus+Howard · 2003-08-10 22:32 · Score: 1

I love spam protection programs. I've been using them for years, but have to switch every couple of months because of the friggen spammers. The people that make the spamming software don't just sit around cackling about how evil they are. They reverse engineer every anti-spam protection out there in an attempt to get around it.

Do they really believe that people who install anti-spam software on their PC's are going to buy penis-enlargement pills from spammers that defeat the software? How fscking stupid are can they get? People who install anti-spam software are not potential customers.

HH
Re:Only useful to a point by spongman · 2003-08-10 22:38 · Score: 3, Informative

I've been using SpamBayes for about 9 months now and I've never had any problem with this 'new kind of spam' you mention. I just don't see it. I don't have to do anything, write any rules, configure anything, it just gets junked. I've never once had any false positives either I get about 30 spams/day, and out of the 8,200+ spams I have in my spambox, less than 100 of those spams are categorized as having less than 90% probability of being spam.
Re:Only useful to a point by Steve+B · 2003-08-10 23:10 · Score: 2, Interesting

They reverse engineer every anti-spam protection out there in an attempt to get around it.
This is why a real anti-spam legal reform would clearly equate circumvention of an anti-spam filter with circumvention of a password prompt. Both are attempts to crack into someone else's computer without permission -- indeed, against an express prohibition -- and the former ought to carry the same penalties as the latter.

--
/. If the government wants us to respect the law, it should set a better example.
Re:Only useful to a point by pen · 2003-08-11 04:55 · Score: 1

Do [spammers] really believe that people who install anti-spam software on their PC's are going to buy penis-enlargement pills from spammers that defeat the software?
No, they believe that customers of ISPs that install anti-spam software on their networks might buy penis-enlargement pills from spammers.

Filtering by rf0 · 2003-08-10 20:16 · Score: 3, Interesting

Taking I get 100+ spams a day I've found that its a goo idea to at least use tagging. For example posting on usernet I use usenet@domain.com with something in my sig saying actualy email is me at domain dot com. Anything sent to usenet is automatically deleted. Doesn't stop the flow by any means but at least I can track where the spam came from.

If you are feeling clever you can even use addresses that expire after a week. So something like epochseconds@domain.com

Just my 0.02p

Rus

--
Cheap UK and US VPS

Re:Filtering by gfody · 2003-08-10 21:21 · Score: 2, Informative

you might find this sight particularly useful. it will let you set up a temporary address based on a naming convention that forwards to your real address but expires after a few emails. you can setup something like rusxxxxx@asdf.com where xxxx is whatever you want and it will fwd to your real address so if the badguys get your email its no big deal the temp addy will just stop working.

--

bite my glorious golden ass.
Re:Filtering by pqdave · 2003-08-11 01:36 · Score: 1

I use tagging, but there are still a couple of problems with it. One is that if you don't preemptively expire tags, you wind up with more spam, as spammers will send to somebody+tagFOO@example.org, somebody+tagBAR@example.org, etc. It also adds to the undeliverables for your ISP--My ISP gets lots of mail to tagFOO@example.org.

The more serious problem is that I want to get responses to Usenet, and not just from technially adept people. I was recently shovelling out an old account that I've essentially abandoned, and found a response to a years-old usenet post. Based on that I had a good conversation with the responder, and wound up selling him stuff. Wouldn't have happened if it had been to an expired tagged address.

Missing the point? by aquishix · 2003-08-10 20:17 · Score: 5, Insightful

As someone who recently acquired a B.S. in mathematics several days ago, I understand how these filters work. They are an excellent way to fight spam over the older methods.

However, I think that ultimately this sort of thing misses the point. Spam needs to be fought in the courts, not in the battlefield. I'm afraid that the success of these filters will cause spam NOT to become illegal, and thus lead to a world where we have a constant trickle of spam, albeit in small amounts.

I think we all agree that we want spam to be gone entirely, as is evidence by the first post being labeled as "troll" ;)

--
- I am a viral sig. Please copy me and help me spread. [strain #2] Thank you

Re:Missing the point? by Ingolfke · 2003-08-10 22:00 · Score: 2, Interesting

Bulk emailing, like any business is a numbers game. By significantly decreasing the # of successful responses to a set of SPAM (through filters) the business costs remain the same w/ the returns dropping. Eventually the business is no longer feasible.

[INCREASE TONE]
SPAM absolutely does not need to be fought in the courts when the markets can work this out on their own (as we see w/ these filters). In the end we'll have better technology for sorting and filtering emails which can be applied to other applications and the spammers will be gone or significantly reduced.

[BREATHE... BREATHE...]
Legislation would only be valid in the country in which the legislation was enacted so spammers could simply move their operations to a SPAM friendly country.

[GRADUALLY INCREASE TONE]
Also, what constitues spam? What if I only send 10,000 emails out? What if I change the email each time I send it so it's unique to you? What if I'm not selling anything? What if someone comrpomised my system and sent all the emails from my PC? Why shouldn' ISPs be liable too... yeah, why are they letting people send those SPAMs... let's sue them too... somebody get a rope!!

[BEGIN ALL OUT RANT!]
So the moral of the story is... everyone remain calm... keep working on your filters and other new technologies... and soon we'll have fewer spammers and better tech and some intelligent hacker out there will have a whole heap load of cash for coming up w/ the solution.

Of course w/ all of the existing hideous legislation we have today... SCO may announce that they are diversifying into bulk emailing and that they have a patent on any spam filtering algorythms and therefore if you ever remove any of their emails you must send them a $699 licensing fee for the use of their IP.
Re:Missing the point? by schon · 2003-08-11 02:51 · Score: 2, Insightful

SPAM absolutely does not need to be fought in the courts when the markets can work this out on their own (as we see w/ these filters)

Yes, absolutely does - just like any other sociopathic behaviour. We need clearly defined rules of what is and is not acceptable. Perhaps you haven't noticed, but "the market" is not working anything out - spam is getting worse, not better, and things such as filters make it worse, by hiding the problem (hint: even though your filters hide your spam from you, you're still paying for it.)

In the end we'll have better technology for sorting and filtering emails

This is the fundamental flaw in your reasoning - you can't solve a social problem with technology.

Legislation would only be valid in the country in which the legislation was enacted so spammers could simply move their operations to a SPAM friendly country.

This argument is fundamentally flawed. "Moving operations" won't do anything - they could still be prosecuted if they stay in the country... and so the question becomes: how many spammers would physically move to another country - permanently - just so they could spam? No, it's more likely they'd just go back to whatever scam they had before they began spamming.

Also, what constitues spam?

The definition of spam is "Unsolicited bulk email". That's pretty simple.

What if I only send 10,000 emails out?

Then it's bulk. If it's unsolicited, then it's spam.

What if I change the email each time I send it so it's unique to you?

Is it unsolicited bulk email? If so, then it's spam.

What if I'm not selling anything?

So? IF IT'S BULK, UNSOLICITED EMAIL THEN IT'S SPAM

What if someone comrpomised my system and sent all the emails from my PC?

Then you're not the one spamming, are you?

Why shouldn' ISPs be liable too...

If the ISPs are condoning the spam, then they probably should be liable. If that's the case, then there will be a paper trail.

why are they letting people send those SPAMs... let's sue them too... somebody get a rope!!

If you feel you can't win an argument except by inciting a (hysterical) straw man, then you've already lost.

Spam is a social problem - it doesn't matter what technologies you come up with, spammers will find a way around them. We need to start social remedies to the spam problem.
Re:Missing the point? by Steve+B · 2003-08-11 03:28 · Score: 1

What if I change the email each time I send it so it's unique to you?
If you are referring to the filter-evasion technique of inserting some trivial difference into each message in a spam run, then you should go to jail under the existing laws against cracking a security system in an attempt to gain unauthorized access to the target computer.

--
/. If the government wants us to respect the law, it should set a better example.
Re:Missing the point? by sketerpot · 2003-08-11 05:19 · Score: 1

I agree. Do you remember the article on slashdot a while back about an incremental bayesian spam classifier that would throttle SMTP connections that were transmitting spam? That would, if widely deployed, make things rather painful for spammers. You would use the same amount of bandwidth if the spam software didn't just give up on you, but it would make it impractical for spammers to send spam in bulk. AND THAT WOULD BE THAT! TAKE THAT, YOU SPAMMING SCUMBAGS! HAHA! DIE DIE DIE DIE DIE!!!!!
Ho ho, sorry about that. I'm talking about TarProxy, which I think should work as an SMTP proxy sitting in front of an existing mail server. Share and enjoy!
Re:Missing the point? by hankaholic · 2003-08-11 05:50 · Score: 1

Good points.

Spam cannot be controlled purely through software, because artificially restricting what one can do with a system limits its utility, and will cause people to cry foul when they can no longer do things quite so easily.

Legislation cannot solve everything, since jurisdictions are limited.

The best solution may very well be a mix of the two -- legislation which isn't excessively broad, but does make illegal (for example) forged headers, has the potential to do much to slow spam originating from the U.S.

Software must not impose excessive restrictions upon the user, and should be written in such a way that it is likely to only penalize spammers while not imposing more than minimal burden upon innocent bystanders (Paul Graham's suggestions and techniques fit nicely here). The idea is to impose a great burden upon the spammer, to impose minimal burden upon the user, and to differentiate between the two (spammers vs. users) as well as possible.

Interestingly enough, many people speak of Outlook [Express] plugins for handling Bayesian filtering. I'm really not sure why MS hasn't stepped up to the plate and incorporated some such technology into their desktop products.

Not to spend too much time trying to guess the intentions of MS here, but given that Hotmail does implement filtering, as I see it there are a few possibilies --

Conspiracy Theory -- Microsoft is trying to give Hotmail advantages over "outside" email services by giving the user additional value, possibly as a push towards the general use of "Passport-enabled" services.

Product Testing -- Microsoft is using Hotmail as a testbed for filtering techniques before integrating anything into a product. As Paul Graham has written, web applications provide a great way to make changes to software and observe the results immediately.

"Choose Your Battles Wisely" Theory -- Microsoft has no desire to compete with companies which produce filtering software for Windows, as the market is not yet large enough to worry about.

Guesses aside, the overall point of this post is that neither legislation nor technology is sufficient to eliminate spamming, although I do believe that if filters were installed by default and mail clients made it obvious to the user that product offers and sales pitches received were likely to be illegitimate then the average rube who would otherwise consider the purchase of a "spamvertised" product would be slightly better informed, and thus less likely to pay for the spammers' wares.

To summarize the previous summary of the intent of this post, I'm a longwinded bastard, and I agree that legislation would be helpful.

Basically, I'm saying, "me too".

In short, AOL. :-)

--
Somebody get that guy an ambulance!
Re:Missing the point? by Ingolfke · 2003-08-11 06:42 · Score: 1
As my straw man has been burned to the ground... I'll step back and take another shot at this.

Let me acknowledge a few of points that I consider to be excellent.
- The definition of SPAM as unsolicited bulk mail... um yeah I should have known that... and that answered most of my questions (rants).
- hankaholic's response to this post about moderation in legislation was on target. I agree that legislation banning forging one's identity would be of value and doesn't at first glance appear to be overly broad. Also, legislation against fraud should be applied or expanded to take care of those who use email to engage in fraudalent activities.
Now to clarify and expand of my original points, with what is hopefully a more controlled tone.

SPAMing is a business based on the idea that by sending out 10s of thousands of emails you'll end up w/ a few people who actually buy the product. The ratio of takers to receivers is something like .1% or less. By adding filters you don't immediatly resolve the underlying problem (receiving the spam), but the costs for the spammer are driven up. As the ratio of suckers to recievers drops spammers make less money and eventually the whole enterprise becomes less profitable than some other alternative means of "marketing".

If legislation were to be used it would need to sufficiently clear and limited as to avoid penalizing individuals who SPAM in ignorance, not for harmful or business purposes. For example someone who forwards a lame joke onto their acquaintances should not be subject to legal action (annoying as it may be). Also, unsolicited should be qualified to ensure that a business that received permission to send email to an email address is not penalized if that email address changes hands and the new owner never authorized the receipt of email (how is the business to know the email address changed hands?).

Legislation that mandated opt. out options for SPAM and that mandated that the user's privacy was protected during the opt out policy might be acceptable, with very harsh penalties for violation of one's privacy rights.

So the end solution may in fact look like a combination of technology and legislation that delicately extends existing law into the realm of the Internet.

Thanks for burning the straw man, and by the way... watch your tone ;)

Fighting spam requires drastic measures by Anonymous Coward · 2003-08-10 20:18 · Score: 1, Insightful

Fighting spam as an individual will never work no matter how great filter algorithms you develop. Hell, even the blacklists won't work until the ISPs are forced, by guerilla action if necessary, to crack down on spammers and hard.

What about features other than text? by wheany · 2003-08-10 20:19 · Score: 1

I have been using POPFile since January, and I know it uses pseudowords for all kinds of features spammers use, like comments, remote images etc. (html:comment, html:imgremotesrc).

Does SpamBayes do anything similar?

Re:What about features other than text? by Gaza · 2003-08-10 20:28 · Score: 3, Interesting

Yes it does, the developers have created a test suite and a very extensive tokenizer. Any additional pseudowords, or new ideas to tokenize a message are tested very throughly before they are added (as most tend to actually lower accuracy instead of raise it). There have even been tests using SpamBayes on just headers and just message bodies and both have worked very well.

Re:great by Goldberg's+Pants · 2003-08-10 20:19 · Score: 3, Insightful

But that's still 3 pieces of shit you have to deal with. Sure, it's a simple click to delete, but the fact is WE SHOULD NOT FUCKING HAVE TOO.

Some wanker spammer got my email address and within two days my spam volume went from zero (seriously) to 30+ a day. All for the same fucking thing. These shits should be legal to hunt and kill.

In respose to the original troll, it's a bogus analogy. We PAY for our internet access. We get bombarded with ads on damn near every site... The revenue generated from these scumbags does NOT go towards funding your internet access, or the production of new content. It goes to their wallets. Ergo, you're an idiot.

Side note: "Last, best hope"... I can't be alone in expecting "for peace" to come after that.

popfile accuracy by Comsn · 2003-08-10 20:20 · Score: 1

i have been using popfile for a while, but its accuracy still isint good enough to skip over just yet...

maybe i should restart it, mightve let one or two spam emails go into home, thats the problem with bayes filtering is if you make a mistake you have to restart ;\

Classification Accuracy
Emails classified: 206
Classification errors: 45
Accuracy: 78.15%

Re:popfile accuracy by dtfinch · 2003-08-10 20:37 · Score: 1

Some of them can keep a record of all your emails (if you don't mind it doing so) to give you the option of reclassifying old email or rebuilding the database if it gets corrupted. I know K9 does this, and I used that feature when I realized that a small few spamish looking good emails made it into the spam folder around when I started, causing any similar good emails to also be classified as spam. It's not cross platform though. 78% doesn't sound right at all. Over 98% is very common after several hundred emails.
Re:popfile accuracy by DukeyToo · 2003-08-11 02:08 · Score: 1

206 emails is not enough for it to be very accurate yet. Also, it is not necessary to restart with Popfile if you have made mistakes...just continue to classify spam as spam and it will all balance out.

I use POPFile for my hotmail address at home and have found it most useful and accurate when creating many categories. Besides "work", "bills", "newsletters", and some others I have 2 "spam" buckets. The first is for general spam, and the other is for offensive spam. That works very well for me, because I never have to look at the offensive spam due to the high accuracy of that bucket. I occasionally manually scan the general "spam" bucket just in case.

--
Most writers regard truth as their most valuable possession, and therefore are most economical in its use - Mark Twain
Re:popfile accuracy by HermanAB · 2003-08-11 02:48 · Score: 1

That is terrible accuracy. I can get >90% with a handful of generic procmail recipes. What you need to do, is recreate your good and bad corpus and retrain the thing. You probably have good messages in your bad list and bad messages in your good list.

--
Oh well, what the hell...
Re:popfile accuracy by guardian-ct · 2003-08-11 04:54 · Score: 1

I've got around 20 buckets in popfile, 10 email address, and classification accuracy is around 98%. The misclassification of good mail as spam has only happened for those messages that are:

1) new sender, sending something that looks like it might be spam.

2) old sender who hasn't sent me much mail, forwarding something that looks like it's definitely spam.

It's more likely to mistake spam for good mail than otherwise.

Suggested training for Popfile: Reclassify EVERY good message you get as "ham" for the first few weeks. Reclassify as "spam" only messages erroneously marked "ham" or "unclassified".

Don't bother with training a large corpus of spam as "spam", since this is more likely to cause ham to be marked "spam".

Filters do not stop spam... by Tehrasha · 2003-08-10 20:23 · Score: 5, Insightful

...they only prevent you from seeing it.

Your server and its harddrives still end up being a storage bin for it, and the spammers will continue to send as long as your machine allows it to be recieved. Always remember that spam differs from postal junk mail, in that the -receiver- pays for it. Unsolicited postage due mail.

Spam must be -blocked- and the ISPs that allow/encourage its continued spread must re-educated, or be put out of business. Only when spam becomes costly to send with it diminish.

The current proposed laws concerning the subject are currently focusing on content rather than consent. They dont mind if you get spammed with hundreds of ads, provided what is being advertised isnt fraudulent. They overlook the fact that the claim of you having 'opt in' for the spam is in itself the lie and fraud.

--Teh

Re:Filters do not stop spam... by Anonymous Coward · 2003-08-10 20:26 · Score: 0

in that the -receiver- pays for it.
Get with the program, man.
That might have been true when you still read your e-mail using a modem and paid for every minute you spent downloading. Nowadays with DSL/Cable you don't have to pay for the traffic - if you do, you've been royally screwed by your ISP.
Re:Filters do not stop spam... by Tehrasha · 2003-08-10 20:34 · Score: 2, Insightful

If you think that your ISP does not incur cost by having to deal with the traffic load and disk storage caused by spam, you are the one in need of a reality check. And if you think that your DSL/Cable traffic is free, then gimmie some of the stuff you're smoking, it must be good.
Re:Filters do not stop spam... by Anonymous Coward · 2003-08-10 20:40 · Score: 0

Ok.
I run KaZaA 24/7/365. The amount of data I move must be in a terabyte range. For the last three years, the cost of my feed has actually dropped. How's that possible?
Re:Filters do not stop spam... by wheany · 2003-08-10 20:40 · Score: 1

Both filtering and blocking help. Blocking helps by not even letting the mail enter your server, and filtering helps by not showing you the mail even if you receive it. If you can't see it, you can't buy anything from the spammer (well, at least directly because of the spam). And I'd say the biggest cost of spam is wasted hours spent looking at the spam when you're checking your mail.
Re:Filters do not stop spam... by mce · 2003-08-10 21:09 · Score: 1

For one, I'm still using dail-up like quite a few others.

DSL/Cable bandwidth costs money as well, even if your ISP doesn't bill by the second. Your subscription could cost a lot less of only they needed to maintain less infrastructure to provide the same service level.

--
Linux user since early January 1992.
Re:Filters do not stop spam... by Tehrasha · 2003-08-10 21:16 · Score: 1

Any number of things may have caused a drop in cost. Decreased customer service, competition from other providers, pink contracts with spammers, initial startup costs for the ISP at the beginning, who knows, maybe you were just getting raped by your ISP the first couple years.
Your little file-sharing operation doesnt do squat to their harddrives or servers (other than create nice logfiles). It adds to their total bandwidth costs of course, but thats minor compared to the cost of hardware, upkeep and storage to deal with spam email. And if you think, 'heck, i can get a new 120GB drive for $100 if i need more storage', heres a little clue for you. People who maintain servers do not buy their harddrives from BestBuy. (not any serious ISP anyway)
Re:Filters do not stop spam... by srn_test · 2003-08-10 22:52 · Score: 0

Bah *waves paw*.

The worst case of spam I've seen was a user getting 6MB/day of spam.

I get probably 40 pieces per day, at maybe 10k each. I think I can probably afford the 400k/day disk space, given that the mail server has a few GB of disk space.

I guess it's a problem for people who don't run their own mail server, but that serves them right.
Re:Filters do not stop spam... by afidel · 2003-08-10 23:19 · Score: 1

The cost of network bandwidth, storage space, and CPU cycles that fighting spam costs is NOTHING compared to the opportunity cost on the time lost to manually filtering it. Laws won't work because the spamers will just move the servers and the shell companies offshore, in most cases their products are already illegal (like Viagra without a prescription) so they can already be prosecuted under existing laws, so why write new ones just because it starts on the internet???

--
There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
Re:Filters do not stop spam... by digitalsushi · 2003-08-10 23:51 · Score: 1

I guess it's a problem for people who don't run their own mail server, but that serves them right.

i'd hand the Internet down to my little brother if everyone had their own mail server.

be like my dad and his voice mail- "i have voice mail?" "yeah, for two years you've had it. you should check your messages." "...wow, i had 430 new messages! they started getting cranky so i just deleted all of them at once!"

narh having your own mail server is actually really sweet. it becomes very annoying to do so when your ISP blocks the SMTP port, too. i dont care if they filter my outgoing SMTP, but incoming, they should never really block that. its not fair :D

--
slashdot: where everyone yells sarcastic metaphors to themselves to understand the issue
Re:Filters do not stop spam... by Anonymous Coward · 2003-08-11 02:57 · Score: 0

Laws won't work because the spamers will just move the servers and the shell companies offshore

yes, because all of other criminals who move their operations offshore are immune from prosecution, right?

That's why the head of DeBeer's likes to vacaion in Miami, right?
Re:Filters do not stop spam... by Moryath · 2003-08-11 05:16 · Score: 1

You moron.

Consider all the webmail or non-"I run my own" people out there.

Our university allots 10MB. Hotmail allots 2MB.

I went on vacation last February, and hit my quota in three days, not because I did anything, but because the spammers were mass-blasting the accounts.

We pay for it in the fees we pay for access, even if it's just our $10.95/month to the local dialup.
We pay for it in LOST MESSAGES when the fucking spammers fill our disk quotas.
We pay for it in the fees our ISP incurs on ever-larger disks and on customer support calls to try to deal with it.
We pay for it in the lost integrity of our communications system when admins like me decide "fuck it, it's not worth it" and block everything from Asia, period.
We pay for it in the time we spend deleting it, or installing filters, teaching the filters to recognize it, deleting it again when they forge addresses.
Random people pay for it when someone spoofs the return address and they get a bunch of "fuck you spammer" emails from people.

The spammers should pay for it with something very, very painful. I'd say their lives, but I'd rather they suffer first.

I changed my mind. Simpler is better. by Peter+Cooper · 2003-08-10 20:24 · Score: 5, Interesting

I have long been an advocate of Bayesian or keyword based spam filters, but have recently been forced to change my outlook, and to argue that MULTIPLE SIMULTANEOUS solutions are the answer.

I encountered a very simple but unique spam system which works entirely on the sender's address. Simply, you create a small database with the domains/addresses you want to whitelist. Then, a program screens your mail, and if the sender is not in your whitelist, it sends an e-mail BACK to the sender with a simple URL (or even an actual link for HTML e-mail clients) which states that they REALLY want to send the e-mail to its destination. When this is done, they are added to the whitelist. Therefore, mails from forged remote addresses are no longer a problem, and neither are mails from trusted sources. And, better than SPEWS or similar blacklists, the sender gets a SECOND CHANCE to send their mail to you.

There's a commercial solution using this system right now, although the URL escapes me.

Of course, one could encounter problems when ordering online, say. Droids at Amazon will not be clicking your links to make sure your order receipt got through. One could argue that you'd put things like Amazon.com in the whitelist, but what if someone used amazon.com as a spoofed e-mail domain/address? Ay, there's the rub. But if this system were tied in with a Bayesian system, it'd be pretty unbeatable. What's more the Bayesian system would have extra data for negative matches, in the form of e-mails that were never 'approved', and positive data in the form of those that were.

So, I'd be more interested in producing a homebrew system that used MULTIPLE weaker systems, than one supposed 'sure fire' method.. as I feel no one method is perfect, whereas multiple systems can approach this nirvana.

Re:I changed my mind. Simpler is better. by ctr2sprt · 2003-08-10 20:38 · Score: 4, Interesting

Any approach that triggers an automatic action on your behalf is bad, because it can be turned against you. It's not likely that email would make a terribly good DDoS service, but a system like the one you describe would certainly be vulnerable to it. And I think it would only last a week, at most, before spammers figured out a way around it. They can already handle "NOSPAM" being inserted in email addresses, and recently added the ability to reverse and combine email addresses until they get something plausible.
I do agree with you that we need multiple layers of safeguards in order to solve spam - or at least to hide it away so nobody has to look at it - but I don't think your specific example is very good.
Re:I changed my mind. Simpler is better. by kevin+lyda · 2003-08-10 21:11 · Score: 1

on the linux side you can just use tmda.net for challenge response.

--
US Citizen living abroad? Register to vote!
Re:I changed my mind. Simpler is better. by The+Grassy+Knoll · 2003-08-10 22:38 · Score: 2, Informative

> There's a commercial solution using this system right now, although the URL escapes me

Spam Arrest?

--
They will never know the simple pleasure of a monkey knife fight
Re:I changed my mind. Simpler is better. by scj · 2003-08-10 22:38 · Score: 5, Interesting
I had thought of something similar for fighting spam. Here's how I'd handle each email:
1. If the email is from someone in my whitelist, allow the mail to go through and feed it as 'ham' to the Bayesian filter.
2. If the email is not in my whitelist, run it through spam filtering software (Spamassassin works well) to determine if it is likely to be spam.
3. If it seems like spam, then use a challenge-response system (like TMDA) to find out if a human sent the email.
4. If the mail doesn't seem like spam, just deliver it. If I get 3 non-spammy messages from the same person (separated by a day or more) then add them to my whitelist automatically.
5. If someone responds to the TMDA challenge, put them in the whitelist and deliver the original email.
6. If no one responds to the TMDA challenge after a week, feed the mail as 'spam' the the Bayesian filter.
In addition, I'd use a system like Sneakemail to generate random email addresses to give out to businesses I want to do business with and use to sign up to mailing lists. These email addresses would be added to my whitelist so they could send me mail without going through the challenge-response system. If they start spamming me, I put the random email I gave them on my blacklist.

This system has the following benefits:
- Business mail I want (like receipts and newsletters from companies I do business with) get through always since the Sneakemail-type address is whitelisted. This solves the problem of businesses not responding to TMDA challenges.
- My real email address is protected from businesses who are likely to sell it and from people farming addresses from mailing lists.
- Personal email that the spam filter sees as non-spam gets delivered without bothering the sender with a challenge-response system.
- Personal email that does seem spammy by the filter still has a second chance to make it through the system with the challenge-response system. This should reduce false-positives to include only spammy emails from people who don't respond the the challenge.
- The Bayesian filter is automatically trained based on mails from people in my whitelist and mails from people who never respond to the challenge-response.
You would still get spam with this system (personal email that your filter thinks is non-spam), but hopefully your false-positive rate would be zero. Also, you don't annoy other people much by only sending challenge-response messages to spam-like emails. Finally, this would be easy for end users to use. They don't have to train the spam filter, since it should train itself. The only complicated part would be generating and using the random emails that you give to businesses and mailing lists.
Re:I changed my mind. Simpler is better. by PhilHibbs · 2003-08-11 00:33 · Score: 2, Insightful

...it sends an e-mail BACK to the sender with a simple URL...
And, not being on their whitelist, their email filter sends you an email back with a simple URL...
Re:I changed my mind. Simpler is better. by mrroach · 2003-08-11 07:56 · Score: 1

4. If the mail doesn't seem like spam, just deliver it. If I get 3 non-spammy messages from the same person (separated by a day or more) then add them to my whitelist automatically.

So I (evil spammer) just completely trash your filtering by sending three non-spammy messages, after which point I am added to your whitelist, then bombard you with spam which will automatically be fed in as ham.

All of these challenge-response systems have the additional flaw of "Which address do I send the challenge to?" The reply-to address? The Mail-Followup-To address, the From address, Enevlope-from, etc. Should you trust email claiming to be from yourself?

Really, the problem that is being addressed by C/R systems is better solved by configuring a network of trusted mailers (using digital certificates). Even that seems less than ideal to me though.

-Mark

Re:great by 1029 · 2003-08-10 20:25 · Score: 1, Troll

Which is an intersting statement.... Just think about this for a second (I do not claim this to be an end all statement about spam): You pay for cable TV, and yet have to sit through over 12 minutes of advertisements when watching a 1 hour program. Now why is that advertising, yet getting spam email is not? You pay for both media per month, yet one is generally allowed "spam" but the other is not. Please, go right on ahead and point out why spam is not the same as a commercial. I simply wanted to bring this topic up for discussion.

--
- I love animals. I try to eat at least one a day.

Re:great by Tirel · 2003-08-10 20:27 · Score: 2, Interesting

ideally, i think the client should take care of the filtering. Pour your resources into improving context based filtering and let the individual clients do the dumping. Widespread usage of this kind of filtering could make spam even further unprofitable. Since spam is entirely business related, it would likely reduce the numbers of it passing through the network.

From a sysadmin's POV, this doesn't halt the issue of spam eating bandwidth or disk space. I'll address that next.

Disk space depends on what kind of e-mail your organization uses. For POP3, most people delete e-mail on the server after its downloaded, so while the disk space may be consumed with spam, it would be temporary. That is unless you have alot of dead or rarely used accounts. In that case, you should have policies in place for when to wipe user's accounts out after a set period of time. Or set up some kind of forwarding policy. If you're using something like IMAP, then using a server-wide content filtering system as mentioned above would be effective.

For bandwidth, the only way to halt spam from consuming your bandwidth is by blocking packets at the router. If you use SPEWS to dump the e-mail by your e-mail server, its still consumed your bandwidth. So you'd have to block the packets directly. I think this is draconian and should be avoided, for the net's sake. Unfortunately there really is no good solution to this, for as long as spam flows, it flows and consumes bandwidth. The only way to halt it is to halt the initial spamming to begin with. As mentioned above, when your spammer's audience never exists as a result of good content filtering, the spam will be unprofitable and lessen somewhat.

Attacking users and their ISP's won't do much good, aside from causing spammers to jump from isp to isp, something they're readily willing to do. Attacking regular users just makes you a big jerk.

Re:great by mirko · 2003-08-10 20:30 · Score: 3, Insightful

I have more than enough things to worry, including my shopping list, my housekeeping tasks, my garden... to just lose time and nerves other that few junk : when I get an unexpected commercial in my snail-mailbox, this *is* annoying as, here, in Switzerland, we pay for each garbage bag we throw away.
So, spam is junk, indeed, but i dispose of it almost instantaneously.

I won't make spamfighting my Holy War...
I have more interesting and valuable things to deal with IRL and I am naturally optimistic.

Let the spammers waste their time sending their hectobytes of off-topic (mostly american-centric) mail to my ever-improving filter.

--
Trolling using another account since 2005.

Re:great by devnulljapan · 2003-08-10 20:31 · Score: 5, Insightful

Just remember though, we would never have television without commercials. Sometimes advertising is necessary.

NEVER?....Try the BBC?
No ads, quality programming, small fee.

Spam is not the same as commercial by Eric+Ass+Raymond · 2003-08-10 20:35 · Score: 4, Insightful

Please, go right on ahead and point out why spam is not the same as a commercial.

I'd be happy to.

I don't know about you but for me e-mail is an important part of my work - not something comparable to watching cable TV.

Spam clogs my mailbox and I have lost several important e-mails from clients when deleting the spam which, by the way, is often disguised as legitimate non-commercial mail and comes with forged headers. In addition to pushing fraudulent products, these facts make spam a completely different beast from the cable TV and its legitimate, controlled ads which eat up only my free time - not my emails or work efficiency.

--
BOO! TERRO

Re:Spam is not the same as commercial by Eric+Ass+Raymond · 2003-08-10 20:41 · Score: 1

I just want to add that if you're a spammer reading this: I'LL KICK YOUR ASS IF I EVER MEET YOU!

--
BOO! TERRO
Re:Spam is not the same as commercial by Anonymous Coward · 2003-08-10 20:51 · Score: 1, Funny

eat up only my free time - not my emails or work efficiency.
Uhhuh? And why are you reading /.? To improve your work efficiency??
Re:Spam is not the same as commercial by momus_radar · 2003-08-11 02:49 · Score: 1

I think it's more like a game of Three Card Montey. Everyone with any amount of intelligence knows it's fixed to make you loose money. But there are those who play it because they think they can actually get something out of it; these are the folk who keep Three Card and SPAM alive and well. Getting these people to stop playing Three Card or answereing SPAM will help make it go away. The problem is that there are just too many out there to stop.

"Bayesian" by RDPIII · 2003-08-10 20:40 · Score: 4, Insightful

I don't mean to troll, but I hope it's not too late to put an end to the unfortunate term "Bayesian spam filtering". This is perhaps the worst abuse of the adjective "Bayesian" I've seen, because nothing crucially depends on the application of Bayes' Theorem and/or on the use of Bayesian methods (informative priors, model selection, etc.). Why not simply call it "data driven spam classification" (as opposed to "rule based") or "empirical spam filtering"?

If the spam disaster had struck fifteen years ago, we'd all be talking about "neural spam filtering" (using artificial neural networks, ANNs) and basking in the warm fuzzy feeling imparted by the term "neural". But ANNs and Bayesian classifiers have the same interface: both are trained on labeled data and can be used to classify unlabeled data. The implementation details are not of primary importance, and if you think they are, I'd encourage you to look into large margin classifiers instead of Naive Bayes or ANNs.

--
Marklar: marklar

Re:"Bayesian" by file-exists-p · 2003-08-10 21:58 · Score: 5, Informative

As far as I know, many of those filters are based on a decision rule of the form

P(mail is spam | words X, Y, Z, ... are in it) > 1-epsilon

The computation is then done using Bayse's rule (P(A|B)=P(B|A)*P(A)/P(B)) under certain independance assumption which makes it tractable.

So this is actually bayesian filtering ...

My favorite filter is spamoracle
Re:"Bayesian" by pfafrich · 2003-08-10 23:22 · Score: 1

Huh, p(its a spam) = p(its a spam|contains word viagra) * p(contains word viagra) + ... a bit of rewriting gives Bayes throrem. The priors are p(its a spam|contains word viagra).
Haveing been in statistics departments which host confrences on basian image analysis which uses entirely the same techniques as these.

--
There are four sorts of people in the world: fools, lunatics, idiots and morons. - Umberto Eco, Foucaut's pendulum.
Re:"Bayesian" by po8 · 2003-08-11 04:11 · Score: 1

Many learning filters are Bayesian in character, but by no means all. Indeed, it is arguable that other approaches have advantages. For a good discussion and comparison of approaches, see the paper at my spam-filtering site
.

A new poll is required by mirko · 2003-08-10 20:42 · Score: 4, Interesting

How should spammers be dealt with ?

Ban their original networks
Throw them in jail
Kill them
Fine them 0.01$/email and improve third world infrastructures with the money.
Filter/Ignore them.

I'd personally go for the last option... Maybe the next-to-last if their suit takes place in a really democratic place (there are 278 millions American citizens and 2,2 of them are in jail, this is a *lot*).

--
Trolling using another account since 2005.

Re:A new poll is required by frovingslosh · 2003-08-10 20:55 · Score: 1

You make a good argument for killing them rather than throwing them in already overcrowded jails. I'm sure if we killed just a few thousand that most of the rest would get the message and the spam problem would be reduced greatly (something that doesn't happen at all with number 5).

--
I'm an American. I love this country and the freedoms that we used to have.
Re:A new poll is required by Anonymous Coward · 2003-08-10 21:04 · Score: 0

Oh yeah -- thanks to prison, crime is virtually non-existent in the U.S. Not.
Re:A new poll is required by mirko · 2003-08-10 21:04 · Score: 1

I suggested this option because the original post mentioned their hunt.
I'd be surprised to see how many /.ers are sufficiently intellectually-impaired to just argue they should simply be killed (possibly a cruel way).

--
Trolling using another account since 2005.
Re:A new poll is required by anubi · 2003-08-10 21:48 · Score: 2, Interesting

I like your last option best, too. I hate to suppress anyone's right to say whatever they want to, but then I want to reserve my right to what I choose to pay attention to.
Under the existing technology, a spammer is like the royal pest on a city bus which takes advantage of the captive audience. The analogy here is that we have to download our POP box, we have no way of arranging our affairs to where the signals exist, but we deliberately choose not to tap into them.
I believe the technology must change. I am loathe to try to settle what I consider a technological issue by passing some sort of law... doing this just makes immense profits for litigators, but does little to solve the underlying problems.
If the technology could change to where ISP's could provide individual bayesian-type filters at the server level so that messages fitting criteria that each individual screens for, this could let the ISP off the hook for dropping messages, as well as having to supply any long-term storage for them... Somehow I get the idea that spammed messages are going to be very similar and should show a very marked correlation to the same spam sent to other accounts in that ISP. The ISP, upon determining a significant number of accounts filters have flagged a particular mailing as a spam may provide the ISP with the opportunity to only store ONE copy of the spam, while possibly putting only pointers to it to the subscribers.
So, what I would think would solve this is if the internet became more like radio transmissions. I support the idea that anybody can transmit whatever they want to the public, and if anyone wants to listen in, fine. But, like RF, it has to make it through the filters before it gets to the listener. The damn-near infinite advantage to the net-based paradigm is we have an almost infinite bandwidth in the notion that anyone can set up his transmitter and not step on someone else's signal. ( i.e, there's only so many "channels" in the AM, FM, or TV broadcast bands, whereas the internet does not have this limitation. ).
Anyway, thats my two cents worth.

--
"Prove all things; hold fast that which is good." [KJV: I Thessalonians 5:21]
Re:A new poll is required by Lars+T. · 2003-08-10 23:54 · Score: 1, Troll

Yeah, the death penalty also worked for reduceing violent crime - hey, wait....

--
Lars T.
To the guy who modded me down from perfect to terrible Karma - Apple haters still suck
Re:A new poll is required by Cato · 2003-08-11 00:32 · Score: 3, Informative

See http://death2spam.net - this is a commercial mailbox service that appears to have really good bayesian-style spam filtering (referenced by Paul Graham in a recent article) - they even fetch URLs in some messages to filter based on website content. They don't require individuals to train on their own messages, which may be controversial but also makes it feasible to deploy this at large scale in ISPs.

Without major ISP deployments, the response rates to spam will not go down, since the clued-up individuals who deploy filtering themselves would never have responded to spam anyway.

Your RF analogy is interesting but it breaks down for people with wireless mobile phone links, dialup when travelling, and so on. The best thing is to make spam unprofitable so it goes away.
Re:A new poll is required by SCHecklerX · 2003-08-11 01:40 · Score: 1

You obviously don't run a mail server, for your company or at home.
Re:A new poll is required by mirko · 2003-08-11 02:01 · Score: 1

I do :)
But I don't care about a few bug bites.
What I mean is that it does not affect me IRL and most /.ers should get a life instead of whinning about such trivialities.

--
Trolling using another account since 2005.
Re:A new poll is required by Anonymous Coward · 2003-08-11 02:25 · Score: 0

not to mention the fact it costs the taxpayer more to kill a person than keep them in jail for life...
Re:A new poll is required by haystor · 2003-08-11 02:50 · Score: 1

There have been no repeat offenders among those that have been put to death.

--
t
Re:A new poll is required by pope+nihil · 2003-08-11 03:35 · Score: 1

The vast majority of that money is because of the "due process" part where we pay lawyers lots of money to fight legal battles over whether or not this person should really be put to death... Toss that out and it would be much cheaper to just execute them...
Re:A new poll is required by Anonymous Coward · 2003-08-11 04:33 · Score: 0

Yeah, the death penalty also worked for reduceing violent crime - hey, wait...
The death penalty isn't imposed for violent crime only under limited instances of murder and only in some states. States with the death penalty also have generally LOWER crime than those that don't, just like states where law abiding people can own guns have far less violent crime than states that ban such ownership.
Re:A new poll is required by mortuusangelus · 2003-08-11 05:44 · Score: 0

Crime isn't lower because very few acting politicians have the cojones to let more then the most publicized and definitely guilty (McVeigh for one) get put to death because they're afraid some nutball will call them murderers. *gasp*

If we imposed it more often, then crime WOULD drop.. assuming we limited the attempts to get out, by.. oh say.. one shot. You fail appeal, you die the next day. Better make damn sure your lawyer knows wtf he's talking about before he gets in front of that judge.

--
Oh god... not again.
Re:A new poll is required by OMEGA+Power · 2003-08-11 08:43 · Score: 1

How should spammers be dealt with ?
Personally, I've always favored "take them out and shoot them" but that is just a matter of personal opinion (for example, if you want to shoot spamers indoors don't let me stop you)
Re:A new poll is required by Sorthum · 2003-08-11 08:56 · Score: 1

Am I the only one who misread that as a step-by-step process?

"Sounds good to me" was my reaction.
Re:A new poll is required by Lars+T. · 2003-08-11 09:08 · Score: 1

Yeah, the streets would be so much safer if you executed even more innocent people.

--
Lars T.
To the guy who modded me down from perfect to terrible Karma - Apple haters still suck
Re:A new poll is required by Lars+T. · 2003-08-11 09:10 · Score: 1

They still have a far higher crime rate than countries that don't have the death penalty and without so much guns in private hands.

--
Lars T.
To the guy who modded me down from perfect to terrible Karma - Apple haters still suck
Re:A new poll is required by Lars+T. · 2003-08-11 09:12 · Score: 1

Yeah, maybe executing all Americans would solve the problem.

--
Lars T.
To the guy who modded me down from perfect to terrible Karma - Apple haters still suck
Re:A new poll is required by Lost+Race · 2003-08-11 14:20 · Score: 1

Same here, I thought: That's a good start.
Re:A new poll is required by tgrigsby · 2003-08-11 16:26 · Score: 1

Fine them 0.01$/email and improve third world infrastructures with the money.

Look ya tree-huggin' hippy, if spammers get fined, that money better dang well go to the people receiving the emails. It's my time those (expletive deleted) are wasting, not some goat-milking dipstick who can't quite figure out why his crops won't grow in the desert. To quote the late Sam Kinison, "We've got deserts too, but we don't LIVE IN THEM!!"

No, spammers should have to pay into a fund that is applied to the communications bills of those that have been subjected to the spams. I got 976 emails in a 36 hour period over the weekend, and there 4 -- count'em, FOUR! -- emails that weren't spam. It's my time and my bandwidth these a******* are using.

--
*** *** You're just jealous 'cause the voices talk to me... ***
Re:A new poll is required by mirko · 2003-08-11 18:53 · Score: 1

I have not read such a stupid answer for a very long time.
You're either a clumsy troll or genetically impaired ...
If you cannot filter that *few* spams, then you should reconvert and become a peasant.

--
Trolling using another account since 2005.

You really just don't get it by frovingslosh · 2003-08-10 20:42 · Score: 5, Insightful

Realistically, I don't give a damn how much spam _you_ get, I care that _I_ don't get any.

But you still do get spam. Exactly as much of not more because you use Bayesian filtering. Spam still wastes your bandwidth to download that spam before it can be filtered. Spam still wastes any inbox size limits your ISP might impose. Spam cuts into any quota a forwarding service might now or in the future impose on your account, or it could take you to a higher charge level if you pay for a forwarding service. It costs your ISP money, costs that one way or another are eventually paid by you. Even the processing power for that Bayesian filtering costs you CPU cycles, while having no negative effect on the spammers whatsoever.

While you might not think you care how much spam I get, you might care if dozens, hundreds or thousands of other users at your work also get tons of spam, particularly when all of that spam significantly cuts into your bandwidth. And you will care when overload from spam on your mail server is so bad that it causes failures, effectively causing a D.O.S. situation.

And as long as geeks happly play with their little Bayesian filters, they stop seeing spam and so stop complaining to the providers that are letting spam get through. They stop doing other things that might make spammer's life difficult. Heck, I fully expect some spam haters with an additude like yours to say within earshot of a congressman or Senator something like "Oh, I never get any Spam. Spam can be filtered easily and nothing should be done about it". The spammers should love Bayesian filtering, it takes the presure off them while allowing them to reach exactly the same number of marks with a mailing.

--
I'm an American. I love this country and the freedoms that we used to have.

Re:You really just don't get it by Plug · 2003-08-10 20:52 · Score: 4, Informative

I don't disagree. I think that eventually we should move to a better email model - something like TMDA perhaps, where there is no guarantee that spammers can reach mailboxes. Or better legislation to make spamming punishable, controls on mail routers on million message mailouts, etc. Or djb's Internet Mail 2000, which moves the onus onto the senders network to store all 1m messages at a time, until people pick them up.

The other thing you can do is impose a microcost for mailing - at 1c/mail, spamming isn't economical any more. But then that is going to penalise the people who have legitimate reasons to send a million emails at a time - you'd have to have a very good micropayment system working on the Internet to do this.

However, those things need widespread change, and they need people in positions of power. Joe User at home can push for it, but they still get spam and they still want a short term solution. I suggest that even if they're filtering, the action of having to check their spam filter will make them irate enough. I see it as being like IPV6 - everyone would really have to change at once for the system to be most effective. (I use Freenet6, do you?)

Now that viruses are public, caught quickly, and Microsoft are being a lot less lax with security (I am in no way commending their effort, but they at least mostly fixed the Outlooks), you don't see people writing them nearly as often. I feel spam will get the same.
Re:You really just don't get it by ADOT+Troll · 2003-08-10 21:45 · Score: 0

E-mail contains HTML tags of any sort, except for
E-mail contains attachments (unless solicited; whitelist)
With all non-alphanumeric characters removed, certain case-insensitive keyword matches can detect spam
E-mail is a forward or looks like chainmail / Nigerian scam
E-mail contains junk strings in subject or sender
E-mail comes from you, but header doesn't match your send name
E-mail is excessively large (>20K) and unsolicited (whitelist)
E-mail headers and/or text contain Mojibake, if unsolicited (whitelist) - this will block anything in Chinese or Russian, for example
Badly formed headers
Address doesn't match reverse lookup
If ANY of these apply, then, IMO, YOU FAIL IT!!

I think, this would be a perfect filter system, if it could be coded. I have a homemade POP3 client that I could stand to add some of this to, I guess...
Re:You really just don't get it by horace · 2003-08-10 22:01 · Score: 1

But email advertising is not intrinsically bad and where a real product is sold, spammer and spammee both benefit, just as with junk snail mail.

Rather than looking at one overall solution there are little things that could be done to make things better. A few ideas of mine (not necessarily original)
1 Use Spam filters to automate complaining/unsubscribing/recording abuses
Some spammers will actually unsubscribe you but usually it is too much of a pain to do more than just delete. Other cases of abuse will be easier to document

2 Persuade ISPs to install user configurable spam filters at th server level. I think a general scheme is useless for the reasons described above but if the responsibility for potentially lost emails is the recipients it could be much more effective

3 Traceability of emails is really the only thing necessary to make opt outs enfoceable
Re:You really just don't get it by Mr.+Ayo · 2003-08-10 22:45 · Score: 1

you might care if dozens, hundreds or thousands of other users at your work also get tons of spam, particularly when all of that spam significantly cuts into your bandwidth

Indeed, but do you think XO, either now or ever, gives a flying fuck if spam reaches my email server? Either way (client side or at my email server), my bandwidth and processing power is still used.

What you missed in the parent post is that false-positives are unacceptable in a corporate environment. The cost of delivering any arbitrary number of spam messages is less than the cost of blocking one single legitimate message.
Re:You really just don't get it by CrazyWingman · 2003-08-10 23:47 · Score: 1

I have only one point of contention with your statements. Are you sure that 1c/mail is enough to make spamming non-economical? I mean, before the advent of e-mail (and even since), I received several spam snail mail messages per day at (I assume) the rate of 32c/mail (well, it was probably 20-some-odd cents then, but it continues in this 34?...37?...I don't know environment). I would like to agree that charging for spam would render it useless, but when compared to physical spam, I question the idea.

If you would argue that I am wrong, why? Is it simply that even 1c/mail would cut down _enough_ on spamming (since it wouldn't be free, there would have to be some limit on the number of mails a spammer could send) that the target audience is not large enough to ensure a high enough probability that someone will buy the product?

Perhaps there is a hidden idea here, in that if a spammer had to pay, then that would be an actual link back to them by which they could be punished for their violations of spam codes. I think there are different problems with this entirely, but it does seem like the case.
Re:You really just don't get it by Azghoul · 2003-08-11 00:26 · Score: 1

I'm just curious... I'm trying to imagine the case where someone would legitimately have to send out a million emails at a time, and I can't think of one. :)
Re:You really just don't get it by schon · 2003-08-11 02:25 · Score: 5, Interesting

spammers should love Bayesian filtering, it takes the presure off them while allowing them to reach exactly the same number of marks with a mailing.

I'm afraid you've made the cardinal mistake of thinking that spammers follow logic.

First question: Why do people install filters on their mailboxes?

Answer: To stop spam.

Now, take a look at any interview with any spammer.. you'll note that when they're asked, the spammer will say "I don't send it to people who don't want it."

They'll also say "we're always coming up with ways to bypass filters."

Now, you'd think that with the two statements, that one of them is false - however (besides the fact that spammers lie), any sociologist will tell you that the spammer actually believes he's telling the truth in each of these statements..

How he justifies it in his mind is that he believes that even though someone has installed a spam filter, that this person only wants to filter spam from other spammers - that his spam is somehow "special".

Spammers are sociopaths, and like all sociopaths, they believe the rules do not apply to them.

If spammers weren't sociopaths, and were capable of applied logic, then they'd realize that any filter (not just Bayseian) would benefit them.. but then, if they weren't sociopaths, they wouldn't be spammers in the first place.
Re:You really just don't get it by theCoder · 2003-08-11 03:17 · Score: 1

Though they're different, I have one of my credit cards send me an email whenever my current statement is ready and whenever they receive my payment. I'm sure they have millions of customers, though I'd imagine only a small percentage use that service. In any case, it's a nice thing, and I'd hate to see the anti-spammers take it away.

Oh, and yes, I hate anti-spammers (the radical ones, not people who just think spam is bad) way more than spammers because while spammers make email annoying, anti-spammers try to make email NOT WORK AT ALL.

(now watch me get modded down for not pulling the anti-spam party line)

--
"Save the whales, feed the hungry, free the mallocs" -- author unknown
Re:You really just don't get it by Teach · 2003-08-11 03:28 · Score: 1

I think a hybrid system would be interesting: use TDMA as the base, and allow through non-whitelisted messages for, say, half a cent per message. This would let "wanted" mail be free, and would stick spammers for $5000 per million unsolicited emails, which would greatly increase their cost of doing business.

Whereas I sent out about 2000 messages last year, so I'd only have had to pay $4 for the whole year, even if 100% of my messages were "unwanted". I'd pay $4 a year to make SPAM go away.

You would need micropayments for this to work, as well as a way of tracing back to the original sender for billing. Which is perhaps infeasible at this point.

You might also have some folks that intentionally keep people off their whitelist just to earn money from getting emails. Imagine setting your TDMA to "whitelist NULL" and then signing up for 400 mailing lists just to sit back and watch the money roll in!

Of course, maybe the money would go to a third party instead of the user, and mailing lists would probably auto-drop subscribers who didn't whitelist their mailings (after the first hit, anyway).

It'd all work out: the subscriber to the mailing list would incur a single charge for the initial 'subscribe' message since they wouldn't be on the mailing list's whitelist. Then if the user doesn't put the mailing list address on its own whitelist, the first message from the list to the user would incur a single charge for that message, and then the user would be automatically dropped from the list (for violation of terms, or what-have-you), and they'd be even. No extra costs for either the mailing list users or maintainers except a trivial one for signing up to the list.

...interesting to daydream about, at least.

--
Graham "Teach" Mitchell, computer science teacher, Leander HS
Re:You really just don't get it by edstromp · 2003-08-11 04:59 · Score: 1

And as long as geeks happly play with their little Bayesian filters, they stop seeing spam and so stop complaining to the providers that are letting spam get through. They stop doing other things that might make spammer's life difficult.
Actually, I get filtering AND I still complain. I use SpamCop. I do of course despise getting spam, but I quite enjoy reporting 100% of my spam to their sysadmins at a click of the button. It is easy to report, and I am quite glad to be doing my part.
Re:You really just don't get it by ATMAvatar · 2003-08-11 06:20 · Score: 1

The cost of delivering any arbitrary number of spam messages is less than the cost of blocking one single legitimate message.

Given the current rate of spam increase we've seen thus far, how long do you think it will be before the above statement is no longer true? It seems to me that within a matter of a few years, spam will make email useless to anyone who doesn't use a good filter.

--
"They that can give up essential liberty to obtain a little temporary safety deserve neither liberty nor safety."
Re:You really just don't get it by Anonymous Coward · 2003-08-11 14:23 · Score: 0

Large companies that maintain mailing lists can quite often legitimately send out over a million emails in a hit - for example the latest "super special offer" from Coke for all people on their mailing lists - or even some notification of a new release of a popular software package from sourceforge. Especially in this latter example, a microfee would hit these non-profit organisations quite heavily
Re:You really just don't get it by TekPolitik · 2003-08-12 13:53 · Score: 1

If spammers weren't sociopaths...
And if 1 were not equal to 1...
You can't become a spammer without being a sociopath. Literally. Spammers lack the capacity to control their own actions by reference to the consequences to others. This is exactly the same flaw that afflicts manifesting sociopaths.

wtf by timerider · 2003-08-10 20:43 · Score: 2, Insightful

When will 'the net community' finally get it?
filtering is no solution as long as there's no way to stop the spammers!

Or would you say that ignoring the corpses in the gutters would be a solution to the problem of violence on the streets?

bye
[L]

Re:wtf by pe1chl · 2003-08-10 21:52 · Score: 1

When you could get a fence setup in the streets, with the violent people on one side and the remainder on the other, that would be a solution.

This can be compared to filtering.

Of course it is better to get rid of the problem, but just as with violence this is not realistic.
No matter how many laws, there will always be people or countries who just don't care.
Re:wtf by timerider · 2003-08-10 23:00 · Score: 1

No.

your 'fence in the street' example is more like a supplement for SMTP that verifies the sender before accepting the mail; spammers are kept outside.

bye,
[L]
Re:wtf by Anonymous Coward · 2003-08-10 23:46 · Score: 0

I think it's more like ignoring the crack dealers.
Re:wtf by PhxBlue · 2003-08-11 00:19 · Score: 1

Or would you say that ignoring the corpses in the gutters would be a solution to the problem of violence on the streets?

Ignore? P'shaw, I'd do no such thing! Mmmmm, soylent green. . .

--
!#@%*)anks for hanging up the phone, dear.
Re:wtf by Chokma · 2003-08-11 00:21 · Score: 3, Funny
filtering is no solution as long as there's no way to stop the spammers!
Or would you say that ignoring the corpses in the gutters would be a solution to the problem of violence on the streets?

Your analogy is slightly flawed. In the case of spam, it would be correct if:
- I would have to examine every corpse closely to determine if it is sill alive
- I would have to manually remove the corpse from the street
On my system, SpamAssassin kills 99% of the Spam, carries it outside, buries the remains in the spam folder and cleans away the bloodstains on the floor. The less I get in touch with spam, the better.
In the perfect world, there would be a "nuke obnoxious netizen" button on my keyboard. But alas, we have to settle for slightly less efficient methods.
Re:wtf by twelveinchbrain · 2003-08-11 01:46 · Score: 1

You don't get it. If all the major email clients and/or all the major ISP's had effective spam filters, there would be little response from spam. Therefore, there would be little reason to send spam.

Your analogy is a terribly poor one. A closer analogy would be if shops stopped keeping supplies of cash on hand, then people would stop robbing them.

--
Not Found
The requested URL /signature.html was not found on this server.
Re:wtf by Anonymous Coward · 2003-08-11 03:01 · Score: 0

Your analogy is slightly flawed

Not with regard to the point he's trying to make (which I guess you missed.)

The point is that even if you don't see it, it still happens, and still has costs,

Filtering spam by examining the content, and claiming this is a 'success' is like solving the problem of violent crime by ignoring the bodies on the street because it's still happening, and you're still being affected by it.

On my system, SpamAssassin kills 99% of the Spam, carries it outside, buries the remains in the spam folder and cleans away the bloodstains on the floor

Yes, but you're still getting the spam, and you're still paying for it.

And the winner is... by Night0wl · 2003-08-10 20:44 · Score: 1

By this article, SpamBayes.

Which only works out of the box with Outlook 2000/Express. Woopy doo.

Are there any recommendations for those of us who aren't forced to use outlook? I use Eudora my self, have been for years, thus I'm not looking for a new email client recommendation. ;)

--
Computational Madness in a round package.

Re:And the winner is... by Gaza · 2003-08-10 20:52 · Score: 3, Informative

SpamBayes has a very well done pop3 proxy that will work with ANY pop3 mail client, including Eudora. There is also an IMAP filter for those that like IMAP and for those procmail fans it also has an app called hammiefilter which is a command line version of the SpamBayes tools.

SpamBayes also has a very well done and integrated Outlook plugin which leads to the common misconception that SpamBayes will only work with Outlook.

Also note the review mentioned that both SpamBayes and POPFile work on multiple platforms and he is reviewing the pop3 proxy on both them, not their counter part outlook plugins.
Re:And the winner is... by puddytat · 2003-08-10 21:34 · Score: 1

Eudora 6.0 which is currently in beta has bayesian filtering build in. I tried it and it looked quite nice. for more details see www.eudora.com (under beta software)
Re:And the winner is... by spongman · 2003-08-10 22:44 · Score: 1

RTFA!!!!
Re:And the winner is... by Anonymous Coward · 2003-08-10 23:04 · Score: 0

RTFA, asshole
Re:And the winner is... by afidel · 2003-08-10 23:28 · Score: 1

Mozilla Mail/Thunderbird has great Bayesian spam filtering, accuracy for me after training on a training set of about 400 spam message was about 99.7% for the first couple months, then it started to creap towards 97%, so I went through and retrained it, brought it back up to 99.7% again. It also has a simple switch to not run filtering on anything from people in your personal address book (basically whitelist ppl in your address book before filtering even starts). I have only had one false positive out of over 3K legit messages recieved (not counting commercial email that wasn't technically spam but that I really didn't mind losing =)

--
There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
Re:And the winner is... by HermanAB · 2003-08-11 02:51 · Score: 1

Yes, the Mozilla filter works, but SpamProbe is better - I tried both.

--
Oh well, what the hell...
Re:And the winner is... by joedohc · 2003-08-11 05:27 · Score: 1

I love Thunderbird. I started about 2 weeks ago and it filters 90% of the spam. Not perfect, but when you get a 100 a day, it helps. 2 false positives, but I chalk that up to the learning. I run it on WinXP Pro and RH9.0 at home. Surf the net with Firebird at home and work.

Re:great by impluvian · 2003-08-10 20:47 · Score: 4, Insightful

I think there's a very simple distinction that can be made between spam and television advertising, and it has to do with the amount of control that your service provider exercises over the advertising content.

When you watch cable TV, you know that for an hour of content, you are going to see up to 12 minutes of advertising. The advertising is controlled by the cable company, and no-one can advertise on the channel without going through that 'filter'.

Spam, on the other hand, is not restricted. If I receive 100 e-mails a day, anywhere from 0 to 100 of them could be spam. None of those spams are sanctioned (or controlled) by my service-provider, and they were not part of the package I signed up for.

I was getting 99.87% by Moderation+abuser · 2003-08-10 20:47 · Score: 1

I get hundreds of mails per day and it's pretty good a picking out the spam.

It's no good at more subtle classification though, but spam/not spam is highly useful.

If you make a mistake filtering you don't have to restart, you just keep training it, eventually your mistake will be drowned out as statistical noise.

I've since been moved to Notes so no more spam filtering.

--
Government of the people, by corporate executives, for corporate profits.

Re:I was getting 99.87% by Chatterton · 2003-08-10 21:48 · Score: 1

In the case of SpamBays, if you make a mistake, you can correct it by putting back the mail in the correct folder. It will correct is db and negate the error.

Re:Surely not another first post! by Anonymous Coward · 2003-08-10 20:47 · Score: 0

Putas chicas! Muy caliente!!

This system is crap by Anonymous Coward · 2003-08-10 20:48 · Score: 0

First of all, I'll never reply to such a mail. You filter your spam on your own. Don't make me filter your spam. I sent you a mail because I thought it countained valuable info. If you don't want to receive it, bad for you, I don't care.

What about order confirmations and the like? Mailing lists? They won't reply either, and you won't get their mail.

What about if two people have this system? Will there be an infinite loop of confirmation requests?

Spammers will just just HTML with images.. by Anonymous Coward · 2003-08-10 20:49 · Score: 1, Interesting

How does bayesian filters solve the problem of pure-image spams? -I.e. HTML mails that contain nothing else than an IMG tag. I only see collaborative filters solving this problem - SPAMfighter would be an example of this.

Re:Spammers will just just HTML with images.. by Gaza · 2003-08-10 20:54 · Score: 1

There is usually enough information in the url and headers to classify it as spam. I rarely get these pure image spams in my unsure folder anymore, SpamBayes nails them dead on. SpamBayes has even been tested to filter only only header information with some success.
Re:Spammers will just just HTML with images.. by wheany · 2003-08-10 21:23 · Score: 1

The headers still offer a lot of information to filter by, as do the urls of the images and the fact that they are remote.

My POPFile says this:
Lookup result for html:imgremotesrc
good 0.2185471262
spam 0.7814528738
html:imgremotesrc is most likely to appear in spam
Re:Spammers will just just HTML with images.. by Anonymous Coward · 2003-08-10 21:29 · Score: 0

Add an OCR Reader plugin to Bayesian filters ?
Re:Spammers will just just HTML with images.. by CrazyWingman · 2003-08-11 00:04 · Score: 1

Who do you ever get an e-mail from that is nothing more than an IMG tag? That's definately my number one spam-blocking rule: if it has an attachment, or is extremely small, and I don't recognize the address, delete it. If someone has something important to tell me, they send it in text, not as a picture with no other message content.
Re:Spammers will just just HTML with images.. by Anonymous Coward · 2003-08-11 00:29 · Score: 0

Your filter will learn to give more spam weight to html img tag. In the absence of any redeeming non-spammy words, it will get classified as spam.

YFI list by usotsuki · 2003-08-10 20:51 · Score: 1, Informative

E-mail contains HTML tags of any sort, except for <A>
E-mail contains attachments (unless solicited; whitelist)
With all non-alphanumeric characters removed, certain case-insensitive keyword matches can detect spam
E-mail is a forward or looks like chainmail / Nigerian scam
E-mail contains junk strings in subject or sender
E-mail comes from you, but header doesn't match your send name
E-mail is excessively large (>20K) and unsolicited (whitelist)
E-mail headers and/or text contain Mojibake, if unsolicited (whitelist) - this will block anything in Chinese or Russian, for example
Badly formed headers
Address doesn't match reverse lookup

If ANY of these apply, then, IMO, YOU FAIL IT!!

I think, this would be a perfect filter system, if it could be coded. I have a homemade POP3 client that I could stand to add some of this to, I guess...

-uso.

--
Dreams, dreams, don't doubt dreams, dreaming children's dreaming dreams. Sailor Moon SS

Re:YFI list by Oddly_Drac · 2003-08-10 20:57 · Score: 2, Informative

"Address doesn't match reverse lookup"

You'd be surprised how many DNS servers are completely misconfigured for this, but I think that a simple ping to the address given could actually show if it _existed_.

Personally I've found that I can reduce my spam by a huge amount by never viewing HTML...which brings a thought about tracking and tracing the webbugs in any given piece of HTML email...

--
Oddly Draconis
Too cynical to live, too stubborn to die.
Re:YFI list by aduxorth · 2003-08-10 21:46 · Score: 2, Interesting

another goodone is if the domain from the envelope sender doesn't have a MX record. bam guarenteed spam. The other one is to verify the sender not just the domain. This kills all those spams from lkiqprejbn@yahoo.com which are obviously bulldust.

That alone kills off about 70% (IMO) of the spam that comes through servers that I administer, and as far as I know, only 2 emails(over the last 4 years or so) that wern't ment to be rejected were rejected because they had invalid sender envelopes.

HTH
cya
Andrew
Re:YFI list by BiggerIsBetter · 2003-08-10 22:05 · Score: 1

Nice try, but don't even think about running 1, 2, 7, or 8 on an address for your business dealing with international clients, or on your mom's PC. Most punters actually *like* HTML mail and pretty inlined graphics, and blocking based on character set is incredibly closed-minded of you.

While it may be your perfect filter system, it's no use for a business or people who deal with non-geek friends.

--
Forget thrust, drag, lift and weight. Airplanes fly because of money.
Re:YFI list by devnullify · 2003-08-10 22:12 · Score: 1

And how many people run personal servers and don't have control of the reverse dns...
Re:YFI list by Anonymous Coward · 2003-08-10 22:30 · Score: 0

"People who deal with non-geek friends" is a pretty long euphemism for "dumb user".
Re:YFI list by usotsuki · 2003-08-10 22:32 · Score: 1

I do get the occasional HTML message, and I usually say "Look, please send TEXT next time". Unfortunately these n00bs don't know or care how to do this, so I'm stuck getting HTML messages. It is fortunately rare. I'd rather send an auto-reply on all HTML messages:
Subject: You Fail It!

This e-mail address accepts only TEXT mail. All other mail will be rejected. If you wish for the recipient, myname@mydomain.com, to read your message, please RESEND it in TEXT mode. Instructions for Outlook and Outlook Express follow...
Note: I don't know how to do it in Outlook and OE but those are the major HTML spewers. The person in question uses OE6. As for the international client thing...it's only Mojibake if you can't read it, and I'd only block it if it is Mojibake to me (note: I can read French and Spanish just fine)

HTML trix0ring is useless in uPOP though - if I were to get HTML at my new address, it wouldn't do any harm, since my e-mail client displays only as text. (It's also incredibly st00pid, but what can I say - I wrote it in VB6 in a couple hours, and it's quite unpolished.)

--
Dreams, dreams, don't doubt dreams, dreaming children's dreaming dreams. Sailor Moon SS
Re:YFI list by CausticWindow · 2003-08-10 23:21 · Score: 1

That doesn't work as intended. Many malconfigured servers send with envelope from user@blah.somedomain.com, and blah.somedomain.com doesn't have an MX record. But the from: field in the actual header says user@somedomain.com, and will work.

--
How small a thought it takes to fill a whole life
Re:YFI list by aduxorth · 2003-08-11 01:50 · Score: 1

hmm so this means if a bounce message is generated they don't get a message back. Sorry but thats their tuff luck. As I said I know of 2 rejections because of no A or MX record and they now send with correct envelope sender.

This is a major part of the problem of spam.
Misconfigured servers.

btw if the domain doesn't have a MX record, but do have an A record then the MTA I use will accept it.
Re:YFI list by aduxorth · 2003-08-11 01:53 · Score: 1

btw thats a A record for the domain in the envelope sender.

Re:great by advocate_one · 2003-08-10 20:55 · Score: 2, Informative

No ads, quality programming, small fee.

No Adds??? no, it's stuffed to the brim with promos for their own stuff though... (Gardening magazine, History magazine, Nature magazine, Radio times, TellyTubby toys, Fimbles stuff, trailers for upcoming programmes and series)

Quality programming??? it's gone really downmarket in the last few years..

Small fee??? That fee is your license for receiving _all_ television programs, even cable and satellite... not just the BBC. Although that license money goes to the BBC, really a goodly share of it should go to the other service providers as well.

--
Donald 'Duck' Dunn: We had a band powerful enough to turn goat piss into gasoline.

Authentication of senders by flakac · 2003-08-10 20:57 · Score: 2, Insightful

Sorry, but filters are not the final answer. Even when the filters can "learn", the user still has to expend a certain amount of effort to "teach" the software. And quite frankly, spammers (or the people who write automated spamming software) just need to study the filters and learn to get around them. And worse, you can never be sure that the filter is not deleting email that you actually want, unless you set it to never delete suspect mail, allowing you to examine and delete it manually. But at this point, you've gained absolutely nothing -- simply setting your email client to put all email that's from addresses not in your address book, or that doesn't contain your exact address in the "To:" line will achieve exactly the same effect.

The only thing that can truly save email is to switch to a service that requires authentication of senders.

Re:Authentication of senders by frovingslosh · 2003-08-10 21:33 · Score: 3, Interesting

The only thing that can truly save email is to switch to a service that requires authentication of senders.
I agree with everything that you said about filters being ineffective. But I strongly disagree with your "only thing" statement. Particularly if you mean it as any of the systems I've ever heard about, such as "If it's not in the address book, the sender must acknowledge a challange message" type of approaches. The problem with such systems is that many of us get quite a bit of e-mail each day from people who are not in our regular address books, some of it quite important to us. We do not want that mail lost because the system at the other end was not in out address book and did not waste their time responding to a challange and response type system. For example, say I purchased something on-line from a vendor I had never dealt with before. Their e-mail system may automatically kick out an e-mail that informs me the product was shipped and give me an important Fed-ex or UPS tracking number. I'm glad they do such things with their shipping systems, and I don't expect them to manually respond to every challange they get back; realistically they will send any such challanges to the bit bucket and people who want e-mail that is important to them will end up never getting it.
So I do not believe that Authentication of senders , at least in any of the traditionally suggested ways, is the correct approach. Much of the spam problem we have is due to what I consider flaws in SMTP. I would very much like to see a replacement for SMTP that considered the spam problems (as well as other problems inherent in SMTP). As an example, another post here mentioned a system where the mail is held, not on your ISP or upstream provider's system until you download it, but rather is held on the sender's or sender's ISP's system. The recipent would presumably receive only a very short indicator of where they have mail waiting, and would fetch it themselves when they are ready to receive it. The puts the burden of storage on the sender or the service provider for the sender, and avoids considerable bandwidth wasted by senders who supposedly send out e-mail with addresses generated to match all combinations of up to x characters (the excuse Mindspring gave to me when addresses that I created but never gave out or used started getting spam, not that I believe them). In addition to putting this burden on the sender, it would insure that there was a good address in the e-mail to fetch the mail from, so spammers would have a much harder time injecting their spam into the system and would be much more traceable. And while I'm not foolish enough to think that laws could completely stop spam, we've seen how laws did drastically curtail fax spam, and some fax spammers have recently been made to pay serious fines. I do think laws would have a big effect on spammers; ther are a lot of spammers who just don't want to have to move out of the country to keep up spamming, and those of us who hate spam will track the spam back to US sources if we have a law with teeth in it to impose fines (or worse) on them when we do.
Of course, and change to or replacement of SMTP must be phased in over time. It's not a short term solution to spam. But I expect SMTP would quickly go the way of gopher or archie or the rest if a viable new protocol was presented that addressed these problems effectively, and this is where I think out greatest chances for sucess are.

--
I'm an American. I love this country and the freedoms that we used to have.
Re:Authentication of senders by FuzzBucket · 2003-08-10 23:40 · Score: 1

There's all this talk lately about redoing SMTP. There's no reason for that. Authentication of senders is indeed the right way to go -- not with a challenge/response mechanism, but with the tool designed for the job: digital signatures. Mail clients (and eventually servers) can be easily configured to bounce unsigned mail (unless on a whitelist), and signed mail can be traced to senders and sorted reliably.

Ubiquitously signed mail would lead to senders with public reputations, and lists of spammers whose mail you might reliably choose not to accept. You would only treat as valid signatures associated with a trusted certificate-authority validated identity, so spammers couldn't just make up new identities every day.

In general, "cold-call" mail -- mail from people you didn't expect to receive anything from and therefore could not have whitelisted -- should cost some small amount to get through. Again, there's no no need to replace SMTP. A standard challenge-response protocol could easily be implemented over SMTP: (1) receive mail; (2) reply to unknown sender with request for payment via PayPal et al, referencing a unique ID; (3) when payment is received send confirmation that mail has been delivered to user-visible inbox. Users can set their own rates.

Spam is not really a hard problem or a technical problem. Content-based filtering is a dead-end, it just encourages people to pretend and dissemble more cleverly. The problem is social -- changing end users' habits (embedded in their software) and senders' expectations from an assumption that all mail sent will be accepted and read to a more realistic understanding that most mail sent will be returned unread unless the sender can persuade the recipient the letter merits even two seconds of her attention.
Re:Authentication of senders by Anonymous Coward · 2003-08-10 23:41 · Score: 0

d00d! mail stored on the senders serv0r implies that it can be easily tracked when you recieved the mail! this is no good!
Re:Authentication of senders by tazan · 2003-08-11 01:46 · Score: 1

If I order something from a new online vendor and am expecting a shipping notice I simply look in the spam folder periodically to see if they sent one, then add the address to the whitelist. It doesn't happen that often for me to be a problem.
Re:Authentication of senders by Anonymous Coward · 2003-08-11 02:14 · Score: 0

"The only thing that can truly save email is to switch to a service that requires authentication of senders. "

A much easier answer is to filter IN the mail you want to keep and then default all remaining email to trash or /dev/null. Does require maybe 10 minutes of training per user. Does nothing for the bandwidth problems. Doesn't require a rewrite of a million versions of SMTP. Doesn't involve waiting for changes to laws or other pie-in-the-sky plans.
Re:Authentication of senders by pongo000 · 2003-08-11 02:26 · Score: 2, Interesting

say I purchased something on-line from a vendor I had never dealt with before. Their e-mail system may automatically kick out an e-mail

Using TMDA, you would generate a "keyword" address: A unique addressed, identified by a keyword embedded in the address, which would allow your vendor to bypass the C/R system. If they keyword address starts being abused then (1) you can easily disable it, and (2) you know not to do business with that vendor again.

As an example, another post here mentioned a system where the mail is held, not on your ISP or upstream provider's system until you download it, but rather is held on the sender's or sender's ISP's system.

This system quickly breaks down, though, as delays are introduced by having to wait to fetch each piece of mail. People bothered by such delays will write/obtain software that automatically fetches the mail at a predetermined time, which would then shift the bandwidth problem (part of it, anyways) back to the recipient.

The other problem with sender authentication is who, exactly, determines whether a sender is authenticated? I run my own e-mail server. Will I have to pay out bucks for an "authority" to confirm that my sending address is valid? Right now, some ISP's (notably Time-Warner offshoots) are denying access to their SMTP servers under the guise of reducing spam. If your IP happens to fall within a certain range, they simply don't allow you access. We will end up in the same morass RBL has put us in: Who plays God in determining whether a sender is truly "authentic" or "worthy"?

Re:great by opensourcehosting.co · 2003-08-10 20:58 · Score: 0, Flamebait

No adverts on the BBC? They advertise all the time - selling the radio and TV listings magazine (Radio Times) for example. They also cross-sell the different channels, especially now we've got BBC 3 and 4 in the UK. Just as annoying as "normal" advertising, IMHO.

The quality of the programming is also going downhill fast - perhaps viewers outside of the UK are spared the dross we have to wade through here?

Why not stop the sellers? by Anonymous Coward · 2003-08-10 21:03 · Score: 5, Insightful

I know this is slightly off topic, but can someone answer me a reasonably simple question thats been bugging me for a while?

Why not instead of hunting down the spammers do we not hunt down the people who are selling and advertising their junk via the spammers?

The spammers purposly make themselves difficult to find, but it must be easier to track down a company that is collecting money and sending out products? Why not make the using of spammers services illegal and fine and punish those doing so?

I think Im correct in saying and please tell me if Im wrong, but here in the UK a similar situation is people "fly-posting". In these cases, if advertising posters are put somewhere illegal or unwanted, it is not the person who put the poster up that is fined, but the club, record label, whoever is beign advertised that takes the rap.

Just my 0.02p

Re:Why not stop the sellers? by CrazyWingman · 2003-08-11 00:09 · Score: 1

I think part of this may come from spammers being resellers. You never see a spam from Pfizer, but you see plenty of spam selling Viagra. So, you end up back in the same place of trying to find the person who sent the spam, because they are the ones doing the selling. You can't even call up Pfizer and say "stop selling Viagra to spammer@hotmail.com", because they don't even know they're selling Viagra to him.
Re:Why not stop the sellers? by Quill_28 · 2003-08-11 00:35 · Score: 1

I agree with you but here is the problem:

You receive spam selling viagra from company X.

Here's how it works:

Company C hires SpammerX to send spam selling Viagra.

You go after Company C. Easy.

But what if SpammerY is ticked off by not getting hired and sends spam from Company C.
Or CompanyD(for whatever reason), if you go after CompanyD how do you prove they paid someone to send it. You have to find the spammer?

Not an easy problem.
Re:Why not stop the sellers? by Zaiff+Urgulbunger · 2003-08-11 01:40 · Score: 1

Or hunt down the people who actually "buy" the products we're all being spammed with?

I've heard recently that there are a lot of sales being made through spam. Prior to that, I just thought that no one could possibly be stupid enough to buy any of the "products" being pushed.... sadly, its seems there is!

Re:great by Goldberg's+Pants · 2003-08-10 21:05 · Score: 4, Insightful

You probably ARE a scumbag spammer.

For people who have to pay for their online time (England for example), these scumbags are essentially stealing money from people. Filtering only works once you've downloaded the mail. You still have to download their worthless drivel. Sure, it may be pennies a week in costs for a user, but you tally that up over a year or two of dealing with these idiots, and you've got a sizeable chunk of change. Certainly enough for a nice pizza.

Let's not forget the TIME these shits waste as well. All this work invested in stopping spam. Who know's what cool stuff may have come from the minds who instead are working on ways of dealing with the email cancer.

As I said, these scumbags should be legal to hunt and kill.

No, I don't use freenet6 by Chep · 2003-08-10 21:05 · Score: 1

My ISP provides me ipv6 natively. Yep, a full /48 for me. And it's on a plain vanilla home DSL line.

Aug 11 03:19:02 traminer pppoe[19276]: Sent PADT
Aug 11 03:19:02 traminer pppd[12690]: Serial connection established.
Aug 11 03:19:02 traminer pppd[12690]: Using interface ppp0
Aug 11 03:19:02 traminer pppd[12690]: Connect: ppp0 <--> /dev/ttyp1
Aug 11 03:19:08 traminer pppoe[12694]: PADS: Service-Name: ''
Aug 11 03:19:08 traminer pppoe[12694]: PPP session is 4029
Aug 11 03:19:12 traminer pppd[12690]: local LL address fe80::c959:a698:ed94:0ec
3
Aug 11 03:19:12 traminer pppd[12690]: remote LL address fe80::0208:e2ff:fe0a:d80
8
Aug 11 03:19:12 traminer pppd[12690]: Cannot determine ethernet address for prox
y ARP
Aug 11 03:19:12 traminer pppd[12690]: local IP address 62.212.101.212
Aug 11 03:19:12 traminer pppd[12690]: remote IP address 62.4.16.244
Aug 11 03:19:12 traminer pppd[12690]: primary DNS address 62.4.16.70
Aug 11 03:19:12 traminer pppd[12690]: secondary DNS address 62.4.17.69

Re:No, I don't use freenet6 by devnullify · 2003-08-10 21:57 · Score: 1

Your ISP provides IPv6, yet forces you to use PPPoE. How lame.
Re:No, I don't use freenet6 by Chep · 2003-08-10 23:09 · Score: 1

No it doesn't. I'm using it with PPPoE, because I use one of the first-batch ECI HiFocus "Merkava" Ethernet bridges, and the firmware I'm stuck with won't do anything else than PPPoE with the kind of DSLAM the still-monopolistic local-loop provider is providing.

Many folks are using the alternatives, but till my modem breaks, I won't bother to fix it, especially since I don't see what I would gain from switching from PPPoE.

Re:great by fr0z · 2003-08-10 21:07 · Score: 0

Because advertisers pay good money for airtime, which in turn provides the cable people with funds to give you more tv shows. But then again...I guess you think those wares peddled by spammers work. In which case you are part of the problem.

--
Never underestimate the predictability of human stupidity...

Re:great by Zog+The+Undeniable · 2003-08-10 21:11 · Score: 1, Informative

Well, I rarely find anything watchable on BBC1 or BBC2 these days (too many soaps, trashy sitcoms and repeats), and the licence fee, while it's cheap compared to Murdoch's Sky subscription, is a tax on watching TV, not an optional payment. Even if you only watch satellite or cable channels you have to pay the BBC.

You used to get a free satellite viewing card for your licence fee giving access to all the "terrestrial" public channels on satellite, which was great if you had a spare decoder and crappy terrestrial reception like where I live. To save a few quid, the BBC no longer fund these cards and have gone unencrypted, which means I've lost the other terrestrial channels upstairs. Thanks guys.

--
When I am king, you will be first against the wall.

Spam may won the fight against Bayesian filtering by Picon · 2003-08-10 21:12 · Score: 1

Spam are intented to people who are in need of a miracle products or enough credule to purchase what spammers have to offer. And i don't think that those people will take the pain to run a spam filter and moreover to learn it to recognize spam. Because bayesian spam filters need to learn and it can take a week or two before it is efficient. This way spammers will always reach their goal, but you will have a way to filter the spam from your mailbox. The only solution that would really annoy spammers will be Bayesian filtering on server side. But ISPs will probably never do that, because the bayesian algorithm is not reliable at 100 percent and can filter the so-called "false positive". IMHO, bayesian filtering will never be the solution to *reduce* the spam flow. It could be useful for people who will classify the spam in a junk directory. Or for business who want to get rid of spam at the risk to loose some good emails. Oh by the way, Spam is a trademark! ;p Regards

Mozilla - filters on client not server by Zog+The+Undeniable · 2003-08-10 21:15 · Score: 3, Interesting

Moz's Bayesian filtering works well, but its Achilles heel is that it doesn't work on the POP3 server, so you still have to download everything. As POP3 allows the header and the first part of the message body to be read without downloading it, surely there could be an option - once Moz has been trained and you're fairly sure the false positive rate is negligible - for filters to operate on the server and delete spam from there?

--
When I am king, you will be first against the wall.

Re:Mozilla - filters on client not server by cameleon · 2003-08-10 21:34 · Score: 1

As POP3 allows the header and the first part of the message body to be read without downloading it
How do you read a message without downloading it? The data has to get to your computer before you can read it, right?
Re:Mozilla - filters on client not server by pe1chl · 2003-08-10 21:46 · Score: 3, Informative

It would be nice if there was filtering done on the server. Then you would not need the packages that are reviewed here.

However, that means a change to the server, and a change to the POP3 protocol. The ISP would have to install a filtering plugin or a modified version of the server, and the client would subscribe to this service and train it (every client would have his own dictionary). With the first few messages there would be some special POP3 report back to the server indicating that you consider it spam, and from then on the server would filter on its own.

However, that would be difficult/impractical to roll out, so you will have to live with clientside filtering like in Mozilla.
Re:Mozilla - filters on client not server by Zog+The+Undeniable · 2003-08-10 21:52 · Score: 1

You use the TOP command on a POP3 server to get just the first part of the message. That should be enough for a good Bayesian filter. Fire up Telnet and try talking directly to the server - it's quite enlightening (and essential for when some "friend" sends you a 15MB movie file and you're on dial-up). Some instructions here.

--
When I am king, you will be first against the wall.
Re:Mozilla - filters on client not server by Peter+Harris · 2003-08-11 01:12 · Score: 1

My current solution (since I'm in the shell as often as the desktop) is
to run popcheck before downloading my mail.
I just look at the subject lines and mark all the obvious spams (detected by
the neural net between my ears) for deletion, and if there are any left I run
Mozilla to read them.
Most of the junk doesn't even get downloaded from the server...

--

-- What do you need?
-- Gnus. Lots of Gnus.
Re:Mozilla - filters on client not server by SCHecklerX · 2003-08-11 01:52 · Score: 1

If it is a *nix box, and the end user has some form of shell access, then it is already possible. They simply need to configure their ~/.procmailrc. There is nothing at all that needs to be changed in pop3. In fact, I pop for my mail while at home, use pine to the mail server while on the road, and for my own account and my mailing lists, this is exactly how I currently call spamassassin. Using MimeDefang to catch it, along with rejecting on dumb things like broken helos and no reverse resolving, without needing real accounts is my next project.
Re:Mozilla - filters on client not server by letxa2000 · 2003-08-11 02:11 · Score: 2, Interesting

You have pretty much described PrismEmail. It, among other things, does Bayesian filtering. It's server-based so you don't have to download the spam. It's user-specific so you have your own Bayesian corpus that applies only to you, not server-wide. You can inspect blocked email on the server at any time or wait for a single spam report each night to see a list of all email blocked--a quick click will then release any message that was misclassified. And you can just click on a link in the headers of a message if it was spam and it got through.
Really, all the people that think that server-side Bayesian filtering is impossible are confused. No, you can't have a single corpus that applies to everyone on the server--that defeats the purpose of Bayesian. But you definitely can do the user-specific filtering on the server. Let the server do the work, you only download the good stuff, and there's nothing to install locally.
Re:Mozilla - filters on client not server by HermanAB · 2003-08-11 02:56 · Score: 2, Informative

I run SpamProbe on the server. For any given business, everybody will receive pretty much the same sort of mail. So a single database works like a charm, with atypically 99.5% accuracy and zero false positives. This works because Spamprobe also counts word pairs, something that no other word counting filter does. To compensate for the enormous increase in computational load, it uses BerkleyDB as a backend. For corrections, i create a user called spam. Corrections can then be forwarded to this user, to reverse the database entry for that message.

--
Oh well, what the hell...
Re:Mozilla - filters on client not server by pe1chl · 2003-08-11 05:52 · Score: 1

What you would need a modification in POP3 for, is to classify the mail you are reading as spam/not spam.
The solution your propose assumes that the knowledge about spam is already on the server. This could be done, but it probably would not be bayesian filtering.
Re:Mozilla - filters on client not server by pe1chl · 2003-08-11 05:59 · Score: 1

This looks good.
Indeed it would be possible to use links instead of POP3 modifications, assuming your mail client is capable of following links (Mozilla certainly is).

Better would be if the software would be running at your ISP instead of some other company that will fetch all your mail and filter it.

Maybe they should sell the software to ISPs... I'm sure more people would be comfortable with an extra-charge spamfilter from their ISP than with some external party filtering all their mail.

My ISP offers spam filtering but it is based on DNS blacklists that filter on sending or relaying SMTP server address. That sucks.
Re:Mozilla - filters on client not server by letxa2000 · 2003-08-11 08:16 · Score: 1

Maybe they should sell the software to ISPs...
"They" is "me." Or rather "us." But I was one of the primary developers.
Better would be if the software would be running at your ISP instead of some other company that will fetch all your mail and filter it. I'm sure more people would be comfortable with an extra-charge spamfilter from their ISP than with some external party filtering all their mail.
A valid point, and we do plan on making it available to ISPs. One step at a time, though. :)
My ISP offers spam filtering but it is based on DNS blacklists that filter on sending or relaying SMTP server address. That sucks.
True, that's always been a rather ineffective way at blocking spam--at least when you consider the potential false positives. We are thinking, though, of making DNS-based blacklists available--i.e., that whether or not a given IP is currently in a DNS blacklist can be used as part of the decision-making process in rating a message as spam or not. A DNS blacklist can be useful at helping making a decision regarding whether a specific message is spam, but I wouldn't certainly make it the ONLY factor in the decision.

Re:great by ntmuffin · 2003-08-10 21:20 · Score: 2, Insightful

Yup, I was also thinking "for peace" ;) Long live B5

But on the other side ... I've had the same problem you had - going from 0 to 25-30 a day, sometimes even more. I don't think we'll ever be able to stop the spammers, but I think that some of the blame has to be put on those people offering free mail services like Hotmail, Yahoo.com (and .ca) and AOL. 95% of my spam originates from accounts on their domain, and when I'll try to send bounce messages with Mailwasher, the accounts used to spam me doesn't exist anymore ... so if these mailservices had made a system couldn't be used to create accounts automatically with a script, we might se a little more spam out on the net, as I doubt that the spammers would bother using lots of time creating accounts themselves ...

I like the thought of an all year huntinglicense for spammers though ;)

Re:re : ducks deserve life tbh by giantq · 2003-08-10 21:27 · Score: 1, Offtopic

Since you use the slang wankers, I'm assuming you're UK based. However, it's a good thing you didn't do that here in the States. I would have pulled the trigger and the good-ol-boy coroner would have ruled it a hunting accent. Offical cause of death: misadventure.

shouldn't that be anti-virii by DrSkwid · 2003-08-10 21:28 · Score: 1

it is viruses you clot

--
There are places where the networks are not touching,and there are places where they are-Boeing's Lori Gunter

A tad off-topic but... by Channard · 2003-08-10 21:33 · Score: 1

Although that license money goes to the BBC, really a goodly share of it should go to the other service providers as well.

Except that it doesn't. With any other service, an ISP, etc, you can take your business elsewhere if you don't like the service being - but the BBC still get paid the same if you watch ITV all the time. And the licence payers have no say in the programmes that are shown - the BBC have a pretty easy ride.

hmm, if you really are so clever by DrSkwid · 2003-08-10 21:33 · Score: 1

If you are going to delete everything that comes to via the Usenet address why do you include a valid email in as your return address?

you could reduce the flow to 0 by putting

From: not_real@naimod.moc

and to be honest if I was an email harvester I might have noticed "user at domain dot com" and be harvesting those too

--
There are places where the networks are not touching,and there are places where they are-Boeing's Lori Gunter

Re:hmm, if you really are so clever by Anonymous Coward · 2003-08-10 22:20 · Score: 5, Interesting

Very good.
Speaking from experience, I know for a fact that many of the harvesting programs (written in perl, running on linux, written by geeks) are very robust at deciphering most email obfuscation methods. You all sit and shake your fists, and the spamware writers are laughing their asses off.
You have the easy answer: don't obfuscate your email, don't even bother putting it on your posts.
Re:hmm, if you really are so clever by Wilk4 · 2003-08-11 04:14 · Score: 2, Interesting

According to Why Am I Getting All This Spam? Unsolicited Commercial E-mail Research Six Month Report, most harvesters really *aren't* that smart, so even simple email address obfuscation and removal from websites can have a dramatic impact on how much spam you get.
The other good news from that study is that they show that spam does decrease after you remove your email address from websites... in other words, they don't keep the addresses as much as we generally believe. You aren't on every spammers list forever just because they get your address once.
Re:hmm, if you really are so clever by Lost+Race · 2003-08-11 14:47 · Score: 1

I'm not shaking my fists, I'm grinding my teeth. Then I sit back, close my eyes, and have a nice little fantasy about what I'd do if the spammer were right here in front of me.... Ahhhhhh......
In the immortal words of Vincent Vega:
I just wish I caught 'em doin' it, ya know? Oh man, I'd give anything to catch 'em doin' it. It'a been worth his doin' it, if I coulda just caught 'em, you know what I mean?

In related news by heli0 · 2003-08-10 21:44 · Score: 3, Informative

If you have ever signed up with the Direct Marketing Association's Mail Preference Service (list of people not to send junk mail to), but continue to receive stacks of crap every day, here is what you can do about it: Prohibitory Order

Links to pdf's you need to print and mail in included.

"A little-known Federal law allows individuals to send a Prohibitory Order against companies that are sending unsolicited sexually provocative or erotically arousing mail. The Supreme Court went one step further, allowing individuals to decide what constitutes "erotically arousing" mail. The law makes it illegal for a company to send mail to an individual within thirty days of receiving the Order."

"Postmasters may not refuse to accept a Form 1500 because the advertisment in question does not appear to be sexually oriented. Only the addressee may make that determination."

--
Whenever the offence inspires less horror than the punishment, the rigour of penal law is obliged to give way...

Everyone? by Jon+Peterson · 2003-08-10 21:46 · Score: 2, Insightful

"Support both Windows and Linux " ...
"The first requirement is because I wanted the results to be applicable to everyone"

My how the definition of everyone has changed. So it's bad luck Mac, Solaris, *BSD, HP-UX, VMS users...

--
----- .sig: file not found

Re:Everyone? by Peter+Harris · 2003-08-11 01:15 · Score: 1

Anyone running a system that can't handle Python (SpamBayes) or perl (POPfile)
doesn't really count.

--

-- What do you need?
-- Gnus. Lots of Gnus.
Re:Everyone? by netringer · 2003-08-11 03:04 · Score: 1

Support both Windows and Linux " ... "The first requirement is because I wanted the results to be applicable to everyone" My how the definition of everyone has changed. So it's bad luck Mac, Solaris, *BSD, HP-UX, VMS users...
I was on the team that discussed a corporate-wide email standard at a Fortune 50 behemouth. I was thinking towards using Internet standards with a choice of POP email clients and a *nix backbone. Silly me.
One of the requirements was that it had to be "Multi-platform. - It must support Windows 95, Windows 98, and Windows NT" To quote Dave Barry, I am not making this up.
So of course, Outlook / Exchange was the perfect answer.

--
Ever dream you could fly? Get up from the Flight Sim. I Fly

Something he misses about popfile. by CGP314 · 2003-08-10 21:49 · Score: 4, Interesting

One of the things I love about popfile is it is not a Spam filter. It is a general mail filter. I have about ten categories of mail that it sorts out for me. This also helps cut out false positives. 'Work', 'Personal', 'Friends' and all much more similar to eacth other than 'Spam'.

Re:great by halowolf · 2003-08-10 21:50 · Score: 1

And advertising is regulated, at least in my country.

How many adds do you see on television that are pornagrphic and targeted to children? How many penis enlargement adds do you see on television every day?

Thats right, in television there are standards to be upheld and when violated, there is usually a backlash against the individuals concerned, well at least in my country :)

Eh... by hendrix69 · 2003-08-10 22:02 · Score: 2, Interesting

POPfile really got shortchanged by this review. It serves as much more that a spam filter. I thought I'll give SpamBayes a try anyway but the Outlook plugin won't install on my XP machine. Some problem with an unresolved dependency in shlwapi.dll... boring. The point is, the SpamBayes site doesn't have a tech support forum where I can ask for help with these kind of problems.

--
The power of Christ compiles you!

Re:Eh... by spongman · 2003-08-10 23:12 · Score: 1

Try joining the mailing list. There's lots of very helpful people on there, and it's very low spam ;-)
Re:Eh... by hendrix69 · 2003-08-11 01:40 · Score: 1

Turns out that copying the shlwapi.dll file from the system32 folder to the application folder solves the problem of the plugin not being able to register it self. Just FYI.

--
The power of Christ compiles you!
Re:Eh... by mooman · 2003-08-11 08:44 · Score: 1

I agree.. the entire shortchange seems due to the fact that the author got some false postiives (ham ranked as spam). I find this really surprising, since in the year or so that I've been using POPFile, on hundreds of incoming emails a day, at about 85% spam/ham ratio (yes, that's about 6/7 are spam) and I think I got maybe *one* false positive in that whole year.. And even that one took me a couple readings to decide that it really wasn't spam. In pre-filter days, that one message would have stood a pretty decent chance of me just deleting it on a first read, so I'd assert the filter is no worse than human judgement.

So maybe the author just needs some tutorials about training or something.. Or maybe he mis-classified something (these *will* throw off your false positives and negatives, and need to be manually fixed generally).

And I even "cheated" on training my POPFile install. I have been saving spam in my eudora folders for a while and I just dumped them into a flat-file and bulk-loaded it into POPFile. I was getting 98% filtering success on day 1, and it only got better as I did some real training over the coming weeks.

So my note to anyone who read the article and is still waffling, don't just take one person's "experiment" as your benchmark.. find some people who have been using the tools for a while and solicit their opinions. POPFile radically changed my view of email, and made several spam-overwhelmed addresses useable again.

--
In the Portland, Ore area and like card games? Check out: http://groups.yahoo.com/group/portlandgames/
Re:Eh... by hendrix69 · 2003-08-11 11:24 · Score: 1

I agree.
Both programs seem to do the job almost equally well and each has it's own unique advantages that, if I'm not mistaken, were not mentioned in the review.
POPFile offers much more than spam filtering, it classifies email according to almost any criterion very successfuly. It's UI offers much more information than SpamBayes too.
SpamBayes is much more spam dedicated and makes it easier for people only interested in spam filtering to get the job done. It also has an Outlook (not express) plug-in which saves a lot of time.

I'm a long time POPFile user and today I switched to SpamBayes in order to give it a try (since I use Outlook). I advise everyone else to do the same thing and give both products a try.

--
The power of Christ compiles you!

Stopping it by Anonymous Coward · 2003-08-10 22:12 · Score: 0

The real solution would be to stop Spam from being sent, instead of stopping Spam from being received.
Now, that's not very easy, I guess. So to stop it from being sent, we stop it from being received so that the spam serves no purpose. Now this wont work until the vast majority starts using good spam filters. What we need is Bayesian filters in Outlook Express, so normal users uses it.

Why filtering isn't the solution by nuwayser · 2003-08-10 22:16 · Score: 4, Insightful

An analysis of filtering methods against spam is kind of like a comparison of bullet-proof vests in that there's no incentive to stop someone from pointing a gun at you and firing it. In the past, spammers have been grossly affected by more sweeping changes, and I'm afraid filtering methods are only creating the mindset of, "Give up, use this software, it will do the deleting for you." It takes the attitude of, "just delete the stuff" and makes it automatic; sure it's convenient for a time, but in a year you're still going to get spam and your ISP will likely have fewer resources to deal with the complaints.

I'm saying, why not focus instead on technology which puts a bigger dent in spammers' ability to operate, like how to secure against proxy hijacking.

--
"The cup... the drop... it's a YES!"

Spam is stealing. by jay-be-em · 2003-08-10 22:23 · Score: 0, Flamebait

Where do you think most of the spam out there comes from? Small business owners. Stop being unpatriotic and show your support for American small business owners.

--
"Orthodoxy means not thinking--not needing to think. Orthodoxy is unconsciousness." --Eric Blair

Re:great by Zog+The+Undeniable · 2003-08-10 22:28 · Score: 2, Informative

Yahoo uses captchas to prevent scripted sign-ups, so if you get anything from a Yahoo mail account, there was once a human (OK, a subhuman) at the other end.

--
When I am king, you will be first against the wall.

Re:Spam may won the fight against Bayesian filteri by Lumpy · 2003-08-10 22:31 · Score: 1

Yes the filters can safely be put on the server side.

simply let all the email through, but everything that is tagged as spam is prepended in the subject line as [spam] and now your users have to sort it out.

and by simply adding a harvesting email address pool to snag spam spam and only spam you can automate the addition of new spam rules.

works great.

--
Do not look at laser with remaining good eye.

cripples... maybe call it BORG-net! by Anonymous Coward · 2003-08-10 22:33 · Score: 0

cripples all cripples.
like adding electronic fuel injection/managment to a shitty engine.

nobody has the cash do fix the problem from the bottom up so just add crutches ... idiots.

i don't want to end up a borg because some much hyped about gen-manipulation went wrong and human-kind couldn't undo it!

how much processing time is it going to cost to filter all the emails sent around the world in 24 hours please? alot!

that this SPAM filtering MIGHT be covert operation to monitor email traffic doesn't seem to bother anyone (distributed CARNIVORE system).

oh hell, if carnivore is really monitoring all the email traffic why not give it the job of cleaning out SPAM?
but this is all nonsense anyway because fbi.gov can't even manage their own domain (hosted by a33.g.akamai.net [202.181.171.73]). talk about home land security!
i think the data on the fbi.gov website isn't even stored in america...

%|

Re:cripples... maybe call it BORG-net! by Anonymous Coward · 2003-08-10 22:41 · Score: 0

no kidding. i went to an internet shop/cafe and people acctually open outlook and sent emails from the internet-cafes domain (using the domain of the internet shop/cafe as return email address.)
i was really tempted to just SPAM these dummies for not knowing what they are acctually doing!

nothing i can do, it's sure. let's wait and see ...

simplest solution... by Lumpy · 2003-08-10 22:36 · Score: 1, Interesting

$0.04US charge for every Email SENT. Collage accounts can get refunded costs by delivering a sent mail list.

This will stop spamming quick... or at least make it slow way down.
1,000,000 spams = $40,000.00US more than the entier net worth of the most sucessful spammer.

--
Do not look at laser with remaining good eye.

Re:simplest solution... by jay-be-em · 2003-08-10 22:58 · Score: 1

Can you explain to me how you plan to enforce that scheme?
What keeps me from creating a new mail protocol that bypasses it all?
And do you honestly believe that we should be charged for data based on it's nature? ie should I be charged more for using bandwidth over a port used for email than over port 80?

--
"Orthodoxy means not thinking--not needing to think. Orthodoxy is unconsciousness." --Eric Blair
Re:simplest solution... by afidel · 2003-08-10 23:38 · Score: 1

Woohoo, and now when your PC gets trojaned and sends a couple million emails while you are at work YOU can be bankrupted! Just what I want, not. What is likely to happen if spam gets bad enough is that the 6-10 largest ISP's will agree to only accept incoming email from each others SMTP servers, if you aren't a customer of one of them then get a free account of maybe a cheap one time cost account. Users found to be abusing the trusted system with spam will be banned permenantly.

--
There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
Re:simplest solution... by CrazyWingman · 2003-08-11 00:17 · Score: 1

Can I ask where you found the "entire net worth of the most successful spammer" statistic?

Re:great by Anonymous Coward · 2003-08-10 22:48 · Score: 1, Informative

hey - it pays for the radio too guys.. Just listen to radois 4 and 3 if you want a bit of quality.

Re:Spam may won the fight against Bayesian filteri by spongman · 2003-08-10 22:52 · Score: 1

SpamBayes only takes a day or two to get up to speed. After a week it's about as good as it's going to get.

My problem with spam by nuggz · 2003-08-10 23:02 · Score: 1

My problem with spam isn't that it exists.
My complaint is it shows up in my inbox.

My problem with violence is it could happen to me, not that it happens to others.

I think this is a typical and selfish attitude.

Challenge / response by Mandrias · 2003-08-10 23:16 · Score: 1

I believe the URL that you are looking for (there was just a story on it on the public radio station the other day) is:

http://www.mailblocks.com

Have a nice day :)

--
Use the Z-modem protocol between Information Superhighway routers to compress the plaintext. ~LordOfYourPants

POPFile is more than just a spam tool by rediguana · 2003-08-10 23:20 · Score: 4, Interesting

POPFiles utility does not lie just in managing the spam menace. To me, the real utility in POPFile is the ability to create x number of buckets and train it to sort your mail. SpamBayes looks great for spam but has no further utility. I like having POPFile sort my work from personal emails, and file all my mailing lists in another, and even jokes. Of course there is the spam folder that I check every now and then. I look forward to it being able to support IMAP servers as well.

Re:POPFile is more than just a spam tool by BradleyUffner · 2003-08-11 01:16 · Score: 2, Informative

I agree, I just discovered POPFile last week when it was shown on BBSpot. I use an exchange plugin called Outcast that allows POPFile to work over exchange also. I have several buckets setup to help sort incomming email into the correct folder for different projects and it works fantasticly. I've only been training it for about 3 days and it already sorts with almost perfect accuracy.

POPFile, and Outcast rock.
Re:POPFile is more than just a spam tool by topham · 2003-08-11 01:20 · Score: 2, Informative

I installed POPFile on my parents computers; I was worried because I thought the interface (web interface) would be confusing to them; since you couldn't do everything within the email client itself.

Works great. My father, who gets far more spam than the average person (why I don't know) has virtually 100% success rate.
Re:POPFile is more than just a spam tool by Anonymous Coward · 2003-08-11 02:25 · Score: 0

My father, who gets far more spam than the average person (why I don't know) has virtually 100% success rate.

Probably because he has pr0n subscriptions.

Ask Slashdot by imbaczek · 2003-08-10 23:31 · Score: 0, Troll

how to deal with lqhwquczzuk lqhwquniounxqs lqhwqusgwthsgn lqhwqumkzhtd lqhwquxuwdmvgxr lqhwqucslcyqxki lqhwquzytbktnxhqlqhwqurstlyagn lqhwquzaloqzq lqhwqumlkohoxfq lqhwqurjsnbjvagp lqhwquyjnfo lqhwquwxaqgnvlox lqhwqudxnht lqhwqurqrlqhwquzspjarube lqhwquuvryuc lqhwqukisuinib lqhwqurqkxans lqhwqufbjxrcgbrl lqhwqugqagax lqhwquhf lqhwqucluiinadcylqhwquhr.

Can some US people sue guys who send real junk like this? Or maybe ask them for a license for sending themselves email ($699 seems a reasonable price for a single inbox?)

Re:great by Afty0r · 2003-08-10 23:55 · Score: 2, Insightful

Actually, the rapid growth of endorsements, product placements, "documentaries" about products etc. means that you're really seeing far more than just 12 minutes of advertising, the only restriction is that you're limited to 12 minutes of OBVIOUS advertising.

SpamAssassin catches 99.99% of the SPAM i get. by thor · 2003-08-11 00:16 · Score: 1

SpamAssassin catches 99.99% of the SPAM i get.

i aggregate about a dozen email addresses through one Linux user (postmaster) and then filter and distribute the mail using procmail and SpamAssassin.

after tweaking it for the first month or so i have not had to mess with SpamAssassin's filtering.

i get the occasional false positive but my setup rarely lets a false negative slip through.

when i do get a false positive, i put the offender email address in a temporary SpamAssassin whitelist and send the message to a dummy email address (deSPAM) which de-spams the message before SpamAssassin passes it to the intended user.

email me if you want to learn more...

thor

Re:SpamAssassin catches 99.99% of the SPAM i get. by dnadig · 2003-08-11 01:24 · Score: 1

You just don't get the kind of spam I get. I was a dilligent user of spamassassin this last year. I spent plenty of time tweaking it. I was sailing along just like you, with spam assassin about 95% effective. Note that with my spam load on three mailboxes, this meant I still got a dozen or so spams a day, and a disturbing number of false positives. THen about three weeks ago, the floodgates openned, and I started getting 50 a day through spamassassin, and no matter how I tweaked, it just worked.

I switched to mailblocks. Spent 5 minutes uploading my address book. Haven't had a SINGLE spam through yet.

Challenge response is the answer. Not the LONG TERM answer, as posters have pointed out, but a perfect short term answer.
Re:SpamAssassin catches 99.99% of the SPAM i get. by stick-boy · 2003-08-11 01:34 · Score: 1

I use SpamAssassin too, and it works great. The one thing I'm worried about is some spam I've been getting lately that gets through the filtering. It has a bunch of random words, or even passages from textbooks among the spam. I'm guessing this is to fool the filters into thinking it's real. The part that concerns me is, what if I make the Bayesian filter learn that this is spam, will it degrade the quality of the filter? Since it's learning that all these ordinary words and phrases are spam, might it start missing more and more spam, and possibly creating more false positives?

~jason

pop3, bahhh by AssFace · 2003-08-11 00:50 · Score: 1

I have finally had to resort to something that scans post download - so I finally have to resort to the pop3 scanner of sorts.

I currently have at least 5 or so e-mail addresses, all of which just funnel down into a single address at this point.
But I am starting up an online company and need to add at least 10 more addresses (info@companyname.com, sales@companyname.com, etc).
I currently get just over 100 spams a day, and I am fine with that - I set the filters to be pretty restrictive and if I miss mail, no big deal. I have a small enough list of people that contact me that I add them to the whitelist and then *most* new people contacting me get through assuming what they are talking about is sufficiently non-spammy.

I am using SpamAssassin 2.60 and it is working well for me. I have tweaked the settings for my uses.

But since my company will have these web facing e-mails, and I really can't miss any of them since they are existing or potential new clients, I have to lessen the strictness of my spam filter.
As a result, the 1-5 e-mails that sneak through each week is going to increase just with the less strict settings, as well as with the increase of new addresses available and coming in.

What I like so much about SpamAssassin is that it runs on the server and therefore it yanks out the spam and I don't need to download it over my connection. It was fine while I was in the States and had a cheap and fast connection to the net.
But now that I am on a variety of connections and speeds, having to download 100 messages that are spam and THEN have them filtered out to find out that I just downloaded something, taking up bandwidth and time for naught, is really annoying.
I would say that well over 90% of my mail right now is spam - so getting rid of that before I download it is key.

That said, I know now that things are going to get through, so I need a client side pop3 filter.

I liked the idea of Cloudmark's SpamNet and so I've been giving that a shot. It is free for a month and it is easy to install.
I have been using it now for a few days - maybe a week at most.

I can't say that I have been particularly impressed with it. Of the 10 or so e-mails that get through each week (the filter is less strict now), it grabs 5-8 of them.
I of course would love for it to pick up on all of them.

That said, it is integrated well with Outlook, is easy to use, and the service is cheap once I have to pay for it ($4 a month I think).

I know there are totally free options out there, and I will very likely look into them at some point soon before committing to paying for SpamNet, but the ease of installation and usage is key to me.

I used to love the "fun" of toying with something and getting it to work. I liked it if it was annoying or challenging - I had time to do it and it made me feel like part of a group that knew what they were doing, and we were better than the slobs that couldn't get it working.
But now I'm very busy and actually do things with my time that make money - and my free time is getting increasingly sparse.
As a result, I just want things that work straight out of the box and always work.

--

There are some odd things afoot now, in the Villa Straylight.

Re:pop3, bahhh by dentar · 2003-08-11 01:37 · Score: 1

If you're starting a business then missing an email IS a big deal. If a client emails you expecting service and you don't see it you could lose them. Better put in a white list with ALL your client domains in it!

--
-- I am. Therefore, I think!
Re:pop3, bahhh by AssFace · 2003-08-11 23:25 · Score: 1

1) I said that missing an e-mail now is a big deal

2) future clients are unknowns - existing clients are on the whitelist

--

There are some odd things afoot now, in the Villa Straylight.

SpamPal by UpnAtom · 2003-08-11 01:04 · Score: 3, Informative

I did my own investigation of spam filters about a week ago. I didn't test the actual algorithms, just the features.
SpamPal with the add-on Bayesian filter (search Google for it) came out top. It works as a proxy and also provides blacklist/whitelist/known Spammer list checking.

Re:re : ducks deserve life tbh by Anonymous Coward · 2003-08-11 01:05 · Score: 0

What a twat.

Re:great by lone_marauder · 2003-08-11 01:11 · Score: 2, Insightful

OK, I'll bite on this troll just because it's still at zero, and the moderators need a reason to finish it off, placing it firmly in -1 hell where it belongs.

In the days before user-paid television service, it is true that advertising was the business impetus to put up huge powerful TV transmitters and undertake the other investmentss necessary to support land-based TV broadcasting. You are correct, therefore, in pointing out that TV content from 1977 derives from the business need to advertise.

But to suggest that the meager investments in bandwidth and hardware the average spammer makes is somehow otherwise useful to the world is absurd. When one considers that most of the infrastructure costs of spam are borne by the recipient rather than the sender, the idea of spammers contributing to the public good is assinine.

--
who are those slashdot people? they swept over like Mongol-Tartars.

It's virtually impossible to not get spam? by setien · 2003-08-11 01:24 · Score: 5, Informative

No it's not.
I get spam at the rate of 1 spam mail per 6 months or so. Or maybe even less. I can't remember getting a single spam email on my actual email address for about a year.

If you have an account on a crapless domain (i.e. not hotmail.com, msn.com, aol.com and the likes),
it all comes down to this very simple rule:
Do not, under any circumstance, have your email address posted publicly accessible ANYWHERE on the web.
It WILL get trawled. And then it will be spammed relentlessly.

If you have an existing address you don't want to give up, or an address at hotmail.com or a similar place, dump it.
Then exercise a bit of common sense about where you use your actual address.

I have a domain which catches email to unknown addresses and put them in my regular mailbox.
Whenever I have to give an email address to some place on the web, I use *domain-i-am-currently-visiting*@mydomain.com. So if I am visiting foobar.com, I would put in foorbar.com@mydomain.com.
I have been doing this for years. It enables me to see what was the source of the leak when I get spam on one of the addresses.
It has taught me one thing: I have never, ever, ever, in all my years of online shopping, forum posting etc, come across a single website that have ignored their own privacy statement. Ever. Even the slightly sketchy sites (like divx subtitle sites) don't leak addresses.
I was surprised to realize this.

The only addresses I ever get spam on are the ones I know to be publicly displayed on the web.

So it's that easy to avoid spam.

--
Give me liberty or give me kill -s 9

Re:It's virtually impossible to not get spam? by Aidtopia · 2003-08-11 03:31 · Score: 2, Insightful

There's one more ingredient to your recipe: get lucky.

It doesn't help when the spammers use a dictionary attack against your domain (aaron@domain.com, abigail@domain.com, adam@domain.com, ...). I guess your domain has never caught the attention of such spammers. Lucky you. They troll my domain on a regular basis.

Some of the published experiments that try to track the harvesters have found that short names near the beginning of the alphabet (like mine) are far more likely to get tons of spam. Other problems are needing to support addresses like "webmaster".
Re:It's virtually impossible to not get spam? by Control-Z · 2003-08-11 05:13 · Score: 1

You must not send e-mail to many novice computer users. I have a special alias address that I give to people over the phone because my real e-mail address is a little complicated to spell out. That alias address has never been posted online or used for anything else, but it still gets spam. My conclusion? Someone I sent it to got a virus/trojan program that harvested their address book. You just can't win.
Re:It's virtually impossible to not get spam? by Anonymous Coward · 2003-08-11 13:06 · Score: 0

That's reassuring to hear, but has anyone ever had their address harvested off one of those "Hi, I'm Bill Gates...giving out $1000 to everyone you send this to" chain forwards?
Re:It's virtually impossible to not get spam? by Lost+Race · 2003-08-11 14:08 · Score: 1

I defeat dictionary attacks by rejecting mail from any host that attempts to send to any common name at my domain. The common name list was generated empirically after the first few dictionary attacks hit me. E.g. bob@mydomain.com, joe@mydomain.com, tom@mydomain.com, etc. The dictionary spammers helpfully tend to blast a few dozen such names in a single SMTP session which makes it much easier to block them.
I've learned by this experience not to use a common first name as a permanent email address!

Re:great by John.Thompson · 2003-08-11 01:40 · Score: 1

I already pay for my email account. Why should I have to put up with unsolicited commercial email? Make email advertizing mandatory "opt-in" and I'll agree with you.

SpamBayes by Anonymous Coward · 2003-08-11 01:41 · Score: 0

I Just installed SpamBayes !

It looks great, but does any one have init.d startup scripts to start the pop3proxy ?

Blame the idiots that respond to SPAM. by momus_radar · 2003-08-11 02:13 · Score: 3, Insightful

This method of combating SPAM is amazing to me. Admitingly I'm a little behind the geek times so my interest in this method was peaked when Apple released Mail.app. But I still use Mac OS 9 and am in no rush to run X yet so I'm glad to see there are alternatives that I can use.

I think the only reasonable way to rid the world of SPAM is to get the foolish folk who respond to it to stop. The reason there is so much of it now is that it seems to work; there are people who actually respond to it. If these people stopped responding to it the use of SPAM would most likely diminish.

Sending SPAM costs money. No sence spending that money if no profit is made.

Pop filter plus webmail? by phildog · 2003-08-11 02:38 · Score: 1

I'm looking for a spam filtering solution that will work with my a)desktop client Eudora, b)webmail client, and with a c)Palm client--and maybe d)a cell phone down the road.

SpamBayes seems to do the trick for (a), where I can filter on the client, but how can I accomplish b) c) and d)??

Can you recommend a good webmail client for b)webmail? I played around with Squirrelmail and liked it, but have moved back to POP mail for the most flexible approach with good clients everywhere. That leaves me with Neomail, or some other you recommend...

And how about c)Palm and d)cellphone? The problem with most of the mark-message-with-new-header approach is that you are still downloading the Spam with the Ham and you are getting bandwidth charges for both. I'd like a pop filter that only returns the known good stuff (when I wish).

In the absence of any helpful responses, I will probably hack up pop3proxy.py from SpamBayes to make it do what I'm looking for.

Thanks in advance.

--
slashsearch.org - slashdot search. powered by google.

Re:Pop filter plus webmail? by Hurricane78 · 2003-08-11 03:40 · Score: 0

Could there not be filesharing for the regexes in eudora? That would be really useful. :)

--
Any sufficiently advanced intelligence is indistinguishable from stupidity.
Re:Pop filter plus webmail? by Anadelonbrin · 2003-08-11 17:22 · Score: 1

There was some discussion of (c) and (d) on the spambayes mailing list a couple of months back that you might want to read if you do this. I started work creating a "leave spam on server temporarily" option (based on header classification, which looks like it would work ok), but then got caught up with other things. You're welcome to that code if you want it (ask on the spambayes list).

The real reason SpamBayes wins... by Moryath · 2003-08-11 02:56 · Score: 4, Interesting

The "unsure" feature directly combats the latest Spammer technique -- filter poisoning.

You've all seen it work; the Spammers don't just send you the same spam once, they send you it 5 to 20 times, and they include a clipping from the headlines or something under their pitch.

They're not doing it to get that one mail past to you. They're actually HOPING that you classify all 20 mails as spam.

Why?

Because every time you classify that mail as spam, EVERY SINGLE WORD of that news clipping is "poisoned" inside the filter, and becomes an indicator of a spam. Then you turn around, and get an email from someone legitimate using those common words... and it gets wrongly classified too.

Enough false positives, and the spammers win, because they'll get you to turn the filter back off.

Enough is enough -- time to establish open hunting season on Spammers.

Fraud & deceptive sales are ALREADY illegal by swb · 2003-08-11 03:03 · Score: 1

Fraud and deceptive sales practices are already illegal, why not use those tools to diminish the spam problem?

Most spam that I get is for the sale of products that don't work (eg, penis enlargers), probably don't work (get rich quick), are part of an ongoing swindle operation (stock spam, which is likely pump-n-dump), or may violate other laws (cable descramblers, online pharmacies).

The people collecting the money for these products are the ones paying the spammers; if you can put a significant dent in these fraudulent enterprises, the spammers will lose business and some may move along to something else, and more power to you if you can implicate the spammers as accessories to the fraud; with a fraudulent businessman facing 5-10 in a Federal prison, they might easily roll over on their spammer friends for a reduction in jail time.

Focusing energy and legislative action to "ban" spam is fruitless if you don't eliminate the source of the spam. Deceptive selling is the problem, spam is just a tool for this.

The only problem with this that I can see is that deceptive selling is often considered a legitimate business practice in the US, and there's a lot of people that lie and cheat customers and only get rich. If we could have a little more stringent interpretation of fraud (ie, you have to tell the truth as the common person understands it), then we could easily go after these people and probably put a significant dent in fraud.

Re:great by Anonymous Coward · 2003-08-11 03:15 · Score: 0

Maybe in the good ol' crappy US of A. Here in Britain, we pay a LICENSE so that we can actually pay for programmes without being bombarded by advertising nonsense. Thank heavens for the BBC.

Short for Bayesian networks by gnalle · 2003-08-11 03:17 · Score: 1

The name refers to Bayesian networks.

A Bayesian approach to spam filtering

A simple mathematical introduction

SpamBayes Testimonial by Cytotoxic · 2003-08-11 03:37 · Score: 4, Interesting

As a network/web/computer manager, my email has been provided to dozens of companies and trade shows. I still remember the day (August, 3 years ago) when someone first sold my address to a spam list. I went from 2-3 spams per day to 15-20. This spring brought another explosion, this time into the 100+ range. I am currently receiving over 6,000 spam messages every month! Obviously my main email address was useless and needed to be burned on a pyre to purge the evil.
After a week or two of this, I installed SpamBayes in the form of it's outlook plugin. I showed it my email archive as my "good" messages, and a bunch of spam gleaned from my deleted folder as "bad". My mailbox is now perfectly clean. I have received at least 15,000 spam messages since installing SpamBayes, and I have probably had to hit the "Delete As Spam" button about 10 times for ones that it missed, most of those being variations on the Nigerian scheme. It has never grabbed a real message, and the "Unsure" feature localizes everything that I really need to look at in one place.
If you have a spam problem, get SpamBayes. It is that simple. There is no need to speculate about that better method that you thought up, or how it really won't work because of XYZ theory... it works almost perfectly, and it lets you know about anything that it is not sure about with the "Unsure" folder, so it never throws the baby out with the bathwater. In short, this is almost the perfect Spam filter. It even caught the emails that were using GIFs to avoid being filtered on content, placing them in unsure until I said "this is spam", after which I never saw another one. Pretty darned cool!
It is actually kind of fun to watch this thing work. I came in this morning to find 568 new messages in my spam folder, 3 in unsure, all of which were spam. No spam anywhere to be found in my inbox, just 15 unread messages that were correctly left alone by SpamBayes. Just imagine having to flip through 600 emails to find 15 real messages! Now I just hit "CTRL-A DEL" in my spam folder and it is all gone! 5 seconds a day to deal with spam, I can live with that....

Re:A new *law* is required by felis_panthera · 2003-08-11 03:41 · Score: 4, Insightful

Out of that 2.2 million people, somewhere near 700,000 are in jail from possession, use or distribution of marijuana. A law that was originally used to control migrant mexican workers has bogged down the american legal system to the breaking point. Imagine, 700,000 new cells open for child molesters, rapists, spammers, and SCO executives.

Wouldn't it be grand?

PS: Sorry about the OT, but things like this need to be said whenever the opportunity presents itself.

--

The chains are broken
Loki is free
Ragnarok is at hand...

Why Kill the Spammer? by Anonymous Coward · 2003-08-11 03:45 · Score: 1, Funny

Why kill the spammer, when you should be focusing on the idiot users that purchase shit from these guys. Kill 'em all! or was it Sue 'em all! I always forget these days!

Oops... by felis_panthera · 2003-08-11 03:47 · Score: 1

I got the link and figures wrong on that last post.

As of 2001, # of Americans (only americans, this says nothing sbout the rest of the world) arrested for marijuana related charges since 1965: Over 11 Million

--

The chains are broken
Loki is free
Ragnarok is at hand...

Knowspam by KermitAndLadyHoliday · 2003-08-11 03:52 · Score: 1, Informative

No one appears to have mentioned Knowspam yet. 100% spam blocking. No filters. Just a simple "prove you're human" auto-reply sent to the sender and a "friends" list. http://knowspam.net/

Re:Knowspam by dhaines · 2003-08-11 05:59 · Score: 1

No one appears to have mentioned Knowspam yet.

Maybe that's because the article is specifically about Bayesian filters.
Re:Knowspam by KermitAndLadyHoliday · 2003-08-11 07:10 · Score: 1

In its essence, the article concerns itself with fighting spam, not just Bayesian filters for the sake of Bayesian filters. Likewise, the resulting forum has evolved into a discussion about reducing or ridding ourselves of spam.

Knowspam is an excellent solution and a good alternative to filtering: 100% effectiveness as opposed to writing 98%-effective algorithms that spammers will eventually work around.

If I wrote an article about curing world hunger with dog food and used the article to compare several popular brands, would you be offended if someone piped in to suggest an alternate solution?

Would you say, "Wait, the article is about dog food!!!"?

Re:great by Chasqui · 2003-08-11 03:53 · Score: 1

Television model: advertisers via commercials (spam) pay for the programming I see, subsidizing my costs to watch. Snail Mail model: advertisers pay the post office to send their bulk mail (spam), subsidizing the cost for me to send and receive mail. Internet model: Spammers have a free ride. I pay to receive their crap. Want to get rid of spammers? Not likely - we have their equivalent in other media. Want to reduce it? Make them pay. Unfortunately there will always be some fool who thinks the herbal viagra will work, or that Munbumi from Nigeria is going to transfer him 20% of $22.5 from the Nigerian reserves. Spammers will always want to get to these people. Right now it costs them next to nothing.

--
my cube has a window...

MIMEDefang + SpamAssassin + Razor by wytcld · 2003-08-11 03:58 · Score: 3, Informative

SpamAssassin has Bayesian learning, which I have running but not for long enough to test. I recently set up MIMEDefang as a Sendmail milter calling SpamAssassin (which calls Razor). This setup allows Sendmail to reject e-mail beyond an arbitrary SpamAssassin score. The remote mail daemon is informed the mail cannot be delivered.

Setting that score at 8 has resulted in no false positives over a week (I log From and Subject information - it's all obvious spam). Then stuff that scores between 5 and 8 I divert to a separate mail box, which I comb through every day or two. There have been two false positives that ended up in that over the week. This is with hundreds of e-mails for a half-dozen users coming in a day. I also end up, with this setup, with 2-4 spams making it through to my own mailbox (the bussiest on the system). These are, because of the filtering, the least obnoxious, and easily enough report to Razor to spare others. Meanwhile, I like to keep a window open to the mail server running "tail -f mail.info | grep REJECT" and watch a dozen or so attempted spams an hour refused acceptance with a message like "554 5.7.1 SpamAssassin score of 15, rejected" back to the origin, which is enough that if it wasn't spam any good mail daemon will inform the sender, and they can find another way to get through.

Even if this gives spammers a clue about ducking SpamAssassin, the spams that can get by it are by far the least obnoxious. I look forward to seeing if the Bayesian feature helps (it feeds itself anything ti scores at over 15 by default). But it's a pretty good system short of that. If it became standard for ISPs to reject all mail with a SpamAssassin score of 8 or higher, the loss of legitimate communications would be exceedingly rare, and politeness standards would be encouraged.

--
"with their freedom lost all virtue lose" - Milton

Mail.app, remark on graphics by dr2chase · 2003-08-11 04:34 · Score: 2, Interesting

I was more than a little disappointed to see that Apple's Mail.app was not included in the comparison. It wouldn't surprise me in the least if it were already the most widely used Bayesian spam filter. Unsurprisingly, it is also very easy to use.

Mail.app also combines Bayesian filtering with the Address book -- any mail from a known correspondent won't be tagged as Junk. This reduces the risk of false positives. This is an integration cheat not available to stand-alone spam filters, because Apple supplies the Address book app and provides other integration between the two applications. But, (as a self-centered end-user) I don't care that it is a cheat, I am merely happy that it all works well. (And I cross my fingers and hope that somehow, Apple's C/C++/Objective-C programmers are less prone to leaving buffer overflow holes than Microsoft's programmers clearly are.)

The author needs to read Edward Tufte's books on presenting information (e.g., The Visual Display of Quantitative Information).

Re:A new *law* is required by maxume · 2003-08-11 04:37 · Score: 1

You are not really helping your argument by horribly misquoting the numbers from your own link. 700,000 is the number of marijuana related arrests in 1997. If you read further into the page, it concludes several different things, but it lists 15,400 as the number of people who are in jail/prison only for marijuana posession. The largest number they list as being related strictly to marijuana, which includes all marijuana related offenses, is 37,500. They leave themselves lots of wiggle room on all of their numbers, but I bet they would say that they are confident in them to +-30%. We are left with many less than the 700,000 cells that you would let us believe could be opened.

In short, we are left with lies, damn lies, and what you said.

--
Nerd rage is the funniest rage.

Whoops, missed one by SpyderFan · 2003-08-11 04:37 · Score: 1

In my humble opinion, the best one was missed. Spam Sleuth from Blue Squirrel makes Bayesian just a feature of a much larger anti-spam solution. If you want just Bayesian, you can shut off all the other capabilities. That seems to be to be sort of like taking the spark plug wires off of 5 of your 8 cylinders.

Spam Sleuth also does all the other things like Whitelists, Blacklists, RBL, Challenge-Response (Turing), etc. It combines the results to determine "spaminess" and takes action.

Another advantage of Spam Sleuth is that it begins working without Bayesian, until it can build up a set of messages it can use for training. It also lets you correct any mistakes before training so you don't get a bad statistical data set.

It is naive (no pun intended) to think that Bayesian will be able to perform better than a multi-view solution.

Re:great by bafu · 2003-08-11 05:19 · Score: 1

The difference is very straightforward (which is why you are getting modded as a troll). Advertising on the TV is supposed to be covering (or, at least, defraying) the cost of production of the programming. IOW, the more ads there are, the cheaper it is to provide access to the programming. Spam, on the other hand, dramatically increases the load on networks, mail servers, storage arrays, and user mailboxes and the spammers do not have to cover that cost. IOW, the more spam there is, the more expensive it is to provide Internet access.

This difference flows out of the other big difference: spam exists because of a loophole (the trusting design of SMTP) and not because someone in the supplier and consumer chain took some sort of extra step to allow it to be there. As a result, there is no option of charging the spammers to cover the cost of their spamming. So, until the email delivery infrastructure is made less implicitly trusting, spammers will have no incentive to keep stop abusing it.

You really really just don't get it by frovingslosh · 2003-08-11 06:00 · Score: 1

Whereas I sent out about 2000 messages last year, so I'd only have had to pay $4 for the whole year, even if 100% of my messages were "unwanted". I'd pay $4 a year to make SPAM go away.

Micropayments would in no way stop spam, but they would cause cost and problems for honest users.

Where would those micropayments go? To the ISP? The one that is providing a haven for the spammer? To the spammer itself when it is acting as it's own ISP. In either case the micropayment would be a farce, certainly any service provider who is working with a spammer could wave the micropayments if they are already letting a spammer on the Internet in the first place.

But more importantly, micropayments will ruin some current valid used of e-mail and prevent some future ones. Think of the many mailing lists that are run by very low budget groups to communicate with hundreds or thousands of members. These would be destroyed if each time they were to send out a message to all members they invoked a micropayment on every single e-mail. Heck, Slashdot sends me an e-mail when someone responds to one of my posts; how long do you think that nice service would last when Slashdot started to have to pay a micropayment for every single response that was posted??? I would certainly see that this would completely kill any new uses of e-mail as well. Imagine a school that was considering using e-mail to keep the parents more current on their child's ststus; perhaps weekly or even daily report cards rather than end of the quarter suprises. It would be a system welcome by many partent, and could be easily done if integrated into an electronic grading system, but would be killed instantly if there were a unit cost on sending e-mail, even a small micropayment. There are lots of other useful things that e-mail might do for us in the future; do you really think we should give them up an adopt a payment system that really isn't going to stop spam in the first place?

And I still maintain that it will not stop spam, even if the payments are not to the original ISP. Many spammers manage to inject their spam into the net through badly configured servers or other exploits. While I'm all in favor of making someone who improperly sets up a server pay for letting spammers have a doorway to the net, I'm not as likely to advocate a system that encourages more trojans and backdoors so that spammers can pass their costs on to unsuspecting in-duh-viduals. There are simply better ways to fix the problem (by changing the fundamental flaws in SMTP) than by approaches that will harm valid e-mail uses.

--
I'm an American. I love this country and the freedoms that we used to have.

Re:You really really just don't get it by Teach · 2003-08-11 08:32 · Score: 1

Think of the many mailing lists that are run by very low budget groups to communicate with hundreds or thousands of members. These would be destroyed if each time they were to send out a message to all members they invoked a micropayment on every single e-mail.

You missed a crucial part of my original post. I suggested that the microcharge would only be incurred if the recipient didn't have the sender on their TDMA-like whitelist. It's opt-in with teeth.

And give the micropayment to the recipient, for all I care. That way if some guy can convince tens of thousands of strangers to email him, he gets rich quick. But he doesn't pay a dime to email his friends, nor they him. And mailing lists wouldn't have to pay to send to as many millions as they want, as long as they've all opted-in by adding the mailing list to their own whitelists.

The idea has its problems, but this is not one of them. Now, spammers using a trojan to spoof the sender is a legitimate concern, but it's not a new concern; they're already doing that.

--
Graham "Teach" Mitchell, computer science teacher, Leander HS
Re:You really really just don't get it by frovingslosh · 2003-08-12 06:59 · Score: 1

I suggested that the microcharge would only be incurred if the recipient didn't have the sender on their TDMA-like whitelist. It's opt-in with teeth.
And give the micropayment to the recipient, for all I care.
Realistically, a legitimate mailer can not count on all (or even most) of the people who have requested that they be put on a mailing list (or otherwise supplied an e-mail address for a valid reason) will have added that sender to their white list. Heck, your "give the micropayment to the recipient" virtually gaurantees that, the effect of not adding them to your while list would be that you still get the mail you wanted and you get paid. But even if the payment went somewhere else, lots of whitelists wouldn't be updated and it would still have enough of a negative effect on legitimate uses of e-mail to kill some good valid uses. Why not just fix SMTP or replace it with something that is spam resistant? There are a lot of things that can be done to improve SMTP that cost the users nothing, and I believe most users would quickly adopt spam resistant mail systems.
Consider the following changes: Make the sender store the outgoing messages on their system until picked up, with only the header being sent as the initial message. Make sure the sender provides a real and valid reply to or other contact address (as the above system does), and maybe even tie that into geographic locations (preventing US spammers from sending through China or Argentina, or at least giving recipents an easy way to block such spam). Pass some laws against spam like we have against junk faxes (which have been very effective), don't just have those who hate spam stick their heads in the sand with filters and let the spammers continue to prey on the masses.

--
I'm an American. I love this country and the freedoms that we used to have.

Missing poll option by Pac · 2003-08-11 06:16 · Score: 1

Ban their original networks
Throw them in jail
Kill them
Fine them 0.01$/email and improve third world infrastructures with the money.
Filter/Ignore them.
All of the above

pop3proxy.pl by antwan · 2003-08-11 06:20 · Score: 1

another good solution is pop3proxy.pl
it works like pop3proxy.py from spambayes but uses spamassassin for checking.
so you benefit from all the spamassassin checks + its bayesian classifier.
works with windows too.

http://mcd.perlmonk.org/pop3proxy/

Web-based email service with Bayesian filtering by oldfellow · 2003-08-11 09:43 · Score: 1

As someone tired of receiving hundreds of pieces of Spam per day, I was overjoyed to find www.123mail.net . This is a great (albeit paid) alternative to Yahoo or Hotmail and for $24 per year you get a POP3/Webmail account with 15MB storage with no spam or advertising. Their filterng method is based on an incrementing point system which includes: Bayesian Classification , white-listing, Heuristics , Automatic Fingerprinting (DCC), Black-lists (known spammer, IP blocks, experiential), and user White-lists. A combination system is much better than a single method IMHO and I have seen the results first hand. I have yet to have a false positive and they have done all the Bayes training already. They have a handy web-based interface for reporting false-positives and re-delivering the mail if it makes any mistakes. 123Mail is actually owned by The Electric Mail Company www.electricmail.com and this same spam filtering solution is also deployed in Fortune 1000 firms. Ok this wasn't meant to sound like a plug, Bayes has just made me a believer.

Bayesian 5 third season opening credits by Dhraakellian · 2003-08-11 10:56 · Score: 2, Funny

The Bayesian Project was our last, best hope for peace.

It failed...

But in the year of the Spammer War, it became something greater: Our last, best hope for spam-free inboxes.

The year is 2003, the place: Bayesian 5.

--
I've read Grocklaw. BoycottNovell, you're no Grocklaw

I had to TROLL for spam! by pair-a-noyd · 2003-08-11 11:26 · Score: 1

I use a very bizzare email address, it's like a 24 character hex number@myisp.com

NO auto-spammer will *EVER* guess it or accidently stumble into it, ever. To this date I've never received a single spam message on my REAL email addy..

Many spammers just take dictionary names and add numbers to them, auto-incrementing the numbers.

Like bob333@aol.com then bob334@aol.com, etc..
Not to hard to figure that method out. If you use some dopey addy like that you are going to get spammed, sooner or later. It's just a matter of time before they hit your combo.

Now, being that I get ZERO spam, and that my dad and my friends get a virtual flood of crap everyday, I created a troll addy that is fully functional, see the addy I have listed on my post (it's real) and I splashed it all over usenet and forums everywhere. I've been training and tuning and learning to control and tweak the various spamassassin type filters so that I can be proficient in it.

Now that I've trolled up thousands and thousands of various crap spams, I can take care of my folks and my friends.

Once I have it down pat I'll delete the troll addy and enjoy a spam free life. With spamassassin installed and tweaked, and going back to using my hexidecimal addy I can live a spam free life.

If spam pisses you off, use bizzare addys and bayesian filters. That's the ticket, trust me..

Re:great by sootman · 2003-08-11 12:56 · Score: 1

Not when I'm *paying* for my ISP and email, you fucking idiot!

--
Dear Slashdot: next time you want to mess with the site, add a rich-text editor for comments.

Re:great by sootman · 2003-08-11 13:03 · Score: 1

No sense mentioning I'm already *paying* for my email account in the first place! Fucking idiot. (Not you, dnj, the parent.)

--
Dear Slashdot: next time you want to mess with the site, add a rich-text editor for comments.

Perhaps theres nothing wrong with spam by jechonias · 2003-08-11 13:26 · Score: 0

I know this sounds like a troll but consider the following:

1) Americans probably get more t.v. advertising than anybody else on the planet, yet slashdoters (apart from tivio style conversations) rarely complain about t.v. adds

2) If you saw 10 advertisments per ad-break and had an ad-break per 10 minutes of viewing time, thats 60 ads per hour.

3) I read recently that the average american watches between 4-8 hours of t.v. a day. (thats increadible, where do they get time to crap and eat and work etc?) which translates to 8*60 which is 480 adverts per day.

thepaytons and msnbc

4) very few of us get 480 spams per day.

The solution isn't to complain or to sue or to punish any advertisers (whilst there are purchasers willing there is always an incentive to spam)

In fact maybee like the bafta awards for commercials immediate future
where tv shows case the best advertisements (funny, sexiest, shockingest etc etc) perhaps spam will mature into stuff we tolerate and maybe even laugh at.

People say that the spammers should be made to pay, or that we should charge for email so that spammers would stop or slow down. Well last time i checked they do pay for email. Sure fat pipes with pre-paid gig bandwidth's is v cheap, but its still bought and paid for.

People say that everyone hates spammers, yet that simply isn't true. No matter how much geeks hate spammers, there are more customers out there willing to buy dick extenders, boob enhancers and the staying power of a donkey!

The correct thing to do with spam is to wait for the market to mature, and silently use technology to strip it out whilst we can.

Whilst we are on the subject of unsolicited advertising, consider junk-mail, bill boards, video previews, movie theater advertising, sports brand promotions, corporate sponsorships etc etc etc.

Advertising and spam is here to stay. Stop whining, and accept it. The vast majority of the world simply does not hate spam as much as the average slashdotter.

Jech.

Can I sue them by bluGill · 2003-08-11 14:32 · Score: 1

IANAL, but can't I sue some of the spammers? Not the mortgage ones, but the penis size ones should be no problem. Isn't there some sexual harrasement law I can apply? I get enough of it a day that I can't imangine anyone would not consider it harrasement. In fact the only juriers I can imangine are those without computers, and I have this idea that many are "little old ladies who only drive their car to church on sunday" and would want to throw the book at anyone who is "degrading socity" in that way, even if there is nothing illegal.

How can I track down who is sending me these things, and then where can I find a lawyer to take the case?

Re:You wanna start a Union? by jaymz666 · 2003-08-11 15:08 · Score: 1

No, if you miss a few it doesn't matter at all. It only matters if you misclassify them. Using POPFile it only updates the corpus on a deliberate classification.

A rather lame comparison by Rick+Zeman · 2003-08-11 15:29 · Score: 1

That's kind of a lame comparison with a lame set of requirements: has to run on both Linux and Windows.

My Bayesian of choice, SpamSieve, is directly hooked into my POP program, Mailsmith 2 on OS X. It sucks all the mail from all the accounts down, transparently adds an isSpam or isNotSpam property to the email and then Mailsmith dumps it into the spam folder if it is, or deals according with them if not.

After going through maybe 20k messages, I've had one false positive (good mail marked as spam and that was very early on its training) and running an overall rate of 97.5% including when it was being trained.

Bah to "Windows or Linux only." Even Apple Mail seems to be a better solution than what was previewed. A web interface? Give me a break.

yes it's anti-viruses sorry by FoeNyx · 2003-08-11 21:17 · Score: 1

It's a strange language :P

1 virus
2 virii
3 viriii
4 viriv ...
10 virx ...
1001 virmi

I prefer this flavor/flavour

Content Filters -- bah! by djlewis · 2003-08-11 22:53 · Score: 1

Any filter based strictly on message content is all but useless in the long run. Why? Three reasons: false positives, false positives and false positives!

If there is a reasonable chance of losing even one real message, then I have to comb the filtered messages anyway, no matter how they got segregated. So absolutely nothing is gained in the end.

For an example in the extreme, what if a good friend forwards you a particularly juicy piece of spam with a commentary to make some point? Bam! Any content-based filter will rate it high and filter it. Message lost... unless you comb the rejects anyway.

Is there a filtering method with no reasonable chance of false positives? Yes, actually, the bait account, distribution-based, signature filtering represented well by BrightMail (I have no affiliation with BrightMail). That approach actually uses the very definition of spam, namely ~unsolicited~ mail sent by strangers to large numbers of recipients, plus blacklisting.

BrightMail claims false positives are 1 in 100,000, but it's probably even smaller than that. Even 1/100,000 is small enough that I don't feel a need to scan the filtered messages for false positives.

And, if you are unhappy with the less-than-100% filtering of something like BrightMasil, then you can apply other methods as well. At least you'll have less purported junk to scan for false positives.

Actually, the best combo would be an automatic whitelist acceptance (anybody you've ever mailed to or accepted mail from) followed by BrightMail (or equivalent) followed by a good content-based filter. Why nobody's done this yet is beyond me!

I sure wish there were a consumer version of BrightMail.

--David

Re:A new *law* is required by bcwengerter · 2003-08-12 07:22 · Score: 1

I know that this will be buried since this story is already a day and a half old, but I figure this, too needs to be said.

Out of that 2.2 million people, somewhere near 700,000 are in jail from possession, use or distribution of marijuana.

That's just wrong, even according to the site which the above poster quotes. Here's the relevant lines:

There were more than 700,000 marijuana arrests in the United States in 1997...Calculations based on recent BJS reports suggest that, at any one time, 59,300 prisoners charged with or convicted of violating marijuana laws (3.3% of the total incarcerated population) are behind bars, at a total cost to taxpayers of some $1.2 billion per year.

Okay, so to review:

The 700,000 number is the number of arrests, not those incarcerated.
The actual number, based on the estimates, is much smaller.
Also keep in mind that this number is from a 1997study. I don't know what the current numbers are, but they may or may not be different.

That's all for now...

Use Spam Bait Addresses by billstewart · 2003-08-13 14:06 · Score: 1

Since you're already doing a bit of work, and running a teergrube, and you're probably also running a domain or subdomain for your mail, it's worth adding a bunch of spambait addresses and putting them out on the web to attract spiders, and also using them to catch dictionary attackers.

So if you don't have any users name Aaron and Alice and Viagra and Zebra, the only people trying to reach them are dictionary-attack spammers and people who found their email addresses by running harvesters on your web site (or alternatively, running harvesters on Google after searching for useful phrases, so make sure you've got a lot of attractive-nuisance words like bulk email and multi-level marketing and such and some meta comments that'll help attract the search engines.)

If you want to also hack your DNS so that it gives different answers depending on who's asking, you could set things so that any DNS requests coming from an address on the less aggressive RBLs get handed the address of your teergrube, or 127.0.0.1, or the address of some other open relay.

--

Bill Stewart
New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks

Slashdot Mirror

Comparison of Bayesian POP3 Spam Filters

326 comments