Researchers Create Highly Predictive Blacklists

Not really that "predictive". by khasim · 2008-07-23 14:40 · Score: 4, Informative

They take X firewall logs ...

Then they look for matches in attacking IP addresses between the logs ...

And if any IP addresses appear in log A (which is very similar to log B) ... then those IP addresses are "predicted" as being possible to attack the firewall from which log B was obtained.

Logical - yes.
Predictive - no.

Re:Not really that "predictive". by twatter · 2008-07-23 14:47 · Score: 4, Insightful

I agree, but the key here is to ensure that there are no false positives, which have been traditionally the biggest problem with blacklists.
If they figure that out, I don't care what kind of statistical approach they are using, as long as it works.
I think someone from MIT (maybe three or four years ago during the height of the problems with Spamhaus?) tried this before, but I don't remember if it got anywhere. Maybe this is an ofshoot from that.
In the meantime... SpamAssasin with whitelists, which is the best of worse worlds.
Re:Not really that "predictive". by Mad+Hughagi · 2008-07-23 15:24 · Score: 2, Insightful

It's pretty easy to get false positives depending on how you configure SpamAssasin.

--
UBU
Re:Not really that "predictive". by elnico · 2008-07-23 15:25 · Score: 5, Informative

Logical - yes.
Predictive - no.
So if this isn't predictive, what is? Would you rather they develop an algorithm that identifies blacklist-worthy addresses before they make their first attack?
The application of this algorithm actually seems pretty clever. It captures the fact that "true" attackers mostly attack "true" (that is, weak or high profile) targets, whereas those targets are mostly attacked by "true" attackers. Thus some isolated attack by a never-before-detected attacker on a never-before-attacked target has very little predictive potential in the eyes of the algorithm, whereas even just a few attacks by a never-before-seen attacker on several oft-attacked targets raises a huge red flag.
Re:Not really that "predictive". by Zadaz · 2008-07-23 15:33 · Score: 1

So if this isn't predictive, what is?
I don't know, but it isn't the service called Highly Predictive Blacklisting that the article is about.
Re:Not really that "predictive". by Joebert · 2008-07-23 15:33 · Score: 0, Redundant

So if this isn't predictive, what is?

Failure.

I like to play it safe and assume nothing works.

--
Wanna fight ? Bend over, stick your head up your ass, and fight for air.
Re:Not really that "predictive". by LostCluster · 2008-07-23 15:34 · Score: 4, Insightful

That worked back in the say when you could say "Syracuse Unversity's gotten hit with the latest worm. So, don't trust any mail that comes from 128.230.x.x." but these days mail comes from one address per organization or household. Most corperations expose only one mail server IP address to the world, and some smaller companies have hundred-user systems and only one IP to show for it. So, who you're next to doesn't hold much water in predicting whether the message is spam.
Re:Not really that "predictive". by kamochan · 2008-07-23 19:11 · Score: 2, Insightful

So if this isn't predictive, what is? Would you rather they develop an algorithm that identifies blacklist-worthy addresses before they make their first attack?
I invented just such a thing. I blocked the entire comcast network and a couple of big Chinese ISPs in my DSL firewall. Reduced ssh login attempts and spam significantly.
Predictive - very.
Collateral victims - nobody I'd care about.
Re:Not really that "predictive". by Anonymous Coward · 2008-07-23 21:19 · Score: 0

I see this putting another significant dent in the already dwindling number of proxies on the internet. Only an idiot attacks a remote server while exposing their personal IP, so really all the predictive nature of these logs can do is block proxies. If it is going to be preemptively blocking potential targets; that's another story, but we already have lots of server code that takes multiple concurrent requests from the same IP and kicks it for a predefined period. If one wanted to they could make it an outright ban and accomplish the same thing. Trying my best to avoid a Bush parallel; I see no point in predicting an attack if there isn't going to be some sort of automatic response. You going to just throw these IPs into Gitmo and forget about them?
Re:Not really that "predictive". by rocketman768 · 2008-07-24 00:44 · Score: 2, Interesting

What the heck does "highly" predictive mean?

"Honey, the weatherman is on and he is highly predicting some storms in the evening."

Maybe "highly effective" prediction?
Re:Not really that "predictive". by Anonymous Coward · 2008-07-24 04:13 · Score: 0

Therefore whitelists to counter... not ideal, I know, but what else to do?
Re:Not really that "predictive". by spazdor · 2008-07-24 05:01 · Score: 1

What I want to know is whether this HPB protocol itself is vulnerable to attack. Can I spoof a few packets at a few common targets and get some sucker blacklisted by half the Internet? Can I run 30,000 virtualized HPB nodes and use them to stack the deck with maliciously generated logs?

--
DRM: Terminator crops for your mind!
Re:Not really that "predictive". by Joebert · 2008-07-24 10:37 · Score: 1

Resistance is futile !

--
Wanna fight ? Bend over, stick your head up your ass, and fight for air.
Re:Not really that "predictive". by rtb61 · 2008-07-24 13:26 · Score: 1

You want effective predictive. How about ISP supplied, white lists of good addresses with IPv6 coming on stream attmetping to blacklist billions of addresses will be a pain. Of course when a ISP supplies contaminated white lists range block the ISP until they behave.

--
Chaos - everything, everywhere, everywhen
Re:Not really that "predictive". by Mad+Hughagi · 2008-07-24 17:38 · Score: 1

First thing is to upgrade the version of SA you're using, then configure it better (install good rule-sets), train bayes, in that order.
I have accounts on servers who have different policies/versions, and have experienced no (important!) false positives on one and had to whitelist on the other.
Convincing the people sending SPAM-ish looking mail to do otherwise could also help, rather than just accepting it:
http://wiki.apache.org/spamassassin/AvoidingFpsForSenders

--
UBU
Re:Not really that "predictive". by Anonymous Coward · 2008-07-25 01:55 · Score: 0

In the meantime... SpamAssasin with whitelists, which is the best of worse worlds.
Only accept encrypted email.
Let the spammers waste more CPU time if they want to stay in busines....

Hmm... by FlyingSquidStudios · 2008-07-23 14:50 · Score: 3, Insightful

This sounds ripe for abuse. For example, a heavy censorship nation like China could use this to block critical sites that they claim are 'attacking' them far more efficiently than their current human-based censoring.

--
http://twitter.com/OLDTELEGRAM

Re:Hmm... by elnico · 2008-07-23 15:05 · Score: 2, Interesting

Somehow, I doubt identifying "troubling" sites is the limiting factor in Chinese internet censorship. More likely, the things holding back the censors are international pressure/attention, circumvention by their people, and the censors' own sense of decency, if that exists.
Re:Hmm... by tukang · 2008-07-23 15:10 · Score: 2, Insightful

This sounds ripe for abuse. For example, a heavy censorship nation like China could use this to block critical sites that they claim are 'attacking' them far more efficiently than their current human-based censoring.
How is it more efficient for China to tell this software that a particular site is 'attacking' them than to block the site at their great firewall and be done with it?
Re:Hmm... by LostCluster · 2008-07-23 15:38 · Score: 1

Just as abusive as "We've noticed a spammer keeps registering with ServerFarm.net, let's block their entire network space!" but human blacklists do that already today. Sounds like this is just automating the process.
Re:Hmm... by Joebert · 2008-07-23 15:39 · Score: 1

They could poison the destination lists so they think China is attacking them, thus blocking China. Remember, Chinas' firewall works ass-backwards.

--
Wanna fight ? Bend over, stick your head up your ass, and fight for air.

Probably a bad idea. by Jane+Q.+Public · 2008-07-23 15:12 · Score: 4, Insightful

The problem with ANY "predictive" statistics (like racial profiling, for one glaring example) is that even when they become accurate enough to produce useful information, they tend to produce too many false positives.

And often (again using racial profiling as a good example), even a few false positives are too many.

Re:Probably a bad idea. by elnico · 2008-07-23 15:31 · Score: 1

And that's the point! This algorithm is clearly designed to avoid false positives. And it's obviously much more complicated than your standard racial profiling algorithms ("If he's black, he's got crack in the back!"). Just look at the page rank algorithm. It can "predict" that a page will be relevant with minimal input from a human, and it's often very accurate.
Re:Probably a bad idea. by Wildclaw · 2008-07-23 16:12 · Score: 1

insurance companies to pay equal retirement benefits to women as they do to men
I don't get what this has to do with the grandparent post. There are no such thing as false positives in insurance payouts.
Re:Probably a bad idea. by caffeinemessiah · 2008-07-23 16:22 · Score: 2, Insightful

The problem with ANY "predictive" statistics (like racial profiling, for one glaring example) is that even when they become accurate enough to produce useful information, they tend to produce too many false positives.

That is an overly general statement. If that were the case, we wouldn't have any reliable spam filters. There are many statistical methodologies (including ensembles of methodologies) applied carefully to different types of domains that produce excellent and usable false positive rates. Indiscriminate use of statistical learning, and the subsequent failure at a given task, does not invalidate the method or its applicability for the task.

--
An old-timer with old-timey ideas.
Re:Probably a bad idea. by silentcoder · 2008-07-24 00:36 · Score: 2

>There are no such thing as false positives in insurance payouts.
Fake your death - bag the life-insurance ?
And there could be false negatives too:
"We regret to inform you that we will not be paying out your husband's life insurance as people of 25 have a high likelihood of still being alive"
Joking aside: what about a man whose insurance payout is not made because the insurance company (incorrectly) believes he committed suicide ? Is that not a false negative ?

--
Unicode killed the ASCII-art *
Re:Probably a bad idea. by Wildclaw · 2008-07-24 01:38 · Score: 1

Is that not a false negative ?
True. The suicide example is a good one. I agree that my statement was too broad. Still, the original poster's statement that I was replying to was giving any even worse argument. Only a slight excuse though.
Thanks for coming up with a good example of why I was wrong. :)

Babies out with the bath water. by LostCluster · 2008-07-23 15:22 · Score: 5, Insightful

This isn't going to work in the real world. Too many users you want to hear from at an ISP won't like it when the virus-victim spammers gets their whole network preventatively banned.

Stop fixing the mail protocols we have today. It's time to replace with some form of sender authentication.

Re:Babies out with the bath water. by Jurily · 2008-07-23 16:19 · Score: 1

Stop fixing the mail protocols we have today. It's time to replace with some form of sender authentication.
That still doesn't fix the zombie issue.
Re:Babies out with the bath water. by arotenbe · 2008-07-23 16:28 · Score: 1

That still doesn't fix the zombie issue.
And neither does our current system. There is nothing stopping us from using multiple security solutions and heuristics to stop spam.

--
Tomato wedge sperm darts that are Republican.
Re:Babies out with the bath water. by RAMMS+EIN · 2008-07-23 17:18 · Score: 2

Rough sketch of what I have been working on:
(I hope this is going to be formatted correctly. It looks ok in w3m...)
Confidentiality
Only the intended recipient is able to read the message. No government
spying.
Use public key cryptography. Encrypt the message with the public key of
the recipient. Only the holder of the corresponding private key can
decrypt the message.
Integrity
The message you send is the message they receive. No monkeying in the
middle.
Use message authentication codes. Encrypt a digest of the message with
the sender's private key. Verifying the authentication code proves that
the message has been sent by the purported sender and has not been
tampered with.
Authenticity
The message will say it's from Alice if and only if it was sent by
Alice. No spoofed From: lines.
See previous solution.
Roaming Access
Access your messages from anywhere. No fragmentation over several
systems.
Allow users to keep messages on the server. Store the user's keys on key
servers. Encrypt the private key, so that only an authenticated user can
use it.
8-bit Cleanliness
All 256 byte values should be allowed. No more base64.
Reputation System
Senders, servers, and recipients have reputations. Messages may receive
special treatment based on the reputation of their sources.
Users should be able to choose among various reputation services, so
that various methods for establishing reputations can be tried.
----
Protocol
- Textual
- Layered on TLS
- Server and client are authenticated using keys
- Sending message:
* Fetch recipient key
* Specify destination
- Server contacts destination server
- Server verifies destination account
- Server acknowledges destination
* Send headers (pubkey encrypted)
- Server forwards headers to destination
- Server acknowledges headers
* Send body (symmetrically encrypted)
- Server saves body
- Server acknowledges message
- Receiving message:
* Fetch headers
* Decrypt headers
* Send headers to reputation server
- Reputation server acknowledges headers
* Request body
- Server contacts sending server
- Server fetches body
- Server sends body
* Decrypt body
* If message is spam, report to reputation server

--
Please correct me if I got my facts wrong.
Re:Babies out with the bath water. by initialE · 2008-07-23 21:34 · Score: 3, Interesting

Half of us here are for sender authentication, or at least verification. And half of us are for privacy and anonymity. These, to me, are conflicting goals. The sad thing is that there is overlap, that people want their privacy, not realizing that spam is exactly what that privacy brings. It surprises me that people can laugh at the implementations of DRM (But Bob and Eve are the same person! Hilarity ensues...) and not know that this is a very similar issue right here, (Bob wants his rights protected, but he doesn't want any riff raff Eve out there to contact him. But Bob and Eve are the same person! Not so funny now?) and it, like DRM, could very well be unsolvable.

--
Starbucks, Harbuckle of Breath.
Re:Babies out with the bath water. by totally+bogus+dude · 2008-07-23 21:47 · Score: 2, Insightful

I don't think it's privacy that needs to be sacrificed, but ease of access. All the popular instant messaging systems, forum and blog software etc. are subject to spam. If it was harder to obtain an address on these services, it would be much harder for spammers to abuse them.
On the other hand, ease of access is one of their primary benefits. An additional hurdle for SMTP is the lack of centralised controls, which is an important thing for any de facto standard communication tool to have.
Re:Babies out with the bath water. by LostCluster · 2008-07-24 01:35 · Score: 1

It does somewhat. Zombies spoof the From field. If that's not possible, then we know exactly who to shut down without any risk of a false positive.
Re:Babies out with the bath water. by initialE · 2008-07-24 05:32 · Score: 1

If it was harder to obtain an address on these services, it would be much harder for spammers to abuse them.
That's right, if we found a more complex form of DRM, surely the pirates won't be able to crack it!
The truth is, every time you raise the bar of entry, someone who is determined to cross that bar will be able to do so. A more complex captcha? A more secure forum? All we are doing is raising the ante in a game of one-upmanship.

--
Starbucks, Harbuckle of Breath.
Re:Babies out with the bath water. by totally+bogus+dude · 2008-07-24 14:17 · Score: 2

The difference between spam and DRM is that spam is received from people who you don't want any kind of contact with, and don't even want or need to have them on the network. DRM tries to prevent people from accessing an unencrypted bitstream in order to copy it or convert it to another format, while also requiring that they're able to access the unencrypted bitstream in order to view / run it.
DRM is therefore an unsolvable problem, and the best you can do is raise the bar enough that it becomes too difficult for 99% of the people. You can put all the crypto stuff in hardware so it's hard to get to, and then make the rest of the system require unmodified crypto hardware in order to run - that's what the Trusted Platform thing is all about. It still won't make it impossible for people to crack though, just hard enough that it'd stop casual piracy.
Spam is another matter though. If we required every single email to be signed by its sender, and every single sender's public key had to be verifiable back to a few trusted roots (i.e. how SSL certificate signing on the web works), and these roots actually went to great lengths to verify each sender's identity before signing their key, AND none of them were corrupt (or at least, an oversight committee wasn't and they had the power to revoke their trustpoint) -- then spam would effectively be stopped. Anyone sending spam using their own key would have their key revoked, so their messages would no longer verify and would be turfed before any user saw them; and they'd be banned from obtaining a new one for some time (and probably fines and maybe a prison sentence, too). This would also provide quite the incentive to keep one's certificate secure, as stealing legitimate certificates would become the primary avenue for spammers to be able to send messages.
Such a system would require central trust, so you could count the number of messages sent by each particular user and if it exceeds a threshold blacklist it. This way even stolen certificates could only be used to send a small amount of messages before being useless. If you can only send 100 messages for every key you steal,
However, a lot of people wouldn't go along with a scheme like this - especially for personal email accounts. This would greatly reduce the number of people using the system and therefore its value. It would also leave all users at the mercy of the trusted roots, which will almost certainly end up being a problem in the long term. There'd also be problems with people who do have their certificate stolen being branded as spammers, but if we really wanted to stop spam then that's a price one would be willing to pay. There would be a mechanism to get your "email license" back after that happened, but you'd probably have to prove you know how to keep your computer and certificate secure before being allowed back on to the net.
We might not want to take such drastic measures as this, but spam is fundamentally a different problem to DRM. It's not "cindy wants to prevent joe from seeing bob's content, where joe and bob are the same person" -- it's more like "cindy doesn't want to receive anything from joe". That's doable, if you're willing to make some sacrifices.

Not really. by khasim · 2008-07-23 15:40 · Score: 4, Interesting

So if this isn't predictive, what is? Would you rather they develop an algorithm that identifies blacklist-worthy addresses before they make their first attack?

Ummmm, yes. If you can identify them BEFORE they make their first attack then that would qualify as "predictive".

It captures the fact that "true" attackers mostly attack "true" (that is, weak or high profile) targets, whereas those targets are mostly attacked by "true" attackers.

Not in my experience. The attacks are usually automated scripts running on zombies that randomly scan address (or search their immediate networks) looking for known vulnerabilities.

Thus some isolated attack by a never-before-detected attacker on a never-before-attacked target has very little predictive potential in the eyes of the algorithm, whereas even just a few attacks by a never-before-seen attacker on several oft-attacked targets raises a huge red flag.

That is the opposite of how their system was described. They looked for matches amongst IP addresses and then "predicted" that if your example machine one firewall it should be blacklisted for the other firewalls that closely matched that list.

Now a real predictive system would look more factors.

#1. Who was attacking.

#2. How did the attacker(s) gain access to the machines used in the attack.

#3. What other machines are vulnerable to #2 that are available to #1.

Example - Spam zombies often appear in ranges of home addresses from the large ISP's. So machines in those ranges are given an increased score in SpamAssassin. Whether they have ever sent spam before or not. See #1 and #2 and #3.

Re:Not really. by Anonymous Coward · 2008-07-23 16:35 · Score: 0

Not in my experience.

This statement nullifies all of your arguments. Your experience is limited, hence the reason Dsheild gathers logs from *many* and compares them. What is not predictive to you will be to others and vice-versa.
Re:Not really. by mcrbids · 2008-07-23 18:17 · Score: 5, Interesting

Ummmm, yes. If you can identify them BEFORE they make their first attack then that would qualify as "predictive".
Stock analysts make daily predictions based on past behavior. This is not only predictive, but if it wasn't for this past analysis, the predictions would be largely meaningless and highly inaccurate. Or do you want a computer program that can predict what you'll think before you actually think it?
Not in my experience. The attacks are usually automated scripts running on zombies that randomly scan address (or search their immediate networks) looking for known vulnerabilities.
How many high profile hosts have you overseen? In my experience, the random attacks you mention are found everywhere. But high-profile hosts are their own deal. I've seen very carefully crafted spam attacks directed at one of my client ISPs that would last anywhere from 3-8 hours. (one of the largest regional ISPs in my area) A typical spam attack would entail perhaps 250,000 deliverable messages. It was a constant game of cat and mouse with firewall rules and automated responses.
I'd implement an anti-spam technology which would work for anywhere from a few days to a few months, while logging the repeated attempts to crack my solution. And then, the measure would be defeated and I'd be back to the drawing board while the mail cluster's load average spiked to 20.0 or so and users complained.
One of my more successful ideas I called "Double Dribble". I'd identify spam that had been sent to a non-deliverable address, then returned to sender, then bounced with an invalid return address. I'd calculate the success rate of the source IP address and within 5 minutes or so, I'd have a spam source identified and blocked with a dynamic DNS RBL.
That solution held off the spammer for almost a full year, until he/she/it began randomizing sending addresses so well that each IP address would send only maybe 10 emails every 24 hours, well below the threshold of Double Dribble. The address pool was insane - well over 100,000 unique IP addresses logged over a 24 hour period.
Then greylisting was implemented, which stopped the spam dead in its tracks, and completely nullified the spam that Double Dribble couldn't stop. That's when I turned over the account to another party. I still use greylisting personally with great success.
Now a real predictive system would look more factors.
#1. Who was attacking.
#2. How did the attacker(s) gain access to the machines used in the attack.
#3. What other machines are vulnerable to #2 that are available to #1.
No. A Real system would find out:
1) Who was attacking.
2) Send out the Russian Mafia after them to bust a few kneecaps.
3) What other machines are attacking that haven't been attacked by the Russian Mafia.
4) Send Chuck Norris after any attackers who are part of the Russian Mafia.
5) Scan for Natalie Portman donkey porn and send a copy to you.
6) ???
7) Profit!

--
I have no problem with your religion until you decide it's reason to deprive others of the truth.
Re:Not really. by Anonymous Coward · 2008-07-24 02:25 · Score: 0

You clearly don't grasp the concept of predictive.
I know that the Milwaukee Brewers have beaten the Houston Astros the last 8 times they've played.
I predict the Brewers will beat the Astros the next time they've played.
Guess what? Still predictive, despite the existence of prior data.
Amazing, isn't it, induction?
Re:Not really. by nabsltd · 2008-07-24 03:19 · Score: 2, Interesting
Then greylisting was implemented, which stopped the spam dead in its tracks, and completely nullified the spam that Double Dribble couldn't stop. That's when I turned over the account to another party. I still use greylisting personally with great success.
For me, between greylisting and requiring strict RFC compliance for the "HELO" parameter, pretty much no spam gets through to even be looked at by SpamAssassin.
For the "HELO" parameter, almost every spambot uses one of:
- something that isn't a fully qualified domain name ("laptop", "Notebook", and "PC-200806211153" are some recent examples)
- an IP address
Neither of these are acceptable (according to section 2.3.5 of the SMTP RFC) as the "HELO" parameter.
Then, I throw out a few more bogus things, like:
- my host/domain name
- my public IP address
- domain literals (i.e., an IP address surrounded by square brackets) that have an IP address in a bogon range
At this point, the e-mail gets to face greylisting, ClamAV, and SpamAssassin. About 1 in 100 "bad emails" get through to the end users.
Re:Not really. by kv9 · 2008-07-24 03:23 · Score: 1

Ummmm, yes. If you can identify them BEFORE they make their first attack then that would qualify as "predictive".
if I can identify them BEFORE they make their first attack ON MY SERVER then that would qualify as "predictive". I predicted their attack. drop all the fucking semantic hairsplitting please.

--
Stop Computers/Cars Analogies on S
Re:Not really. by h4ck7h3p14n37 · 2008-07-24 06:37 · Score: 1

Stock analysts make daily predictions based on past behavior. This is not only predictive, but if it wasn't for this past analysis, the predictions would be largely meaningless and highly inaccurate.
But I thought those analysts' predictions were largely meaningless and highly inaccurate? It's my understanding that the index funds (which I believe are managed by computers and not people) do much over the long term than any analyst.

Gonna need a new name.... by derfy · 2008-07-23 15:49 · Score: 0, Offtopic

Cause I come from the QuakeWorld days, and HPB means High Ping Bastard to me.

Yes, it does. by khasim · 2008-07-23 15:50 · Score: 1

So, who you're next to doesn't hold much water in predicting whether the message is spam.

Yes, it does. Look at the spam zombies on the major ISP networks.

Most corperations expose only one mail server IP address to the world, and some smaller companies have hundred-user systems and only one IP to show for it.

Now do the math about whether there are more home users on the big ISP networks or whether there are more companies running their own email servers.

If you're getting spam, 99.9%+ of the time it will be from a cracked machine on a home system easily identified as such.

Likewise, 99.9%+ of the legitimate email will not be coming from an ISP's home user block. If it is coming from that ISP's block, it will come from their mail servers.

Predictive goes both ways. Identifying what is probably good and identifying what is probably bad.

Re:Yes, it does. by Anonymous Coward · 2008-07-23 19:57 · Score: 0

Cite your sources, please; I'm curious about this but you've left me no information, only assertions.

Enumerating Badness by giminy · 2008-07-23 16:03 · Score: 5, Interesting

Every time I read some new whiz-bang security tool, I look back to Marcus Ranum's terrific The Six Dumbest Ideas in Computer Security article.

This idea meets three of the 'dumb' criteria:

1) Default Permit. Use of firewalls (even 'intelligent' firewalls) allows all traffic through, except that traffic that looks somehow bad.
2) Enumerating Badness. Kind of like #1, you're blacklisting the bad stuff. There's a helpful chart in the article to show why this is dumb.
6) Action is Better than Inaction. 'Nuff said.

Reid

--
The Right Reverend K. Reid Wightman,

Re:Enumerating Badness by RAMMS+EIN · 2008-07-23 17:50 · Score: 4, Interesting

Well, two points here.
First of all, security and spam are not the same. If one security threat makes it through to you, your security has been compromised. If one spam message makes it through to you, it's a little annoying, but no disaster. If, on the other hand, your "spam filtering" causes a legitimate message not to reach you, this is much worse. For spam, you err on the safe side by letting the message through. In security, you err on the safe side by blocking it.
Secondly, while mjr's 6 "dumb ideas" aren't going to give you perfect security, it's not obvious how you _would_ get that, nor that you should not implement any of those ideas. For example: enumerating badness is certainly not going to allow you to recognize and stop all badness. However, it isn't clear how you _would_ do that. How do you determine if something should or shouldn't be allowed to enter your system? Perhaps having a list of things you _don't_ want on your system could be helpful.
Enumerating badness certainly seems to work pretty well for email. With software, you can (really!) get away with making a list of what _is_ allowed on your system, and refuse everything else. With email, you actually _want_ messages you have never seen before from people you have never seen before, about things you have never talked about before. At least, most people do. On the other hand, spammers will often send lots of somehow similar messages. My spam filter, which I train based on lists of good and bad messages, correctly recognizes all good messages and something like 99% (it varies a bit) of bad messages. It doesn't keep the spam out, but it reduces it by a factor 100, without losing me any good messages. Is this a Dumb Idea?

--
Please correct me if I got my facts wrong.
Re:Enumerating Badness by mjensen · 2008-07-23 23:56 · Score: 1

From that link:
There's an old saying, "You cannot make a silk purse out of a sow's ear." It's pretty much true, unless you wind up using so much silk to patch the sow's ear that eventually the sow's ear is completely replaced with silk.
Read that.....He's talking about making a sow's ear from a silk purse and has his idea backwards.
There's a lot of (sometimes technical) statements on that site that show the creator made it without thinking very much about it.

Standard form by Eighty7 · 2008-07-23 16:23 · Score: 5, Funny

Your post advocates a

(x) technical ( ) legislative ( ) market-based ( ) vigilante

approach to fighting spam. Your idea will not work. Here is why it won't work. (One or more of the following may apply to your particular idea, and it may have other flaws which used to vary from state to state before a bad federal law was passed.)

( ) Spammers can easily use it to harvest email addresses
( ) Mailing lists and other legitimate email uses would be affected
( ) No one will be able to find the guy or collect the money
( ) It is defenseless against brute force attacks
(x) It will stop spam for two weeks and then we'll be stuck with it
(x) Users of email will not put up with it
( ) Microsoft will not put up with it
( ) The police will not put up with it
(x) Requires too much cooperation from spammers
(x) Requires immediate total cooperation from everybody at once
( ) Many email users cannot afford to lose business or alienate potential employers
( ) Spammers don't care about invalid addresses in their lists
( ) Anyone could anonymously destroy anyone else's career or business

Specifically, your plan fails to account for

( ) Laws expressly prohibiting it
(x) Lack of centrally controlling authority for email
( ) Open relays in foreign countries
( ) Ease of searching tiny alphanumeric address space of all email addresses
( ) Asshats
( ) Jurisdictional problems
( ) Unpopularity of weird new taxes
( ) Public reluctance to accept weird new forms of money
(x) Huge existing software investment in SMTP
(x) Susceptibility of protocols other than SMTP to attack
(x) Willingness of users to install OS patches received by email
(x) Armies of worm riddled broadband-connected Windows boxes
( ) Eternal arms race involved in all filtering approaches
( ) Extreme profitability of spam
(x) Joe jobs and/or identity theft
( ) Technically illiterate politicians
( ) Extreme stupidity on the part of people who do business with spammers
( ) Dishonesty on the part of spammers themselves
( ) Bandwidth costs that are unaffected by client filtering
( ) Outlook

and the following philosophical objections may also apply:

(x) Ideas similar to yours are easy to come up with, yet none have ever been shown practical
( ) Any scheme based on opt-out is unacceptable
( ) SMTP headers should not be the subject of legislation
( ) Blacklists suck
( ) Whitelists suck
( ) We should be able to talk about Viagra without being censored
( ) Countermeasures should not involve wire fraud or credit card fraud
( ) Countermeasures should not involve sabotage of public networks
(x) Countermeasures must work if phased in gradually
( ) Sending email should be free
( ) Why should we have to trust you and your servers?
( ) Incompatiblity with open source or open source licenses
( ) Feel-good measures do nothing to solve the problem
( ) Temporary/one-time email addresses are cumbersome
( ) I don't want the government reading my email
( ) Killing them that way is not slow and painful enough

Furthermore, this is what I think about you:

(x) Sorry dude, but I don't think it would work.
( ) This is a stupid idea, and you're a stupid person for suggesting it.
( ) Nice try, assh0le! I'm going to find out where you live and burn your house down!

Re:Standard form by Heembo · 2008-07-23 20:41 · Score: 2

You are the wind beneath my wings. That was my most favorite Slashdot post, ever.

--
Horns are really just a broken halo.
Re:Standard form by Quince+alPillan · 2008-07-24 00:48 · Score: 1

I think you missed a couple, since this is specifically about a client side blacklist.
Your post advocates a
(x) technical ( ) legislative ( ) market-based ( ) vigilante
approach to fighting spam. Your idea will not work. Here is why it won't work. (One or more of the following may apply to your particular idea, and it may have other flaws which used to vary from state to state before a bad federal law was passed.)
( ) Spammers can easily use it to harvest email addresses
(x) Mailing lists and other legitimate email uses would be affected
( ) No one will be able to find the guy or collect the money
( ) It is defenseless against brute force attacks
(x) It will stop spam for two weeks and then we'll be stuck with it
(x) Users of email will not put up with it
( ) Microsoft will not put up with it
( ) The police will not put up with it
(x) Requires too much cooperation from spammers
(x) Requires immediate total cooperation from everybody at once
(x) Many email users cannot afford to lose business or alienate potential employers
( ) Spammers don't care about invalid addresses in their lists
(x) Anyone could anonymously destroy anyone else's career or business
Specifically, your plan fails to account for:
( ) Laws expressly prohibiting it
(x) Lack of centrally controlling authority for email
( ) Open relays in foreign countries
( ) Ease of searching tiny alphanumeric address space of all email addresses
( ) Asshats
( ) Jurisdictional problems
( ) Unpopularity of weird new taxes
( ) Public reluctance to accept weird new forms of money
(x) Huge existing software investment in SMTP
(x) Susceptibility of protocols other than SMTP to attack
(x) Willingness of users to install OS patches received by email
(x) Armies of worm riddled broadband-connected Windows boxes
(x) Eternal arms race involved in all filtering approaches
( ) Extreme profitability of spam
(x) Joe jobs and/or identity theft
( ) Technically illiterate politicians
(x) Extreme stupidity on the part of people who do business with spammers
( ) Dishonesty on the part of spammers themselves
(x) Bandwidth costs that are unaffected by client filtering
( ) Outlook
and the following philosophical objections may also apply:
(x) Ideas similar to yours are easy to come up with, yet none have ever been shown practical
( ) Any scheme based on opt-out is unacceptable
( ) SMTP headers should not be the subject of legislation
(x) Blacklists suck
( ) Whitelists suck
( ) We should be able to talk about Viagra without being censored
( ) Countermeasures should not involve wire fraud or credit card fraud
( ) Countermeasures should not involve sabotage of public networks
(x) Countermeasures must work if phased in gradually
( ) Sending email should be free
( ) Why should we have to trust you and your servers?
( ) Incompatiblity with open source or open source licenses
( ) Feel-good measures do nothing to solve the problem
( ) Temporary/one-time email addresses are cumbersome
( ) I don't want the government reading my email
( ) Killing them that way is not slow and painful enough
Furthermore, this is what I think about you:
(x) Sorry dude, but I don't think it would work.
( ) This is a stupid idea, and you're a stupid person for suggesting it.
( ) Nice try, assh0le! I'm going to find out where you live and burn your house down!

Wish they'd add my ex-gf to the list... by skaet · 2008-07-23 16:33 · Score: 4, Funny

... then they could warn the poor bastard she's going to attack next.

--
There is no knowledge that is not power.

I stand corrected by Jane+Q.+Public · 2008-07-23 17:04 · Score: 1

I should not have written "any", much less emphasized it. Nevertheless, there is a strong tendency, and it behooves designers to take this into account. Probably I have become cynical, because so many otherwise intelligent people do not quite grasp the subtleties of predictive statistics and refuse to acknowledge this problem, even though it can be demonstrated with nothing more than simple middle-school-level math.

Re:I stand corrected by caffeinemessiah · 2008-07-23 17:40 · Score: 1

Thank you for a well-thought out response -- a rarity here these days it seems.

--
An old-timer with old-timey ideas.

Haha by Jane+Q.+Public · 2008-07-23 17:07 · Score: 1

People who do not have at least a basic grasp of statistics should never be allowed to be politicians. Half or even most of what they do involves statistics to some degree.

Wouldn't it be funny... by PC+and+Sony+Fanboy · 2008-07-23 17:37 · Score: 1

to have someone poison the 'predictive' list, and suddenly behind such a system would lose access to google, the pirate bay, or demonoid? (c'mon, those are like, the only 3 sites other than slashdot I use!)

comparing firewall logs .. by rs232 · 2008-07-23 19:50 · Score: 1

"the new HPB service will employ a link analysis algorithm to cross-compare firewall logs"

Snoooze ..

--
davecb5620@gmail.com

re: enumerating logic .. by rs232 · 2008-07-23 20:02 · Score: 1

"Well, two points here .. First of all, security and spam are not the same"

Identifying spam is actually 'enumerating badness', which does lead to losing legitimate messages.

--
davecb5620@gmail.com

Distributed Universal Reputation System by Colin+Smith · 2008-07-23 20:13 · Score: 3

That's what we really need... (baggsy on the acronym BTW)

A network of mathematical values which define reputation relative to one another. We have a number of attempts at this in place just now, not the least of which are Slashdot Karma, Google Pagerank, Stumbleupon etc. The thing is that what may be a good reputation to one person may well be the antithesis to another, so simple averaging is inappropriate. Richard Dawkins for example is someone who will have a very high reputation among certain groups and very low among others.

I should be able to see a relative reputation of someone/thing based on those other things which I hold in esteem and the things/people which they hold in esteem.

Decidedly non trivial. We haven't actually worked it out in The Real World (tm) either, relying on branding instead.

--
Deleted

Spoof City, here we come! by the_pete · 2008-07-24 00:27 · Score: 1

Wait until somebody spoofs somebody else's IP address and throws "attacks" with it at a few of the networks that submit logs. That would effectively block the IP from the spoofed address as the system would predict that the host is an attacker. Since TCP allows us to spoof almost any IP we want, we could get creative and spoof the addresses of the submitting members or even dshield itself.

Re:eat my shorts slashdot !! by spazdor · 2008-07-24 04:58 · Score: 1

Suck my double-precision floats, AC!

--
DRM: Terminator crops for your mind!

Slashdot Mirror

Researchers Create Highly Predictive Blacklists

71 comments