Spam Archive opening FTP service December 4
Saint Aardvark writes "The FTP archives for spamarchive.org will be opening on December 4, according to this Wired article. But there already appear to be some archives available." I tried saving my spam for awhile just for giggles, but seeing that file grow to 100+ megs made me so angry I had to delete it. Currently getting ~200 spam every day, and now often they attach images so they are 100k+. Yay Internet!
It's not the net's fault. Blame (or shoot) the spammer's.
Wouldn't this spam archive be a form of free advertising?
who actually gets loads of spam every day?
I get about 3 per day (3 too many!)
You always hear about these poor suckers getting 200 or so a day, but how many of us actrually have to put up with that much stuff? If I got that much, I'd just switch email accounts, cos I just wouldn't put up with it.
I'm not defending spam here, but I'm just kinda curious how much people actually do get on average.
Is the poster's comprehension of time a bit twisted?
Really, all you need to do is manage your address properly from the beginning, don't do obvious spam-lure tactics with it, use sneakemail/other aliasing and you're set.
Seriously ... in the last year, maybe 3 total spams have come to my main address. (They're all the same spam too. Something about skin care. Weird.)
I think the idea behind their site is nice, but I also think that more and more, people are realizing that the only way to really effectively block spam is to use whitelists -- no fancy schmancy algorithm is going to block spam for long.
It's a shame, because I'm pretty sure that ceaseless, unrelenting, brutal torture of known spammers would be equally effective, but is unfortunately illegal.
evil adrian
Maybe we'll get some of the more creative spamers to run a "best of spam" series. coming to a mailbox near you this holiday season.
What sucks is that you'll get modded down. I bet every Canadian finds this hilarious. I know I do, but unfortunately, I'm not modding.
I think they should just go ahead and provide a subscription email service. That way, people can get the spam right in their inbox, instead of having to download it through ftp.
"I'd rather have a full bottle in front of me than a full frontal lobotomy"
Is there so little happening in the news, or did we hit some bizarre wormhole, and we are now going back in time ?
At this point, I think that in the last 2 days over 70% of the stories have been dups.
know what SpamAssassin does, but what I do know is that it works marvelously. Occasionally, a rogue spam does make it into my inbox without being tagged. I wonder, will the opening of the Spam Archive be beneficial to the SpamAssassin developers, or does the SpamAssassin algorithm rely only on new stuff?
Previous article about SpamArchive.ORG opening...
Trolling using another account since 2005.
I tried saving my spam for awhile just for giggles, but seeing that file grow to 100+ megs made me so angry I had to delete it.
Good thing too, it's probably illegal to save it anyway, as seen on slashdot today.
You fucken loser - it shows how much of a geek you are when you think that someone is bragging when they talk about how much spam they get.
Awww. CmdrTaco has finally installed a filter.
First they get rid of Jon Katz, now CmdrTaco is filtering his emails - as soon as Timothy starts checking for dupes we'll have to start finding new ways to take the piss :o)
Avantslash - View Slashdot cleanly on your mobile phone.
The email to the fake account can be discarded in any case, so you don't get more junk email this way.
I get that much on my PERSONAL account, and i also 'manage' spam for a 10K user base..
Somedays, ALL I get done is dealing with spam.
Too bad we cant bill them back for my salary, and lost network resources, like we can do for un-requested faxes.
And arrest them for sending porn with out verifying a person's age. Around here, you would be either fined ( bookstore ) or arrested ( individual ) for trying such a stunt in 'real life'.
---- Booth was a patriot ----
I have hundreds to spare..
Is this to provide a amusement to future anthropologists and social historians?
Rich.
libguestfs - tools for accessing and modifying virtual machine disk images
This is no dupe, it's a followup on a previous story
"Currently getting ~200 spam every day, and now often they attach images so they are 100k+. Yay Internet!"
:-)
Nope thats actualy whats known as a Pr0n mailing list
Burt "Out of my mind back in 5 minutes"
(Raises hand)
I do. Let's check. This morning I have:
30 spams that are not directly addressed to me,
130 spams that are directly addressed to my Verio email address,
5 spams addressed directly to my personal address.
Hmmm so I think I know what the problem is.
Verio sold my email address to every spam-merchant in the world.
- For the complete works of Shakespeare: cat
I wonder...
If (or when) everybody starts using whitelists, could we not recieve spam like viruses - from friends?
E.g., your friend sending you an email promoting this very special swedish p-enlarger, and without you noticing, your user-friendly Outlook has forwarded this mail to all your friends!
Or would that - finally - be illegal?
I also manage email for 10,000+ users. And I do a lot more than that; it simply does not take that much time if you handle things properly.
For corporate-wide spam blocking, sendmail has some great spam filtering features via DNS Black Lists (dnsbl). I use spamhaus.org and relays.osirusoft.com.
Add these lines to your sendmail.mc:
FEATURE(dnsbl, `sbl.spamhaus.org', `"550 Mail from " $&{client_addr} " rejected, see http://www.spamhaus.org/"')dnl
FEATURE(dnsbl, `relays.osirusoft.com', `"550 Mail from " $&{client_addr} " rejected, see http://relays.osirusoft.com"')dnl
There goes 90+% of the problem. After that, spamassassin handles the 10% that trickles through quite nicely.
If you don't use sendmail, all other modern mail relays can handle this problem in similar ways.
If people are going to use this archive to automatically induce rules for recognising junk mail (e.g. using naive bayes or ripper), then they will also need at least as many examples of legitimate mail.
Of course it could be useful for evaluating classifiers built using smaller corpora.
Now does this make EVERY email you receive spam?
Regardless, it works. I have never received spam through their service.
So far I've been restricted to only getting loads of spam every day, but now I can download some too!
LOL....AC
Of course a blacklist like this will be better than an algorithm for the one reason that if everyone has access to this algorithm to filter their mail, then spammers could possibly just keep sending an e-mail to themself and having it be filtered by all of the different filter algorithms and changing it a bit each time until he/she has custom-tailored that spam to get through all of the filters
Close the world.
"I'm pretty sure that ceaseless, unrelenting, brutal torture of known spammers would be equally effective, but is unfortunately illegal."
...Remember - most of these spammers base their operations out of China. So what we could do is somehow convice them to go there (offer them something they cannot refuse - a week's worth of unlimited serverfarm and bandwidth usage or something like that). Once they are there, we can inform the government that several dozen Falun Gong supports are in country trying to insight rebellion. Then you will get your wish.
To make laws that man cannot, and will not obey, serves to bring all law into contempt.
--E.C. Stanton
Gopher would be perfect for this type of thing! Why on earth are they using FTP?
What about all the Foreign spam out there that doesn't use standard ascii like the archive seems to contain?
Almost all of my spam is from taiwan or china and sadly enough yahoo mail doesn't provide any good way to filter this out when the messages have fake headers. If I could simply filter on something in the Received path then it would help, but all they allow you to do is the From address as far as where the message came from.
"Not knowing when the dawn will come, I open every door." - Emily Dickinson
- wget spamarchive
- grep emailaddresses spamarchive
- mail emailaddresses
- ???
You know the rest...google your verio address.
Are you sure you investigated exactly
what osirusoft does?
I fint it unfortunate that so many
administrators seem to put in osirusoft
as a blacklist without examing what it
does. Osirusoft combines the blackhole
listing of many many other blackhole
listings, one of which is unfortunately,
SPEWS. SPEWS in my opinion is
overzealous with blacklisting and it
is unfortunate that osirusoft includes
them in its list. To read more about
the problem, read this posting
here
here is a relavent quote...
ii. a grep on osirusoft - which yields about 1/2 the messages -
but.. when there's a false positive, there's a really good chance that
it's in this group - and of this class of false positives, there's a close
to 100% liklihood that it's SPEWS that's given the false positive
You can alos check out antispews.
Outlook users on Windoze can use Cloudmark's SpamNet, which does exactly that. They are working on a version for Outlook.
http://www.cloudmark.com/
Linux users can use Vipul's Razor:
http://razor.sourceforge.net/
patent this idea: authoring of commercially-oriented unsolicited email specifically formatted to defeat X antispam measure (like spamassassin say).
Another idea might be to protect spam utilities using the DMCA -- if you use it, you're not allowed to figure out how it works, and you're not allowed to circumvent its spam protection.
Thought I doubt either would work, it'd be ironic to use stupid laws for protection for a change.
The Right Reverend K. Reid Wightman,
Well, some people ask for it by using their personal email account for signing up on sites, posting on usenet etc.
Yeah, like those rape victims that were asking for it by wearing short skirts.
Nobody 'asked for it'. Don't you even resent the fact that spammers have made it impossible to post on Usenet with a legitimate e-mail address? Doesn't it piss you off that you have to be paranoid if some less-computer-savvy friend tells some web site to mail an article to you or sends you an online greeting card? Don't you get annoyed that every e-mail address that you post, no matter for what reason, get spammed?
Blame the criminals, not the victims.
hehe, I repeat. I use yahoo mail. Maybe that's my problem right there.
I did like it when I was using unix based mail and could procmail everything. *sigh*
When/if I get a newsystem maybe I'll leave my current one up as a permanent dedicated mail client.
"Not knowing when the dawn will come, I open every door." - Emily Dickinson
A message in the archive would have the following structure
Where I have replaced every name before a @ with SSS-PRIVATE. What do you think ?
I wouldn't give my spam archive if my emails privacy was not protected.
Note this message is not a spam.
Men are born ignorant, not stupid; they are made stupid by education. Bertrand Russel
It doesn't take a rocket scientist to learn how to block 99.99% of this crap.
1. Block any chinese domain.
2. Filter the keywords: sex, viagra, printer toner, extend, penis, enlargement, vitamin, from the subject line and block them.
3. Block anything with no subject or no sender.
4. Use software like http://www.mailwasher.net/ to manage your email.
... Governments are instituted among Men, deriving their just Powers from the Consent of the Governed...
On my older @usa.net account I would get 50+ spam messages a day. I closed it after they started charging more for their e-mail accounts. I now have a few e-mail accounts at my domain name for my website, and I get on average 10 a week. Mac OS X's Mail.app spam filter correctly sorts out the spam around 90% of the time, and I've only had one false positive. It's getting better as I use it. I'm happily impressed. my @lycos.com gets 0 spam. Zero. All spam effectively gets cut out by their spam filter and makes it into my junkmail box. (It's auto-deleted after a certain number of days, I don't know how many though.) I've been really impressed with the lycos.com mail, however. Best free mail I've found.
Just another use for spam (jaufs)
Sex - Find It
I'm sure much leakage is because of underhanded ISPs, companies selling email, and the like.
But in my case--and many people's--the main problem is that I am a public personality. I do things where there is good reason to disclose my email address to strangers (in my case, because I am a writer). A lot of those strangers write me for very legitimate reasons, but obviously once an email is made public you cannot keep it to only the good guys.
It doesn't apply so much to me personally, but a similar situation is where email addresses are listed in directories--company, organizations, and so on. In those cases also, you need to publish your email to let legitimate correspondence contact you.
I've always been a little puzzled by the (somewhat naive) folks who think to answer the spam problem by hiding their email from everywhere it might leak. There are various tricks for doing this, false addresses, complex usernames, different accounts, etc. That only really works for people--typically college kids or younger--who never need to DO anything in the world. For the rest of us, hiding an email address would be like hiding our snailmail address from business contacts, because we might get junk mail from releasing it.
Buy Text Processing in Python
While a handful of experts and analysts have applauded the project, the reaction in chat rooms and on weblogs has been muted.
:)
"There's absolutely no reason to believe that the spams collected here will be any 'better' a sample than those collected by opening a random Hotmail account," read one posting on Slashdot.
If that's your comment, smile, you're in wired. Good to see such cynicism speaking for the whole
You have paid for a total of 0 pages and so far 0 have been used up (0 today).
I put up with it because everyone who knows me knows not to use my account from my ISP, and mainly, because I'm too damned lazy to get the ISP to kill off the e-mail account.
:P
'sides, it's good for a laugh now and then, like when I get those 'legal highs' spams. Those're hillarious, I tell you.
Of course, on my 'real' account, I've decided to start forwarding any incoming spam to certain organizations. No, my friend, I most certainly did not sign up for Euro Farm Sex.
This might be very slightly offtopic, given that we're talking about a spam archive here and not about the mechanics of spam itself, but I'm curious.
This story is about someone who tried a little experiment: she wanted to see if the "click here to unsubscribe" link in most spams REALLY worked. So she tried the link and got INUNDATED with MORE spam.
Anyone have experience with this? A friend of mine agrees--she says that hitting the "Unsubscribe" link just verifies that your address is in fact a real and active one.
I always thought that was bullshit, because spammers don't seem to care whether addresses work or not (see The Story of Nadine. Any comments?
--Theresa
Angry IT woman in big clompy boots. And talking lint!.
I found this script on the SpamAssassin mailing list. It's been pretty interesting to use.
/var/log/mail are fucking SPAM!!!
/etc/postfix/main.cf all running.
I ran the script a few minutes ago on a machine that I host for a friend (web counter service, her websites/etc - she's been on the net for many years with her own domain/etc)
Total Messages...: 14950
Clean Messages...: 6413
Spam Messages....: 8537
Spam Percentage..: 57 percent
57% of the email out of nearly 15,000 emails in
That is *ridiculous*
This is with SpamAssassin, Razor, Pyzor, and several RBL lists in
The spam still gets through. It sucks up bandwidth. It sucks up resources. It's really offensive sometimes. Spammers know there's no federal legislation in place to block them, so they go on their spammy ways.
Spammers are scum. They *do not care* that you don't want their spam.
If you look at them, it looks like the headers have been hand parsed. No doubt for privacy reasons. But for example, there are no RECEIVED headers, which even though often forged are needed to test RBL heuristics and many other DNS tests.
Each archive should tell you whether EVERY mail has been hand-verified to be spam. This is not clarified.
The archives are not "versioned". They should be, so if any of the corpuses need to be corrected, people know.
Overall it looks clumsily put together.
agreed. in the past week i have gotten a few of these types of messages. they are not just designed to hide the spammers origin, but to also trip up filters. very annoying... fortunately popfile classifies them correctly though.
Large print giveth, and the small print taketh away
...if all the thousands of man hours put into filtering, blacklists, etc., were spent creating and installing a new authenticating mail transport prococol. I know, I know, just like IPv6, it would take years to get everyone to switch over. But right now spam costs lots of money and is just plain annoying, and the situation doesn't seem to be improving much.
I say we go for it. Why not build in an easy-to-use encryption scheme too, so all the Carnivore/RIAA/etc crap won't work?
Slashdot moderator(s), please kill any stories about this commercial anti-spam software company that is running a "trojan" site to get everyone to help improve their software for free.
Via internic, I found that the domain was registered by planetdomain. Via planetdomain, I found that the domain was registered by some guy who has an e-mail account at ciphertrust.com. Upon checking ciphertrust's website, I see that they offer a "secure e-mail gateway product that protects against spam".
Vic
Proposed Additions to the PDP-11 Instruction Set:
PI Punch Invalid
POPI Punch Operator Immediately
PVLC Punch Variable Length Card
RASC Read And Shred Card
RPM Read Programmers Mind
RSSC reduce speed, step carefully (for improved accuracy)
RTAB Rewind tape and break
RWDSK rewind disk
RWOC Read Writing On Card
SCRBL scribble to disk - faster than a write
SLC Search for Lost Chord
SPSW Scramble Program Status Word
SRSD Seek Record and Scar Disk
STROM Store in Read Only Memory
TDB Transfer and Drop Bit
WBT Water Binary Tree
- this post brought to you by the Automated Last Post Generator...