Human-Powered Spam Filtering
arturs writes "A company called eProvisia
started an unusal business: they filter out spam not by using complicated algorithms, but human beings... It costs around $20/year - is the war against spam over?" It's an interesting idea - the privacy concerns are big of course, but how would this stack up to, say SpamAssassin or a suite like Barracuda's Spam firewall. We tested the Barracuda device - great integration of OSS software, with a nice interface. Update: 09/20 15:12 GMT by J : Corrected price of Spam Eradicator.
1. The only way that they'll be able to do this at a good cost is to use some kind of third world labour with a first language that isn't english. 2. Given that my baysian spam solution seems to be better at sorting spam than me (I've accidently deleted items which were not spam before), then I'm not entirely sure that a stranger could do better. They offer a 100% guarantee. I doubt they'll be able to offer a 100% service.
Training monkeys for world domination since 1439
No kidding, that second paragraph (well, actually sentence) has high buzz word content yet actually only says, we do anti-spam. I have a feeling marketing and technical don't agree on much at this company.
MailWasher Pro from Firetrust has an option similar to that known as FirstAlert, you sign up for a year membership and use their app to submit spam type messages to them, later, a human verifies that the e-mail is indeed spam and adds it to their database. Once added, any MWP users using FirstAlert will hit the database, see that the message is there and act accordingly (often times deleting without even showing it to the user). It works... ok, about 30% of my spam is nuked this way, the built in Bayesian filtering catches another 40% or so, and the DNS blacklists catch most of the rest. Of the last two groups I verify manually, but have come to trust FirstAlert... it's just a shame it's not getting the high %'s it used to.
Help Brendan pay off his student loans
That second paragraph, in particular, would keep me as far away from them as possible.
And yet those are exactly the words that HR departments are looking for on resumes. As long as you can use stupid words that really don't work in any other situation you can get your foot in the door.
"24 hours a day" * 30 days/month = 720 hours
$20 per month / 720 hours = about 3 cents an hour.
Since they say they begin "manually reviewing, hand-picking and approving important correspondence", how does this work? To pay someone $6/hour, they'd need to be reviewing at least 200 mailboxes simultaneously. My confidence level of their accuracy under these circumstances would be considerably -lower- than a software solution.
~ Whence do you come, slayer of men, or where are you going, conqueror of space?
Look at this about page.
Im going to include their footnotes on that pge in parentheses and bold.
Privately funded in 1993, now with customers in 40 countries(Not all currently recognized by UN) and over $67 million(Palmyra Atoll dollars) in cash reserves, the company experienced a phenomenal growth
if its setup properly and bayes is enabled. Since Aug. 1st i've recieved 1800 emails flagged as spam... A few false positives but I have my threshold set pretty low. About 140 emails have gotten through and soon i'll take that folder and process it. Before I started processing spam that wasn't caught by spamassassin about 4-5 were getting through a day.. Now its once every few days.
"Thanks to the remote control I have the attention span of a gerbil."
from my understanding, all gmail users that click the little "report spam" button are essentially helping build the database and increase the effectiveness of its filters... though I guess they must be careful because this could potentially generate a lot of false positives.
--- If we knew half the things we shouldn't we'd stop wishing we knew it all
There can be a huge advantage of looking over data manually.
My job is focused around looking at data to find problems. Many companies by high cost and very fancy data verification tools, address cassing software etc... Then they run these things on automatic on all of there data. What happens is that there data slowly becomes unreliable over time because they are paying their data entry a pittiance or are simply outsourceing it to a foreign nation who has people who do not speak the language, as much as spell it.
So, having someone like me who can clean the data (even sensitive data) to quickly check for errors, find common errors, and write novel solutions for them is a pretty big business. I deal with crap sent from data entry day after day and turn it into data that can be used, and is highly accurate.
So, I can definately see someone using a human as an intermediary between spam.
I could see this working this way. If the email comes from someone "trusted" in the company, it gets passed along without a human filter. Any email addresses that are not trusted would come by a physical person.
This way, if "Bob, your investment advisor" emails you, then that data would be sent directly to the company without human interference.
but, if a mail comes to sales@mycompany.com, then you would have a person scan that e-mail and verify its spammyness. This way, any spam that would get through would have to be from someone trusted. If someone is trusted, then they are not likely to be spam.
It could be a very good system, but I doubt companies would want to allow email to be looked at this way. Consumer responce data, address data, etc.. I can see, but not really e-mails unless there was a pressing need to stop it.
If you don't vote, you don't matter, so don't waste your time telling me your opinion
Actually, LLC's are the smart way to do a partnership. In a partnership, all principles enjoy equal responsibility for mishaps. In LLC's, all principles enjoy shared responsibility.
I guess the best way to sum it up would be to quote my Business Legal Environment professor: "...and I hope that now you all have a clear understanding of partnerships. Now let me give you a word of advice, never form one."
This article has recently been linked from Slashdot. Please keep an eye on the page history for errors or vandalism.
And people were upset when it was 'discovered' that GMail was going to programatically 'read' your email to provide contextual advertising (and spam filtering) as an invasion of privacy? Here we will have actual _people_ reading your private correspondences. No thanks.
A piece of software that is filtering your spam is not being paid to do it. Yes it's creators were perhaps paid, but at least they are one step removed. A paid human-based service doing spam-filtering, however, would have a direct motive to want as much spam flying around as possible, so as to have as many potential customers as possible. I wouldn't be surprised if this company was created by and is being funded by the spammers themselves as just one more way to make money from spam! .
Jiggity
Why can't we use spelling checkers to filter spam? It seems all the spam now uses mispelled words and numbers in words to trip other filtering methods. So measure percent words mispelled or with numbers in them and above a particular threshold consider it spam.
Not only that, but the contract is "governed by and construed in accordance with the laws of Uninhabited Sovereign Territory of Palmyra Atoll, ..."
Many popular anti-spam systems, e.g. those implemented by webmail services, are already indirectly human-powered. Users classify their own spam emails and the everyone benefits system-wide without privacy concerns.
I'd say the system works pretty well. My Yahoo account, which was unusable after being harvested from my Usenet postings, is usable again. I just checked, and I have 426 messages in my bulk (spam) folder and 9 in my inbox. Of the nine, half (ok, 4!) are auto-responses from mail daemons to messages I never sent, while the other half are spam that escaped the filters. Not bad at all for a few days' worth of mail.
I think a sensible business model is for the webmail services to leverage their huge, continually updated, spam database and license them to ISPs, who can then filter spam at the server level before users download anything. I think that's much more elegant than software+community based solutions implemented at the user level.
Imposing Libertarian views on everyone online since 1992.
From the CIA World Factbook on Palmyra Atoll:
Translation of eProvisia's four-page web site : We're from the Government, and we're here to help you.
I expect the same is going on here. Sign up for their service, and not only do you lose your $20, but you end up on a load more spammer's lists as well.
As seen here the Palmyra Atoll is nothing more then an inhabited ring of island a thousand miles south of Hawaii. There is no population, no government, no money, and very likely no eProvisia. The site is a complete farce, and is probably run by spammers collecting email addresses and names.
Ah the good old times - the last company I worked for was called "Synergize Solutions". It was during the "boom" times so we had "free donuts Friday".
> One Palmyra Atoll dollar = 17 pieces of mithril, or approximately twenty
> kilograms of fairy dust. There's no such thing, people. This is a joke.
Actually, mithril does exist. It's also known as titanium ore or titanium
steel (depending on whether it's being mined or whether a worked object is
being described).
It's Adamantium that hasn't been invented yet. Unlike mithril, adamantium is
not vulnerable to the liquid nitrogen freeze-and-shatter attack. It is
speculated to contain plastic polymer in addition to metal alloy, but we don't
know how to actually make it.
HTH.HAND.
Cut that out, or I will ship you to Norilsk in a box.
This article is not a troll. This is a very cleverly written ad for Barracuda.
I immediately realised such a business would never thrive, because:
Yes, it's a hoax. Which could be immediately deduced from the fact that it is not viable business (especially with the price they quote).