Block Spam Bots With Free CAPTCHA Service
Chirag Mehta writes "I just released a freeware service called BotBlock (barebones demo) that lets site owners copy/paste a few lines of PHP code and insert a CAPTCHA image-verification system into any web form. The amount of form spamming by bots is on a rise. While remedies exist for MT blogs, a more efficient solution is to use image-verification or text-identification. Used for a while by sites like Yahoo! (scroll to bottom), Hotmail and patented in 2001 by AltaVista, CAPTCHAs are now being used more widely. PARC also came up with two algorithms Baffletext and Pessimal Print. The technology always existed, but until now required the site owners to install image libraries and understand how to generate images that cannot be OCR'ed. With BotBlock it is like inserting a page counter."
What about people who are blind or visually impared? Does your implementation take that into account?
Prevent email address forgery. Publish SPF records for y
much better than blacklists and captcha is a bayesian filter.
blacklists are innaccurate: blacklisted words can be misspelled and pass through.
captcha discriminates against the disabled and cuts them off from online discussions.
James Seng has crafted a good bayesian filter for movable type.
Some of the examples on their site take a lot more time and mental effort than just looking at a word and typing it. I would be very bothered if I had to take one of those little tests just to fill out a form.
Scratched Emulsion
How will people with a text-based or non-visual web browser be able to site the site?
I tried to sign up with a forum this weekend, and I couldnt tell the letters, Couldnt tell the Zero from an "O". Only a minor problem, still has a few bugs to be worked out. But its nice to have real time authorization, instead of waiting for email to authorize the accout.
Also lots of services, are there any good free downloadable php addons?
For my GPL'ed PHP Captcha sofware:
http://sourceforge.net/projects/session-captcha/
patented in 2001 by AltaVista
If AltaVista patented it, does BotBlock license the patent? Or will this service be rather short-lived?
Opinions on the Twiddler2 hand-held keyboard?
...the images here here are absolutely unreadable. If I had to use this to subscribe to a site or forum, or fill out a form, I'd just say "screw it", and wander on down the 'net.
The entire point of the image was that it couldn't be read by machines, by providing alt text you've just removed that restriction and the image's usefulness along with it.
The poster knew this. It was either a joke or a troll, or both.
I used to bulls-eye womp-rats in my pants
Not that I really looked at how configurable this is, but...
...seems to me this BotBlock thingy wouldn't be that hard to decode, juding by the example, at least.
Ssooo, I bet it's feasible to figure out where the codeword starts on the pic. And since the font is easy I guess you can figure out each of the letters.
And if all this fails, get what you have, make your bot click the image and try again :)
Leaves me wondering whether the point of these things is to actually make it impossible to programmatically figure out the text or just make it hard enough so that most spammers won't bother.
Now having it make sounds, as mentioned somewhere above, that'd be really interesting.
.SIGSEGV
If you come up with a suite of questions. the spammer can come up with a suite of responses.
You (and parent poster) have some good points here. Something you're missing, though -- you're still thinking in terms of a large service that can be reused by lots of websites.
Suppose the system only offered the framework, and you had to provide (and rotate) the questions yourself for your own website. I'm thinking of writing a filter question into my forms, since I hate those text recognition things (my eyesight's not very good, so obscured text with negligible contrast is very difficult to read)... I'd just ask the user to enter in the last word on the bottom of this page, or the name of this website, or something like that.
Because I did it myself, and it's using questions specific to my website, it would be very difficult for a tool to:
a) detect that I'm using a filtering mechanism
b) answer any of the questions (since they would NOT match familiar patterns)
c) successfully submit any form without human attention to configure it.
The weak point of any spambot is that it must scale. It must be able to successfully spam millions of forms to make up for the poor response rate. Anything we can to do defeat scaling will help.
My approach is a little dicey as a general solution, since it depends on the site developer or maintainer to write good questions... any suggestions? What about a field that must be left blank, or the form will be rejected?
There are only 10 types of people: those who understand decimal, those who don't, and, uh, 8 other types I forget.
BotBlock offers the easiest to OCR CAPTHCA I ever cracked! Is there a market for selling a BotBlock crack as a web services to spammers?
It seems like all these clever bot deflectors are really intelligence tests of one form or another. That they discriminate against the blind, non-English-speakers or people with lower IQ is a shame. Bot makers will now work hard to OCR given classes of text-image-disruption algorithms or answer given classes of common sense questions. This means we will have an arms race of smarter bots and tougher tests.
At some point the tests will be so tough and the bots will be so good that many people will be thwarted while many bots will get through. At that point will we concede that computers have passed some form of Turing test?
Two wrongs don't make a right, but three lefts do.
I'm working on another version, which I believe is unique at this point. (At least I didn't find anything like in on Google a few weeks ago).
See a sample at the link below. (DISCLAIMER:: This site is a small self run hosting company, and has "sales" links, and is of commercial nature. So if you're going to get all pissed off because I am trying to feed my kids please do not click through. The sample does not collect or log anything outside of what Apache routinely collects. ) http://webshowhost.com/main.php?smPID=PHP::ui_huma n_verify.php&caseFlag=SAMPLE
What makes this implementation unique is that in the pattern user must identify color and characters. It combines multiple levels of recognition. The user must understand the concept of COLOR and the characters. This should make it particularly difficult for SPAM bots to dicipher, since color is very subjective. I am posting this here mainly to establish prior art (as I have not seen any test use these concepts before) in case some joker tries to patent this variety of CAPTCHA.
My variety integrates into a toolkit I've developed, but basically uses imagemagik montage to fuse pre-rendered image bitmaps into a single JPEG.
It is obviously weak in the sense that it discriminates against blind folks and illiterate folks. On the bright side it has definately eliminated ALL of my spam!
If your interested in this contact me at captcha1@webshowpro.com ** Note you'll have to verify yourself with the prototype system to sendmail to that account.
I'll do my best to provide you with the relevent code. I don't have time at this point to lead a project (as my company is a oneman show barely scraping by at this point). So my apologies in advance if I cannot support the code to your satisfaction.
And if they do, the worst they'll do is try to sell us penis enlargement pills, which is still preferable to a Terminator style apocalypse.
Even if you had an image that was 0% readable by OCR, image verification only stops "pure bot" spamming. It does not stop someone writing a helper or proxy app that presents them with a list of 1000 images that they type out in a very efficient manner. This could mean the difference between a million and a thousand spams per hour, but that's still a thousand spams per hour. And if you dismiss this as something that nobody would bother to do, you obviously don't know anything about spammers...
This is not the greatest sig in the world, this is just a tribute.