Defeating Captcha

Old news is no news. :-( by XorNand · 2005-08-24 05:25 · Score: 4, Informative

# Q. Where is the code? # A. No code is available yet. I am still pondering the pertinence of allowing code in the wild. The good old full-disclosure debate... If you think I should release the code for PWNtcha, feel free to explain your arguments to me.

::sigh:: The blurb leads one to believe that there's a new script kiddie tool in the wild. This is just someone's experiment with OCR and some AI. (And an old project at that; I remember reading this site about six months ago while working on my own Captcha implementation). There's a handful of researchers around the world doing the same type of work, including at team at UC Berkeley that devised a system that they claimed was 92% accurate... back in 2003. All in all, this isn't all that newsworthy.

--
Entrepreneur : (noun), French for "unemployed"

Re:Old news is no news. :-( by Cujo · 2005-08-24 05:28 · Score: 2, Interesting

The blurb leads one to believe that there's a new script kiddie tool in the wild.
I doubt it. I'm willing to give him the benefit of the doubt and assume he's just trying to make sure what he's doing is responsible by releassing the code. And what he's doing at this site is mainly pointing out the weaknesses in some common captchas.

--
Helium balloons want to be free.
Re:Old news is no news. :-( by drgonzo59 · 2005-08-24 06:54 · Score: 2, Insightful

The problem with with captcha stuff is that if it is so good that if the current OCR cannot read it, it is probably bad enough that even humans cannot read it.
I saw a couple of sites a while that used some captchas that you could barely read, which made it annoying and unusable.
What would make it much more difficult is if they combined captchas with pictures, or ask people a simple question with a captcha that would have a common sense answer. Like "what is 2+2=" and then alternate it with forms like "what is two plus two equal to" and such, combine such questions with stuff like "what color is the sky?" or "what is the 1st derivative of x^n with respect to x"... well, ok, maybe not this one...
Or how about blending images together. For example a picture of a dog and a cat on some background, also both transperenlty super-imposed with a small overlap. Then ask the question name the two animals in the picture?
How about asking the user to make a mouse gesture in an applet. (Did someone already implement this?). For example: "draw a circle with a small triangle in the middle" or "draw number '4'", then let the server use OCR to validate.
Re:Old news is no news. :-( by feargal · 2005-08-24 07:35 · Score: 3, Interesting

The problem with blending images and so on is that blind people still cannot see them.
This slide demonstrates the problem beautifully, I think.

With regard to the simple questions, that is indeed what I do, some simple trivia, and some basic maths, and the library is called SimpleQuestions.

"What colour is the sky?" is actually one of the questions, and the maths question do indeed vary in form, from expression to natural language.

The problem with the drawing requirement is that you're now blocking people who cannot draw.

--
"A goldfish was his muse, eternally amused"

mirrored by Anonymous Coward · 2005-08-24 05:25 · Score: 5, Informative

here

What Captcha is... by geders · 2005-08-24 05:27 · Score: 5, Informative

Whew, I had never even heard of Captcha before...

A captcha is a type of challenge-response test used in computing to determine whether or not the user is human.

Re:What Captcha is... by jd · 2005-08-24 05:43 · Score: 2, Funny

A test for humanness will not be convincing until it cuts out 70% of AOL users and 58.2% of Belgium. (58.2% of Belgian users would work, too.)

It would also have to be impossible for lawyers, tax collectors, marketroids and politicians to use. (Taxes are important, I'm just not convinced anyone in the IRS is biologically related to life on this planet.)

As of this time, Captcha fails this test and therefore is quite unsuitable. A better test would be a short quiz on the meaning of that day's Dilbert cartoon.

--
It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
Re:What Captcha is... by slavemowgli · 2005-08-24 06:34 · Score: 2, Informative

I can understand AOL users, but... Belgians? Huh? Why Belgians? I've been to Belgium, and it's actually a very nice country with very nice (in general) people. Or are there any cliches I'm not aware of?

--
quidquid latine dictum sit altum videtur.
Re:What Captcha is... by toggleflipflop · 2005-08-24 06:34 · Score: 2, Informative

>A test for humanness will not be convincing until it cuts out 70% of AOL users and 58.2% of Belgium. (58.2% of Belgian users would work, too.)

Just got ditched by your Belgian girlfriend or what did we deserve this statement for?

At least we got good-tasting beer that can help you feel less bad about whatever is bothering you :-)

greets,
Tom
Re:What Captcha is... by The-Bus · 2005-08-24 07:54 · Score: 2, Interesting

God bless your monks.

Literally.

--
Small potatoes make the steak look bigger.
Re:What Captcha is... by Elwood+P+Dowd · 2005-08-24 08:39 · Score: 2, Insightful

Dunno what grandparent's problem is, but there's plenty of good beer here in the US too. We don't judge Belgian beer by Stella Artois, and y'all don't need to judge ours by Budweiser.

So long as we're talking about beer and not politics, America is fine.

--

There are no trails. There are no trees out here.
Re:What Captcha is... by cayenne8 · 2005-08-24 08:46 · Score: 2, Insightful

"That's the joke. Belgium is a very pleasant, mostly harmless country..on the whole Belgians themselves are extremely polite, well mannered..."
And...they do make GREAT beers!! Strong beers...
Which may in fact, explain the strange mayo on the french fries thing......
:-)

--
Light travels faster than sound. This is why some people appear bright until you hear them speak.........

spammer's low-tech way by Anonymous Coward · 2005-08-24 05:28 · Score: 5, Interesting

A while ago, I remember hearing about how some spammers whould post the Yahoo Mail (or other free email services) Captchas on the registration forms on pr0n sites. The pr0n registrants would have to fill out the Captcha, but this would then be used by the spammer to get around the Captcha without any fancy software.

Re:spammer's low-tech way by merreborn · 2005-08-24 05:36 · Score: 2, Informative

The best part is that *no* advance in captcha technology can really fix this. It's no longer a race against OCR technology, the whole can't be plugged by switching to object-based (rather than text based), neither can it be stopped by switching to audio-based captcha.
Re:spammer's low-tech way by jesup · 2005-08-24 05:51 · Score: 3, Insightful

It's trivial to hack a browser (hell, you don't even have to actually hack it, just know how it works) to snag the image for you. Then repeat as per grandparent (have a unwitting (or witting) human do it for you).

Next stage: make the captcha Java code that generates the warped image dynamically. Reponse: send the JS to the unwitting human.

Next stage: make the Java code generate the token using something intrinsic to the machine running it (IP, etc, etc). Response: snatch the image from display ram to present to the unwitting human.

Next stage: include in the image information about what the image is for (site, etc). Response: block those parts, or use witting humans who don't care or are otherwise paid (in porn, 3rd-world wages, etc).

You can make it progressively harder, but you can't make it impossible. You might be able to make it hard enough, though.
Re:spammer's low-tech way by Drooling+Iguana · 2005-08-24 06:22 · Score: 2, Insightful

By making everyone so pissed off at the state of the computer industry that they go back to using an abacus and slide rule?

--
... I'm addicted to placebos
Re:spammer's low-tech way by Goaway · 2005-08-24 06:22 · Score: 4, Informative

It originated as an off-hand remark by someone - maybe Cory Doctorow, I forget - as an example for a theoretical way to break captchas. This was quickly misremembered and blown out of proportion by people wanting to seem clever on Slashdot.
Re:spammer's low-tech way by Gordonjcp · 2005-08-24 06:30 · Score: 3, Funny

It's very difficult to get around this. Even using things with no text at all, such as the Cwazymail images, you still have this gaping hole that ne'er-do-wells will get in through.
Re:spammer's low-tech way by dgatwood · 2005-08-24 06:35 · Score: 4, Insightful

Most of these techniques could be defeated with a simple color filter, sadly.... Regardless, crypto is a really good comparison because a lot of crypto can be broken with statistical techniques, and in that regard, getting past Captcha grids can be done using very similar methodology.
Take a histogram of... say a hundred random subregious within the image of varying sizes and shapes. Sort colors by the number of these subregions in which they appear. Assume that colors that appear in every block (or above some threshold... say 90%) are background. Replace them all with white. Assume that colors that appear in only some of those blocks are foreground. Replace those colors with black. Do your OCR.
To some extent, you can get around that by masking parts of the text using the same color or by adding chunks of background in the same color, but this is only of limited effectiveness. The only way you can really defeat even the most basic stochastic analysis is by making the color information change from one side of the picture to another. Even then, unless this is done randomly in a dynamic fashion, once you manually figure out the gradation once, the mechanism is broken.
Basically, these things don't work even at a conceptual level. The fundamental problem is that you have a choice: either require the person to do something that doesn't require thought or require the person to solve problems that require logical thought.
In the case of the former, it can be obscured easily, but the level of thought needed can be easily simulated by a computer program, and any algorithm one could write to fool that program is inherently reversible. If the noise level is sufficient to make this impractical, it also will be unlikely that a human can read it, though with multiple tests, this could still work---more on this later..
In the case of the latter, the limitations to the reasonable size of the problem space mean that, while the computer can't simulate the intellect needed to actually figure out the example, it can trivially store a list of all of the problems and their answers and simply regurgitate the right answer on command, in much the same way that most lower animals can be trained to regurgitate an action on command even though they do not actually understand what the command means.
The only potentially viable mechanism for doing this sort of thing involves dynamic creation of the images using random number generators to perturb the image in ways that are of similar color to the test, using color variation on the text to fool stochastic methods, using foreground masking of the text (i.e. lines that go in front of the text, not just behind it), and using a wide enough variety of fonts, some of which should be things like cursive fonts with variable baselines. That really makes OCR mad.
If you do all of those things, you -might- have something that could only be broken by a computer a third of the time. The problem is that it could only be broken by a -human- about half of the time. If you do multiple tests, you should be able to establish a reasonable threshold above which the antagonist is likely to be a human rather than a piece of software, though even then, you will have to algorithmically change it frequently or else computers will eventually overtake humans no matter what your algorithm... because, quite frankly, computers are a lot better at DSP than we are. :-)

--
Check out my sci-fi/humor trilogy at PatriotsBooks.
Re:spammer's low-tech way by bobbozzo · 2005-08-24 08:55 · Score: 2, Informative

Udi Manber (while he was chief scientist at Yahoo) mentioned it was happening to Yahoo, during a presentation at UCR.

--
Nothing to see here; Move along.
Re:spammer's low-tech way by McGregorMortis · 2005-08-24 09:11 · Score: 4, Insightful

The thing is, then, the porn site asking you to solve the captcha doesn't know the answer themselves. You can screw 'em by giving the wrong answer.

They'll waste their resources trying to spam with the wrong answer, and you'll still get your porn fix.

rock paper scissors... by jpellino · 2005-08-24 05:29 · Score: 5, Funny

captcha stops bots
pwntcha breaks captcha
slashdot cremates pwntcha

--
"Win treats sysadmins better than users. Mac treats users better than sysadmins. Linux treats everyone like sysadmins."

Re:rock paper scissors... by swelke · 2005-08-24 06:45 · Score: 4, Funny

Working that backwards: Slashdot cremates pwntcha, un-breaking captcha. Un-breaking captcha un-stops bots. Therefore, slashdot un-stops bots. I was starting to think the whole slashdot system was just an automated method of destroying the internet; now I have proof. Thanks.

--
Have you ever wondered How to Take Over

Hmm by sexyrexy · 2005-08-24 05:30 · Score: 2, Interesting

While it is an interesting project from a hobbyist and academic standpoint, I'm not really sure what practical value it holds (unless the intent is to sell a mature algorithm to spammers, which is not the case since the project is being published). This is nothing more than a personal scripting project - no new forray into new concepts of computer science or pattern recognition; no new breakthroughs of computer-based heuristics.

--

Rex is 09 F9 11 02 9D 74 E3 5B D8 41 56 C5 63 56 88 C0

Re:Hmm by barawn · 2005-08-24 06:50 · Score: 2, Interesting
I'm not really sure what practical value it holds

Well, if you read the site, there's a list of reasons why certain captchas are bad.

For instance:
- Too few fonts (or only one font)
- Constant rotation (or no rotation)
- No deformation
- Constant colors
And a list of reasons why certain captchas are good. It's a pretty good summary of the strengths (and weaknesses) of a lot of them.

One thing you may notice is how complicated (and difficult to read as a human!) some of the broken ones are (like linuxfr.org, or vBulletin), and how easy to read (yet hard to defeat!) the ICQ one is.

One easy thing to take away from this page would be: if you have to have one, for crying out loud, use a ton of fonts and a ton of backgrounds.

ADA by dnoyeb · 2005-08-24 05:31 · Score: 5, Insightful

Having a legally blind mother that uses the web, I wonder how captcha complies with the Americans With Disabilities Act (when used by American companies of course)?

Is it compatible with BLINUX? I think by definition it is not.

Perhaps I should ask, what alternate method of identification do sights employ to take into account blind users and the ADA?

Re:ADA by jpatters · 2005-08-24 05:37 · Score: 2, Interesting

Audio captchas?

--
"Remember, there never were pineapple-almond cookies here."
Re:ADA by donnyspi · 2005-08-24 05:37 · Score: 4, Interesting

Instead of an image based Turing test like Captcha, I just have the last question on a log in screen or form be a randomly selected super easy question. For example, "Spell the number 7" or "What is the next logical number in the sequence 1, 3, 5, 7, ...? Check it out here: http://www.donnyspi.com/contact.php
Re:ADA by guardian-ct · 2005-08-24 05:51 · Score: 2, Interesting

Livejournal has a "If you can't read the text, type "AUDIO" and take a sound test instead." thing, and other sites have other ways around the visual test.

Unfortunately, not all sites have non-visual humanity tests.
Re:ADA by Tumbleweed · 2005-08-24 05:53 · Score: 5, Funny

I wonder how captcha complies with the Americans With Disabilities Act

Simple - they just use ALT text for the image! :)
Re:ADA by TheRaven64 · 2005-08-24 05:57 · Score: 4, Funny

Hmm. Done right, you could weed out bots and stupid people. Excellent!

--
I am TheRaven on Soylent News
Re:ADA by La+Gris · 2005-08-24 06:00 · Score: 2, Interesting

This is a real problem for visualy impaired and not only blinds.

Distored fonts, noisy lines, random dots and low contrast used in such pictures, makes it at least very hard or impossible to read.

Accessibility recommandations and W3C standards would require such important content, to be backuped with alternate formats like an audio record.

I believe these rules should apply not only to government sites.

But, I know no site, providing alternativ audio captcha for now. My husband and I, require a tier person to read most captchas actualy.

--
Léa Gris
Re:ADA by JadeNB · 2005-08-24 06:11 · Score: 2, Insightful

This solution is interesting, but surely not scaleable -- while captchas are, by design, easy for computers to generate but hard for them to solve, the same thing that prevents computers from solving `easy' problems will presumably also prevent them from generating `easy' problems.
Re:ADA by aardvarkjoe · 2005-08-24 06:16 · Score: 2, Insightful

For fun, I tried plugging five questions from your page into google. Of the five, three were answered directly by google, and one had the answer in the summary for the first result. Creating a parser to determine the right answer from the google results would take some work, but I would bet that a 50% accuracy rate is not unreasonable. A first, fairly obvious method, would be to take the summary of the first google result, remove all of the words that appeared in the original question, and pick from the remaining words.
Of course, as long as your system isn't widely used, nobody will bother to create tools to defeat it.

--

How can we continue to believe in a just universe and freedom to eat crackers if we have no ale?
Re:ADA by Anonymous Coward · 2005-08-24 06:31 · Score: 2, Funny

"What is the next logical number in the sequence 1, 3, 5, 7, ...?"

11. Oh, wait, you're not using octal?
Re:ADA by moeffju · 2005-08-24 09:30 · Score: 2, Informative

There are several programs doing the TREC (Text REtrieval Conference) Question Answering track that give you an accuracy of 80% upwards, and that's for hard questions like historical data on a huge corpus.

--
follow me on Twitter: http://twitter.com/moeffju

Consider the problem by ReformedExCon · 2005-08-24 05:31 · Score: 5, Insightful

The problem is that people are using robots to work in an autonomous manner to find ways around typical human limitations (we can only send several hundred emails a day, robots are not so limited). So people want to stop these "cheater" by making the user prove that they are a human rather than a robot.

Is this really a good thing, though? Even on a site like Slashdot, in a story about defeating bots, the very first comment in this story is posted by a bot. How ironic is that? What is accomplished by banning users who can't read these "captchas" (what a horrendous fake word)? Nothing, apparently, as the story says. It only serves to annoy legitimate users and does nothing to hamper illegitimate robots.

The solution is not this sort of halfway measure. The solution is to make it simply not worth the effort to be a nuisance on a discussion forum. I suppose that requires a glut of intelligent posters, but with the entire citizenry of the Internet available, that can't be so hard.

--
Jesus saved me from my past. He can save you as well.

Re:Consider the problem by A+beautiful+mind · 2005-08-24 05:57 · Score: 5, Insightful

"What is accomplished by banning users who can't read these "captchas" (what a horrendous fake word)? Nothing, apparently, as the story says."

I actually disagree. The captcha method reduces spam load for most sites down to zero. Only the bigger sites need to worry, because spammers may set up a site to specifically target them by rerouting captchas. That's not the case with 99% of the websites using captchas, it's just not worth the effort.

It's sorta like a copy protection: if it stops 90% of the people, then it's good enough.

--
It takes a man to suffer ignorance and smile
Be yourself no matter what they say

Mod parent up by XNormal · 2005-08-24 05:32 · Score: 3, Interesting

It's a cheap and scaleable method to defeat such algorithms. There will always be enough humans willing to do this for very little reward (some free pics).

--
Stop worrying about the risks of nuclear power and start worrying about the risks of not using nuclear power.

Rock paper scissors snorkel by Wilson_6500 · 2005-08-24 05:33 · Score: 2, Insightful

Uh, that game doesn't work unless, say, bots stop Slashdot. Otherwise everyone just picks Slashdot and it's fifth grade all over again.

It is patented by dmeranda · 2005-08-24 05:34 · Score: 3, Informative

This is a good study of how hard it is to design secure systems. It's just like a non-cryptographer trying to create their own cipher, only in the visual processing world. Sadly, the article does not touch on non-visual captchas, which are alternatives for the blind. It would also be interesting to see what Jakob Nielsen might have to say on this technology from a usability perspective.

Of course, one of the primary bad things is that the concept of a captcha is patented, and the patent language is very broad. US Patent# 6,195,698

Also see the Wikipedia article for more information.

Heh by hungrygrue · 2005-08-24 05:35 · Score: 4, Funny

Well I'm glad someone is writing code to solve those "prove you aren't a script" images, because a lot of times I can't quite figure them out myself.

"Q. What is your favorite color?.. No on second thought, nevermind that. What is written in this blob?"
A. I'm not sure, is this a rorschach test? Oh, I know 4 - 3 - Two flies mating - U - V - Giant Nose - X."

Its bad idea for several reasons by bogie · 2005-08-24 05:38 · Score: 4, Insightful

Chiefly among them is sometimes you can't tell what the fucking words are. Within the last few months on more than one occasion I simply could not read the letters because they were so distorted and the lines overlapped the letters too much. No fun redoing a web form over and over because you can't figure out what the hell the verification box says.

I can't imagine how people with difficulties cope with this.

--
If you wanna get rich, you know that payback is a bitch

Re:Its bad idea for several reasons by 0xABADC0DA · 2005-08-24 06:13 · Score: 2, Interesting

The sites with really good captcha's should run anti-captcha's... to filter out the *reallly* hard to read ones. =P

But there are still a lot of ways that haven't been used yet to make the image hard to read for the computer but less hard than the expreme distortions, such as overlapping letters and words. For example, if say only 25% of a word is covered up by another word on top of it, it should still be very easy for a normal person to read both words. Or use different colors and transparency. Or chain capchas together, for example one captcha that says "green" or "small" and another full of letters of various color/size/whatever. Then ask the user to enter the right code (ie, so they have to use reasoning instead of just pattern recognition).
Re:Its bad idea for several reasons by bill_mcgonigle · 2005-08-24 08:13 · Score: 2, Interesting

No fun redoing a web form over and over because you can't figure out what the hell the verification box says.

Yahoo! does this and it's asinine. I hit a captcha today that clearly had a ` character in it, but apparently it was a 'confuser' line, not a `. The rules for what character sets are valid are not given, so you don't know if punctuation is valid or not. Apparently it's invalid. How about case? A c and a C are pretty hard to discriminate when they're rendered along a Bezier curve.

Clearing the web form is no hinderance at all to a robot, but makes life difficult for humans. There's no excuse for pissing off users unnecessarily.

The Yahoo! web team is going down hill. The Groups code used to be able to register e-mail addresses with a '+' in it, but that broke recently. You can't get an e-mail into their bug support system. I've tried. I've argued with the helpbots. I lost.

--
My God, it's Full of Source!
OUTSIDE_IP=$(dig +short my.ip @outsideip.net)

Re:From the site... by Anonymous Coward · 2005-08-24 05:40 · Score: 3, Insightful

And then again, maybe he isn't. It doesn't really matter which library he uses for image import, does it? I mean, the interesting part would be the data structures and algorithms used in the "reverse-mapping" from image data to text. It's doubtful that the rudimentary processing methods provided by ImageMagick (although often a god-send of convenience and compatibility) would help here.

Not that this would stop you from plugging some random open-source software package. Even though your plug will probably do more Good-For-The-World than the rest of the discussion in this thread combined, your motives are still strange to me.

OCR wins by marked23 · 2005-08-24 05:40 · Score: 3, Funny

Once all these new algorithms get integrated into OCR software... OCR software might just work.

Interesting flash-based captcha by fahrvergnugen · 2005-08-24 05:41 · Score: 4, Interesting

I just saw a great flash-based Captcha designed to combat just this sort of attack. The test was composed of white text on a white background. Colored shapes of various sizes swirled in the background behind the text in a pseudo-random pattern, and the text was visible or obfuscated depending on whether there was a shape behind it at the moment. After watching it for a few minutes to see if there were any obvious flaws, I noticed that the entire phrase was never visible all at once.

A little patience was required, but I was able to verify in less than 10 seconds. Animation seems to be very useful for this kind of application.

--
Even Jesus hates listening to Creed.

Re:Interesting flash-based captcha by JimmehAH · 2005-08-24 05:55 · Score: 5, Insightful

You could just write the bot to decompile the .swf file and grab the string (or vector/raster representation of the text) from that.

Flash is a bad format to use for a CAPTCHA from a security and accessibility point of view.

Re:From the site... by tcopeland · 2005-08-24 05:47 · Score: 2, Insightful

> It doesn't really matter which library he
> uses for image import, does it?

I'd be interested in knowing what it is... but I may well be the only person on the planet that is interested.

> your motives are still strange to me

Most of the time I don't understand them myself!

--
The Army reading list

Try AuthImage for WordPress with a little tweaking by PeeAitchPee · 2005-08-24 05:52 · Score: 2, Interesting

Having to wade through 60+ spam comments a day on a WordPress blog (with all the stock antispam options enabled) just sucked . . . and the blog didn't even get much traffic (PageRank of 4). I installed the AuthImage plugin and used it on its stock settings, and for awhile didn't get a single bit of spam. Then, magically, it started up again. It seems some industrious little script kiddies have written a crawler to massively bombard AuthImage-enabled blogs with words from the stock word list. I switched from the wordlist file to randomly-generated strings and increased the size of the image for readability, and I never had another piece of comment spam in that blog again.

As for blind folks, I suppose every webmaster has to make that decision based on their target demographic, but I've seen a few text-only captchas that work well enough ("What color is an orange?") but will inevitably have the same limitation as the AuthImage word list above.

Easiest way to Defeat Captchas by Bondolo · 2005-08-24 05:53 · Score: 4, Interesting

Put up a "free" pr0n site.
Require visitors to the pr0n site to process a captcha before viewing the pr0n. In reality they are proxy processing a captcha for another site (paypal, hotmail, yahoo, etc.) which they never see.
Profit!

Captchas are next to useless and for the visually impaired very frustrating. One more of a example of a technology which annoys everyone and yet doesn't really stop the determined miscreant. <cough>airport shoe inspections</cough>

--
-- "Most people prefer a popular myth to an unpopular truth"

Captchas = Turing test by G4from128k · 2005-08-24 05:54 · Score: 4, Insightful

As with the Turing test, the entire purpose of a captcha is to distinguish humans from machines. As captcha-defeaters improve, the captchas will need to become more and more sophisticated and require more and more human or human-like intelligence to process. This arms race will culminate in a Turing test-like approach for discerning natural intelligences from artificial ones.

The ultimate irony may occur when the first human-intelligent computer is created by a spammer for the purpose of assaulting our collective intelligences with their commerical drivel. Given the increasing value of online commerce and Google page ranking, there's probably more money in AI for captchas than AI for academic research.

But before captchas get that sophisticated, the system will become self-defeating as the number of real humans defeated by captchas exceeds the number of AIs repelled by them.

--
Two wrongs don't make a right, but three lefts do.

Commentary on w3's captcha-inaccessibility page by davidwr · 2005-08-24 06:01 · Score: 2, Informative

The main article refers to Inaccessibilyt of Visually-Oriented Anti-Robot Tests, which deserves a read and commentary.

Among the claims:
- captchas are inaccessbile to the blind - true
- a horde of human beings can decode the entire library over time - only true if the images are recycled, not if they are created on-demand or for one-time use.

It also discusses some of the side-effects of making access to real humans harder, or harder for a class of users such as the visually impaired. For example, I've seen sites that say "If you cannot read this, call this phone number for access." Too bad for you if you don't have a phone.

As alternatives, it offers
- logic puzzles
- sound output
- credit-card validation
- live operators
- limited-use of unverified accounts, such as throttling for email
- behavior and heuristic analysis
- already-established credentials, such as single-sign-on systems or public-key-based systems
- biometrics

The article briefly discusses the pros and cons of each.

I rate its conclusion

"Visual verification alone is known to create problems with users. It is imperative that site designers take the needs of users with disabilities into account, and it is likewise hoped that one or more of these potential solutions can make that process easier."

as: insightful +5 obvious -1.

The article as a whole gets an "informative +5."

--
Knowledge is how to play a game, intelligence is how to win, wisdom is knowing what game to play.

Re: Disabilities by chato · 2005-08-24 06:07 · Score: 2, Informative

The W3C proposed in 2003 a number of Solutions for the Inaccessibility of Visually-Oriented Anti-Robot Tests, including logic puzzles, audio captchas, credit card validation, etc. It is interesting that they also show how a federated identity system can help users with disabilities.

Re:From the site... by the_mad_poster · 2005-08-24 06:18 · Score: 5, Informative

http://www.gh-sts.com/captcha.txt

This is what slashdot's previous iteration of a captcha looked like in an in-memory associative array after the intersecting lines had been removed and a de-skewing algorithm applied. There was actually a version of the code after that which properly picked out where the lines actually intersected the letters and didn't erase the intersecting section to create those gaps.

Before they switched to the newest CAPTCHA system, I was breaking their CAPTCHAs with a modified SS.pl script with almost 100% accuracy (it had a little trouble properly splitting up the text when a j or other similar character wrapped partially under another letter).

Of course, the new CAPTCHAs are much harder. I can't even read some of them myself, but the point is that breaking CAPTCHA that people can easily read usually isn't really that hard.

Yes, I used ImageMagick's Perlmagick library.

--
Alito: A vote for Alito is a punch in the eye to put that bitch back in her place!

Is that goatse I see? by themightythor · 2005-08-24 06:23 · Score: 3, Funny

In the table for "Cwazymail", I was trying to figure out what the pictures were. One's an elephant, one's an owl, and one is a man pulling apart his anus. Great!

Re:The GOATSE picture is NOT in the mirrordot by SoCalChris · 2005-08-24 06:34 · Score: 2, Informative

Nice, the site owner probably added it when he added the notice to slashdot readers.

That's why by Phil+John · 2005-08-24 06:47 · Score: 2, Interesting

all captchas should timeout after, oh, say 10 minutes?

In all honesty, do you really think you're going to get that many people to regularly visit a pr0n site? The sector is extreemly cut-throat and vastly bigger than the market can justifiably support (hence why many pr0n sites close each month).

The only way to get to the top of the engines in the first few months would be to use PPC advertising (costs money). After that, even if you get to the top of the SERPS by using nefarious means, you'll need to give people a viable reason to sign-up to your service, i.e. you'll need content which costs money (unless you want to steal it, at which point you can probably expect some real mean types to track you down and kill you, them porn businesspeople are crazy).

--
I am NaN

Re:Yet another problem hashcash can solve by xenocide2 · 2005-08-24 06:48 · Score: 2, Funny

I'm not sure what Hashcash does, but it sounds like I've already got a great idea for a counter-program: Hashcache.

--
I Browse at +4 Flamebait

Open Source Sysadmin

The linked page is NSFW by poincaraux · 2005-08-24 06:50 · Score: 4, Informative

Editors -

Please don't link to the goatse man without at least some warning.

Thanks.

Re:BFD by SComps · 2005-08-24 07:04 · Score: 3, Funny

that would be a draft beer yes?

--
The world according to SComps

Goatse Man by Inda · 2005-08-24 07:29 · Score: 5, Informative

Thanks for linking the Goatse Man image in the article. Oh how I've missed being tricked into viewing thee.

The link is not work safe.

--
This post contains benzene, nitrosamines, formaldehyde and hydrogen cyanide.

Re:Goatse Man by Siva · 2005-08-24 07:44 · Score: 3, Funny

But the image is distorted, so while you might be able to determine what it is, your Manager-Bot won't.

--

Keyboard not found.
Press F1 to continue.

Re:Okay, I know I'm going to feel like an idiot, b by operagost · 2005-08-24 07:58 · Score: 2, Funny

I'm from Holland. Isn't that veird?

--

Gamingmuseum.com: Give your 3D accelerator a rest.

Re:From the site... by cHiphead · 2005-08-24 07:59 · Score: 3, Informative

THIS IS ONE GIANT TROLL ARTICLE! LOL!

About 3/4ths down the page there is a goatse picture, and the caption at the top thanks the GNAA. Wake up slashdot.

--

This is my sig. There are many like it, but this one is mine.

Here's an idea by 5n3ak3rp1mp · 2005-08-24 09:29 · Score: 2, Interesting

I thought about this problem on a recent trip to the urinal and here's what I got.

1) Get (or construct) a large database of nouns of well-known objects (car, orange, bottle, phone, pencil, brick, cup, etc. etc.)

2) Retrieve image references from a (safesearch-enabled) Google image search for a random noun from your database. Pick randomly from the result set.

3) Present images to the user. "These are pictures of a..."

4) My next strategy was to figure out a combinatorial way to increase the number of possible replies so that an attacker couldn't simply create a database of knowns (such as a hash database of images)

What do you smart fellers think? other than google being pissed for scraping their site

Totally fake by VAXGeek · 2005-08-24 14:07 · Score: 4, Insightful

This article is a fraud. No source is presented, and goatse.cx is displayed in the examples. This whole thing was contrived just to get goatse.cx in a legitimate front page post. Best troll in years.

--
this sig limit is too small to put anything good h

Re:Prime Numbers? by ChadN · 2005-08-24 15:21 · Score: 2, Informative

1... is... not... a... prime...

For info on why, see the mathworld prime number entry.

Interestingly, it says that, at one time, 1 was considered prime and 2 was not. Pretty amazing, considering importance of the Fundamental Theorem of Arithmetic.

--
"It's overkill, of course. But you can never have too much overkill." - Anonymous Slashdot Coward

69 of 430 comments (clear)