Next-Generation CAPTCHA Exploits the Semantic Gap
captcha_fun writes "Researchers at Penn State have developed a patent-pending image-based CAPTCHA technology for next-generation computer authentication. A user is asked to pass two tests: (1) click the geometric center of an image within a composite image, and (2) annotate an image using a word selected from a list. These images shown to the users have fake colors, textures, and edges, based on a sequence of randomly-generated parameters. Computer vision and recognition algorithms, such as alipr, rely on original colors, textures, and shapes in order to interpret the semantic content of an image. Because of the endowed power of imagination, even without the correct color, texture, and shape information, humans can still pass the tests with ease. Until computers can 'imagine' what is missing from an image, robotic programs will be unable to pass these tests. The system is called IMAGINATION and you can try it out." This sounds promising given how broken current CAPTCHA technology is.
The general public will not know what "geometric" means*.
This Captcha suffers from the same old problem. As Captchas get harder more humans will fail them.
*or annotate... or centre
There are shills on slashdot. Apparently, I'm one of them.
who needs to write CAPTCHA exploits when you can just hire 50 chinese kids for 3 cents per day to create email accounts and send spam out for you?
All they need to do is offer free porn to people who solve the captchas and embed the captcha in their site. It doesn't matter how sophisticated the test is or hard it is for a machine to do it, they all have that fatal flaw.
Then there's also the option of paying Warcraft gold farmers to solve captchas and take a break from the game.
Any captcha with multiple choice answers is not a good one. 20 choices? So the computer gets by 1/20 of the time. Hmmm, how many attempts does it take to get 1000 e-mail accounts? As for "geometric center" note that all the images are rectangular. I haven't tried it, but writing a program to pull out all possible rectanges and then sort them on size, and pick the center of the one of the larger rectangles should do it. Why not a captcha that works with google. "Describe in one or two words what is in this picture", then use a google like search to match up the actual description with what the person typed. Person types "Dog" picture is a "Labrador Retriever" match.
TODO: create/find/steal funny sig.
Like airport security, CAPTCHA puts a tremendous burden on the innocent people just because they cannot detect the terrorists.
How is CAPTCHA broken and how is it "technology"?
It is not broken because it works as it is suppose to. I would think the correct term would be "solved" or "been overcome".
Technology-wise, CAPTCHA is a workaround, not a solution. The real problem is automated bots manipulating forms where the webmaster only wants humans. Detecting whether or not the visitor is an automaton would be the solution, but because people have apparently given up on this, they have resorted to trying to detect whether or not the visitor is human.
It is imaginable to create a model that describes speech characteristics in general and computer speech characteristics in particular. Any sound sample could compared with the two models. If it fits the wider speech model but not the computer speech model, then you would call it human speech. QED.
The ability to distinquish between two things does not imply that you'll be able to generate them effectively (unless the search space is very narrow). Imagine it this way: you can probably distinguish Chinese from Spanish. That does not imply you speak either language.
the image is huge. plus its two steps. also, the annotation part... i wasn't actually *sure* i was answering correctly. it looked like they were near water... boat was an option... didn't look like a boat... but nothing else really made sense... well, 'cept there was a guy in the picture and "man" was a choice as well... but i went with boat cause the guy didn't seem to be the focus. nonetheless, it required effort to reason it out. i don't want my captcha taking up more than 2 seconds, let alone like 30 seconds.
Spammers will still just pay sweatshop workers to solve these, won't they? What does this solve?
Pretty soon they'll just set up a "free porn" site - free access so long as you solve a captcha to get in.
It's been threatened and talked about before, all it needs is something "unbreakable" like this to actually make it happen.
No sig today...
Ok, so I was able to do the image analysis one, where they take an image, muck with the color, draw a bunch of black lines over it, and then ask you to annotate it with a word from a list.
This is no better, and may be worse, than what we have now, for two reasons.
1) If you fill in the gaps programmatically, and then make the image grayscale, you probably have something you can use for image matching.
2) Much more severely: The interface reduces the number of possible answers by multiple orders of magnitude. For the one I saw I think there were 10 or 15 answers. Even if you kick image recognition to the curb and randomly choose an answer, you'll be right 1/15 times. It'd be trivial to write a program to harvest hundreds of accounts in a day by just picking random answers. Hand that off to a botnet or similar, and this becomes a minor speedbump.
~D
This sig has been enciphered with a one-time pad. It could say almost anything.
How do you protect the sign-up page to get an OpenID? With a captcha?
Reduce, reuse, cycle
I for one welcome this development. The more complex are CAPTCHA to solve, the less is the number of idiots in the tubes.
I do not believe in karma. "Funny"=-6. Do good and forbid evil. Yours, Oft-Offtopic Flamebaiting Troll.
Wikipedia does this by restricting what new accounts and non-logged-in accounts can do.
If free mail servers put restrictions on what new accounts could do, with an override to anyone who is willing to go to a lot of trouble to prove they are human, it would short-circuit the spammer problem.
If Yahoo, Gmail, etc. all limited you to 10 outgoing mail recipients a day until you had both 1) had the service for 1 day and replied to 10 messages, AND limited you to 100 outgoing mail recipients a day until you signed up to be a "high volume sender," it would cut most spammers off at the knees. Depending on the service, being a "high volume sender" may involve turning over a credit card number and may not be free. Some services may give "loyalty awards" to long-term customers by removing this restriction for people who have had their accounts for 6 months and show a heavy non-spammy ad-revenue-generating usage pattern.
Knowledge is how to play a game, intelligence is how to win, wisdom is knowing what game to play.
I know this was meant as a funny comment, but if you'll notice, of the replies to your comment there are as many different responses as there are posts!
The more we "exploit the semantic gap" the more problems like this are likely to arise.
The answer is "Slashdotting", but where do I type it? I can't figure this CAPTCHA out...
FWIW you don't need a dedicated HTTP proxy, as SSH has a built-in SOCKS proxy. Try it out some time: ssh -D 1080 remote.tld and configure your browser of choice to use SOCKS on localhost port 1080. For other apps that don't have native support for proxying, check out proxychains (on Unix). Not only great for browsing at work, but also a godsend for unsecured wireless nets.
Did anyone else go "What the F***?!" when they read the instructions? That was the crappiest most pain-in-the-ass 'captcha' I've ever seen. Geometric Center of any image you can figure out? Annotate the image? not exactly simple or unobtrusive, is it? I got it right away, maybe it'd be ok if someone can write layman-legible instructions. I just guessed at when the heck they were asking for...but it wasn't immediately obvious. Anyone care to enlightned me as to what's so wrong with the current (10 or 20 different types) of Captchas? I don't see them being broken/spammed all over the place, so it's boviously not too bad...