Next-Generation CAPTCHA Exploits the Semantic Gap
captcha_fun writes "Researchers at Penn State have developed a patent-pending image-based CAPTCHA technology for next-generation computer authentication. A user is asked to pass two tests: (1) click the geometric center of an image within a composite image, and (2) annotate an image using a word selected from a list. These images shown to the users have fake colors, textures, and edges, based on a sequence of randomly-generated parameters. Computer vision and recognition algorithms, such as alipr, rely on original colors, textures, and shapes in order to interpret the semantic content of an image. Because of the endowed power of imagination, even without the correct color, texture, and shape information, humans can still pass the tests with ease. Until computers can 'imagine' what is missing from an image, robotic programs will be unable to pass these tests. The system is called IMAGINATION and you can try it out." This sounds promising given how broken current CAPTCHA technology is.
The general public will not know what "geometric" means*.
This Captcha suffers from the same old problem. As Captchas get harder more humans will fail them.
*or annotate... or centre
There are shills on slashdot. Apparently, I'm one of them.
It's already spotted that I am a computer and it won't even load.
who needs to write CAPTCHA exploits when you can just hire 50 chinese kids for 3 cents per day to create email accounts and send spam out for you?
Why don't we take a note from TV and have the user sing the missing lyrics of a classic hit. Even if they don't pass, it will make for much more fun around the computer, especially at the office.
Invenio via vel creo
All they need to do is offer free porn to people who solve the captchas and embed the captcha in their site. It doesn't matter how sophisticated the test is or hard it is for a machine to do it, they all have that fatal flaw.
Then there's also the option of paying Warcraft gold farmers to solve captchas and take a break from the game.
Alternative URL: http://wang.ist.psu.edu/docs/projects/imagination.html
www.purevolume.com/martyd
Any captcha with multiple choice answers is not a good one. 20 choices? So the computer gets by 1/20 of the time. Hmmm, how many attempts does it take to get 1000 e-mail accounts? As for "geometric center" note that all the images are rectangular. I haven't tried it, but writing a program to pull out all possible rectanges and then sort them on size, and pick the center of the one of the larger rectangles should do it. Why not a captcha that works with google. "Describe in one or two words what is in this picture", then use a google like search to match up the actual description with what the person typed. Person types "Dog" picture is a "Labrador Retriever" match.
TODO: create/find/steal funny sig.
...but some more info here as well as a (ugh) [a href="http://wang.ist.psu.edu/imagination/imagination.ppt">powerpoint and a user study with some samples.
Slashdot: stuff for news, nerds that matter, matter for news, stuff that nerd
This just reaffirms the article's conviction that the CAPTCHA is broken.
It annoyed me mightily the day slashdot introduced captchas for comments when you weren't already logged in. And somehow broke the login process from lynx.
Lynx is the geek slacker's greatest tool, when run in an ssh session from your home server, not only is the traffic unloggable (except for "he's calling home a bit") but it even looks like work to the uninitiated.
It is imaginable to create a model that describes speech characteristics in general and computer speech characteristics in particular. Any sound sample could compared with the two models. If it fits the wider speech model but not the computer speech model, then you would call it human speech. QED.
The ability to distinquish between two things does not imply that you'll be able to generate them effectively (unless the search space is very narrow). Imagine it this way: you can probably distinguish Chinese from Spanish. That does not imply you speak either language.
Ok, so I was able to do the image analysis one, where they take an image, muck with the color, draw a bunch of black lines over it, and then ask you to annotate it with a word from a list.
This is no better, and may be worse, than what we have now, for two reasons.
1) If you fill in the gaps programmatically, and then make the image grayscale, you probably have something you can use for image matching.
2) Much more severely: The interface reduces the number of possible answers by multiple orders of magnitude. For the one I saw I think there were 10 or 15 answers. Even if you kick image recognition to the curb and randomly choose an answer, you'll be right 1/15 times. It'd be trivial to write a program to harvest hundreds of accounts in a day by just picking random answers. Hand that off to a botnet or similar, and this becomes a minor speedbump.
~D
This sig has been enciphered with a one-time pad. It could say almost anything.
How do you protect the sign-up page to get an OpenID? With a captcha?
Reduce, reuse, cycle
Wikipedia does this by restricting what new accounts and non-logged-in accounts can do.
If free mail servers put restrictions on what new accounts could do, with an override to anyone who is willing to go to a lot of trouble to prove they are human, it would short-circuit the spammer problem.
If Yahoo, Gmail, etc. all limited you to 10 outgoing mail recipients a day until you had both 1) had the service for 1 day and replied to 10 messages, AND limited you to 100 outgoing mail recipients a day until you signed up to be a "high volume sender," it would cut most spammers off at the knees. Depending on the service, being a "high volume sender" may involve turning over a credit card number and may not be free. Some services may give "loyalty awards" to long-term customers by removing this restriction for people who have had their accounts for 6 months and show a heavy non-spammy ad-revenue-generating usage pattern.
Knowledge is how to play a game, intelligence is how to win, wisdom is knowing what game to play.
I can already see how this is going to go.
"You stole my sig!"
"No I didn't."
"Yes you did, it's exactly the same as mine!"
"No it isn't."
"Yes it is!"
"No it isn't. Look, mine is in two lines."
"That hardly makes a difference."
"Yes it does!"
"No it doesn't."
- None can love freedom heartily, but good men; the rest love not freedom, but license. -- John Milton
The saddest poem
I like this better:
http://www.hotcaptcha.com/