Slashdot Mirror


Google ReCAPTCHA Cracked In New Automated Attack

An anonymous reader writes: A trio of security researchers have devised a new automated attack that can break the CAPTCHA systems employed by Google and Facebook. On Google's reCAPTCHA system, researchers recorded a 70.78 percent success rate over 2,235 CAPTCHAs. Average CAPTCHA solving time was 19.2 seconds. They achieved a better success rate on Facebook's system, where they had a success rate of 83.5 percent on over 200 CAPTCHAs, but this was mainly because of higher quality images, and photos were selected from different topics, and were also easier to recognize and classify. For attackers, the whole automated system would cost only $110 a day, per IP address, and would allow them to crack around 63,000 CAPTCHAs in 24 hours from one IP address without being detected and getting banned.

66 comments

  1. turk by Anonymous Coward · · Score: 0

    would anonymous turk not be cheaper?

    1. Re:turk by AvitarX · · Score: 1

      you think 5 a cent on Mechanical Turk?

      --
      Wow, sent an e-mail as suggested when clicking on "use classic" banner, and got a fast response that addressed my msg
  2. dammit by Kkloe · · Score: 3, Funny

    now how are we going to stop terminator infiltrators at the door when skynet rises

    1. Re:dammit by Thanshin · · Score: 1

      "You're in a desert, walking along in the sand when all of a sudden you look down and see a tortoise. It's crawling toward you..."

    2. Re:dammit by Anonymous Coward · · Score: 1

      know your paradoxes

      1. Stand still
      2. Remain calm
      3. Scream

      "This statement is false!"
      "New Mission: Refuse this mission!"
      "Does a set of all sets contain itself?"

      Aperature

      Yeah, they screwed up the Barber paradox. :(

    3. Re:dammit by Big+Hairy+Ian · · Score: 2

      "You're in a desert, walking along in the sand when all of a sudden you look down and see a tortoise. It's crawling toward you..."

      Holden: Describe in single words only the good things that come into your mind about... your mother.

      Leon: My mother?

      Holden: Yeah.

      Leon: Let me tell you about my mother.

      I fairness to the guy any member of the Palin family would probably have had the same response

      --

      Build a Man a Fire, and He'll Be Warm for a Day. Set a Man on Fire, and He'll Be Warm for the Rest of His Life.

    4. Re:dammit by Thanshin · · Score: 2

      1. Stand still
      2. Remain calm
      3. Scream

      Paradox? This looks like standard operational meeting procedure.

    5. Re:dammit by Anonymous Coward · · Score: 0

      Wow, still haven't gotten over your Palin Derangement Syndrome? How you sought professional help?

    6. Re:dammit by lgw · · Score: 1

      This was played so well in Ghost in the Shell SAC, where the more advanced AIs took out a less advanced AI this way, mocking it for not being able to handle such a simple trick. Tachikomas remain my favorite AIs from all SF, and the story had the best telling of how dealing with rogue military AIs would realistically go (no spoilers).

      --
      Socialism: a lie told by totalitarians and believed by fools.
  3. Captcha cracking using AI is a losing battle by 140Mandak262Jamuna · · Score: 4, Interesting

    Captcha generation can be scaled up quite cheaply and the cracking it automatically does not scale well. But why bother to create a complex system to mimic a human brain, when human brain itself is available for hire for a pittance? You could hire someone in India to manually solve some 30 to 60 captcha an hour for about 100 Rs per hour, or less than $1.50. This method of cracking captcha is unbeatable because, you can not make Captcha more difficult without hampering legitimate users.

    --
    sed -e 's/Chuck Norris/Rajnikant/g' joke > fact
    1. Re:Captcha cracking using AI is a losing battle by Anonymous Coward · · Score: 0

      the fucking recapcha is already unusable. 9 of 10 times i get something totally unreadable and just random letters similar to others and overly distorted you can't possibly figure it out.

    2. Re:Captcha cracking using AI is a losing battle by Anonymous Coward · · Score: 0

      You could however make captchas more difficult for someone who is not from the user's social environment, kind of like the age verification questions that games used: Ask for pop culture references that someone younger than a certain age wouldn't know, or in this case, someone from India would have difficulty answering quickly.

    3. Re:Captcha cracking using AI is a losing battle by Anonymous Coward · · Score: 1

      From the article: "Our completely offline captcha-breaking system is comparable to a professional solving service in both accuracy and attack duration, with the added benefit of not incurring any cost on the attacker." From the researcher paper: "When taking into account the flexibility, 321 (44.3%) of the captchas were solved. The average solving time for the challenges that received a solution was 22.5 seconds. While the accuracy may increase over time as the human solvers become more accustomed to the image reCaptcha, it is evident that our system is a cost-effective alternative."

    4. Re:Captcha cracking using AI is a losing battle by 140Mandak262Jamuna · · Score: 1

      True for the current generation of the captchas. Once google improves its captcha, the cost of upgrading the cracking software and training it would be very high. Human beings, should adapt instantly.

      --
      sed -e 's/Chuck Norris/Rajnikant/g' joke > fact
    5. Re:Captcha cracking using AI is a losing battle by Anonymous Coward · · Score: 0

      So make CAPTCHAs that are hard for typical humans in India and related countries to solve.

      Fast speech. Use knowledge that only a real person from that country would know.

      Yes, yes, it's not a full-proof solution. But slowing them down is costing them money and reducing their efficacy (possibly to the point of negating the attack vector for common situations).

    6. Re:Captcha cracking using AI is a losing battle by wvmarle · · Score: 1

      Better use the bot.

      It's been a while I've had a ReCAPTCHA to solve, but the stats posted seem much better than my attempts (maybe 30% correct, after numerous reloads to get something I think I can read... so frustrating). Those Indians may get better with practice but those modern day captchas are just getting too hard for normal humans to solve!

    7. Re:Captcha cracking using AI is a losing battle by joboss · · Score: 1

      You can crowd source. I've done personal security experiments with this kind of thing. You basically just proxy the captcha and make a frame work for doing that. You have a normal captcha system that before making a captcha first checks a queue of ones already available and uses one of those if present. The user input is forwarded and the response used. If there is nothing in the queue it uses the normal captcha mechanism. Obviously you want some kind of timeout. The bot/s can simply keep getting new captchas until one is fullfilled. The bot puts captchas on the queue, and takes answers off the queue. The bot also puts results on the queue. The captcha library pulls captchas from the queue and puts answers on the queue. You have several methods of dissemination. First you can put it on your own sites. Second you can disseminate the library normally and webdevs will simply download and use it, similarly you can offer it as a service (API, etc) and they may never figure it out. Finally you can put it on hacked websites. I came up with this solution years ago but have never heard of it being used. I expect that eventually I will. Especially since I posted it. Someone will tell me of a case or go implement it themselves. Someone is going to get rich now with their captcha breaking API.

    8. Re:Captcha cracking using AI is a losing battle by bhcompy · · Score: 1

      Please select all images that reflect different forms of Goku

    9. Re:Captcha cracking using AI is a losing battle by Anonymous Coward · · Score: 0

      It's not a new concept. I've heard people talking about using high traffic shady (porn and warez) sites for this for at least a decade. I don't know if anyone has ever actually implemented it.

    10. Re: Captcha cracking using AI is a losing battle by Anonymous Coward · · Score: 0

      Please assign a name to a friend's picture

    11. Re:Captcha cracking using AI is a losing battle by aliquis · · Score: 1

      80% accuracy rate is better than I have with some of them.

      Some are simply too hard.

  4. Deep Learning/Neural Net by wardrich86 · · Score: 2

    Wouldn't it be neat if Google's very own system was being used to crack their CAPTCHA system?

    1. Re:Deep Learning/Neural Net by shawn2772 · · Score: 1

      Wouldn't it be neat if Google's very own system was being used to crack their CAPTCHA system?

      What's cool is that Google's reCAPTCHA system is being used by Google Maps to improve street address localization, using images of street numbers captured by StreetView cars. People are asked to extract numbers from images that Google's automated number extraction system couldn't get, or wasn't sure about its results. Yes, this means the first few times a given image is presented to a human, the system isn't sure what the correct answer is which means it passes some people/bots it should not, but it also means that to beat reCAPTCHA, you need to build a better, cleverer OCR system for street numbers than Google has.

      This means that Google's next step is to get the details from the researchers and use the information to improve its own number-recognition system, which will simultaneously make Google Maps work better and improve reCAPTCHA... at least until the automated system is roughly as good at recognizing street numbers as humans are. That, of course, is the logical conclusion of the captcha race, computers will get better at whatever tasks are presented, until we reach a point that we have no way to distinguish between a bot and a human.

      It's a form of the Turing test, I suppose.

    2. Re:Deep Learning/Neural Net by religionofpeas · · Score: 1

      which will simultaneously make Google Maps work better and improve reCAPTCHA.

      I don't know if reCAPTCHA will be improved. It will make it harder, for sure, but it's already dangerously close to the point where it's getting too hard for humans.

    3. Re:Deep Learning/Neural Net by Anonymous Coward · · Score: 0
    4. Re:Deep Learning/Neural Net by shawn2772 · · Score: 1

      which will simultaneously make Google Maps work better and improve reCAPTCHA.

      I don't know if reCAPTCHA will be improved. It will make it harder, for sure, but it's already dangerously close to the point where it's getting too hard for humans.

      Perhaps. I don't have any trouble with reCAPTCHAs. I do with other captcha systems, but Google's seems pretty easy for humans to me. So far.

    5. Re:Deep Learning/Neural Net by wardrich86 · · Score: 1

      It's been a long time since I've seen a Captcha that wasn't a simple "[ ] I am not a robot" followed by a simple "Click on all the pictures of [FOO]" if it happens to think you might be a robot.

    6. Re:Deep Learning/Neural Net by Wycliffe · · Score: 1

      Yes, this means the first few times a given image is presented to a human, the system isn't sure what the correct answer is which means it passes some people/bots it should not

      No, it doesn't. That's why recaptcha always presents you two images. One image is the test where they know the correct answer because X number of people have already told them the correct answer while the other image is the image that they don't yet know the correct answer for. You really only need to know the answer to one of the images to pass but you never know which one. If one of the images is completely unreadable then it is likely that it is the unsolved one so just making something up for that spot will likely still let you pass.

      Recaptcha is a great platform because it's not a computer trying to beat a computer so it's easy to stay one step ahead of any computer program. For instance it would be easy to switch it to "identify this object" or even "identify this emotion". Computer vision has a long way before it can identify photos as well as humans.

    7. Re: Deep Learning/Neural Net by Anonymous Coward · · Score: 0

      Mostly, really recently, I've just had to click on "I'm not a robot" and I'm done.

  5. Hmmm .... by gstoddart · · Score: 1

    So I'm a little rusty on doing shady things on the intertubes which could get me banned ...

    For attackers, the whole automated system would cost only $110 a day, per IP address, and would allow them to crack around 63,000 CAPTCHAs in 24 hours from one IP address without being detected and getting banned.

    And I would be doing this ... why?

    So I can spam Google and Facebook? Really, it's that lucrative that you'd spend $110/day/IP?

    I've never even seen a Captcha for Google, and I really have no idea of when you'd see them, or why you'd pay to break them.

    Is the interweb so utterly broken that people are paying to get past to spam discussion boards? Oh, hell, what am I saying, of course it is.

    --
    Lost at C:>. Found at C.
    1. Re:Hmmm .... by pla · · Score: 2

      I've never even seen a Captcha for Google, and I really have no idea of when you'd see them, or why you'd pay to break them.

      If you do a bunch of searches in quick succession, it will occasionally ask you to solve one. Seems kinda random, though, some days I can search for half an hour as fast as I can type without getting one, while others I get a captcha after my third attempt to refine the results.

    2. Re:Hmmm .... by Anonymous Coward · · Score: 1

      And I would be doing this ... why?

      If you have to ask, you'll never know...

    3. Re:Hmmm .... by gstoddart · · Score: 2

      If you have to ask, you'll never know...

      Google has a Room of Requirement?

      Oh, man, I never get to have any fun.

      --
      Lost at C:>. Found at C.
    4. Re:Hmmm .... by Anonymous Coward · · Score: 0

      "Is the interweb so utterly broken that people are paying to get past to spam discussion boards?" No, reCAPTCHAs are used to protect government portals sometimes. Nobody's using automated attacks to post forum spam. They use it to go around login screens and CloudFlare Tor limits.

    5. Re:Hmmm .... by Anonymous Coward · · Score: 0

      I'm curious why they think they can crack 63000 captchas in 24 hours from one IP address without being detected. Seems like it's be pretty easy to detect. Maybe google's current limitations are set to allow 64k captchas in 24 hours, but that's WAY more you should be receiving. Even with NAT or IPs getting reused on wifi networks, I suspect it would be fairly unusual to see even 100 captcha's successfully completed in 24 hours.

    6. Re:Hmmm .... by Anonymous Coward · · Score: 0

      No. So you can spam slashdot (if they used google's recaptcha, or facebook's). There are a TON of sites that use them to prevent spammers, just like the captcha you get here on slashdot if you try and post anonymously without logging in.

    7. Re:Hmmm .... by AvitarX · · Score: 1

      My phone uses carrier grade NAT.

      I don't know how many devices per an IP, but it seems plausible that thousands could happen.

      --
      Wow, sent an e-mail as suggested when clicking on "use classic" banner, and got a fast response that addressed my msg
    8. Re:Hmmm .... by Anonymous Coward · · Score: 1

      It's partly related to the search terms aswell. If you use things like site: or inurl: the risk of a captcha is much higher.

    9. Re:Hmmm .... by Anonymous Coward · · Score: 0

      Outside of securities, which would be easier to circumvent through hack, it is useful for hiding identity when committing to hack research. For pure monetary, many contests, etc, are on timers, with the capthas somewhat used to slow down entries and allow for more, while cutting down on users doing multiple entries, however, all of these actions can be scripted until they hit captcha, so if you can integrate a bot, even if you utilized the service for one day, it could be rather lucrative. Interestingly, if one were to go the route of paying someone for this service, that would be in itself, a compromise of their own securities and may leave a trail.

  6. Cost analysis from article differs from summary by Hwaguy · · Score: 3, Interesting

    I'm not sure where the the article summary got its notion about the costs. The article doesn't address that- instead it spoke to how much could be made selling the service. From the article:

    Assuming a selling price of $2 per 1,000 solved captchas, our token harvesting attack could accrue $104 - $110 daily, per host (i.e., IP address). By leveraging proxy services and running multiple attacks in parallel, this amount could be significantly higher for a single machine.

    I think the authors of the article were trying to communicate how much money they could make selling this 'service' to other unsavory agents. It could be a lucrative business given the assumed market rates of $2 per 1k, and the mentioned optimizations could make it even more attractive. It makes me wonder if you could set up the whole thing in a cloud computing environment like AWS and come out ahead.

  7. 70% That's better than I can do by Registered+Coward+v2 · · Score: 2

    trying to enter them as a real human being. Seriously, the captcha system is broken because as long as there is a monetary value to breaking it someone will, even if it is simply paying a few cents per capture to break them to a human in some low wage country. The only authentication system I have seen that didn't rely on a separate hardware device for authentication that was worth a damn were those that, rather than requiring a selection or inputing what you see on screen, asked a question that only someone family with the topic would know. For example, I've seen engineering bulletin boards ask for the name of a specific type of beam, automotive ones that ask something unique to the marque, etc. so automating a process to gain entry is not practical. Of course, one you know the answer you can easily create multiple accounts, but these boards also limited posting ability for a set period of time and or required a secondary confirmation before gaining full access to limit the drive by spamming of EXCELLENT QUALITY!!! YOU BUY CHEAP!!! DESIGNER!!! posts.

    --
    I'm a consultant - I convert gibberish into cash-flow.
    1. Re:70% That's better than I can do by wvmarle · · Score: 1

      For example, I've seen engineering bulletin boards ask for the name of a specific type of beam, automotive ones that ask something unique to the marque, etc. so automating a process to gain entry is not practical.

      Works mostly due to obscurity and there being millions of tiny boards out there, a single human intervention and it's completely broken.

    2. Re:70% That's better than I can do by shawn2772 · · Score: 1

      The only authentication system I have seen that didn't rely on a separate hardware device for authentication that was worth a damn were those that, rather than requiring a selection or inputing what you see on screen, asked a question that only someone family with the topic would know. For example, I've seen engineering bulletin boards ask for the name of a specific type of beam, automotive ones that ask something unique to the marque, etc. so automating a process to gain entry is not practical.

      Nonsense. Those are weaker than the general-purpose ones. They draw on knowledge from a relatively obscure area, but it's very unlikely that they have a wide selection of questions/answers. All you need is a knowledgeable human to work his or her way through the question database providing answers for the bot to use, and it's broken. Of course, the value of creating large numbers of fake accounts on such systems is so small that it doesn't matter. Honestly, their goal probably isn't to keep out bots at all, but to make the forum hard for people outside of their target audience to access.

    3. Re:70% That's better than I can do by Registered+Coward+v2 · · Score: 1

      For example, I've seen engineering bulletin boards ask for the name of a specific type of beam, automotive ones that ask something unique to the marque, etc. so automating a process to gain entry is not practical.

      Works mostly due to obscurity and there being millions of tiny boards out there, a single human intervention and it's completely broken.

      I concur, which is why the decent ones add in a another layer to try to frustrate the drive by's.

      --
      I'm a consultant - I convert gibberish into cash-flow.
    4. Re:70% That's better than I can do by Registered+Coward+v2 · · Score: 1

      The only authentication system I have seen that didn't rely on a separate hardware device for authentication that was worth a damn were those that, rather than requiring a selection or inputing what you see on screen, asked a question that only someone family with the topic would know. For example, I've seen engineering bulletin boards ask for the name of a specific type of beam, automotive ones that ask something unique to the marque, etc. so automating a process to gain entry is not practical.

      Nonsense. Those are weaker than the general-purpose ones. They draw on knowledge from a relatively obscure area, but it's very unlikely that they have a wide selection of questions/answers. All you need is a knowledgeable human to work his or her way through the question database providing answers for the bot to use, and it's broken.

      While I agree it certainly can be broken, and once an answer is identified it is a very weak system, but then again a bot would have to look for clues in questions and understand context because the even if you have the same words in a question they could have different answers depending on context. A second layer of validation is easier for smaller sites but then they aren't really targets as you point out.

      Of course, the value of creating large numbers of fake accounts on such systems is so small that it doesn't matter. Honestly, their goal probably isn't to keep out bots at all, but to make the forum hard for people outside of their target audience to access.

      Certainly, most are so obscure they aren't worth the trouble, so making it labor intensive operation for a small return is counter productive; though I would guess bots are outside their target audience as well.

      --
      I'm a consultant - I convert gibberish into cash-flow.
    5. Re:70% That's better than I can do by Anonymous Coward · · Score: 0

      I went to sign up on a forum for a differential equation solver library (FEniCS). The captcha asked me to solve an integral. I was just a sysadmin trying to get this software working for my customers. Luckily I have a degree in scientific computing so I was able to solve it, but I found the approach you describe rather obnoxious in this case. The domain knowledge approach also discourages noobies from participating.

  8. Use newer system by Anonymous Coward · · Score: 0

    reCaptcha has a newer system where no image is shown but instead the user clicks a checkbox. It performs some math to slow down attackers since it requires client compute time.

    reCaptcha was defeated years ago using simple open source OCR libraries.

    The newer style (client-side compute time cost) achieves the only realistic goal which is to slow down spammers since captcha cannot stop them.

    1. Re:Use newer system by war4peace · · Score: 1

      I guess you're referring to the "I am not a robot" checkbox?

      --
      ...gis sdrawkcab (usually not responding to ACs; don't bother posting as AC)
    2. Re:Use newer system by VorpalRodent · · Score: 1

      Yes. They refer to this as "NoCaptcha".

      --
      Take it to the limit, everybody to the limit, come on, everybody fhqwhgads.
  9. Bots can be easily beaten by fb0r · · Score: 1

    Just make the recaptcha a minimum length of twelve characters, use sans serif only and make it consist solely of capital i's and lower case l's.

  10. Right by Anonymous Coward · · Score: 0

    and would allow them to crack around 63,000 CAPTCHAs in 24 hours from one IP address without being detected and getting banned.

    That's an interesting assumption. It would be ridiculously easy to detect.
    Route the incoming requests evenly across your pool of servers depending on the range it's IP address belongs to.
    Each server keeps a count of requests for each incoming IP Address. 1 byte per unique IP address let's say. You could store the entire range of IPv4 in 4GB of storage. If you've got a 1024 servers in your pool that's only 4MB.
    Reset the count every so often and start keeping count for however long required. If an IP hits a preset limit, add a rule to the firewall.
    That just happened. (drops mic...walks off stage)

    1. Re:Right by Anonymous Coward · · Score: 0

      Do I get a software patent awarded for that?
      That's the problem with the patent system. Patents allowed for retardedly simple shit.

    2. Re:Right by KGIII · · Score: 1

      That you, not logged in, Karmashock?

      --
      "So long and thanks for all the fish."
  11. No human could possibly solve that by Anonymous Coward · · Score: 0

    Pretty soon, the test will have to be inverted to detect that you are human only if you get the captcha WRONG.

  12. Oh crap... by Ecuador · · Score: 1

    I already have trouble deciphering some captchas, having need for a second (or even third) try in some occasions, which is REALLY annoying. If they make them any harder, I think only bots will be able to solve them and not humans...
    Wait, is that the next generation of captcha? You are human if you fail?

    --
    Violence is the last refuge of the incompetent. Polar Scope Align for iOS
    1. Re:Oh crap... by gstoddart · · Score: 1

      Well, it's simple ... hire a bot to solve the captchas for you.

      --
      Lost at C:>. Found at C.
    2. Re:Oh crap... by operagost · · Score: 2

      I already tried to do that, but in order to sign up for the bot, the company made me solve a captcha.

      --

      Gamingmuseum.com: Give your 3D accelerator a rest.
    3. Re:Oh crap... by gstoddart · · Score: 2

      It's bots and capchtas all the way down. ;-)

      --
      Lost at C:>. Found at C.
  13. I'm dyslexic - can I buy a copy of this software? by Anonymous Coward · · Score: 1

    Seriously, I hate the CAPTCHA. I have a very hard tiem reading them. Some site, it takes me 4 or 5 tries to get it right. You hit the "voice" and half the time I cannot understand it. There are many types of dyslexia - they all seem to be gathered under one umbrella diagnosis - but sometiems, for some of us, who mispronounce - mishear the exact same words we mistype and spell

    CAPTCHA for people like me who are dyslexic is exactly the same thing as putting child proof/resistant tops on bottles of pain killers for people suffering arthritis. There are days I am so fustraited with the CAPTCHA , I can only imagine in my head that the people who came up with this crap are either brain dead stupid or very sadistic.

    Oh yeah, for the Slashdot CAPTHCA, I i had to play the sound as well as look at the word to get it. It's not as bad as some others, but still a PITA.

  14. Obsolete in a few years by Anonymous Coward · · Score: 0

    Once a cryptocurrency can actually handle microtransactions (maybe bitcoin this summer?) then we won't need CAPTCHAs at all - just charge $0.01 per submission or something. Instead of making spammers burn the cost, have them send it to the would-be victim.

  15. Re:I'm dyslexic - can I buy a copy of this softwar by KGIII · · Score: 1

    Heh... I'm partially colorblind and it appears to be getting worse with age. I can usually still recognize patterns but I have issues with certain colors and various shades. I have, on the other hand, learned to not argue with people when they tell me something is a different color than what I said it was. At first, I thought people were just fucking with me. It wasn't too bad when I was younger but it's not that great now. Blue, gray? Yellow, orange? Red, orange? Fucked if I can be certain. I seriously thought people were just fucking with me at first. But no, no... I'm partially colorblind.

    --
    "So long and thanks for all the fish."
  16. Re:I'm dyslexic - can I buy a copy of this softwar by Anonymous Coward · · Score: 0

    your post shows you are doing well. congratulations! keep on trucking, man!

  17. Robot better than humans by loufoque · · Score: 1

    Those numbers are better than I am at this.