Slashdot Mirror


Yahoo CAPTCHA Hacked

Hell Yeah! reminds us of a 2-week-old development that somehow escaped notice here. A team of Russian hackers has found a way to decipher a Yahoo CAPTCHA, thought to be one of the most difficult, with 35% accuracy. The Russian group's notice, posted by one "John Wane," is dated January 16. This site hosts a rapidshare link to what looks to be demonstration software for Windows, and quotes the Russian researchers: "It's not necessary to achieve high degree of accuracy when designing automated recognition software. The accuracy of 15% is enough when attacker is able to run 100,000 tries per day, taking into the consideration the price of not automated recognition — one cent per one CAPTCHA."

50 of 252 comments (clear)

  1. I thought those things were already broken by Anonymous Coward · · Score: 5, Funny

    by having a teenage boy do it in exchange for letting him see porn.

    1. Re:I thought those things were already broken by 2.7182 · · Score: 4, Insightful

      I think the parent is serious. The idea is that your robot goes and grabs the images that needs to be decoded. Then on another website, it is presented and you can see free porn if you type in the word. I've heard of this but never read about it. Sounds like a good idea. Anyone know what this is called or some references ?

    2. Re:I thought those things were already broken by Rageon · · Score: 2, Insightful

      No idea where I first read this, but I too remembering reading something very similar to the "solve the captcha for porn" idea.

    3. Re:I thought those things were already broken by rthomas6 · · Score: 4, Informative

      http://news.bbc.co.uk/2/hi/technology/7067962.stm
      Here is a link to a BBC article about something like that. It's a Windows program that rewards typing in captchas by showing a woman that takes off progressively more and more clothes.

    4. Re:I thought those things were already broken by 2.7182 · · Score: 2, Informative
    5. Re:I thought those things were already broken by kesuki · · Score: 4, Interesting

      that's why it costs 1 cent per 1 captcha, the overall cost of webhosting the porn for exchange boils down to 1 cent per solved captcha. obviously, if you're hosting on root-kited windows boxes in the us (the highest rate of infection is in the us) the cost is still about 1 cent per one captcha because the cost of paying hackers to keep a bot net sizable enough comes to about the same cost.

      especially with sp3 coming out now, the cost of bot nets is higher, since sp3 offers a 'easy' bot net removal path, since staying off-line long enough to get all sp2's flaws patched is crucial in preventing reinfection. believe me, having a root-kit installed is easy even for a veteran computer guy to miss.

      i have dvd's i burned almost 3 years ago that reinfect any windows machine with a root-kit, and are un-readable in linux, apparently the root-kit was using some hooks in nero burning rom to 'randomly' pick a burn project and put the root-kit installer on there so when windows tried to auto run it would install the root-kit, then show the 'window' that normally shows up on auto-run would show up. the rootkit took an 'extra' session, that was transparent, eg: it would only show using burning software to read the track data, for the burned cd or dvd. no additional files showed up in windows, but the extra session made it unreadable to linux.

      also, the root-kit only runs in a 'blank' screen saver, which it protects and makes sure loads when the system is idle, so it never sends data when the user might be there to notice. and i think it sends the data as like, internet explorer, to bypass firewall rules. since none of the firewalls i tried could block it. i actually only found the original root kit when a second root-kit moved the first root-kit's files to the recycle bin. other than that none of the root kit scanners that were recommended to me could even detect this thing. only the 'symptoms' and the fact i could 'remove them' by staying off-line and not using my old discs were proof that i had a root kit.

      symptoms included, auto-run becoming disabled, screen saver always resetting to 15 minutes (only when both root-kits were on there), and the 'desktop' showing up 2-3 times a day when in full-screen games (also only with both root kits), and finding root-kit files in recycle bin(only found on networked systems with the root kit, and didn't return on reinstall of both root-kit, likely was a 1 time 'bug' that was fixed later on)

      so yeah, I didn't notice it for 3 years. Not that i usually have to deal with virus, but in the past I had only ever had to deal with 3 virus and in my 15 years online. and the third one was really a root-kit. I've also been using open-source software for 11 years, so that probably helped, of course, one of the virus was one that affected my open source software, the other 2 were windows based.

      it's still easy to miss windows root-kit's nowadays, especially when hackers have root-kits that aren't published, and they use scripts to make the exe's have unique signatures (using compiler tricks) for known root-kits.

    6. Re:I thought those things were already broken by novakyu · · Score: 3, Informative

      that's why it costs 1 cent per 1 captcha, the overall cost of webhosting the porn for exchange boils down to 1 cent per solved captcha. Er, where did you get that number? At Nearly Free Speech, it only costs $1 / GB (of transfer), and that's how much it would cost nearly anywhere else (or even less!), if you use significant amount of bandwidth.

      I don't know exactly how large porn images are, never having looked at them, but if you guess a round number of 0.1 MB per picture, it's only about $0.0001, or 0.01 cent per captcha. I suppose it's better than nothing, but it's not yet very cost-prohibitive.
    7. Re:I thought those things were already broken by Anonymous Coward · · Score: 3, Funny

      I don't know exactly how large porn images are, never having looked at them.

      Posting on /. and you've never seen porn? Bullshit.

    8. Re:I thought those things were already broken by nb+caffeine · · Score: 3, Funny

      Maybe he only watches movies

      --

      "Something's wrong with you...and I hope we never do meet again." - Deftones When Girls Telephone Boys
  2. Hey by Misanthrope · · Score: 5, Funny

    They're used to seeing Cyrillic, the captcha has got to be easier to read!

    1. Re:Hey by Janek+Kozicki · · Score: 4, Interesting

      The 3D captcha seems to be a good solution here (that's a link from wikipedia article)

      You pick several 3d models, like people, chairs or flowers. Name all their parts, like "chair leg", "human head" etc. The CAPTCHA is generated by placing a several 3D models randomly rotated on a scene and rendering them with easily readable letters "A", "B" placed on the named parts. The captcha questions are: "what is the letter on human head", "what is the letter on chair leg", etc..

      People can answer pretty easily. The 3D models are always randomly placed and rotated on a scene, so bots have a problem.

      --
      #
      #\ @ ? Colonize Mars
      #
  3. Not really news by Anonymous Coward · · Score: 5, Insightful

    A few months ago Yahoo introduced a CAPTCHA to prevent bots entering their chatrooms. Within a few days every room on yahoo was filled with bots once more, and still are to this day.

    Given the current situation of the chat rooms on yahoo, it comes as no suprise at all that the other parts of the Yahoo system are inadequately protected from bots either.

    1. Re:Not really news by Hojima · · Score: 2, Insightful

      Probably the best thing I can come up with in order to prevent bots is have a recognition question of some sort. Just have a picture of something simple and ask what it is (a dog for instance), or have a very simple question like, "Is Paris Hilton a whore?"

    2. Re:Not really news by ookabooka · · Score: 2, Interesting

      Heh, yeah. . . .I used to hook up my computer using Rybka to yahoo chess. I played against other bots, other players(always a glorious win), and tolerated the unending spam from other bots that would just want you to go to some porn website. Eventually, they instituted a CAPTCHA. . .Oh noes, my bot was broken. Turns out I could just manually enter the CAPTCHA and grab the session ID info before the applet loaded and forward that manually to the bot. Once I'm "logged in" with the bot, it's no big deal. Point is: If a spammer has to type in one CAPTCHA and can then spam for days in God knows how many chat rooms. . is it really that effective? Should we interrupt logged in users with more CAPTCHA's? Quite the interesting problem indeed, perhaps some sort of feedback where people would mark someone as a bot, if enough people did it, it would present the bot with a CAPTCHA. *shrug*

      --
      If you are about to mod me down, keep in mind that this post was most likely sarcastic.
  4. captcha security by primadd · · Score: 2, Interesting

    I did my own captcha, but I'm not sure how much its worth - figured any non-standard one is better than none (or a std one).

    Please take a look - are the effects actually helping the recognition process?

    --
    social bookmarking widget for your site

    1. Re:captcha security by Kaitnieks · · Score: 2, Informative

      The letters are too far away from each other - makes it easy to separate them for proccessing. In fact, the only challenging aspect for OCRs in your captcha is the letter rotation/skewing. However, I don't think anyone will bother to write a captcha OCR for your site, unless it's Yahoo sized.

    2. Re:captcha security by Carnildo · · Score: 3, Informative

      The character outlines are nicely distinct, which means that even basic OCR software should be able to break the CAPTCHA. Since it's so easy to break, you want to hide it from any bots that come by: remove all references to "captcha" from the page source, and you might want to move the HTML for the image away from the HTML for the entry box.

      --
      "They redundantly repeated themselves over and over again incessantly without end ad infinitum" -- ibid.
    3. Re:captcha security by yani · · Score: 4, Informative
      Although it seems counter-intuitive, character recognition (even with your filtering) is a relatively easy problem for a computer to solve. The hard problem is segmentation. It is relatively easy for a human to segment characters when they are somehow joined together, by artifacts or occlusion, it can be very hard to do with current methods.

      Hence all good modern captchas have moved away from character recognition captchas (such as yours) to segmentation based captchas. You only need to read the wikipedia article on CAPTCHAs to see some examples: http://en.wikipedia.org/wiki/Captcha.

  5. Re:Gentlemen, start your spambots by xaxa · · Score: 3, Insightful

    Natural language processing etc:

    To register, answer these questions and click the button on the right
    What colour are buses in London?
    What is three times three?
    [Red] [Green] [Blue]

  6. That's really impressive. by heyguy · · Score: 5, Insightful

    I've found Yahoo's CAPTCHA to be really annoying. I probably get it wrong about 20% of the time because the picture is so distorted (and I've been surprised that I got it right a lot of the time). I even considered writing them an email complaining about it, but then I realized they probably don't give a crap.

  7. Lets all say it togeter. by twotailakitsune · · Score: 2, Insightful

    We hate CAPTCHA. Most thing they do to make it difficult for computers to decode, make it a lot more difficult for humans to decode. Most of them are not usable by text browsers (dah), and the blind. Some have audio that is hard for people to hear, and sill easy for computer to decode. Last, CAPTCHA's are so over used that people just do them without thinking. For all you know that Porn/ware site is using you to do CAPTCHA for them. Not that it is needed. This is just one more nail in the CAPTCHA coffin.

  8. Only Yahoo? by Sigma+7 · · Score: 4, Informative

    33% of Yahoo capitchas isn't really impressive - you still get a large quantity of negative hits, and unless you have an array of IP addresses (most people don't), there will still be a large quantity of addresses registered from a given IP. Also, a large quantity of negatives would cast doubt on any positive matches from the same IP.

    Also, Yahoo captchas aren't that "hard" - they are black text from known font pools on a white background that get slightly warped and have black lines drawn on some characters. This is hardly strong since it doesn't hit all letters within the word (which is done by reCAPTCHA) or use a large font-pool variety.

    Even the Slashdot Captcha is harder - it hits the whole image and uses different fonts within the word.

  9. Re:Gentlemen, start your spambots by SoupGuru · · Score: 5, Funny

    That reminds me of the age check for Leisure Suit Larry back in the day... Who knew that the desire of a horny teen to see pixellated boobs would lead to history research?

    --
    What doesn't kill you only delays the inevitable
  10. Re:Malware by wellingtonsteve · · Score: 4, Funny

    without a chord is fine... ...it's when you're missing a cord that you need to worry

  11. Re:Gentlemen, start your spambots by paeanblack · · Score: 4, Insightful

    To register, answer these questions and click the button on the right
    What colour are buses in London?
    What is three times three?
    [Red] [Green] [Blue]


    Yes, those are undoubtedly hard questions for a computer. How, exactly, do you plan to generate billions of these questions? For a CAPTCHA to work, it must still be hard even if the generation algorithm is public knowledge.

  12. Re:Gentlemen, start your spambots by driftingwalrus · · Score: 3, Insightful

    What about introducing spelling and grammatical errors? This would be difficult for a computer to interpret, but doable for a human.

    --
    Paul Anderson
    "I drank WHAT?!" -- Socrates
  13. 35%??? by wbren · · Score: 3, Informative

    I'm impressed. That's better than I can do. Some CAPTCHAs take me five or six tries to get right.

    --
    -William Brendel
    1. Re:35%??? by GiMP · · Score: 3, Insightful

      I agree, that is better than I normally do as well. Maybe someone could make this a firefox plugin so that mere mortals can actually access webpages that use CAPTCHAs.

      It is sad because with corrective lenses, my vision is 20/20, and I'm highly technical. I should not have any problems with CAPTCHAs; However, my grandmother is another story. She has poor vision, can't figure out how to do a carriage return on her computer, has difficulty understanding the concept of scrollbars, and I'm sure would not be able to deal with even the easiest CAPTCHAs in use today. This is not usability. Granted, given the choice between SPAM or CAPTCHAs, I'll chose the lesser of the two evils...

  14. Re:Google Hacks by Anonymous Coward · · Score: 2, Interesting

    Are you bashing MS just to bash them. Honestly, their so called 'stupid system' is the best thing I've seen out there. Please enlighten me wise one, and link me to a better alternative.

    p.s. How do you know that Gmail accounts haven't been hacked into? Do you have data validating this?

    It's not a challenge to bash MS, that comes way to easy, but to add some useful content to /. , might be a challenge for yourself, wise one.

  15. Re:Malware by bcdm · · Score: 2, Funny

    Hey now, be fair...what's the point of bungee jumping if you can't have "Thunderstruck" or similar playing on the way down?

    Jumping without a chord would be no fun at all.

    --
    I can has sig?
  16. Warning on playing with the demo by xynopsis · · Score: 5, Insightful

    Did anyone notice that the image recognition code is imported from a binary DLL? I was under the impression that the Russian hackers would provide the source for the recognition code as well. But then, the people who released this are only interested in generating as much spam. Why should you trust them? You would be foolish enough to _not_ execute your test program that imports this dll in a vmware instance instead of your actual machine. Anybody done a comprehensive strace to determine sockets/descriptors opened by using this dll?

  17. Re:Gentlemen, start your spambots by LordLucless · · Score: 2, Insightful

    Not really. After a couple of (thousand) runs through, the attacker would have a reasonably accurate database of the questions. They can then analyze the text to find the nearest match to one of the questions in its database.

    --
    Just because you're paranoid doesn't mean there isn't an invisible demon about to eat your face
  18. Lease time on a botnet... by POttedPOrk · · Score: 2, Insightful

    Botnets have a whole bunch of IP addresses. Simply deploy your Yahoo CAPTCHA cracker code on a botnet that some other fine internet entrepreneur has assembled, and it doesn't matter how many negatives you generate because they will be from a variety of hosts. Certainly with 33% success rate, you're doing pretty well, especially considering your typical spray-and-pray spam blitz.

  19. Re:Gentlemen, start your spambots by General+Wesc · · Score: 2, Funny

    What about introducing spelling and grammatical errors? This would be difficult for a computer to interpret, but doable for a human.

    Yeah, that would solve the problem until someone developed an automated program to check spelling and grammar, which I'm sure is near-imposible. (By the way, does anyone know why there's a red line under that last word? Is my screen screwed up?)

  20. Re:Gentlemen, start your spambots by TubeSteak · · Score: 2, Funny

    What about introducing spelling and grammatical errors? This would be difficult for a computer to interpret, but doable for a human. LoL! I find ur 1d3as fascntng, & wood lik 2 sbscrbe 2 YR noozl3ter.
    kthxby
    --
    [Fuck Beta]
    o0t!
  21. Re:Dynamic forms? by Loplin · · Score: 2, Interesting

    >What about the form that is around the captcha, generally a new account application, etc? What if those were to be made dynamic so the automated software trying to look for a hard-coded form fail?

    Even if this were dynamic, there is only so many possible methods of displaying a form while still letting it be decipherable by a human. Given this limited set of possibilities, the programmer of a spam bot needs only to take into account any possible page mutations. More likely though, the spammer doesn't even look at a certain spot on the page; they probably do a little javascript to search the DOM for all text boxes and all images and ignores any images it already has copies of, the remainder image is likely the captcha. Then they would just search for context clues around the text boxes to see which box is most likely to be the one that accepts the captcha answer.

    >Or better yet, have questions that modern computer AI has yet to break. Show a picture of a circle and ask "is this round?" or "is this not round?". Generally make the questions a bit more complex as AI gets better.

    This is also suffers from the problem of limited number of possibilities. If someone can spend time putting questions in, someone can spend time filling in answers, and they only have to fill in answers once, after that, the bot can remember them for the next time it sees the same question.

    If some sort of AI was used that could ask common sense questions, like cyc, the problem would be that the spammers have access to the very same AI.

    The leading thought is that AI is not going to create better CAPTCHAs, but that bots that break CAPTCHAs are going to create better AI.

    >I wonder if there could be some sort of AI research project that works in conjunction with a captcha system.
    Not exactly AI, but the reCACPTCHA project does uses CAPTCHAs to decipher text that OCR programs can't when scanning books.

  22. Gee, Ya THINK by buss_error · · Score: 3, Insightful

    Yahoo!'s captcha has been hacked, perhaps not as well, in the past. I've seen open http proxies pounding away at Yahoo to the tune of 100,000 per hour and more. Hotmail's is broken, so are others. The real shame is that the Storm Worm controllers are being protected by a national government and law enforecement system.

    So what's the answer?

    I'm sure I don't know. I do know that the wild west theory of accepting any kind of behaviour isn't acceptable. I know that some minimum standard of what's allowed and what isn't is going to have to take place. Where these limits are placed is a thing for a global conversation, and there will be differances of opinion.

    Is cracking a captcha acceptalbe? Is phishing and identity theft acceptable? Is fraud and uncontrolled spam acceptable? What limits, and on what actions?

    I'm just not that smart. But I think we can agree on a few things. Let's start to find out what those things are... and acting in concert with other network operators to enforce those standards. Fail to meet them, and your network routing gets dropped...

    --
    Necessity is the plea for every infringement of human freedom. It is the argument of tyrants; it is the creed of slaves.
  23. Other interesting work on CAPTCHAs by ChoppedBroccoli · · Score: 3, Interesting

    Segmentation and intersecting arcs can be difficult for automated attacks: http://portal.acm.org/citation.cfm?id=1054972.1055070

    You know those annoying flash advertisement games (shoot the monkey for a free iPod)? Well, they could potentially be adapted for CAPTCHAs as well: http://cups.cs.cmu.edu/soups/2006/posters/misra-poster_abstract.pdf

  24. Re:Gentlemen, start your spambots by omeomi · · Score: 5, Insightful

    Not really. After a couple of (thousand) runs through, the attacker would have a reasonably accurate database of the questions. They can then analyze the text to find the nearest match to one of the questions in its database.

    That's true. I've found, however, that introducing custom spam blocking methods, such as this, no matter how easy to break, often does a better job at stopping spam bots than more robust publicly available methods. For a target as big as Yahoo, this probably won't work, but I've found on PHPbb for instance, instead of using any of the publicly available captchas, which are easily defeated by bots, creating a simple question of this sort does wonders for bot-blocking. Even if it's just one question. If your site isn't big enough to be specifically targeted by bot farmers, sometimes a simple solution is better than a more complex one that everybody else is using.

  25. Re:Gentlemen, start your spambots by Artefacto · · Score: 2, Funny

    That's still not as good as this solution. I can't understand why it's not widely adopted.

  26. Re: Imposible red lining. by bornwaysouth · · Score: 2, Funny

    Red lining ( a motoring term) comes from tiping too fast, typing to fist, typing two farst, um, using more than one finger per hand.

    The key is to never type faster than your brains alpha rhythm. Otherwise, you slide into a meditative zone known as 'T-pool bimbo limbo'. On the other hand, I've generally found typists to be saner than managers, so maybe the mediative zone is a defense mechanism. The frontal cortex contemplates what's for dinner tonight while some low reptilian region recognizes scrawled letters and types them.

    Which leads back to the main topic.
    What is the lowest animal life that could be trained to log into Yahoo?

  27. Yahoo fails even with captcha by MeditationSensation · · Score: 2, Informative

    If you've ever tried the Yahoo chatrooms, you know they're overrun by spam bots. The problem wasn't with the captcha, it was that it challenged users only once and at the beginning of the session. So as long as your spam bot didn't appear idle or lose connection, it could stay on indefinitely. Now with the captcha broken, spammers don't even have to do captchas manually.

  28. Random Coloration Photos by copponex · · Score: 2, Interesting

    (if anyone uses this and makes a million, at least cut me in 10% for the idea)

    I gather the last frontier for computers is image recognition. I'm not sure of the state of image processing, but if you could randomly color simple pictures (one flower, one pen, one cup (NO PUN INTENDED)) into about twenty different shades, and get about a hundred different photos, and just start rotating two or three a week in. So the user sees a small photo with radio boxes below:

    The cup is ()red ()blue ()green ()purple ()orange ()yellow orange
    The flower petals are ()orange ()blue ()brown ()black
    The pen is ()grey ()black ()yellow

    You could even start throwing in random names for the colors (silver, charcoal, etc.) using it in sentences, combine with shape guesses (the longer pens are what color? the biggest cup is what color?) Either that or use tiny bits of flash with motion. (the bouncing flower is what color? the flashing red object is what?)

    I say a few thousand different sites armed with the same "screen green" paint and tens of thousands of different photos could throw up somewhat of a roadblock.

    What say ye?

    1. Re:Random Coloration Photos by jsoderba · · Score: 3, Insightful

      I say that a lot of people are color blind.

    2. Re:Random Coloration Photos by mgblst · · Score: 2, Insightful

      Well, it is about time we got rid of those mutants anyway. Nobody is interested in what they have to say.

  29. Re:Gentlemen, start your spambots by nazanne · · Score: 3, Interesting

    That has been my experience, too. I admin a small bb and was having horrible problems with spam sign ups. CAPTCHAs didn't slow the spammers down at all. I went to a simple question that will be easily known by all of my target audience but probably won't be known by someone half way around the world entering CAPTCHAs for a penny a piece and allowed any spelling that is even close. I haven't had any spammers sign up for a couple years now. That obviously won't work for a major target like YAHOO though.

  30. Re:Gentlemen, start your spambots by aliquis · · Score: 5, Funny

    Just put some hard to read perl code in there and ask the user to say what it does. If the answer is correct it's a bot, if the answer is wrong it's probably a human ;)

  31. What about accessibility by kylehase · · Score: 2, Informative

    The topic of "are you human" was covered on Security Now a while back and someone brought up a great point. Tools to deter bots also makes it difficult for accessibility software since they use many of the same concepts as bots. Even audio captchas are no longer a strong bot deterrence.

    With advocacy groups like the National Federation of the Blind suing Target for their inaccessible website it'll be a very tough challenge to develop new good captchas while maintaining accessibility to everyone.

    On another note, could an organization representing the mathematically challenged sue companies using math captchas?

    --
    You want fun, go home and buy a monkey!
  32. Re:Gentlemen, start your spambots by goatpunch · · Score: 2, Informative

    I have a little site, only really intended to share stuff with family and friends, served with custom scripts. I couldn't believe it when it was targetted by spammers. I could even see the test posts they made, checking to see if html was allowed etc., before unleashing the the bot to post dozens of links a day.

  33. What about i18n? by gr8dude · · Score: 2, Informative

    As these CAPTCHAs get more complicated, it becomes more difficult for non-speakers of the language to interpret them.