A Vision For a World Free of CAPTCHAs

← Back to Stories (view on slashdot.org)

A Vision For a World Free of CAPTCHAs

Posted by Soulskill on Friday April 24, 2009 @06:12PM from the is-that-an-oh-or-a-zero dept.

An anonymous reader writes "Slate argues that we're going about verifying humans on the Web all wrong: 'As Alan Turing laid out in the 1950 paper that postulated his test, the goal is to determine whether a computer can behave like a human, not perform tasks that a human can. The reason CAPTCHAs have a term limit is that they measure ability, not behavior. ... the random, circuitous way that people interact with Web pages — the scrolling and highlighting and typing and retyping — would be very difficult for a bot to mimic. A system that could capture the way humans interact with forms algorithmically could eventually relieve humans of the need to prove anything altogether.' Seems smart, if an algorithm could actually do that."

131 of 168 comments (clear)

Min score:

Reason:

Sort:

Just a Thought... by ryanleary · 2009-04-24 18:14 · Score: 5, Insightful

It seems to me that if you can design an algorithm to verify how humans interact with a computer, it should be relatively trivial to engineer an algorithm that mimics this interaction?
Maybe someone smarter than I could clarify?
1. Re:Just a Thought... by Nazlfrag · 2009-04-24 18:17 · Score: 5, Insightful
  
  Using anything other than a human to judge the behaviour puts it outside of the Turing test. So not only does their proposed solution not match the goal they set, it should indeed be defeatable by another algorithm.
2. Re:Just a Thought... by l3prador · 2009-04-24 18:25 · Score: 4, Insightful
  
  Yep. If you can characterize the behavior pattern enough to automatically determine that it's "human-like," then you can automatically generate "human-like" behavior. The only way around it that I can see is if there is some sort of asymmetrical information involved, such as the invisible form honeypot mentioned in TFA--the website's creator (and thus the bot-detection script) knows that there is an invisible form present, but it's difficult for a script to see without rendering the site in standards compliant CSS.
3. Re:Just a Thought... by Z00L00K · 2009-04-24 18:25 · Score: 1
  
  Aren't many of those things like captchas circumvented by a trial and error methodology?
  What if you get three tries and then a blacklisted IP address? Not that the poster will realize that it's blacklisted, just that the tries to crack the captcha won't work, even if it's the correct answer.
  
  --
  If builders built buildings the way programmers wrote programs, then the first woodpecker would destroy civilization.
4. Re:Just a Thought... by roblarky · 2009-04-24 18:25 · Score: 1
  
  Right. The problem can only be solved if the Internet removes all forms of anonymity. Otherwise, it's just jumping through hoops which a bot can emulate.
5. Re:Just a Thought... by Anonymous Coward · 2009-04-24 18:39 · Score: 3, Insightful
  
  So if I have an algorithm that can verify an integer factorization quickly, it means there must be an algorithm that can factor any integer quickly? How would that work?
6. Re:Just a Thought... by MichaelSmith · 2009-04-24 18:57 · Score: 1
  
  Right. The problem can only be solved if the Internet removes all forms of anonymity. Otherwise, it's just jumping through hoops which a bot can emulate.
  We could see zombies skimming cemeteries for unused human identities.
  
  --
  http://michaelsmith.id.au
7. Re:Just a Thought... by RiotingPacifist · 2009-04-24 18:59 · Score: 1
  
  but it's difficult for a script to see without rendering the site in standards compliant CSS.
  But with many OpenSource web browsers, would it be that hard to work out what is rendered and what is not? it seams that bots could even run a hidden tab of firefox/chrome on a victims computer if they had to. I suppose it does make cracking capatchas computationally more difficult but isn't OCR much more intensive than rendering a page (wait why not just put capatchas in terribly codded flash apps)?
  
  --
  IranAir Flight 655 never forget!
8. Re:Just a Thought... by RiotingPacifist · 2009-04-24 19:01 · Score: 3, Insightful
  
  If you have a botnet then a single computer probably dosen't need to try a site more often than a human would.
  
  --
  IranAir Flight 655 never forget!
9. Re:Just a Thought... by cjfs · 2009-04-24 19:03 · Score: 2, Funny
  
  It seems to me that if you can design an algorithm to verify how humans interact with a computer, it should be relatively trivial to engineer an algorithm that mimics this interaction?
  Maybe someone smarter than I could clarify?
  You're looking at this all backwards. This isn't the humans attempting to prevent access to the bots. It's the bots getting the humans to speed up their evolutionary arms race.
  Think of it, bots trying to determine bot from non-bot. Bots honing their human-infiltration skills vs the best of the bots. It'll be the greatest leap since spam filtering. We'll^WThey'll be getting +5s again on Slashdot in no time!
10. Re:Just a Thought... by julesh · 2009-04-24 19:16 · Score: 3, Insightful
  
  It seems to me that if you can design an algorithm to verify how humans interact with a computer, it should be relatively trivial to engineer an algorithm that mimics this interaction?
  Maybe someone smarter than I could clarify?
  Sometimes it's easier to write an algorithm that checks that something is correct than to generate that something in the first place. An example: if you have a public key, checking a message is signed with it is fairly easy; signing a message with it is hard, because it requires you to factor the key.
  I see no evidence that "human behaviour" is such an algorithm. It might be, but we're way too far off understanding it to be able to make any sensible guesses in this field.
  A simplified approach is doomed to failure; simplified human behaviour is much more likely to behave like you suggest than like public keys, I think. Also, because different people interact with their browser in different ways, how do you cope with that? I tend to navigate via keyboard, so would the script reject me because I tabbed to the form field (thus jumping directly to it) rather than scrolling circuitously to reach it? I also make far fewer typos than average and type faster than the average user, so is this going to count against me?
11. Re:Just a Thought... by major_fault · 2009-04-24 19:19 · Score: 5, Insightful
  
  No algorithm will do. Ultimately the question that must be solved is whether the user is malicious or not. Best possibilities so far are the tried and true invitation system and excluding malicious users from the system. Malicious users are also users who keep including other malicious users. Easily detectable with proper moderation system that needn't be gotten into right here and now.
12. Re:Just a Thought... by Devout_IPUite · 2009-04-24 19:29 · Score: 2, Informative
  
  Factoring an integer has one answer. Trial and error doesn't work. Scrolling and clicking tempos have many answers, trial and error does work.
13. Re:Just a Thought... by phantomfive · 2009-04-24 19:37 · Score: 1
  
  The human brain is works on an algorithm that is Turing complete. It is also unlikely that the human brain has any algorithmic capability that a computer does not have, so it is reasonable to say that
  
  Any captcha that can be solved by a human, eventually will also be solvable by a computer.
  
  --
  Qxe4
14. Re:Just a Thought... by 1+a+bee · 2009-04-24 19:49 · Score: 4, Insightful
  
  So if I have an algorithm that can verify an integer factorization quickly, it means there must be an algorithm that can factor any integer quickly? How would that work?
  The anonymous poster makes a good counter argument against the idea that the algorithm must be easily defeatible: just because you have an algorithm that detects human behavior does not imply you have an algorithm that emulates the human behavior detected by the original algorithm.
  In fact, there are many, so-called, one-way (correct terminology?) algorithms. So, for example, for a given file, it's easy to compute its MD5; harder to compute a file for a given MD5 (though doable). And of course, the AC's better example which is impossibly hard in reverse for composite numbers made from very large prime factors.
  So no. Labeling the idea flawedbydesign is jumping the gun--logically, speaking.
15. Re:Just a Thought... by martin-boundary · 2009-04-24 20:04 · Score: 1
  
  it should indeed be defeatable by another algorithm.
  
  True. Let's say you have a test T in mind. This test will have some inputs I1,...,In which represent some observations coming from the keyboard and the mouse input obtained from some websurfer. If a computer tries to pass the test T, all it has to do is know the observations I1,...,In that are being looked for and simulate plausible values.
  What are plausible values? To obtain them, all you have to do, before the test T goes live, is ask some humans to act normally, and observe the quantities I1,...,In. That's how you calibrate the test T.
  But here's the thing: an attacker can do that too. He observes some friends to get plausible values of I1,...,In and once those values are known, an algorithm can simulate those values and pass the test T.
16. Re:Just a Thought... by Joce640k · 2009-04-24 20:10 · Score: 4, Interesting
  
  I disagree. I don't think there's anything terribly un-mimicable about the way humans interact with web pages.
  Besides, have you considered the effect of false positives (which will be many)?
  With a captcha it's a black/white decision and people know why they passed/failed.
  In the world being proposed in the article people will have to sit dejectedly wiggling their mouse while a web page decides if they're human or not based on some unknown criteria. Pass or fail? It's up to the machine.
  After two or three sessions of this people will be running away screaming from your web pages.
  
  --
  No sig today...
17. Re:Just a Thought... by bytesex · 2009-04-24 20:13 · Score: 1
  
  The only thing I can think of that could break this, is lack of efficiency on the human's part. That is, if the test, or the judgement takes time, then this is time that automated algorithms usually do not have. They want to inject, mass-mail, or do whatever they maliciously want to do, quickly. But then again, they might not.
  
  --
  Religion is what happens when nature strikes and groupthink goes wrong.
18. Re:Just a Thought... by 1+a+bee · 2009-04-24 20:56 · Score: 1
  
  I disagree. I don't think there's anything terribly un-mimicable about the way humans interact with web pages.
  Maybe, maybe not. The point was that claiming
  
  it should indeed be defeatable by another algorithm
  is not a logical slam-dunk.
19. Re:Just a Thought... by Z00L00K · 2009-04-24 21:37 · Score: 1
  
  That's assuming the botnet is targeting a single site or only a few sites.
  
  --
  If builders built buildings the way programmers wrote programs, then the first woodpecker would destroy civilization.
20. Re:Just a Thought... by Joce640k · 2009-04-24 23:31 · Score: 2, Insightful
  
  I'd say it's a lot more of a slam-dunk than this:
  "Read heavily distorted text on random patterned backgrounds with added noise and geometric figures drawn across it"
  My real problem with the proposal is with the false positives. There's no clear feedback to let a user know *why* he's not being allowed into the system, it's just that the machine doesn't like the look of him.
  
  --
  No sig today...
21. Re:Just a Thought... by buchner.johannes · 2009-04-25 00:06 · Score: 1
  
  Using anything other than a human to judge the behaviour puts it outside of the Turing test. So not only does their proposed solution not match the goal they set, it should indeed be defeatable by another algorithm.
  I imagine there will have to be a new job description for the webmaster ...
  
  --
  NB: The message above might reflect my opinion right now, but not necessarily tomorrow or next year.
22. Re:Just a Thought... by mrsquid0 · 2009-04-25 00:37 · Score: 1
  
  Perhaps the verification algorithm could reject any"one" who behaves too much like the algorithm expects a human to.
  Seriously though, this sort of verification method seems like it would be easy to defeat.
  
  --
  Just because you are paranoid does not mean that no-one is out to get you.
23. Re:Just a Thought... by cskrat · 2009-04-25 00:44 · Score: 2, Interesting
  
  The anonymous poster that you're responding to was actually the one to introduce the word "quickly" to the discussion.
  That being said, I think the method proposed at the end of the article is flawed in that the algorithm is reversible and facing the wrong direction.
  Assuming that the website in question only has access to the message information passed to the GUI window of the browser by the OS, (I'm sure as hell never installing a browser with ring 0 access to my system) it would be fairly trivial to produce an AI algorithm to replicate that behavior. A few hard coded target parameters and a bit of randomization would sufficiently emulate a human based on gathered metrics of a small sample, possibly as small as just one, human subjects. And don't forget that spammers don't need anywhere near a %100 success rate to be viable.
  The checking process, on the other hand, would require a very large, heterogeneous sample of human subjects to determine the limits, distribution, and correlations of tested metrics. A team of statisticians and psychologists would be required to analyze the data so that it can be converted into a working algorithm by software engineers. That's an enormous amount of man hours just to produce the system. Assuming, however, that the system is produced in spite of it's high development cost, it would still be computationally expensive to analyze each potential human to see if it's generating a valid combination of metrics.
  Think of it this way, It's trivial for me to write a PHP script to quickly generate valid XML markup to send to a remote system. Parsing a string of potential XML on the other side, however, is more computationally intensive and the algorithms to do it are more complex, especially if you consider the complexity of any prebuilt parsing tools, such as regular expression tools, as being part of the overall algorithm complexity. While, granted, a parser can be reasonably expected to run in linear time, the script to produce XML can be reduced to constant time if optimized for a specific purpose.
  
  --
  My God! It's full of eval()'s.
24. Re:Just a Thought... by postbigbang · 2009-04-25 00:50 · Score: 1
  
  Consider that IPV6 takes care of the details; no Turing Test needed. At some point, you won't be able to spoof an address without setting off a last-mile or even core-located router alarm. Once you kill NAT, we're all exposed like worms after a shovel turn in the garden. The need for CAPTCHA and other algorithms that authenticate humanity will be reduced to simply partitioning your machine if it turns out to be a spam or other bot. Get your machine clean, or you don't logon. At least that's the concept until it gets cracked.
  I believe as long as there are psycopaths and sociopaths, Turing's Tests will be hacked. Why not just actually give up the misbelief that you're anonymous on the Internet?
  
  --
  ---- Teach Peace. It's Cheaper Than War.
25. Re:Just a Thought... by jonaskoelker · 2009-04-25 01:28 · Score: 4, Funny
  
  There's no clear feedback to let a user know *why* he's not being allowed into the system, it's just that the machine doesn't like the look of him.
  So it's like dating? ;)
26. Re:Just a Thought... by Jeff+DeMaagd · 2009-04-25 01:55 · Score: 1
  
  I think this makes sense, though that has to mean that the legitimate user has to find someone they know that is part of a community. I think this is going to keep out a lot of good users.
  Anyways, I thought CAPTCHAS usually aren't solved by machines, so trying to deliver a Turing-like test isn't going to solve the problem.
27. Re:Just a Thought... by Endo13 · 2009-04-25 02:33 · Score: 1
  
  The reason it's never going to work is because unlike passwords/encryption/captcha methods it's not something that can be continuously changed or updated when it gets compromised. Even if just one large company uses it you still know that eventually the algorithm will get out into the wild. If everyone uses it, the algorithm will be trivially easy for anyone to get their hands on almost immediately. And just like it's trivially easy for a computer to "crack" an encryption if it has the key, it's also going to be trivially easy for it to mimic behavior that passes the test if it has the algorithm.
  So basically, yeah it can work until the first time the algorithm gets into the wrong hands. Then it's permanently useless.
  
  --
  There is no -1 Disagree mod. Slashdot.org/faq defines mod options. USE IT.
28. Re:Just a Thought... by xaosflux · 2009-04-25 02:39 · Score: 1
  
  In fact, there are many, so-called, one-way (correct terminology?) algorithms. So, for example, for a given file, it's easy to compute its MD5; harder to compute a file for a given MD5 (though doable).
  
  There are almost no situations (Rulling out EXTREMELY small files) where that would be true. It is doable to create a COLLISION with an MD% hash, not recreate the source, it is by nature lossy.
29. Re:Just a Thought... by xaosflux · 2009-04-25 02:40 · Score: 2, Funny
  
  And yes, that must be a capital "5" !
30. Re:Just a Thought... by hughperkins · 2009-04-25 03:23 · Score: 1
  
  Yes, and ultimately intelligent computers will actually improve spam detection, since they will be able to analyse, understand, posts sufficiently to check whether they are reasoned posts relevant to the thread, or spam about pharmaceutical enhancements.
31. Re:Just a Thought... by dcollins · 2009-04-25 03:24 · Score: 2, Informative
  
  The anonymous poster makes a good counter argument against the idea that the algorithm must be easily defeatible: just because you have an algorithm that detects human behavior does not imply you have an algorithm that emulates the human behavior detected by the original algorithm.
  That's vaguely clever, but it doesn't really pass the sniff test. While "one-way" or "trapdoor" functions may or may not exist, they appear to be pretty rare. That's why it's such a big deal when computer scientists identify a new possible trapdoor function. The chances that any randomly-chosen process happens to be trapdoor (for example, verifying human mouse gestures on a webpage) is monumentally unlikely.
  
  Trapdoor functions came to prominence in cryptography in the mid-1970s with the publication of asymmetric (or public key) encryption techniques by Diffie, Hellman, and Merkle. Indeed, Diffie and Hellman first coined the term (Diffie and Hellman, 1976). Several function classes have been proposed, and it soon became obvious that trapdoor functions are harder to find than was initially thought.
  
  http://en.wikipedia.org/wiki/Trap_door_function
  
  --
  We know where leadership by an anti-intellectual "strongman" who scapegoats minorities and likes boisterous rallies goes
32. Re:Just a Thought... by bishiraver · 2009-04-25 03:42 · Score: 1
  
  Using javascript to record certain events: random clicks on the page, scroll actions, and snapshots of the mouse x/y position every 5 seconds or so.
  Using xmlhttprequest to send this data to a server that determines whether the behavior fits, within a margin of error, to a markov model built via previous human interaction in the page.
  Of course, if the automated blogspam bot ever got ahold of the markov model, it would be able to generate 'believable' interaction with the page by creating a markov chain.
33. Re:Just a Thought... by Pentium100 · 2009-04-25 04:37 · Score: 1
  
  So, what you are saying is that you would IP ban those who spam. OK. Why is IPv6 necessary? Oh, you don't want to ban entire networks that are behind NAT? OK, but IIRC, with IPv6 you can change the IP of the computer at will (well, part of the IP anyway), so you would still need to ban entire networks (using the part that does not change) or the bot will just change IP of the machine...
  
  Why not just actually give up the misbelief that you're anonymous on the Internet?
  Because even if the government knows who I am and where I live, the other internet users do not (or I hope so). There is one thing if the government comes after me if I do something they do not like, but it's worse if any psycho can come after me with an axe just because I said something he did not like.
34. Re:Just a Thought... by LordLimecat · 2009-04-25 05:49 · Score: 1
  
  harder to compute a file for a given MD5 (though doable)
  If you can come up with an algorithm to do this, you will have the most efficient compression algorithm ever. MD5 is one way, and lossy--thats why its a hash.
35. Re:Just a Thought... by pmarini · 2009-04-25 06:47 · Score: 1
  
  How would that work?
  I'm not sure yet, give me a few days to write an algorithm to verify your question... :-)
  
  --
  Can I put a spell on those who can't spell?
  Your wheels are loose and they're losing their grip, good you're there.
36. Re:Just a Thought... by pmarini · 2009-04-25 07:08 · Score: 1
  
  evolution changes the calibration, unless genetists find its formula.
  now, if in the future we'll create a savepoint and let geno-human and simo-human evolve separately from that point on, then the question of what defines "human thought" really becomes... irrelevant
  
  --
  Can I put a spell on those who can't spell?
  Your wheels are loose and they're losing their grip, good you're there.
37. Re:Just a Thought... by dissy · 2009-04-25 07:33 · Score: 1
  
  The only way around it that I can see is if there is some sort of asymmetrical information involved, such as the invisible form honeypot mentioned in TFA--the website's creator (and thus the bot-detection script) knows that there is an invisible form present, but it's difficult for a script to see without rendering the site in standards compliant CSS.
  The one website I had to make where the signup page (Thus the bot attacked form) had quite a few text input boxes at the owners request.
  To handle the auto-tag reading bots, which will either look for standard text element names like 'username', or the human bot owner will pull one copy of the page to see what you personally named that field and program the bot accordingly, i setup a season system with happy randomness in it.
  Basically the only one single tag that is the same on every load of the page is a hidden form element named USID (Unique season ID) which was randomly generated, linked in SQL to the other real form tags, and expires after a set time.
  It then each page load generates random 8 char alphanumeric 'names' for all the other form elements, and adds those names with the USID in the database. IE for your load it might be 'jHy698as' for username, '9saf7aA3' for password, etc. The next page load will have different names, but also random.
  If the USID sent back is valid and not expired, then it will match 1:1 those tags, and if Any tag not listed is sent back with data, it is flagged as a possible bot. Two of those in a row with different SUIDs, or 5 in a row on the same SUID, the SUID is expired and the IP is added to a block list for 15 minutes.
  With some random other data that basically wastes a kb or two of bandwidth, the html is different enough hopefully for a bot to be unable to parse out form element positions enough to auto detect the random names for it to form the same match-up table. And since they are different on each page load, even a human writing down the text element names wouldn't make it too far.
  If things don't match up in a way that could possibly be a season timeout, then it doesn't get counted against the host/ip, Just In Case(tm)
  There is a bit more logic to help identify if a user hits back and the form is loaded different but some elements may be cached, it won't flag as a bot then either.
  Sadly, the website was never all that high in popularity so chances are it was overkill, but it was at the site owners request so we did just that.
  Not being a high profile site its hard to compare directly how effective this method is or would be, but the bots that brute force accounts all have failed so far. His only issues seem to be humans making multiple accounts, which of course none of these solutions are designed to even address.
38. Re:Just a Thought... by aynoknman · 2009-04-25 07:57 · Score: 1
  
  It seems to me that if you can design an algorithm to verify how humans interact with a computer, it should be relatively trivial to engineer an algorithm that mimics this interaction?
  I just thought of new career plans for all those Chinese gold farmers.
  
  --
  We need a "+1 -- nice sig" moderation.
39. Re:Just a Thought... by blahplusplus · 2009-04-25 08:05 · Score: 1
  
  Personally I don't nkow why they just don't use pictures of things and randomly circle some item in the picture that has a tagged string to it the picture and then type in.
  Right now they are using distorted letters and numbers, it seems to me that using pictures and asking questions about the nature of the objects ('randoly') circled would be a lot better..
  because algorithms would have a tough time desciphering what object might be circled or pointed out in the picture,
  Or you could use pictures to "suggest" something like have a Dog pic with a plus sign and a dog "running" pic with a plus sign, and then have a house
  Then ask the question, what does this picture suggest in terms of what it is trying to say?
  "The dog ran home", then sample it and tune it on thousands of real people.
  That might be a better approach.
40. Re:Just a Thought... by RiotingPacifist · 2009-04-25 13:15 · Score: 1
  
  No it assumes the opposite, it assumes a botnet is targeting many sites!
  if you have 1000 computers, then you can attack every site you want 1000 times before a single computer has to attack the same site twice. If you only attack a few sites most of the time the bots will be inactive, its only if your attacking many sites that your not wasting time.
  
  --
  IranAir Flight 655 never forget!
41. Re:Just a Thought... by Brian+Gordon · 2009-04-25 14:26 · Score: 1
  
  What does multiple possible answers have to do with anything? The correct riposte to ryanleary is to point him to NP which is a whole class of decision problems defined by the fact that they're simple to verify but hard to solve.
  
  Honestly, did he even use his head? How does he think his computer can verify an SSL cert in a fraction of a second when it's common knowledge that they take a long time to crack?
  
  Also the article's idea is awful. Hey here's a bot that could defeat the algorithm: record one human pointing around the page and scrolling, then play it back a billion times to register a billion accounts. And what about browsers with Javascript disabled? I should start submitting slashdot's front page to thedailywtf..
  
  Still, constructively using data from the roving mouse pointer is a really cool idea. I bet you could train a neural net to analyze people psychologically based on their restless cursor habits. Reminds me of the video game they used in Ender's Game to profile kids' every reaction :) Elder Scrolls or GTA or Deus Ex (or Fable?) could be interesting tests as well.
42. Re:Just a Thought... by amRadioHed · 2009-04-25 14:29 · Score: 1
  
  The reason that won't work is because how do you get a computer to dynamically generate these puzzles? The trick is to find something that is easy for a computer to create but not easy for it to reverse. Most common sense AI type puzzles require as much AI to create as they need to solve.
  
  --
  We hope your rules and wisdom choke you / Now we are one in everlasting peace
43. Re:Just a Thought... by Z00L00K · 2009-04-25 17:03 · Score: 1
  
  The usual method of a bot is to use the same bot against a single site for several repeated attempts, so that reasoning doesn't hold.
  There may be another bot in the net that targets that site, but then that bot may run the same logic or similar logic. This since the nodes in the bot net aren't doing communication between each other otherwise the traffic for control would be horrible.
  So your reasoning doesn't hold.
  And I didn't claim perfectness, just another spanner in the works of spam bots that will slow them down.
  
  --
  If builders built buildings the way programmers wrote programs, then the first woodpecker would destroy civilization.
44. Re:Just a Thought... by nicuramar · 2009-04-25 18:55 · Score: 1
  
  Actually, all P problems (which are "easy to solve") are also NP ("easy to verify"). But NPC then (NP and "no easier" than any other NP problem)... unless P=NP :)
45. Re:Just a Thought... by acheron12 · 2009-04-25 20:44 · Score: 1
  
  One advantage: you can slow them down. Humans can't post thousands of spam posts per minute. For a bot to mimic a human, they also have to avoid doing that kind of behaviour. While that doesn't eliminate the problem, it certainly reduces it.
  
  --
  there is no god but truth, and reality is its prophet
46. Re:Just a Thought... by mpeskett · 2009-04-26 10:59 · Score: 1
  
  If there's a limited number of question/answer pairs then it can be broken by having a human solve each one once (or have a computer trial and error it) until they know all the answers.
  It could work for a small low-profile site that isn't a big enough target to have that effort directed at breaking its Captcha, but the big players need something more dynamic.
Not so sure by Misanthrope · 2009-04-24 18:19 · Score: 4, Insightful

Assuming you could write an algorithm to determine humanistic behavior, it stands to reason that you could write a bot to fool the initial algorithm.
1. Re:Not so sure by Misanthrope · 2009-04-24 20:28 · Score: 1
  
  I have a PhD from Tongue Firmly in Cheek U.
2. Re:Not so sure by noppy · 2009-04-24 22:01 · Score: 1
  
  Assuming you could write an algorithm to determine humanistic behavior, it stands to reason that you could write a bot to fool the initial algorithm.
  Write a bot that surfs all day at /.
3. Re:Not so sure by TheRaven64 · 2009-04-24 22:55 · Score: 3, Insightful
  
  Not true. For example, any NP-complete problems can be solved in polynomial time on a nondeterministic Turing machine, but a solution can be verified in polynomial time on a deterministic Turing machine. There are lots of examples of this kind of problem, for example factoring the product of two primes or the travelling salesman problem. In a vast number of cases, it is easier to test whether a solution is correct than it is to produce the solution. Even division is an example of this; it is easier to find c in a*b = c than it is to find a in c/b = a.
  Of course, as the other poster said, there is no evidence that 'seeming human' is in this category, and it's a very wooly description of a problem so it is probably not even possible to prove one way or the other.
  
  --
  I am TheRaven on Soylent News
4. Re:Not so sure by smallfries · 2009-04-24 23:45 · Score: 1
  
  Even division is an example of this; it is easier to find c in a*b = c than it is to find a in c/b = a.
  
  That would be quite hard to prove... ;)
  
  --
  Slashdot: where don knuth is an idiot because he cant grasp the awesome power of php
5. Re:Not so sure by Pentium100 · 2009-04-25 04:40 · Score: 1
  
  no, it's actually the same number of steps:
  multiplication:
  1.get a calculator
  2.type first number (a)
  3.press *
  4.type second number (b)
  5.press =
  division:
  1.get a calculator
  2.type first number (a)
  3.press /
  4.type second number (b)
  5.press =
  See, 5 steps for both.
6. Re:Not so sure by TheRaven64 · 2009-04-25 06:32 · Score: 1
  
  No it's not, it's the set to first-year computer science students to prove (or was when I was was an undergrad).
  
  --
  I am TheRaven on Soylent News
7. Re:Not so sure by smallfries · 2009-04-29 05:45 · Score: 1
  
  There is no known lower bound for division, so what kind of proof is used?
  
  --
  Slashdot: where don knuth is an idiot because he cant grasp the awesome power of php
I read something about this by gcnaddict · 2009-04-24 18:21 · Score: 4, Interesting

I remember reading... I can't remember if it was a post about an algorithm already written or a proposal for an algorithm which would run alongside a CAPTCHA through the entire registration process, but the basic premise was just that: measure the entropy and fluidity of human movement and determine whether or not the user is a bot based on whether or not the user fits typical random human usage patterns.

I also remember the writer of the post noting that this kind of system would basically stretch the human-unwittingly-answers-CAPTCHA out such that humans would have to do the entire setup process manually instead of just the CAPTCHA, thus defeating the point of automated setup.

Does anyone have this article? I can remember reading it but I can't find it.

--
Viable Slashdot alternatives: https://pipedot.org/ and http://soylentnews.org/
1. Re:I read something about this by fahrbot-bot · 2009-04-24 18:45 · Score: 1
  
  ...algorithm ... which would run alongside a CAPTCHA through the entire registration process, ... measure the entropy and fluidity of human movement and determine whether or not the user is a bot based on whether or not the user fits typical random human usage patterns.
  
  Ya. I don't think I'll be whitelisting *that* in NoScript... :-)
  
  --
  It must have been something you assimilated. . . .
2. Re:I read something about this by abolitiontheory · 2009-04-24 19:25 · Score: 4, Insightful
  
  In addition to this, what about those humans who just happen to fall into the seemingly 'mechanical pattern' that a computer registrant would? I know some parents of friends who very meticulously and methodically fill out forms, reading every box and explanation to ensure that they're inputting the right data.
  Any computer judgment of what is authentically human is in a way a reverse Turing test. It's a computer judging if humans are behaving enough like humans. The problem here is too many degrees of separation: a very specific type of human [engineer] designs a computer to assess the 'humanness' of other humans actions. Any such assessment would be based on certain assumptions and biases about how humans act. It sounds like putting a document through Google translator into another language and then back again, before turning it in for a final grade.
3. Re:I read something about this by canthusus · 2009-04-24 19:47 · Score: 1
  
  I can't find the article itself, but there's a short summary of it here.
4. Re:I read something about this by adamofgreyskull · 2009-04-24 21:20 · Score: 1
  
  In addition to this, what about those humans who just happen to fall into the seemingly 'mechanical pattern' that a computer registrant would? I know some parents of friends who very meticulously and methodically fill out forms, reading every box and explanation to ensure that they're inputting the right data.
  
  Even the most "mechanical" of your friends wouldn't download the page, parse it in its entirety without scrolling the page in their browser, then enter all form fields in a fraction of a second, before submitting it. In fact what you're describing is probably exactly the kind of thing that the test would detect as normal human behaviour. Scroll down, read field label, read form field explanation, type answer into form field, scroll down, repeat.
  
  The tricksiness of defining a useful (i.e. easy for a human to pass, difficult for a machine to pass) test will be in measuring, "by how much did the browser viewport move that time?", "how fast did they type that word into the field", "did they need to scroll the page to see the field", "is the scroll exactly 20px every time?", "how much time has elapsed since the viewport was last scrolled?" etc. All of which will have to be measured client-side, *ahem*. THEN, you have to feed that into your algorithm and determine how human those inputs make the form submitter. The test could be calibrated by having a number of known humans fill out the form and observing the inputs you get, how much variance there is etc.
  
  The simplest version of the proposed test is to calculate the amount of time between a computer X requesting the form and computer X submitting the form. If you've recorded the time of the fastest human as 30 seconds, then you prevent all form submissions before 30 seconds has elapsed. But that's a single data point and if you were writing a bot,it would be trivial to put in a wait time of 30 seconds between form load and submission, if you were willing to wait. Similarly, it will be possible to emulate a human browsing a form and submitting it...but it would hopefully involve a lot more time, effort and money than is economical for the spammer...
5. Re:I read something about this by TheRaven64 · 2009-04-24 23:01 · Score: 2, Insightful
  
  It's a nice idea, but unfortunately it's easy for a computer to work around. How does the client-side JavaScript know how much the page has been scrolled? Because the browser tells it. There is nothing stopping a bot from downloading the page and then submitting the same HTTP requests that the client-side JavaScript would (or even running it in a VM and injecting DOM events into it with some random wait events). Once you know the algorithm used on the server to determine whether something is human, it's easy to work around it. In your simple example, the client just needs to sleep for 30 seconds between downloading and submitting the form - one line of code to program, while the test is likely to need at least four lines. This limits the number of registrations a single bot can do in a single day, but only to one site - the bot can overlap its requests so that it's hitting 30 sites at once, and then it's back up to one spam per second. Or, it may keep using the slow approach, making its traffic harder to spot.
  
  --
  I am TheRaven on Soylent News
6. Re:I read something about this by caramelcarrot · 2009-04-25 00:23 · Score: 3, Interesting
  
  Last time this came up, I suggested the idea of constant bayesian analysis on HTTP logs to determine the likelyhood of the current user being a bot.
  
  It could take things into account like if the user bothered to visit previous pages, request images, the time between requests etc. You could then either just make the webserver kill the connection, or you could add a function to your preferred web language (e.g. PHP) that returned the probability that the current user is a bot, and so redirect them to a more annoying turing test or block them.
  
  This'd also work pretty effectively if people wanted to stop scrapers and bots in browser games. Of course a bot could mimic all this, but it'd raise the cost of entry significantly - and it might end up being that the bot is no more effective than a human working 24/7, though even then you'd need to be changing ips constantly.
  
  I was thinking of trying to implement this over the summer, based on comment spam bots on my website, all without any need for client-side spying
7. Re:I read something about this by Atraxen · 2009-04-25 00:31 · Score: 1
  
  Plus, there are hardware based differences in interaction that modify your reading/interaction behavior. Analyzing mouse cursor movements for a trackball, mouse, and touchpad will likely give very different results - and that's assuming they're being moved the same way. When I'm reading with a mouse, I tend to 'follow along' on the page - with a trackball, I park the cursor to the side - with a touchpad, I tend to move in blocks. Add enough variables, and you can model any behavior (at the risk of losing the ability to probe correlation of real factors) - by adding enough exceptions to the algorithm to handle all these cases (and all the others) it strikes me as unlikely that the algo would be able to distinguish between humans and bots.
  And if it does, the spammers will probably write a trojan that watches for the user generating a login, and swaps the interaction with the captcha the spammer wants solved. Reminds me of the good ole days of Cold War Arms Racing!
  
  --
  Be careful of your thoughts; they could become words at any minute...
8. Re:I read something about this by Pentium100 · 2009-04-25 04:44 · Score: 1
  
  Don't forget tablet PCs with their touch screens. In that case the mouse pointer is jumping from point to point.
alternate captcha based ways by mehrotra.akash · 2009-04-24 18:48 · Score: 1

if there was a way for a computer to determine that the behaviour is human, wouldnt the computer be able to do it anyways?? but what about tricks like telling a user to leave a particular field blank and filling it on the next page instead this field could be indicated by a captcha which contains a URL on opening the URL you get another captcha which has a number, u leave that numbered field empty if the 2nd captcha is entered wrong, then you have to repeat the process from the beginning and fill in 2 captchas on the 2nd page and so on this way most humans would be able to do it in 1-2 attempts, but bots doing it the hit and trial way would be stuck with 1000's of different captchas also, having a central database of all the types of captchas and mixing displaying 2-3 different types would be effective as bots are designed for one type of captcha only, arent they
capture and copy by tmk · 2009-04-24 18:49 · Score: 1

A system that could capture the way humans interact with forms algorithmically could eventually relieve humans of the need to prove anything altogether.'
This system could also reproduce human interactions. So it's only time until this behavourial approach stops working.
BTW: I don't want slashdot to check how I scroll the page, nor is my typing and retyping business of anybody but me. Imagine you can't comment anywhere because you block Google Analytics.
If an algorithm can be made to detect it by rolfwind · 2009-04-24 18:52 · Score: 1

doesn't that just mean a computer can also feed the correct data in, defeating it?
Anyway, the little tests these days are stupid and annoying, and perhaps for some people, getting impossible to do. Perhaps instead of the test being administered at the point of registration, new accounts at places should be automatically monitored for type of activity.
For instance, if the first post at a forum has any links to blacklisted ad sites (could be EasyList USA, whatever), it's probably safe to just kick it out automatically. And things of that nature. Or just the old sign up with a credit card and charge onetime $0.41 trick (or whatever to just cover min fees) to keep bots out of the community's hair.
I'm sure other solutions will have the old How-To-Fix-Email-response "Yes, but your idea won't work because (Mark random amount of 100 checkboxes)"
1. Re:If an algorithm can be made to detect it by kvezach · 2009-04-24 22:49 · Score: 1
  
  doesn't that just mean a computer can also feed the correct data in, defeating it?
  
  Unless P == NP, checking a solution can some times be a lot easier than actually generating a solution. Consider, for instance, a hash like SHA-1. The whole point of a secure cryptographic hash is that checking if a certain hash matches that corresponding to a document is very easy, but crafting a document that matches an already specified hash is very hard.
2. Re:If an algorithm can be made to detect it by Spasemunki · 2009-04-25 00:09 · Score: 1
  
  But this problem isn't checking a number theoretic property; it's applying a heuristic to a small pool of data points that may have been passed to you by a hostile reporter. Nothing indicates that this problem is significantly harder to provide a solution to programatically than to check programatically. Plus, the attacker gets a free oracle that tells you if you've created a good set of attack data. Let one real person register through the system, capture their data, and add a small amount of randomness to the timing and it would appear that you have permanently broken the system; I don't know how you could exclude an attacker doing that without also excluding a lot of actual users.
Tech Support by cjfs · 2009-04-24 18:53 · Score: 5, Funny

I can see it now: "have you tried moving your mouse around randomly?", "how about clicking on a few different parts of the page then making coffee?", "still not working? Try slamming the mouse down several times", "okay, as a last resort click on the tabloid pop-up."
1. Re:Tech Support by ElectricTurtle · 2009-04-24 19:24 · Score: 1
  
  Ugh. Mod parent up for truth.
  
  --
  I support the Slashcott and will not be reading or commenting from 2/10/14 to 2/17/14. Beta is steaming pile of dog shit
Modelling behaviour by pfafrich · 2009-04-24 18:55 · Score: 1

The tricky part of the an alternative solution seems to be modelling human behaviour - in order to detect if something is human or not your need to have a pretty good model of what humans do. I suspect there would be a lot of variation in the sort of way people interact, if I'm feeling sleepy I would present a very different profile of use to when I'm on task and in flow. A program to do this will probably have to be statistical in nature with some sort of confidence intervals of humanness. Maybe it will need some Cluster Analysis. This all makes for some pretty hard code and I'm not convinced the difference between two humans will be smaller than the difference between human and bot.

--
There are four sorts of people in the world: fools, lunatics, idiots and morons. - Umberto Eco, Foucaut's pendulum.
All those CAPTCHAs... by __aaclcg7560 · 2009-04-24 18:56 · Score: 1

You mean I didn't need a new pair of glasses every time I couldn't read on of those CAPTCHAs? I want my money back.
Response Times by Anenome · 2009-04-24 19:00 · Score: 1

Seems some things should be easy. There's a certain minimum amount of time that it takes a human to tab from one field to another as they fill in data, even if they're pasting info in. Even just slowing down bots to the speed that a human could reasonably do a task would put a dent in the problem =\

--
"I Don't Have Enough Faith to be an Atheist"
1. Re:Response Times by Jason+Pollock · 2009-04-24 19:11 · Score: 1
  
  The problem is already easily parallelised. If it takes you 10s to fill in a form, and it isn't using any CPU (you're sleeping), then run a couple of thousand attempts in parallel. You get the _exact_ same throughput as you do if they are all run serially.
  For batch processes, latency isn't really an issue, it just means you need to do more transactions at once.
2. Re:Response Times by rdnetto · 2009-04-24 20:30 · Score: 1
  
  Then limit it to one attempt per IP address to prevent the parallelization. The only downside would be that this would also block people behind NAT, since they would have the same address.
  
  --
  Most human behaviour can be explained in terms of identity.
3. Re:Response Times by Jason+Pollock · 2009-04-24 21:30 · Score: 2, Informative
  
  These guys have botnets, and with networks like Tor, you can't limit access to one IP. Besides, if you've got captcha that is being attacked, to limit them by IP, you need to send them all through a single location to perform the detection, completely breaking your load balancing. It becomes a DoS target.
  Basically, the attacker has more machines, more IP addresses and more time than the target.
  Even if I only have one machine, that's fine, I attack 10 or 100 sites instead of just yours. Or, I use a network like Tor and select random out proxies. The only problem? All of my compatriots will be doing the same.
  The target won't see any real decrease in attacks, they will only lose all of their corporate customers who are unable to access the network from home (or dorms, or school, or libraries).
What does it mean to be human? by mcrbids · 2009-04-24 19:01 · Score: 4, Insightful

It's a lot tougher do define what a human is than it may seem on the surface, and the difference between man and machine will, by definition become more and more blurred until there is no effective difference.
It's an idea that I've become familiar with esp. aftre reading 'The Singularity is Near' by Ray Kurzweil. As our technology advances, we'll find that our capabilies beyond our technolgy will diminish. Machines have long ago surpassed our running speed (cars/planes/trains) and our ability to farm/grow food (tractors) and our ability to hurl object (guns) and swim (boats) but we've always had the ability to out-think our machines.
Increasingly, this isn't true.
We've already shown that SPAM filters are good enough to be more accurate than the people who read the messages. Machines have long been better than people for math-related stuff, keeping track of stuff, and the like, but now we're getting close to the threshhold for image processing and character recognition. It's already true for voice recognition. Captcha is, therefore, doomed to fall eventually as we approach the singularity, and is already pretty weakened. The next question is, therefore simple: what does it mean to be human?
Remember Lt. Commander Data on Star Trek, trying to be human? It's quaint largely because he/it was a minority on he show, but in reality the machine will outnumber us by a wide margin - they already do!
So what does it mean to be human?
If you have a prosthetic leg, are you still human?
If the leg has a CPU in it, are you still human?
If the CPU is more powerful than your mind, are you still human?
If the chip is wired into your mind, are you still human?
If you use the CPU as though it were part of your mind, are you still human?
If you have transferred modt of your thinking to the CPU, are you still human?
If you transferred all your thinking to the CPU and rarely use your 'wet' brain, are you still human?
If you find th

--
I have no problem with your religion until you decide it's reason to deprive others of the truth.
1. Re:What does it mean to be human? by Devout_IPUite · 2009-04-24 19:32 · Score: 2, Interesting
  
  I might recommend http://en.wikipedia.org/wiki/Homosapien for further reading on this topic. Clearly, you are not a human no matter how smart you are if you're a computer. Are you a person? Well, depends how you define 'person'.
2. Re:What does it mean to be human? by alx5000 · 2009-04-24 19:36 · Score: 1
  
  If the leg has a CPU in it, are you still human?
  Maybe. Maybe not. I'll be back.
  
  --
  My 0.02 cents
3. Re:What does it mean to be human? by martin-boundary · 2009-04-24 22:08 · Score: 1
  
  So what does it mean to be human?
  
  Born of a human mother. Take that, mister data!
4. Re:What does it mean to be human? by Squeeonline · 2009-04-24 23:39 · Score: 1
  
  So what does it mean to be human?
  Born of a human mother. Take that, mister data!
  So if you replace the mothers placenta with a machine that is linked to her brain so the rights chemicals are transfered, are you still human when you are born?
  
  If you replace that mothers mind with a computer because it will do things right, are you still human when you are born?
  
  The same argument applies to biochemistry. At what point does a group of self replicating molecules constitute life? At what ratio of silicon:organic flesh is considered still human.
5. Re:What does it mean to be human? by Hurricane78 · 2009-04-25 00:49 · Score: 1
  
  You are still human, if you fit the definition of a Homo sapiens (eg flesh, genetics, e.t.c.). Simple as that. :)
  But you can be a person, without being human.
  The error in this whole discussion stems from a false dichotomy. There is no "or" in the human/machine question.
  Humans are some biological form of machine. And most importantly: You can be part anything. Part human, part machine, part manbearpig. :)
  
  --
  Any sufficiently advanced intelligence is indistinguishable from stupidity.
6. Re:What does it mean to be human? by mcrbids · 2009-04-25 01:50 · Score: 2, Interesting
  
  Yes but machines can't sue for damages. If you crash your car, no matter how 'smart' it is, it won't take you to court for driving drunk.
  But what if the car had an intelligence directly derived from a real person, like the logical progression of an amputee to a full machine above?
  That's the point of this discussion. If you develop software that so closely emulates the human mind that it (and anybody else talking to it) can't tell the difference, is it human? If the software is a direct descendant of a 'natural born' human, is it human? Can it sue? Can it get married?
  (kinda makes the whole gay marriage thing pale, huh?)
  
  --
  I have no problem with your religion until you decide it's reason to deprive others of the truth.
7. Re:What does it mean to be human? by maxume · 2009-04-25 02:24 · Score: 1
  
  It's kind of odd that you are mixing a discussion of an anachronism like marriage (special government treatment of marriages deemed acceptable by society is the anachronism, not the part where a couple decides to commit to each other) with your enthusiasm for the future.
  
  --
  Nerd rage is the funniest rage.
8. Re:What does it mean to be human? by martin-boundary · 2009-04-25 12:57 · Score: 1
  
  It depends on what you consider human. It seems to me that you've already decided for yourself to accept as human certain genetically engineered living beings with prosthetics, or whose origins are machine enhanced. Beings who others might prefer to call cyborgs, or post-human, etc.
  I think it makes more sense overall to keep the definition of the word human essentially biological, precisely so that we may be able to give more precise description of variations.
  One of the characteristics of prosthetic enhanced individuals is that their needs and requirements for survival are slightly different from regular human beings. For example, if you have a pacemaker, there are some dangers such as magnetic fields you should be aware of. As technology develops in the future, I expect that keeping a single word for a wide range of living beings will introduce more issues than it solves.
hmmmm by thatskinnyguy · 2009-04-24 19:21 · Score: 1

A system that can determine whether or not a user is human would have built-in characteristics as to what a human would do in such a situation. What's keeping someone from taking that same algorithm and adapting it for means other than their intended purpose?
If a machine knows what to do, another machine can take advantage of that.
Obligatory: import skynet; blah

--
The game.
The judge is a computer by DeadboltX · 2009-04-24 19:29 · Score: 1

If the judge of the test is a computer, then the test will always be passable by a computer.
1. Re:The judge is a computer by rcastro0 · 2009-04-25 01:16 · Score: 1
  
  >If the judge of the test is a computer, then the test will always be passable by a computer.
  You are missing the point. It is not about making a failproof system. And you are stating the obvious. Any conceivable system could be fooled, either with human or computer judges. But the issue here is finding less obtrusive ways to detect spambots, screenscraper scripts, etc... without giving up too much detection efficiency.
  
  --
  Quem a paca cara compra, paca cara pagará.
Javascript will kill this idea. by MasterOfDisaster · 2009-04-24 19:39 · Score: 1

Everyone has been focusing on the how easy/difficult it would be to reverse this hypothetical algorithm that would determine based on your use of a webpage if you're human or not... ...I see a more fundamental problem. This is on the internet, so they have basically 3 options on how to implement this.
1) server side. The only variable you could track is time between page requests. Don't see how that could possibly be enough information
2) Client side JS. Simple, just modify the JS to return &isHuman=true
3) Client side JS acting as a keylogger, sending back for server side verification. Harder to defeat, but you'll lose my business, the business of all of my friends, and have a horde of angry nerds picket your offices.
Also, this doesn't take into account any edge cases, for example if I've already been to your site, surf straight to /contact.html and paste in a email I previously wrote in Word(err, excuse me, OOo)

--
The opinions in this post are ficticious. Any similarity to actual opinions, real or imagined, is purely coincidental.
1. Re:Javascript will kill this idea. by mrbene · 2009-04-24 20:05 · Score: 1
  
  JavaScript can send the datapoints to the server as events without waiting for navigation events - think AJAX.
  As for current real world implementation, I've only seen a bank site that uses a Flash app for login, where they measure the typing cadence for user name and password.
  Having an unexpected cadence does not prevent log in, but does inform the server to later do additional identity validation, and you get prompted from a pre-configured pool of questions when you try to do things like send money to that guy who just needs to bribe the official, and then the wealth of a princess will be yours...
2. Re:Javascript will kill this idea. by Spasemunki · 2009-04-24 20:50 · Score: 1
  
  That still leaves the system vulnerable to a replay attack- modify the client-side JS to record the sequence of events in a successful login, and then play them back later. If you're using the timing between events to determine if a user is human, you have the problem that Ajax runs asynchronously; the timing between events being sent to the server isn't going to be the same as the timing between events. You'll probably also want to batch sending events so that you keep the app responsive. Both those things mean you need to record timing information in the data you send over, which means that you can't trust the timing data because it comes from the client.
human usage patterns might vary too much by Anonymous Coward · 2009-04-24 19:45 · Score: 1, Insightful

I think there might be so much variation in human usage patterns, who all need to be accepted by the algorithm, that it should make it easy to simulate a behaviour that stays within those bounds.
On the other hand, if the algorithm doesn't allow much derivation, it will annoy a lot of people, who get falsely detected as bots. It might hit handicapped people or old people first then.
Simple, no? by CaptSaltyJack · 2009-04-24 19:48 · Score: 1

Just use Javascript, watch for either some mouse movements or onBlur/onFocus.. and if those are present, then isHuman will == 1, and you pass that to the server side. Actually, you'll want to have some obscure variable name to make it less obvious.
1. Re:Simple, no? by audunr · 2009-04-24 21:48 · Score: 1
  
  Actually, you'll want to have some obscure variable name to make it less obvious.
  Like isDancer?
Strength in unity-in-diversity by brettz9 · 2009-04-24 19:49 · Score: 2, Insightful

The problem with a lot of sites dealing with spam is that they are using the same software that tries to solve everything at the top. Uniformity doesn't help.
But leaving people to their own devices to create or adapt their own forum/blogging/wiki software is not a good solution either. Uncoordinated diversity leaves a lot of people to fend for themselves.
Having unity-in-diversity (a common strength across systems and organisms), however, might well solve the problem.
If forum/blogging/wiki software creators would give sites the opportunity to make (and be able to change) their own set of question and answers for first-time-users (and not trouble them after that), I think bots would be hard-pressed to be programmed to interpret all such site-specific questions on their own. If bots could actually be programmed to intelligently answer all such human language questions, I think the bot-makers could be making a lot more dough in legitimate business...
1. Re:Strength in unity-in-diversity by omuls+are+tasty · 2009-04-25 04:28 · Score: 1
  
  Uncoordinated diversity leaves a lot of people to fend for themselves. Having unity-in-diversity (a common strength across systems and organisms), however, might well solve the problem.
  Now you wouldn't happen to hold a degree in politics or economics, would you?
DO YOU KNOW HOW OLD THIS IS? by Jane+Q.+Public · 2009-04-24 20:03 · Score: 1

The idea that behavior was a better judge of identity than "biometrics" is old old. I wish I could remember the name of the program, but there was a Gnu / Unix utility that measured word frequency, letter frequency, the amount of delay between pressing any two letter combinations on the keyboard, and more... all put together to verify identity. And it worked quite well. I think that program is close to 20 years old.

Biometrics fails for the same reason it always has... as soon as someone comes up with a halfway reliable way to identify somebody, others come up with a fairly reliable way to fake the system. But micro-delays on the keyboard, etc. make for a pretty individual signature.
Vision = Fail by ZeroNullVoid · 2009-04-24 20:04 · Score: 1

Simple enough
April 1 was at the beginning of the month... by Torodung · 2009-04-24 20:30 · Score: 1

Wouldn't the ability to collect biometric information require a fairly potent piece of spyware to be loaded on the client system? How would a user, or even a security professional, easily tell the difference between a keylogger that reads our actual strokes, and one that is just timing the key presses?
Sounds like a kernel mode device that would have be part of the input drivers. It's an attack surface, IMO. I would think it's safer to have an separate input device for biometric authentication only than attempt to biometric metadata from highly sensitive input devs like keyboards and mice.
I did enjoy the 'honeypot field' example (in TFA). I suspect it is probably easily defeated, unfortunately. If the field is hidden on the page, can't we write a bot to detect that physical fact, or any source code (javascript?) that hides it. How do you obfuscate something like that without serving it with the page?
Sounds to me like CAPTCHA still wins. Oh well, I didn't expect much. ;^)
--
Toro
1. Re:April 1 was at the beginning of the month... by guyminuslife · 2009-04-24 21:19 · Score: 1
  
  I think the deadline for making meta-April Fools jokes must have also passed. And yes, there's a deadline. "April Fools was last year!" so he says.
  
  --
  I don't believe in time. It's a grand conspiracy designed to sell watches.
2. Re:April 1 was at the beginning of the month... by perryizgr8 · 2009-04-24 23:48 · Score: 1
  
  Wouldn't the ability to collect biometric information require a fairly potent piece of spyware to be loaded on the client system? How would a user, or even a security professional, easily tell the difference between a keylogger that reads our actual strokes, and one that is just timing the key presses?
  Sounds like a kernel mode device that would have be part of the input drivers. It's an attack surface, IMO. I would think it's safer to have an separate input device for biometric authentication only than attempt to biometric metadata from highly sensitive input devs like keyboards and mice.
  I did enjoy the 'honeypot field' example (in TFA). I suspect it is probably easily defeated, unfortunately. If the field is hidden on the page, can't we write a bot to detect that physical fact, or any source code (javascript?) that hides it. How do you obfuscate something like that without serving it with the page?
  Sounds to me like CAPTCHA still wins. Oh well, I didn't expect much. ;^)
  -- Toro
  you don't actually hide it. you write above the field "Please leave this field empty."
  
  --
  Wealth is the gift that keeps on giving.
Spam Karma? by nilbog · 2009-04-24 20:43 · Score: 2, Informative

It seems like the old Spam Karma module for Wordpress did this. It calculated how long they were on the page vs. how much they had typed, how fast they typed, and a bunch of other factors before it ever hit a captcha. Back when I used wordpress I remember being it pretty accurate too.

--
or else!
voice recording by Ofloo · 2009-04-24 21:35 · Score: 2, Insightful

Think of every behavior as a voice recording, record and replay ! And there you go bots are able to mimic.
External authentication by Jeppe+Salvesen · 2009-04-24 21:55 · Score: 1

Captcha's etc won't work perfectly. Ever. There are always bot(net)s that are able to defeat them. If you use software to make the lettering difficult to read, you can still write software to read it. Like the algorithms, we detect the order in the chaos..
So let's just face it:
The internets needs a unified authentication system if we are to kill spam. If there was a unified authentication system, you would't need to store your passwords around the internet, and your mails would be tracable to you.
So, let those who need anonymity create their own solutions for interacting anonymously.

--
Stop the brainwash
Not a great idea by jgoemat · 2009-04-24 22:03 · Score: 3, Interesting

The article did have links to some interesting topics, such as google experimenting with image orientation as a test. The premise of using how a user interacts with a page is deeply flawed though. There's not even a need for an algorithm or program to 'figure out' the captcha, just record how an actual user interacts once and you can send the same exact thing every time to pass the test. The reason this works is because the 'question' doesn't change. This would be like showing the same text captcha every time. If they ignore identical values being sent, the values can just be fudged a bit.
1. Re:Not a great idea by perryizgr8 · 2009-04-24 23:50 · Score: 1
  
  the image orientation test seems to be the real answer. man, google has some smart people.
  
  --
  Wealth is the gift that keeps on giving.
Use Turbo Tax Lately by SunSpot505 · 2009-04-24 22:17 · Score: 2, Interesting

When I posted question to the Turbo Tax community forum it asked a simple question as a CAPTCHA. Seems like an easy enough solution, and it changes each time to foil a persistent brute force attack.

Of course I'm sure it's only a matter of time before someone has an algorithm smart enought to answer questions. And I suppose that a botnet with enought time would work too. Still an interesting approah I thought.
"Scrolling and typing" by Arancaytar · 2009-04-24 22:21 · Score: 2, Insightful

The user's local behavior before form submission is detectable only via a client-side script. There are therefore two ways this can go.
1.) You maintain accessibility standards and make the client-side script optional. The effectiveness of this approach is comparable to xkcd's "When Littlefoot's mother died in /Land before Time/, did you feel sad? (Bots: NO LYING!)
2.) You require client-side script execution in order to submit the form. The effect is a lot of pissed-off users with NoScript or non-compatible Javascript interpreters (IE or the rest, depending on which one you support).
This idea is basically like visual captchas, but instead of the visually impaired, you're screwing everyone without Javascript.
There is one aspect of user behavior that can be detected, however, and that is the time passed between the user requesting the form and submitting it. From an AI perspective, humans spend an eternity typing, so setting a minimum delay between request and submission will slow the bot right down - especially with a flood control that requires a delay before submitting the next form. Slashdot does both of these things already, by the way.
Google Groups' implementation by tfg004 · 2009-04-24 22:51 · Score: 1

Some time ago I already noticed that Google Groups has implemented a bot detection based on behaviour.

However, often when I browse through a google group in an efficient way, google thinks I'm a bot and blocks me for quite a while. The only way around is to work inefficiently on purpose, by making my clicks as rondom as possible with as random as possible time intervals. This costs me at least five times as much time as it would cost me the efficient way.
This is very annoying, so I think it would be better for them to ditch the behaviour detection and just rely on properly designed captcha's.
Another flaw in this idea by olddotter · 2009-04-24 23:34 · Score: 1

The captcha is entered into a field and submitted to the web server. However our random highlights, backspacing, scrolling etc. all happens in the browser on our system. The web server (thank ______ ) doesn't know about any of that, it just sees the end result. So it doesn't have access to any of that data, to make any kinda of determination. Currently only malware would be collecting this data and sending it somewhere. So the proposal here is to be human verified by malware.
There are other flaws that others have pointed out.

--
Think Deeply. ...
Generate a textual representation by dvdx · 2009-04-25 00:58 · Score: 1

Generate a textual representation of user's action on the site, including also timing between clicks, scrolls and so on (but not just as plain numbers, use some words to *describe* relationship between time of actions).
Whenever user posts content, feed the report, perhaps including also the post, to a spam filter (like CRM114?), to check whether the description matches human, or mechanical behavior. Train the filter on posts it got wrong.
The tricky part is how to describe the action in a meaningful way.
1. Re:Generate a textual representation by John+Hasler · 2009-04-25 02:08 · Score: 1
  
  > Train the filter on posts it got wrong.
  Which would happen often enough to satisfy the spammers. The bots need only succeed a few percent of the time in order to be effective. That's how they are defeating CAPTCHAs.
  
  --
  Warning: this article may contain humor, sarcasm, parody, and perhaps even irony. Read at your own risk.
Not really seeing a difference by monoqlith · 2009-04-25 02:34 · Score: 1

Not really seeing a difference between behavior and ability.
Any action that you perform is behavior, and, obviously, if you perform an action you are also capable of performing it. A behavior is therefore an ability. Any algorithm that tries to distinguish between human behavior and computer behavior is still a reverse Turing test.
Given that, testing the quirky way humans navigate through the web is arguably even flimsier test than the captcha. There is a certain degree of randomness, but nothing that rand() can't imitate to fool what would have to be an algorithm based, somewhat, on measuring randomness with a limit to its sensitivity so that false positives can be reduced to a reasonable level.
Here's a test by monoqlith · 2009-04-25 02:37 · Score: 2, Funny

Can Slate stop writing articles about shit it doesn't know about?
Re:Two more angles no one seemed to take... by Fantastic+Lad · 2009-04-25 03:04 · Score: 1

This is an insightful post. Too bad it was posted AC. A waste of mod points.
-FL
Blame stupid people ... by drydino · 2009-04-25 03:07 · Score: 1

Like many laws intended to prevent undesirable behavior (e.g. gun control, sale of illegal drugs, etc.), CAPTCHAs only block the casual (law abiding) user. It is regrettable that there are so many stupid people is this world. If stupid people didn't respond and make SPAM profitable, SPAM would have died out years ago. Too bad we can't outlaw stupid people!!
I kinda like the current scheme. by Fantastic+Lad · 2009-04-25 03:27 · Score: 1

Some notes in no particular order. . .
1. I kind of like winning the Turing Test. It makes me feel human. Some days, before the coffee kicks in, this is a plus.
2. It's funny when I can't read the secret warped word. It throws me in existential questioning for about half a second.
3. I like the new idea of having to describe a randomly rotated 3D image. That's a cool system which I'd like to see implemented, though I can't imagine it will be very long before it too is solved.
4. I find it funny that proving one's "Human-ness" is easier to do with a basic kindergarten reading or shape-recognition test than with the old Star Trek method of demonstrating an understanding of Love or being able to write an opera or such. --Especially since you can fairly easily program a computer to compose random Haiku.
5. An interesting test would be to write a short paragraph and ask the potential human how they feel about it. You could probably weed out trolls as well as computers that way. Or potentially learn something disturbing about the head-space of the webmaster and/or feel like a total outsider when you fail at multiple choice emotions.
6. Whatever the case, I think it's pretty sci-fi that we've gotten to the point where major effort is being spent to out-smart AI's. William Gibson and Niel Stephenson keep getting closer to having described our Now.
-FL
Mod parent up, they're right. by Animats · 2009-04-25 03:28 · Score: 1

Can Slate stop writing articles about shit it doesn't know about?
Right.
First, most of the things Slate suggests have been tried. Timing human input behavior is in use already, and attacks already do some randomization there.
Second, despite what the Slate article quotes, the CAPTCHA for Gmail has been cracked. The success rate is only 20%, but because the cracker is embedded in a botnet, that's good enough to survive IP blacklisting. MessageLabs says Gmail spam went from 1.3 percent of all spam e-mail in January to 2.6 percent in February.
All the proposed tasks - recognizing people, cats vs dogs, etc. - can be done by computers at the 20% accuracy level or better. So that's not going to work.
ReCAPTCHA isn't very good in practice. You get two words, one of which was recognized by an OCR program and one of which wasn't. You only have to re-recognize the one which some OCR program already got to pass the CAPTCHA. If you can do that, you have a 50% chance of success.
Then there are the outsourcing services. "We are 35 seater call center located in Hyderabad, we would be interested." The going rate is US$0.001 to US$0.003 per CAPTCHA solved successfully. There are always ads on GetAFreelancer for CAPTCHA solving. Read Black Hat World for sources.
Replay attack? by EdMcMan · 2009-04-25 03:32 · Score: 1

What stops someone from recording a human looking at the page, and then replaying that behavior from a bot?
Also, will humans actually want to send the information needed for this to remote websites? I don't really want a website to know what part of the page I'm looking at.
Can't trust the client.... by Joe+Mucchiello · 2009-04-25 04:24 · Score: 1

Regardless of the Turning Test aspects of this, forms are filled on the client. This hypothetical algorithm would also be running on the client. The server can't trust any "Yes this is a human" that comes from the client. So even if you could make this algorithm it would not solve the intended problem.
To "Anonymous Coward" by Jane+Q.+Public · 2009-04-25 04:27 · Score: 1

They hadn't pointed it out by the time I posted this.

Nevertheless, microdelays and such are not "biometrics". They are behavior-based. The fact is, though, that people generally find this kind of behavior-based approach harder to fake or mask than actual biometrics. That is where the difference lies: in the difficulty of obscuring who you are.
Sounds good ... for about a year ... by gordguide · 2009-04-25 05:22 · Score: 1

This whole thing is a moving target.
Anything your algorithm can do, my algorithm can do too.
Might work for a while, though, but then again, so did CAPTCHAs.
Wait, did I just say "so did CAPTCHAs"? What I meant was, so are CAPTCHAs, because everyone is still using them, even though they don't work.
Which is the real problem ... not only is the whole thing a moving target, but tackling the problem only works when everyone actually moves.
Remember, it's measure --> countermeasure.
All this really means is now everyone gets to live like we really are in a 1960's spy movie. Sure hope that's what everyone wanted.
...no? by Miv333 · 2009-04-25 06:41 · Score: 1

A bot will crack it regardless. There is nothing that can be done, other than remove the bot creators.
Honeypots are the ANSWER! by StCredZero · 2009-04-25 06:54 · Score: 1

Honeypots are the Answer! You simply have pages and options which are just distasteful to humans, the reasons for which are not comprehensible to machines! The machines will give themselves away because they cannot distinguish the distasteful options.
Example: A page of Markov-chain nonsense in an otherwise informative website.
This page would be generated using the same technology that spammers use to get past spam filters. Only a real human being or an AI that can achieve some sort of comprehension will be able to tell that it's full of nonsense. Programs that are trying to simulate human browsing behavior will "dwell" on this page, even though it's junk, and give themselves away.
I think this sort of "spam inoculation" can be done in a way that it doesn't detract too much from the website's quality as a whole, much as vaccines incorporate bits of pathogens without harming the patient.
Another approach by martin_dk · 2009-04-25 07:22 · Score: 1
Though the CAPCHA problem is interesting, I think we will se other ways to skip these showstoppers in near future.
Sooner or later google or somebody else will provide a service that will return information on the likeliness that you are human and that your account has not been taken over by malware. Perhaps a kind of an expanded OpenID which may return information on your behavior on several other websites or in the physical world.
Certain actions could provide "human" credits (or some similar or detailed concept)
- Transfer of cash to an certain account
- Having an active account on facebook or some other social platforms
- Approval by some webcam-chat service
- Solving stupid capchas like everybody is doing now
Go ahead make a list of actions that will make it unfeasible for anybody to automate.
Once you have your credits you may use your ID to bypass captchas. Im sure there are clever ways of solving issues about being anonymous when using the ID.
Thinking about new ways of designing exotic captcha puzzles is just plain waste of time
Holy crap by thePowerOfGrayskull · 2009-04-25 09:35 · Score: 1

20 or more of the top-level posts on this page are all "Well yeah, but if a computer can test it, then a computer can emulate it." I'd ask if anybody bothered reading other comments before they posted, but I already know the answer (this /is/ slashdot after all).
On to the topic at hand: this is impractical for another, less complex reason. From what I've been seeing, most of the "bot" registrations these days are not bots, they're people. If those who wish to can pay someone a couple dollars a day to spam registrations and comments, there's really not much defense against it.
1. Re:Holy crap by thePowerOfGrayskull · 2009-04-26 14:28 · Score: 1
  
  Dude, that was funny.
2. Re:Holy crap by peoplesign · 2009-04-26 23:06 · Score: 1
  
  Thanks man: I posted anonymously for non-obvious reasons. Truth be told that was my first comment on /. and I was ignorant of the implications of posting as 'Anonymous Cringer'.
  
  Since you thought it was funny I'll repost the AC comment (sorry in advance if that's against the rules)
  --
  
  He-man, for a meat-head you bring up a good point. The sweatshops are a real problem; however, there is a defense against them.
  
  Consider this: the folks working there generally can't read English. Yet they're cracking CAPTCHAs on sites designed for English-literate visitors!
  
  Why not use a CAPTCHA that requires English literacy to solve? Maybe show a picture of Battle Cat and ask them to select a matching English phrase, such as "ferocious feline" or "green puss" from a list containing other non-matching phrases. But wait, such a CAPTCHA exists, it's available for free for immediate install on a php website, and there's a live demo at peoplesign.com!
  
  Yes He-man, with peoplesign you DO have the power to stop spam-baron Skeletor's sweatshop armies! Note that peoplesign also works great against Skeletor's conventional spamming threats such as hover robots and spiderdroids.
Yes, correct terminology: "one-way" by jonaskoelker · 2009-04-25 10:18 · Score: 1

In fact, there are many, so-called, one-way (correct terminology?) algorithms.
Background: I'm doing my phd in crypto. I use terms like one-way function (and one-way {,trapdoor} permutation.
Won't work. by wiedzmin · 2009-04-27 05:46 · Score: 1

I have tried the keystroke dynamics authentication systems for example, and my personal opinion is that they don't work. In my opinion, if one human can implement a solution - another human will be able to implement a bot to bypass it. The only way you will be able to defeat bots is to create something that constantly permutates and advances, making development of bots that can defeat it in its current form if not impossible, then at least inefficient. Anything more permanent, will eventually be defeated as we can see on example of CAPTCHA, DVD and BluRay (the latter actually might have something going for it).

--
Bow before me, for I am root.