Slashdot Mirror


Experimenting On Mechanical Turk

itwbennett writes "In a recent article, Dr. Markus Jakobsson, a Principal Scientist at PARC, offers some tips on effectively running human-subject research studies on Amazon's Mechanical Turk. '...[B]enefits [include] very low experiment costs, quick turn-around rates, and relatively simple approvals from human subjects boards. But you have to be careful to avoid bias and error.' says Dr. Jakobsson. For example, in many situations subjects may be biased just from knowing that they are participating in a study, or by knowing the goals of a study. To avoid this bias, you need to 'convey a different task to your subject than what you are observing — essentially deceive them — to see how they react when faced with the situation of interest. Consider a study of user reactions to phishing sites. You may, for example, say that you are studying the common reaction to online e-commerce sites, and ask them to rate how helpful various sites are, with a free-text input field where they can add other observations. You first show them three or four legitimate websites, asking them to rate and describe them; then you show them a phishing site and do the same. Will they tell you that this is a site run by fraudsters? If they do, they noticed signs of fraud without you prompting them.'" The author also gives tips on avoiding cheaters, and determining how much to pay and when.

46 comments

  1. I'm still waiting... by mirix · · Score: 3, Funny

    For mechanical turkish delight.

    --
    Sent from my PDP-11
  2. Man... by ZekoMal · · Score: 3, Interesting

    The title makes it sounds so exciting, like we're experimenting with our robotic Turks.

    But the statistician within me is also fascinated with this. It always made me wonder, as the human mind can precondition itself. The study about whether or not prayer helps the sickly followed this mindset, and since the sick humans had no idea what the study was about, when they were told that people were praying for them they thought they were much sicker and actually recovered more slowly. If they hadn't told the sick humans, however, would there have been a large difference?

    Math and the brain, it's amazing how they meet.

    1. Re:Man... by Idiomatick · · Score: 4, Funny

      "likely to be fucking Christians"

      They better be married then!

    2. Re:Man... by biryokumaru · · Score: 0, Offtopic

      Wow.

      I mean... just wow.

      If there is anyone on Slashdot who is deserving, in the extreme, of their own criticism, it's you.

      I mean... just wow.

      Seeing as you clearly don't have a firm grasp of the English language, it is probably best in the future that you don't start off any arguments critiquing valid grammar choices. Being a non-native speaker is not a reasonable defense for you right now.

      --
      When you're afraid to download music illegally in your own home, then the terrorists have won!
    3. Re:Man... by shic · · Score: 1, Interesting

      The study about whether or not prayer helps the sickly followed this mindset, and since the sick humans had no idea what the study was about, when they were told that people were praying for them they thought they were much sicker and actually recovered more slowly.

      I've recently read about this - in anticipation of seeing Richard Dawkins speak next month. In spite of 'Christian origins' I've always considered evolution to be as rock-solid a theory as mankind has discovered... but, on reading this prominent Atheist's work, I find myself aligning more and more with the perspectives of those Dawkins dismisses as 'Creationists'. Dawkins' arguments are so unbelievably poor, it seems to me that - in effect, if not by intention - he serves as a fifth-columnist for religious cults in undermining the intellectual integrity I associate with science.

      I strongly suspect that the benefit of prayer is predominantly for those who pray as opposed to those for whom prayers are directed. I definitely see benefits from 'meditation' - and expect that ritual observance is beneficial in its own right... allowing people to 'do something' when they feel helpless... it likely also has subconscious influences which are significant. Perhaps the act influences the praying to subtly alter their practical behaviour? Similarly, being prayed for by a bunch of strangers who clearly have no particular reason to care must feel depressing - the idea that "it's come to this" could easily be a negative influence - especially for those who aren't immersed in the culture.

    4. Re:Man... by Arthur+Grumbine · · Score: 3, Funny

      Math and the brain, it's amazing how they meet.

      (Brain is walking past dark alley)
      Math: Pssst!! Hey, you!
      Brain: Huh? Wh-- me?
      Math: Yeah - you! You wanna try some good shit? I got some seriously advanced Number Theory, here. You totally have to try this shit!
      Brain: Nah, thanks anyway, man, but I'm not into that hard shit.
      Math: You sure?! This shit's fucking amazing, man. Real pure, man, grade A. Some of this shit will blow your fucking mind!
      Brain: Nah it's cool, thank--
      Math: That's cool, that's cool, man. It's not for everybody...how 'bout some primo Euclidean Geometry? You ain't never calculated the sum of the squares of the sides of a right triangle until you've demonstrated the Pythagorean Theorem. This shit's a trip!
      Brain: Hey, yeah, I'd be down for some of that...
      Math: Excellent! Now don't worry about nothin' - the first book is on me... friend. (puts his arm around Brain)
      Brain: Hey, man, you're all right, y'know...

      --
      Now that I think about it, I'm pretty sure everything I just said is completely wrong.
    5. Re:Man... by Anonymous Coward · · Score: 0

      Maybe you should develop a mechanical turk HIT that will help you identify the nationality of a potential 'virtual' employee.

      While divining their country of origin, you could also determine what might induce them to purchase your product or service.

      Please Anonymous Coward, share with us your company name, products or services that you offer, so that we (Americans) can reciprocate and have nothing to do with you.

      I'd write more, but I have to drive 5 miles while eating a Big Mac in my SUV with the "Jesus Loves You" bumper sticker so I can meet my lawyer and arrange for a summons to be served to /. to determine your IP address info in order to sue your @ss.

    6. Re:Man... by ZekoMal · · Score: 1

      Meditation is a lot about clearing your mind and simply relaxing, which does the body a lot of good. Praying makes us feel good about ourselves because we feel we've done something. Maybe then we should assume that prayer and the like does not have divine power, but that the sense of relaxation or helping someone else makes us feel better.

      Now there's a study worth doing. Another study worth doing would be to see if loved ones praying for someone without that someone knowing would indeed speed up their recovery or not.

      Either way, prayer does not literally move mountains, and its power is not as strong as one might think. From "every prayer of the righteous is heard and answered by God" to "every prayer of the righteous may make the praying person feel better about themselves". Seems a little...placebo-y.

    7. Re:Man... by ZekoMal · · Score: 1

      Which is why I'm a skinny, semi-fit woman that has never sued anyone, is atheist, and pretty high up there in IQ-skill (if you believe that arbitrary system). Man I love when people throw broad sweeping generalizations at someone and miss the mark on each one.

      What's that you say? I didn't address the "math" issue?

      What's that? Every language has subtle nuanced differences, such as the various forms of Chinese, French, and Spanish to name a few? Geographical location alters the way in which we speak? Some historians believe that Americans speak a less bastardized form of old English because England has had near-constant contact with multiple languages?

      But go ahead and believe that my lack of an "s" at the end of "math" makes me a fat, stupid, Christian with an IQ of 30 flipping burgers at McDonalds whilst suing the bejeezus out of everyone. Totally makes sense, oh troll of trolls.

    8. Re:Man... by tsm_sf · · Score: 1

      I like how people will assume that a country twenty or thirty times larger than theirs is completely homogenous. Must be all the alcohol, bad dentistry, and knife crimYOU SEE WHAT I DID THERE?

      --
      Literalism isn't a form of humor, it's you being irritating.
    9. Re:Man... by tibman · · Score: 2, Funny

      When I was a stupid kid I worked at McDonalds : /

      But i did walk two miles there and back everyday after school! Bought a huge trampoline with the first paycheck :) Then later a rusted pile of metal i mistook for a car... oh the memories.

      --
      http://soylentnews.org/~tibman
    10. Re:Man... by Anonymous Coward · · Score: 0

      5 times the population, motherfucker. Or do you regard Canada as being a bigger country than the US?

    11. Re:Man... by Anonymous Coward · · Score: 0

      For those who wish to learn the fine art of trolling, let this be a prime exhibit. Well written, some spelling/grammar mistakes (but not too many), and a hint of believability.

      My hat's off to you sir.

    12. Re:Man... by Anonymous Coward · · Score: 0

      Uh, "larger" was being applied to the word country, not population.... so... yes?

    13. Re:Man... by shic · · Score: 1

      LOL, very Dawkinsian. The problem with the divine is that, by definition, it defies scientific investigation. Sure, you might be able to set up these experiments - but you won't be able to overcome my objection that correlation is not causation.

      This relates to other ideas I've been pondering of late (of a more earthly variety...) relating to economic policy. Essentially, central bankers are faced with something of a 'divine' problem in the task they're set. There are 'levers' that can be pulled, and there are statistics that can be watched - but it is impossible to say, with any scientific certainty, that any intervention had the desired effect... since there is no control-experiment... no parallel universe in which a different decision can be made with which to compare. This is rather like the prayer experiments... where, if you insist on a scientific-sounding explanation, the effect of praying might be subtle and indirect - like the proverbial butterfly wings in chaos theory. Until you can provide a parallel universe with a different praying behaviour... it is impossible to confirm or refute the value of the activity to the extent that any thinking scientist should be happy to agree.

      Divine questions are about establishing what should be - independent of what is or what was. Simply thinking about these things has an impact on our environment - even if that is restricted to a few neurons in a skull... I think it would be very foolish to rule out any wider influence... irrespective of whether existing scientific theories can explain a mechanism. Our theories approximate reality - not the other way around.

  3. Well now the cat is out of the bag... by Aldenissin · · Score: 2, Interesting

    I often see websites where I feel there is a hidden agenda other than to make me happy enjoying the content to make the website owners some money. But people get savvy over time.. and it will start to get obvious. One thing is for sure.. you can only do this at most a few times before individuals start to figure it out, or you have get a new pool every time. Unless, you are looking for possible even more real world scenarios where there are a lot of websites that are frauds...

    --
    Like a city whose walls are broken down is a man who lacks self-control.
    1. Re:Well now the cat is out of the bag... by PopeRatzo · · Score: 3, Insightful

      you can only do this at most a few times before individuals start to figure it out

      No, with most people you can do it over, and over, and over.

      It's why Fox News makes money.

      --
      You are welcome on my lawn.
  4. Huge Problem with this study by Anonymous Coward · · Score: 3, Interesting

    If I was in a study gauging the helpfulness of various websites and one I was shown a fake website, I would simply assume they ran out of, or could not have access too the real site and so were using a mockup.

    If the mockup was particularly bad I might tell them, but otherwise I'd probably chuckle to myself and then just rate it as usual.

    There is a huge margin of error with this type of thinking.

  5. you may want to scrub. by Anonymous Coward · · Score: 2, Funny

    i make my mechanical Turk have sex with my mechanical JD.

  6. Great Article, however... by smitty777 · · Score: 5, Interesting

    Great article, however you should realize it's impossible to completely avoid bias. For example, he has restricted his population to MTurk users while generalizing to the population of web users. He also "weeded out" the lazy people for the convenience of his experiment - aren't there lazy people in the real world?

    --
    "Before God we are all equally wise - and equally foolish"
    Albert Einstein
    1. Re:Great Article, however... by radtea · · Score: 5, Insightful

      Great article, however you should realize it's impossible to completely avoid bias.

      It's also impossible to completely avoid banal truisms masquerading as insight, but that doesn't stop us from trying.

      --
      Blasphemy is a human right. Blasphemophobia kills.
    2. Re:Great Article, however... by Idiomatick · · Score: 1

      I'd mod you up but I wasn't sure if you were shooting for redundant too.

    3. Re:Great Article, however... by smitty777 · · Score: 1

      Awww...don't be a hater. The whole article is on avoiding bias - don't you see a little irony here?

      --
      "Before God we are all equally wise - and equally foolish"
      Albert Einstein
    4. Re:Great Article, however... by Anonymous Coward · · Score: 0

      Truisms? Doubleplusgood newspeak!

  7. Better idea: take a research methods class by pridkett · · Score: 5, Insightful

    The subject basically says it all. If you're conducting a study, either for academia or industry, do yourself a favor and take a good research methods class. I can't count then number of promising studies I've rejected for publication because their methods were poor. While Dr. Jacobson makes some good points, most of them are pretty obvious to anyone who has taken a good class on creating experiments with humans -- for example, deception is a cornerstone of many human studies. Also, for you budding young scientists, make sure you get IRB approval before conducting your study. I reviewed a paper where the authors were clearly from a University and I had questions about the ethics of their methods so I asked for the IRB data for the study...whoops, they never got it. The paper was withdrawn by the authors shortly after.

    --
    My Slashdot account is old enough to drink...
    1. Re:Better idea: take a research methods class by sbump · · Score: 1

      While Dr. Jacobson makes some good points, most of them are pretty obvious to anyone who has taken a good class on creating experiments with humans

      I did not find the article trivial like that. There's a lot of useful thinking the author has put into Turk specifically---and how it intersects with research methods, IRB, etc. For instance his own experiments to help think about what to pay people. (Not a lot of detail given on how he did this, but helpful information nonetheless.) Also helpful technical information on how, within the structure of MTurk, to maintain contact with subjects for follow-ups, make payments efficiently, etc.

    2. Re:Better idea: take a research methods class by mikael · · Score: 2, Informative

      I've done 'turking' a few times - mainly transcription work and some paragraph writing. For an individual, whether a task is worth doing really depends on how quickly they can do it. The going rate would be around $20/hour.

      Simple quick repetitive tasks pay the least, while long creative tasks pay the most.
      Quick tasks might just pay 10 cents and require the 'turker' to look at an image of a road from a car and click yes/no buttons to say whether there are road markings or traffic lights. Some people are good at doing this kind of repetitive work, so they can keep going for hours and hours.

      Creative work usually consists of paragraph writing, and would involve writing 500 words or so for $10, with a particular set of words repeated a specified number of times. A person with English as a first language could do that within 30 minutes, so that's a quick little earner.

      There are legends about how the first Amazon Tasks were recognizing music artists and songs from a 10 second snippet of audio. For each task, turkers were being paid 50 cents. People were quick to realize that a single four minute track was being automatically chopped up into 20+ such snippets, so if they got one snippet right, the odds are they could get the next 20 or so tasks done as well. Another nice little earner for $10 every 30 minutes.

      Transcription work involves listening to a multi-minute segment of audio and converting it into written text. Tricky work as you have to write down the time, who is speaking and what they said, taking into account regional accents. Alternatively, you could work as an editor and edit together 20 or so transcriptions to form a single consistent transcript that covered an hour or so. Just 3 minutes work would take half an hour, but with bonuses that would make $10 for 30 minutes work.

      A search for Steve Fossett was performed using Amazon turk. They took satellite photos of the desert, diced them up and had people look at each square. Many other wrecks were found, but not Steve, as the crash site was actually up at 10,000 feet in the mountains.

      --
      Vintage computer adverts: http://www.vintageadbrowser.com/computers-and-software-ads
    3. Re:Better idea: take a research methods class by edcheevy · · Score: 1

      Thank you! I wish I had mod points. It's a nice article but you'll get all of this and more in a decent research methods class.

  8. Old concept, applied to the web. by Aladrin · · Score: 4, Interesting

    When I was a kid, my parents received a free episode of some potential new comedy show to evaluate for them. We watched it, commercials and all. (Yeah, the commercials were a surprise.) At the end, unsurprisingly, the questions were actually about the commercials and only the last was about the show. It didn't fool us at all, of course.

    --
    "If you make people think they're thinking, they'll love you; But if you really make them think, they'll hate you." - DM
    1. Re:Old concept, applied to the web. by Anonymous Coward · · Score: 0

      Any chance the show was "Dads"? Apparently that show was used in exactly the manner you described for a number of years to test various commercials.

  9. Re : by Anonymous Coward · · Score: 0

    But I am a digital Turk you insensitive clod ! Allah cezanizi verir insallah ...

  10. what the fuck? by Anonymous Coward · · Score: 0

    The ability to ignore extraneous data is basically what separates the good mind from the bad. If some procedure requires me to concentrate on X, and Y is not part of X, there's no way in hell I'm going to concern myself with Y unless it's threatening to shoot me in the face. Put at another way, I aim to be highly perceptive where I need to be, and in doing so I aim to filter out everything else. I conjecture that this is how many people who are focused in their work tend to act.

    Applied to this example, when entering personal data on the web I *am* on the lookout for phishing attempts. When doing a supposedly controlled study, I'm not. Because I am focused, I would be likely to succeed in the real-world test but fail at the deceptive one.

    (Actually, this isn't quite true, because when I am given a study to take part in I *always* add a "how might the tester be trying to deceive me?" factor to my response. But if your subject is always looking for ways you might be trying to deceive him, he's again not a valid subject. In conclusion, all controlled tests based on deception are invalid.)

  11. apropos of nothing having to do with mech turks by PopeRatzo · · Score: 1

    Some funny shit right there. I also like the story about the third amendment rights group.

    --
    You are welcome on my lawn.
  12. MPU Re:Better idea: take a research methods class by cnoocy · · Score: 1

    Thank you for saying this. IRBs exist for a reason and it's concerning to see an article like this essentially saying "Hey kids! Go do some human experimentation!"

    --
    This sig is not the Zahir. Lucky for you.
  13. Why is this news? by AtomicDevice · · Score: 2, Interesting

    This is like psych 101, don't tell subjects what they are doing, or they'll just game you and your results will suck. Add internet. Same concept

    --
    Ze Atomic Device! It iz Ztolen!
  14. What the hell is mechanical turk? by damn_registrars · · Score: 1
    I had to read to the second paragraph to find:

    Amazon Mechanical Turk, or MTurk for short, is a cloud computing platform that permits outsourcing of tasks to other users, using a built-in payment scheme to compensate users. People (often referred to as "Turkers") perform MTurk tasks, which are called Human Intelligence Tasks (HITs), and are paid just a few cents for completing them.

    Although I'm still not sure what is mechanical or turkish about it. The Amazon part apparently refers to the fact that payment is made in way of credits to Amazon.com.

    --
    Damn_registrars has no butt-hole. Damn_registrars has no use for a butt-hole.
    1. Re:What the hell is mechanical turk? by Anonymous Coward · · Score: 0

      Mechanical Turk in Sweden would be a pizza making robot.

    2. Re:What the hell is mechanical turk? by pamar · · Score: 2, Informative

      ...

      Although I'm still not sure what is mechanical or turkish about it. The Amazon part apparently refers to the fact that payment is made in way of credits to Amazon.com.

      Amazon got the name from this: http://en.wikipedia.org/wiki/The_Turk

  15. Biased in the other direction by SleazyRidr · · Score: 1

    There've been enough comments about why you need to trick your participants, so I won't go into it, but the specific content of this study made me think.

    I usually go to websites thinking that they are probably going to scam me, and don't let my guard down until they've demonstrated that they aren't. But in this situation, where I'm being directed to it as part of a survey on e-commerce sites, I'd probably show a lot less scepticism at the start, and would be less likely to flag it as if I had just stumbled onto it normally.

    Ah, the things we have to take into account.

  16. blah by Anonymous Coward · · Score: 0

    You think a bunch of indians would care? As long as they get the 1 cent they will move on fast and type whatever.

  17. What's a Mechanical Turk? by Tetsujin · · Score: 2, Funny

    Is it something like an Electric Monk?

    --
    Bow-ties are cool.
  18. +1 to the /. crew... by Undead+NDR · · Score: 1

    ...for spotting the chess reference in the name.

  19. No IRB? by GregNorc · · Score: 1

    Contrary to the article (at CMU at least) we _are_ required to run our Mturk studies past the IRB.

    They all get rubber stamped 99% of the time, but it's still an important formality.