Slashdot Mirror


Many Surveys, About One In Five, May Contain Fraudulent Data (sciencemag.org)

sciencehabit writes: How often do people conducting surveys simply fabricate some or all of the data? Several high-profile cases of fraud over the past few years have shone a spotlight on that question, but the full scope of the problem has remained unknown. [Tuesday], at a meeting in Washington, D.C., a pair of well-known researchers presented a statistical test for detecting fabricated data in survey answers. When they applied it to more than 1000 public data sets from international surveys, a worrying picture emerged: About one in five of the surveys failed, indicating a high likelihood of fabricated data.

115 comments

  1. 97% Consensus by Anonymous Coward · · Score: 1

    Well, we know one does for sure.

    1. Re:97% Consensus by frovingslosh · · Score: 4, Funny

      Hey, we thought that 1 in 5 sounded like a nice believable number. How dare you question it. We looked at at least three surveys to come up with this.

      --
      I'm an American. I love this country and the freedoms that we used to have.
  2. Of course. by Anonymous Coward · · Score: 0

    Our survey shows many surveys contain fraudulent data. And I am lying right now.

    1. Re:Of course. by Anonymous Coward · · Score: 0

      No you aren't. But I am.

    2. Re:Of course. by Anonymous Coward · · Score: 0

      Everything I post on slashdot is a lie

    3. Re:Of course. by jrumney · · Score: 1

      The lie in this case is the 1 in 5 figure. In reality it is much higher.

    4. Re:Of course. by goose-incarnated · · Score: 1

      The lie in this case is the 1 in 5 figure. In reality it is much higher.

      Much higher than you think if you take into account surveys done by people who need to see a certain result - look at the survey which claims that 1 in 4 women are sexually molested or raped on campus.

      One of the questions, IIRC, was "While drunk, did you ever have sex with someone you wouldn't normally consent to have sex with?", and counted that as a rape. Even if all other parts of the survey work were accurate, it's easy to get the result you want by asking the right question.

      --
      I'm a minority race. Save your vitriol for white people.
    5. Re:Of course. by beastofburdon · · Score: 1

      Not to mention the fact that their definition of rape excluded the possibility of men being included unless they were raped by another man.
      Then there are the domestic violence surveys that count using logic to counter a woman's argument as domestic violence as well as a man withholding sex from a woman. Funny how neither of those counted when it was a woman using the same tactics against a man. There were many more tactics used to inflate the figures, but I do not have time to go through that list, it would take all day.

  3. Gee, you don't suppose respondents lie? by Anonymous Coward · · Score: 4, Interesting

    When I take most surveys I answer calculated to confound the test as much as possible, on the assumption that anyone could be doing so; and thus, to accelerate the process is to bring it to the attention of the researchers faster. The problem is that it's an inherently flawed method.

    1. Re:Gee, you don't suppose respondents lie? by fustakrakich · · Score: 1

      The problem is that it's an inherently flawed method.

      The problem is that people believe what they see and hear

      --
      “He’s not deformed, he’s just drunk!”
    2. Re:Gee, you don't suppose respondents lie? by Anonymous Coward · · Score: 0

      ...I'm pretty sure even kids who make surveys are aware people can say whatever they want to on them. You don't need to "school" them on that.

    3. Re:Gee, you don't suppose respondents lie? by Anonymous Coward · · Score: 1, Interesting

      Respondent's lie and researchers lie. I've been unable to find it now, but a month ago I came across a essay from an established sociologist who was bemoaning the current number of researchers in his field who were conducting shit studies in the furtherance of promoting their political and cultural ideology. He examined a few prominent studies that were debunked when it came to light that the researchers had either manipulated, or dropped data points that conflicted with their study's conclusions, or outright fabricated data. I wish I could find it because it was very illuminating about how younger social/psychological science researchers are teaming up with journalists that have the same ideological bent to manipulate and shape public opinion.

    4. Re:Gee, you don't suppose respondents lie? by ShanghaiBill · · Score: 2

      When I take most surveys I answer calculated to confound the test as much as possible

      If a few people did this randomly, then it wouldn't skew the results much. But it is not random. Liberals are more willing to participate in surveys, and more willing to answer honestly. Conservatives tend to be more cynical and calculating. Other factors skewing the results are that Democrats are more likely to be home, more likely to answer the phone, and more likely to participate in social media polls. Republican are more likely to let their phone calls roll over to voice mail, but also more likely to have more than one phone line. Professional pollsters try to take all these factors into account.

      The statistical test in TFA won't fix the problem. It will just give the fakers another tool. Just make up some fake data, apply the test, and if it fails then manipulate the data and try again.

    5. Re:Gee, you don't suppose respondents lie? by codeAlDente · · Score: 2

      Probably not the article you're talking about, but same conclusion, and a good read: http://prospect.org/article/ac...

      --
      He once inserted random mutations into his code, just so he could have the experience of debugging.
    6. Re:Gee, you don't suppose respondents lie? by codeAlDente · · Score: 1

      That is quite interesting. Can you link to a paper, etc., that calculates the adjustments for liberal vs. conservative survey-taking patterns? I would be interested to see the magnitude of this effect.

      --
      He once inserted random mutations into his code, just so he could have the experience of debugging.
    7. Re:Gee, you don't suppose respondents lie? by The+MAZZTer · · Score: 1

      Google does the whole survey thing with its Google Opinion Rewards app, with Google Play Store credit as an incentive. They address this problem by asking you a bogus question like "Have you been to any of the following locations recently?" and then listing locations that do not exist. If you answer in the affirmative, they know you're lying and cut you off from surveys in the future.

    8. Re:Gee, you don't suppose respondents lie? by EndlessNameless · · Score: 1, Insightful

      Your response will most likely be washed out by a sea of honest responses.

      Most participants respond to the best of their ability---although it cannot be assumed they are always correct. Respondent error, even about themselves, is more common than outright deception.

      Researchers are aware that participants lie due to self-deception, social desirability, deliberate sabotage, and other reasons. Surveys often incorporate measures to detect deception.

      If you answer in a nonsensical fashion, the worst you'll do is reduce the likelihood of the results being significant---assuming your results are contrary to the hypothesis and are not eliminated for being invalid or an outlier. It's not like the entire survey method is going away; its weaknesses are well-known, and it is generally used as a last resort.

      Note that this behavior can only hurt honest scientists. If someone is manipulating data, they will purge your result or fabricate enough desirable results that your behavior is irrelevant. You are literally helping no one.

      --

      ---
      According to the latest ruleset, this post should be modded as Vorpal Flamebait +5.
    9. Re:Gee, you don't suppose respondents lie? by ShanghaiBill · · Score: 2

      Can you link to a paper, etc., that calculates the adjustments for liberal vs. conservative survey-taking patterns?

      Sorry, I have seen these issues mentioned several places, including Nate Silver's blog, but I have never seen them actually quantified.

      I think the most famous skewed poll was in 1948. Phone surveys predicted a decisive win by Dewey, but instead Truman was re-elected. The reason was that back in 1948, households with lower incomes (and more likely to vote Democratic) often didn't have a phone.

    10. Re:Gee, you don't suppose respondents lie? by codeAlDente · · Score: 1

      No worries, just curious. I've read Nate's blog since we learned in the 2012 election that he's the first to successfully apply Bayes rule to political science. That part of his model is probably proprietary.

      --
      He once inserted random mutations into his code, just so he could have the experience of debugging.
    11. Re:Gee, you don't suppose respondents lie? by gumbi+west · · Score: 1

      The problem is that it's an inherently flawed method.

      Yet Nate Silver accurately predicted the last two presidential election outcomes. Put another way, just because you're a skeptic doesn't mean you're not the one making shit up and chasing ghosts that aren't there.

    12. Re:Gee, you don't suppose respondents lie? by Anonymous Coward · · Score: 0

      "The problem is that people believe what they see and hear"

      Well, I can see you or hear you so I don't believe you.

    13. Re:Gee, you don't suppose respondents lie? by jrumney · · Score: 1

      I think the problem is that surveys always return an unusually high number of responses by 104 year old female construction workers from Kazakhstan holding PhDs, so researchers have come to think that it is normal to get so many responses from that group.

    14. Re:Gee, you don't suppose respondents lie? by Anonymous Coward · · Score: 0

      Two elections is hardly a statistical trend.

    15. Re: Gee, you don't suppose respondents lie? by cthulhu11 · · Score: 1

      I see so many flaws in surveys, including multiple choices that don't include my answer, and page after page of ranking grids. Worse yet are the ones that show four product models -- say, server computers -- with random combinations of features and prices, asking me to rank. If they want to know what matters to me, just ask, don't try to trick it out of me. Other surveys lie about duration. All ways that people stop trying to answer and instead just pick randomly or on a straight line to get through the damned thing. A few have questions like "select D for this row" to try to detect this, but the practice is far from consistent and of limited efficacy.

    16. Re: Gee, you don't suppose respondents lie? by Anonymous Coward · · Score: 0

      What the fuck has the issue of fabricated data in academic surveys got to do with your homophobia?

    17. Re:Gee, you don't suppose respondents lie? by jc79 · · Score: 1

      As a gross estimate, the odds are 1/4 that he would have predicted both elections successfully. I'd give it a bit longer before deciding that he's got all the answers.

    18. Re:Gee, you don't suppose respondents lie? by gumbi+west · · Score: 1

      There were 50 states and he nailed them all.

  4. Self referential? by goombah99 · · Score: 4, Insightful

    23.7 % of statistical analyses make up their statistics.

    --
    Some drink at the fountain of knowledge. Others just gargle.
    1. Re:Self referential? by butzwonker · · Score: 1

      Also, 94.2% of all statistics quoted on /. are ad hoc inventions.

    2. Re:Self referential? by codeAlDente · · Score: 1

      No, 65.6% of these stats appear to be ad hoc but in reality there is an agenda behind them. 76.4% of the time that agenda includes a bad joke. Moo.

      --
      He once inserted random mutations into his code, just so he could have the experience of debugging.
    3. Re:Self referential? by twotacocombo · · Score: 1

      60% of the time, statistics lie every time.

  5. at least i dont eat flesh from my feet by Anonymous Coward · · Score: 0

    flesh feet

  6. only 1 in 5 fraudulent? by Anonymous Coward · · Score: 0

    I bet its higher than that.

    1. Re:only 1 in 5 fraudulent? by penguinoid · · Score: 4, Informative

      The rest are merely intentionally misleading. You can get just about any answer you want, without making up data, by carefully selecting your questions and your survey population.

      --
      Don't waste your vote! Vote for whoever you want, unless you live in a swing state it won't matter anyways
    2. Re:only 1 in 5 fraudulent? by Anonymous Coward · · Score: 0

      What I came here to say.

      Mods do the needful.

    3. Re:only 1 in 5 fraudulent? by Anonymous Coward · · Score: 0

      The rest are merely intentionally misleading. You can get just about any answer you want, without making up data, by carefully selecting your questions and your survey population.

      Manipulation like this is exactly why statistics as a whole should carry about as much weight as a patients astrological sign in a hospital.

      Defining statistics accurately? Manufactured Bullshit for numbers sake.

    4. Re:only 1 in 5 fraudulent? by david_thornley · · Score: 1

      There's nothing wrong with statistics as a field. There's plenty wrong with how some people apply it.

      A couple of sayings come to mind: "Figures don't lie, but liars figure", and "He used the statistics as a drunk uses a lamppost: for support, not illumination."

      --
      "When you have eliminated the unacceptable, whatever is left, however improbable, must be the truthiness" - Holmes
  7. Obgliatory Simpsons by Anonymous Coward · · Score: 0

    "You can come up with statistics to prove anything, Kent. Forfty percent of all people know that."

    1. Re:Obgliatory Simpsons by HiThere · · Score: 1

      You can lie about anything. Claiming that it is a statistic doesn't make it anything other than another lie.

      It's also true that statistics are easy to misuse, but the problem isn't (usually) lies, it's usually people not understanding what they are doing...or at least so I believe.

      Statistics was invented to solve practical problems, specifically to win gambling games without cheating. It works. (And the original games it was developed for have for some reason become unpopular, or had their rules changed.) This doesn't mean you can't use it inappropriately, i.e. in places where it doesn't work, but the same is true of the inverse square law.

      --

      I think we've pushed this "anyone can grow up to be president" thing too far.
  8. One in 5 surveys incompetently fraudulent by Diss+Champ · · Score: 5, Insightful

    It's not that only 1 in 5 surveys may contain fraudulent data, it is that the fraud is only incompetent enough to be caught by this method in 1 in 5 surveys.

    1. Re:One in 5 surveys incompetently fraudulent by codeAlDente · · Score: 1

      Exactly, and there is a negative correlation between the amount of incompetence and the amount of funding.

      --
      He once inserted random mutations into his code, just so he could have the experience of debugging.
    2. Re:One in 5 surveys incompetently fraudulent by EndlessNameless · · Score: 1

      They could have both false positives and false negatives.

      They eliminated studies which had a prima facie case for having highly similar responses. However, it is possible for them to miss a study which generates fairly consistent response patterns for non-obvious reasons.

      I can buy the 17% number. A sizable majority are legitimate scientific endeavors, but there are enough bad actors that you need to actively seek them out.

      I seem to recall about 9% of Americans have a felony conviction. If you assume that's the dumber half of the felons being caught, you end up in roughly the same ballpark.

      --

      ---
      According to the latest ruleset, this post should be modded as Vorpal Flamebait +5.
  9. I'm not actually surprised ... by gstoddart · · Score: 4, Insightful

    Like it or not, a lot of public opinion polls are paid for by people who want to support a specific point.

    Public opinion polls these days are as much PR and marketing as anything else.

    Honestly, Pew makes money doing this stuff; honest player or not, they have a vested interest in keeping up the belief that their stuff is honest, unbiased, and accurate.

    But I'm entirely willing to believe opinion polls are carefully crafted, or sneakily tweaked, to arrive at the conclusions they've been commissioned to a arrive at.

    --
    Lost at C:>. Found at C.
    1. Re:I'm not actually surprised ... by Anonymous Coward · · Score: 1

      Right, like the Gartner Magic Quadrant shit that most C-suite people drool over. I've talked to several software vendors who have told me that Gartner approaches them on the side and allows them to "buy up" their rankings for the right price.

    2. Re:I'm not actually surprised ... by gstoddart · · Score: 3, Insightful

      Right, like the Gartner Magic Quadrant shit that most C-suite people drool over.

      Or bond ratings agencies.

      I suspect most people, except the people who cite those things, have long since assumed they're full of shit and the conclusions are paid for.

      Why would you assume it's honest and objective information? Someone has to make money off it.

      And if you don't like that, start your own foundation or think tank, and have them publish stuff to your liking.

      Sorry, it's all PR and marketing. It sure as hell aint facts or accurate predictons.

      --
      Lost at C:>. Found at C.
    3. Re:I'm not actually surprised ... by Anonymous Coward · · Score: 0

      Nearly all consumer communication has become promotional, rather than informational. Big difference.

  10. Something's Fishy... by Frosty+Piss · · Score: 1

    ...a pair of well-known researchers presented a statistical test for detecting fabricated data in survey answers...

    That sounds a little suspect...

    --
    If you want news from today, you have to come back tomorrow.
  11. Lies by Anonymous Coward · · Score: 0

    Obligatory:

    Lies, Damn Lies, and Statistics

  12. Which means... by Anonymous Coward · · Score: 0

    There's a 76.3% chance your statistic is true. And a 23.7% of a paradox at play.

  13. May not be fraud - simply incompetence. by gurps_npc · · Score: 4, Informative
    There are a lot of ways you can screw with a study. For example, you get dramatically different answers by rephrasing the question.

    If you ask: "Do you believe that mothers should be able to legally murder their babies within 2 months of the creation of life?" you get a very different answer than if you ask "Do you believe that women should have the legal right to abortion when the fetus can be demonstrated to show no brain activity more significant than that of a snail."

    This might be intentional, or simple unconscious bias.

    --
    excitingthingstodo.blogspot.com
    1. Re:May not be fraud - simply incompetence. by Anonymous Coward · · Score: 1

      Never attribute to incompetence that which can easily be explained by malice.

      Consider how loaded and/or leading these questions often are. How unwanted results are known to be thrown out. How population samples are carefully selected to maximize the odds of a certain result.

      To assume that fraudulent data would be some mere accidental whoopsie -though perhaps claimed as such when those defrauding us are caught- would be not only blind, but total sensory deprivation.

    2. Re:May not be fraud - simply incompetence. by Anonymous Coward · · Score: 0

      All we are thinking is in benefit of the woman's health .

    3. Re:May not be fraud - simply incompetence. by Anonymous Coward · · Score: 0

      Sure, you can change a study by asking a completely different question. It is not a baby until it is born. Before that, it is a fetus. So people are bound to interpret the first question as "kill the baby withing 2 months of being born". A fetus exists 2 months after conception - but it is not 'a baby' at that stage. A life form yes - a baby no!

    4. Re: May not be fraud - simply incompetence. by Anonymous Coward · · Score: 0

      Fetus is an anachronism. We use the word baby to describe that stage of life now. It's a living language, for better or for worse.

    5. Re: May not be fraud - simply incompetence. by david_thornley · · Score: 1

      Who uses the word "baby" that way? That's an important question here. Also, what do "we" do to distinguish a "baby" still inside a woman from one that's been born?

      --
      "When you have eliminated the unacceptable, whatever is left, however improbable, must be the truthiness" - Holmes
  14. 4chan by Anonymous Coward · · Score: 0

    4chan is known to trolls lots of these things often with hilarious results.

  15. Including This Study? by Jason+Levine · · Score: 1

    So one in five studies might contain false data. Essentially what they are saying is that there's a 20% chance that they are lying about there being a 20% chance of them lying.

    --
    My sci-fi novel, Ghost Thief, is now available from Amazon.com.
    1. Re:Including This Study? by codeAlDente · · Score: 1

      No. TFA states exactly what they are saying: "About one in five of the surveys failed, indicating a high likelihood of fabricated data."

      --
      He once inserted random mutations into his code, just so he could have the experience of debugging.
  16. Be careful...read what they actually said. by sbaker · · Score: 2

    What they ACTUALLY said was that in surveys conducted in the Western World - only 5% failed their test - but in developing countries - the number was 26% of faked surveys.

    Then, they also say that the KIND of survey matters. Their approach is to say that if 85% of answers are identical between two or more respondents then the result is likely to be faked...but they recognize that (for example) in a health survey, all of the healthy people will answer identically to questions about how healthy they are. So that kind of survey is excluded.

    So if the research is to be taken at face value, then in the Western world, one in twenty of *some* classes of survey are probably faked. But they looked at 1000 surveys to arrive at that number - we don't know what fraction of those came from the developing world. If all you're interested in is Western World surveys - then maybe the sample size is very small. Given that there are some classes of survey that are known to be excluded - is it possible that they included a few of "the wrong kind" in their sample.

    All surveys have an error bar of a few percent - this is a survey about surveys.

    I think the conclusion here is that you should ignore surveys carried out by dubious agencies in the developing world. I don't think you should conclude that surveys done by reputable agencies in the western world are unreliable.

    --
    www.sjbaker.org
    1. Re:Be careful...read what they actually said. by codeAlDente · · Score: 2

      This is not a survey about surveys. This is a statistical test of the number of identical responses in a survey, and the likelihood that such uniformity could have resulted by chance. That probably indicates not just fraud, but incompetent fraud (or, to be fair, perhaps people in developing countries are just much more likely to answer independent questions in exactly the same way, and the researchers drastically underestimated the independence of these questions). Seems like grading math or physics homework though - it's a lot harder to demonstrate when the top students are copying answers from each other.

      --
      He once inserted random mutations into his code, just so he could have the experience of debugging.
    2. Re:Be careful...read what they actually said. by Anonymous Coward · · Score: 0

      I do statistical survey methodology research field for a living, and think this is overhyped a bit.

      Kuriakose and Robbins developed some procedure for detecting fake surveys based on duplicate response patterns, and they've applied it to a bunch of datasets, claiming certain rates of fraudulent survey results in different parts of the world.

      Although I think their method is sort of interesting, I'm really totally not convinced it's a valid test of fraud. Does a dataset failing this test increase the probability of fraud? Sure. But I sense that their method is really flawed and prone to false positives.

      Why? All sorts of reasons. The first is that they basically developed their test on random data (correlated and uncorrelated), which human responses are not. People are not all that different in the end--they're not randomly responding--and they also do funny stuff, like get bored and give the same response over and over again. Their method doesn't seem to account for any of this at all. My guess is that their method is prone to false positives when respondents are giving fake responses, for example (i.e., to mess with the surveyor, or because they're ashamed, or whatever).

      The linked Science paper quotes someone from Pew who pointed out that their test doesn't even account for the number of questions on a survey or the number of respondents. So, if you're doing a relatively smallish survey on a huge number of people, what do you think is going to happen to the number of duplicates?

      Really, the method seems ridiculously naive.

      My guess is that it helps give a rough guesstimate of the relative ratio of problematic surveys in different areas geographically, but I would not use this to identify a fraudulent survey. I'm sure if you combed through the surveys that were flagged as fraudulent, sure you'd find some that were. But my guess is you'd also run into totally normal surveys, and also data processing errors, or poorly designed surveys, etc.

    3. Re:Be careful...read what they actually said. by codeAlDente · · Score: 1

      I dunno, surely it can't be claimed to be a 100% valid test, but it got peer reviewed and published in a stats journal. And it's really an effort to figure out the extent of the problem, not to make a diagnosis of fraud in any specific case. And if false positives are caused by a bunch of people giving fake responses in all the same way, I'd say the test works as advertised, and the notion of a false positive should be revisited.

      --
      He once inserted random mutations into his code, just so he could have the experience of debugging.
  17. Leading questions are also an issue by Anonymous Coward · · Score: 1

    A few years ago a survey said 2/3 of the Danes supports nuclear power. Two days later another survey said 1/3 supports it. A journalist then figured at least one of them is wrong and started digging. It turned out that they didn't answer the same question. 2/3 said yes when asked if nuclear power can be used to reduce worldwide CO2 emissions. 1/3 said yes to Denmark gaining nuclear power for environmental reasons. Despite not answering the same question, the press releases and following headlines made it look like they did.

    There is way more to this than "not in my backyard" to this issue. For starters Denmark can't get rid of nuclear waste. Despite the politicians declaring it a nuclear free country, somebody got away with building 3 test reactors "for research purposes". Today there is 233 kg nuclear waste and it turns out that there is nowhere to get rid of it. Being islands of mud placed by the glacier during the ice age, there is no stable rocky underground to store it. With that in mind and the huge debate about it, I'm more surprised 1/3 even said yes to nuclear power. Main argument (in the media) is that the world leading country in wind energy shouldn't start focusing on something it has no experience in. From an engineering perspective, the high voltage network needs to be rebuild to handle a single source of production rather than distributed production, which would be at least as expensive as the powerplant itself. The list of reasons other than "not in my backyard" for saying yes to nuclear power elsewhere and not in Denmark seems almost endless.

    Back on topic, US had a leading question not long ago. 30% of the republicans wants to bomb Agrabah, the city Aladdin lives in. It makes good headlines, but looking at the survey, it had 50 questions, 49 serious. Question 34 (or was it 32?) asked if Agrabah should be bombed or not. Based on wording and context, at least 30% read that as "should (middle east city pointed out by US military) be bombed?". It had absolutely no hint of the city being fictional.

    1. Re:Leading questions are also an issue by Anonymous Coward · · Score: 0

      Just dump all your waste in the middle of Greenland, do it discreetly and no one will notice.

    2. Re:Leading questions are also an issue by Anonymous Coward · · Score: 0

      The people of Greenland are furious that the Danish government agreed to a US base in the middle of Greenland during the cold war. It's not about the base itself, but rather that they set up a nuclear reactor to power the base. Greenland didn't know about it, but the US has recently revealed it existed. It's still unknown if the government knew about the reactor when they agreed to the base.

      Hell would break loose if nuclear dumping were to be discovered. As for the nuclear waste, the US took the spent fuel rods (weapon grade?). In fact they demanded them at some point. This leaves waste with significantly lower halflife and the plan is now to store it "temporally" in a building for 100 years as this will solve most of the radiation issue.

    3. Re:Leading questions are also an issue by EndlessNameless · · Score: 2

      It had absolutely no hint of the city being fictional.

      If they're willing to bomb a city and kill people without even knowing the specific reason for that bombing, their ignorance is truly dangerous to the world.

      If they cannot immediately recall the "where" and "why" to justify homicide, they deserve to be embarrassed.

      The only conceivable justification for their position is the lack of an option for indicating "unsure" or "no opinion". And I'd be shocked if a modern survey didn't include that.

      --

      ---
      According to the latest ruleset, this post should be modded as Vorpal Flamebait +5.
    4. Re:Leading questions are also an issue by Anonymous Coward · · Score: 0

      Now we know the real reason the glaciers on Greenland are melting.

    5. Re:Leading questions are also an issue by Anonymous Coward · · Score: 0

      it had 50 questions, 49 serious. Question 34 (or was it 32?) asked if Agrabah should be bombed or not. Based on wording and context, at least 30% read that as "should (middle east city pointed out by US military) be bombed?".

      That is no excuse for the respondents. They recommended bombing a city they did not know anything about. There are plenty of cities with middle-east names (because they are in the middle east.) Many are in countries not involved in war with the U.S., or even allied with the U.S. So recommending bombing a place 'with a middle-eastish name' is outright silly. How about bombing Riyadh? Might make those shale oil businesses incredibly profitable . . .

  18. How ironic would it be... by McPierce · · Score: 1

    ...if the data from THIS survey was deliberately falsified to see if anybody actually checked the sources?

    --
    Darryl L. Pierce "What do you care what people think, Mr. Feynman?"
    1. Re:How ironic would it be... by codeAlDente · · Score: 1

      THIS is not a survey.

      --
      He once inserted random mutations into his code, just so he could have the experience of debugging.
    2. Re:How ironic would it be... by McPierce · · Score: 1

      You're right, I was mistaken in calling it a "survey". But my point was, did anybody verify their data? :D

      --
      Darryl L. Pierce "What do you care what people think, Mr. Feynman?"
    3. Re:How ironic would it be... by codeAlDente · · Score: 1

      Yes, it is a good point, and that's why we should be glad that the researchers eschewed Pew's threats and published their paper anyway. Hopefully other researchers will follow up using these, or perhaps more sophisticated, methods to identify the extent of the problem.

      --
      He once inserted random mutations into his code, just so he could have the experience of debugging.
    4. Re:How ironic would it be... by Anonymous Coward · · Score: 0

      Correct, and it is also published in a Scientific Journal, subject to critique by others. (Some of the others may actually have a brain, as opposed to...)

  19. This seems the hard way easier method: by evolutionary · · Score: 1

    Would be easier to write fixed questions wouldn't it? Also harder for the less observant to spot. :D Surveys in my mind go into the category of statistics, which can lie in a number of ways. Surprised at this news.

    --
    "Imagination is more important than knowledge" - Einstein
  20. How many? by JustAnotherOldGuy · · Score: 2

    Many Surveys, About Eight In Three, May Contain Fraudulent Data

    --
    Just cruising through this digital world at 33 1/3 rpm...
    1. Re:How many? by Anonymous Coward · · Score: 0

      One in five may contain fraudulent data, the other 4 do certainly contain fraudulent data

  21. All surveys are not the same by drew_kime · · Score: 2

    "Do you prefer chocolate or vanilla?" is different from "Do you support Falun Gong?" Opinion surveys always have to account for the confounding factor that each respondent may be more likely to provide the socially acceptable answer than their true opinion. The stronger the social stigma associated with the question, the more likely this will be a problem.

    This new test is a useful addition to the data analysis process, but doesn't "prove" anything. The challenge is how to refine the technique. If you want to eliminate "false positives" you would need some way to identify "true positives". And if we had a way to do that, we wouldn't need to do surveys.

    Bottom line: Surveys don't prove anything. At best they point to interesting ideas for future study.

    --
    Nope, no sig
    1. Re:All surveys are not the same by Anonymous Coward · · Score: 0

      "Do you prefer chocolate or vanilla?" is different from "Do you support Falun Gong?" Opinion surveys always have to account for the confounding factor that each respondent may be more likely to provide the socially acceptable answer than their true opinion. The stronger the social stigma associated with the question, the more likely this will be a problem.

      This new test is a useful addition to the data analysis process, but doesn't "prove" anything. The challenge is how to refine the technique. If you want to eliminate "false positives" you would need some way to identify "true positives". And if we had a way to do that, we wouldn't need to do surveys.

      Bottom line: Surveys don't prove anything. At best they point to interesting ideas for future study.

      Right, there was a recent survey of Americans where something like 50% said they went to church every Sunday.
      The real number was something closer to 20%, but the people didn't want to admit that.

      Similarly, a survey would probably tell you that 0% of the population had ever been arrested.

    2. Re:All surveys are not the same by Anonymous Coward · · Score: 1

      Right, there was a recent survey of Americans where something like 50% said they went to church every Sunday.

      The real number was something closer to 20%, but the people didn't want to admit that.

      There is actually statistical models and proven theory behind filtering out noise like that. On top of that, it also provides an indication of uncertainty and provides an interval where the right answer likely is. The problem is that statistical analysis and the math involved is fairly complex. Getting people with a Ph.D. would be good and master of science would do too, but looking at how long it takes to handle the data and the amount of skilled people, you get a max for how many surveys can be made. Sadly the amount of surveys used in press releases is far greater than that number, telling that most are done by unskilled people.

      For the church result, a proper result would be cross referenced with say a church door counter. If 500 says they attend that church every sunday, but only 150 shows up, then we have an indication of how many tells the truth. A more trustworthy survey has say 5000 answers and then 100 interviews to detect how trustworthy the 5000 people can be placed into the groups.

      The modern approach of paying people to answer by hitting checkboxes on a webpage is pure garbage. People aren't selected randomly and you have no control if you get semi random people or you got hit by a flashmob from (name of politician)'s fan club or PETA or whoever could have an interest in screwing a result. During the last election in the UK, all the surveys agreed that tories and labour were tied. labour only got 2/3 of what the tories got. The only survey to predict that was rejected for being wrong considering how much off it was compared to all the others. However that one was the only one based on phone call interviews while the rest were based on internet surveys. I think they ended up with a committee on how to make surveys trustworthy again because the biggest loser were the surveys (lib.dem. might disagree).

    3. Re:All surveys are not the same by drew_kime · · Score: 1

      Similarly, a survey would probably tell you that 0% of the population had ever been arrested.

      "We surveyed a thousand prison inmates. 99.9% reported they were actually innocent. The exception was some guy named 'Andy'."

      --
      Nope, no sig
    4. Re:All surveys are not the same by EndlessNameless · · Score: 1

      If they publish their method with sufficient details for others to duplicate it, the cheaters will be able to use it as well.

      If they vet their fabricated data to ensure it passes muster, we will have a real problem.

      I fear this could end up like the arms race between malware coders and antivirus vendors. Because I doubt the good guys would have much of a chance here either.

      --

      ---
      According to the latest ruleset, this post should be modded as Vorpal Flamebait +5.
  22. Research show by Anonymous Coward · · Score: 1

    Over half of all analysis of surveys is flawed.

  23. Everyone does it. by Harold+Halloway · · Score: 1

    I once had a girlfriend who was doing her PhD. She habitually 'nudged' her data co-ordinates closer to the line of best fit. Other data she merely fabricated.

    1. Re:Everyone does it. by HornWumpus · · Score: 1

      That was just plane wrong. You have to get your doctorate before you are qualified to doctor data.

      --
      John McAfee 'It was like that time I hired that Bangkok prostitute; to do my taxes, while I fucked my accountant'
  24. And the rest... by fuzzyf · · Score: 1

    The other 4 of 5 is just biased by wording the question in a carefull way or with an odd scale.

  25. Survey says by NotPeteMcCabe · · Score: 1

    A survey showed that 1 in 5 surveys have fraudulent data. The other 4 surveys had different results.

  26. Half the marketing departments I've seen by xxxJonBoyxxx · · Score: 4, Interesting

    Roughly half the marketing departments at companies I've worked for have used half-baked surveys to gather statistics so the company name and the statistic get repeated in the industry over and over again.

    This often happens like this: "At (industry conference) this year, let's pass out a survey asking whether or not someone has every heard of a coworker getting hacked by (whatever threat our product purports to mitigate). Survey goes out to already half-paranoid people walking by, and the entire marketing and sales department fills one out that says 'yes I have'. A week later a press release goes out that says "(company) surveyed (# of people) IT managers and other attendees at (conference) and found that (high percentage) had direct knowledge of a coworker getting hacked by (threat)." Very often this stuff gets picked up by the press, bloggers and even other competitors, and the essentially made-up stat gets repeated and repeated until some people even think its true.

    Examples:
    - http://www.tripwire.com/compan...
    - http://www.prnewswire.com/news...
    - https://www.voltage.com/breach...

  27. 70% of the time... by Anonymous Coward · · Score: 0

    It works every time.

  28. Just look at all the ones that claim Trump... by Anonymous Coward · · Score: 0

    has more than single digit support. Corporations like Gallop support him and exaggerate his support.

    1. Re: Just look at all the ones that claim Trump... by Anonymous Coward · · Score: 0

      Single-digit support is correct. The corporations that control the news know their ratings will go up if he is elected so they lie.

    2. Re: Just look at all the ones that claim Trump... by Anonymous Coward · · Score: 0

      Hollywood is too since they know people will seek solace in entertainment.

    3. Re: Just look at all the ones that claim Trump... by Anonymous Coward · · Score: 0

      The corporations hate us and want us to suffer so they want him to win. Want him to win.

    4. Re:Just look at all the ones that claim Trump... by Anonymous Coward · · Score: 0

      has more than single digit support. Corporations like Gallop support him and exaggerate his support.

      That doesn't match the election results well. He won 3 out of 4 states (he was one delegate from a tie in Iowa). You need something better to claim the single digit support that "it's common knowledge" when it goes against both surveys and election results.

      My guess is the single digit support is wishful thinking from AC.

    5. Re: Just look at all the ones that claim Trump... by Anonymous Coward · · Score: 0

      Single-digit support is correct.

      This. I don't know anyone that supports him. He has no support, but the media is propping him up.

    6. Re: Just look at all the ones that claim Trump... by Anonymous Coward · · Score: 0

      The media is propping him up, Charlie! I don't know anybody that supports him, Charlie! Look, Charlie! A liopleurodon! A magical liopleurodon!

    7. Re:Just look at all the ones that claim Trump... by Anonymous Coward · · Score: 0

      Trump's single digit support is true, it's the middle one.

  29. I do that by Tablizer · · Score: 1

    When surveys ask for personal details that I don't want to supply, I often put crap data in if they don't offer a way to bypass it, such as a "Do not wish to state" option among the choices. For example, age, occupation, and income level.

    1. Re:I do that by Anonymous Coward · · Score: 0

      The study in the article deals with researchers who fabricate data.

      Your individual response is nothing more than statistical noise. That said, they will have better data if you decline to participate rather than falsify your response.

      Age and income are often useful for correlation. If you are worried about privacy, those two bits of information are fairly innocuous relative to their utility for research.

  30. Survey Says! by tekrat · · Score: 1

    How does this affect the outcome of every episode of FAMILY FEUD??? Maybe the Jones family won after all?

    --
    If telephones are outlawed, then only outlaws will have telephones.
  31. How many... by Anonymous Coward · · Score: 0

    ...were technology surveys by Gartner that were sponsored by Microsoft?

  32. False Data! For shame by Anonymous Coward · · Score: 0

    Now any paper on sociology, psychology, education or law has to be destroyed and the tests/surveys repeated under strict supervision.
    Political surveys, media surveys, advertising surveys ( "three in ten dentists use KY...") must all be suspected of being fraudulent.
    By law, if they are using false information claims for advertising, then they are commiting a crime.
    Media and politics is already distrusted so much that the owners/CEOs hide.....

    Blame it on lawyers and the mentality of "how should this actually look? Not like this! I'll just fix it..."

    Lack of integrity, honesty and common sense are contributing factors.

  33. Who takes a 100 questions survey? by camg188 · · Score: 1

    In a 100-question survey of 100 people, for example, fewer than five people would be expected to have identical answers on 85 of the questions.

    Who in the world takes a 100 question survey? Every survey I've been asked to take has been less than 10 questions.

  34. Re:Would "climate change" data pass this test? by Anonymous Coward · · Score: 0

    There is no data that will ever meet your standards. You'll always find some reason to conclude that the data you're reviewing are incorrect or have been "tampered with" or whatever unless they don't show any warming trend or you can find some some way to illustrate that any warming trend is beyond human control (bonus points if you can show that it's temporary).

  35. "Many Surveys" by Anonymous Coward · · Score: 0

    How many? Something like One on Five?

  36. Maybe it means groupthink... by Anonymous Coward · · Score: 0

    Or that third world totalitarian governments have succeeded it brainwashing their people, to the point that they all answer questions the same?
    Of course, here in America we aren't brainwashed. Instead, they give Kardashians.

  37. Don't know how many fraudsters there are out there by StevenMaurer · · Score: 1

    But I can say that on every porn site that asks, I can truthfully say that I was born on January the 1st, 1927.

  38. Actually... No. Exactly opposite of that. by denzacar · · Score: 1

    Public opinion polls these days are as much PR and marketing as anything else.

    Honestly, Pew makes money doing this stuff; honest player or not, they have a vested interest in keeping up the belief that their stuff is honest, unbiased, and accurate.

    But I'm entirely willing to believe opinion polls are carefully crafted, or sneakily tweaked, to arrive at the conclusions they've been commissioned to a arrive at.

    This is NOT AT ALL about public opinion polls which "people who want to support a specific point" would pay to skew in a certain direction NOR is it about polls that are designed (or "sneakily tweaked") to create certain results.
    It's not even about confirmation bias by the pollsters or researchers leaching into the data.
    This is about finding cases where a pollster would just sit down and fill out a survey after survey by themselves instead of going door to door.
    I.e. Charging for a field survey, forging the data and then pocketing the money.

    Further more, it is not about most kinds of "public" opinion polls.
    The results show that what most people here would consider "public opinion polls" are actually mostly free of such fabrication.
    It's that "other public" whose opinion polls are the issue.
    I.e. The "developing world" public.

    From TFA:

    That made the results all the more worrying: Among 1008 surveys, their test flagged 17% as likely to contain a significant portion of fabricated data.
    For surveys conducted in wealthy westernized nations, that figure drops to 5%, whereas for those done in the developing world it shoots up to 26%.

    Surveying communities in the developing world often requires face-to-face interviews, going house-by-house in dangerous environments.
    So one of the inevitable problems, Robbins says, is "curbstoning" where an interviewer sits on the curb and invents survey responsesâ"often duplicating answersâ"in order to avoid risk or save time.

    --
    Mit der Dummheit kämpfen Götter selbst vergebens
  39. Yes, Pew did it. Then they pulled it offline... by denzacar · · Score: 1

    From TFA:

    During her turn on the stage, Kennedy mounted an attack on the test's methodology. For example, she points out, it does not account for the number of questions on a survey, the number of respondents, nor other factors that can skew the results. She also takes exception to the 85% similarity threshold. "I would choose a different threshold depending on the population and the survey," she says. By putting a number on the extent of data fabrication across all surveys, "they took it too far," Kennedy says. Pew's rebuttal is now online.

    Some at the meeting saw merit in both sides of the fight. Rather than overestimating data fabrication, the method of Kuriakose and Robbins "very likely underestimates the true extent of the problem," says Michael Spagat, an economist at Royal Holloway, University of London, who has investigated high-profile cases of possible data fabrication in war zones. Yet Kennedy's response impressed him, too. "I think the Pew paper is interesting and made some good points," he says. "Specifically, there isn't a hard and fast cutoff beyond which you know there is fabrication." Overall, however, Spagat remains very concerned about data fabrication in surveys. "Robbins and Kuriakose have uncovered a massive problem and the Pew paper doesn't change that."

    Nothing was settled by the end, says the meeting's co-organizer Steven Koczela, president of the MassINC Polling Group in Boston and a previous survey research leader for the U.S. State Department. The case laid out by Kuriakose and Robbins "seems unassailable to me," he says, "but [Pew] are giving it their level best."

    --
    Mit der Dummheit kämpfen Götter selbst vergebens
  40. I have personally falisfied survey data by LennyDotCom · · Score: 1

    I used to own a polling company with my wife Tracy Costin The people that sponser the survey's are always asking for more then is realisticly possible they will be making changes to survey questions right up to the last minute then they want a certain number of responsise you can only cal a certain number of people in a certain geographic area once use use up all your numbers and you can't get anyone else to answer questions what do you do? plus they are always in a hurry you have to complete the survey in a very short time. you can only call between certain hours. it's really a high pressure biz you get a lot of serveys that are incomplete it's really frustrating a little cut and pate of partial srveys can go a long way you don't have to make up data just good old cut and paste

    --
    http://Lenny.com
    1. Re:I have personally falisfied survey data by LennyDotCom · · Score: 1
      --
      http://Lenny.com
    2. Re:I have personally falisfied survey data by careysub · · Score: 1

      Based on the questions I see on surveys, and comparing to the quality of writing displayed here, I totally believe that you used to own a polling company.

      --
      Starships were meant to fly, Hands up and touch the sky - Nicky Minaj
  41. Our survey said... by Anonymous Coward · · Score: 0

    An online survey found that 92847639 people per day didn't read long numbers

  42. Well that explains... by otaku244 · · Score: 1

    ...why only 4 out of 5 dentist approve of Crest White Strips

    --
    Mod me down, I shall become more off-topic than you could possibly imagine.