Slashdot Mirror


How 136 People Became 7 Million Illegal File-Sharers

Barence writes "The British government's official figures on the level of illegal file sharing in the UK come from questionable research commissioned by the music industry. The Radio 4 show named More or Less examined the government's claim that 7m people in Britain are engaged in illegal file sharing. The 7m figure actually came from a report written about music industry losses for Forrester subsidiary Jupiter Research. The report was privately commissioned by none other than the UK's music trade body, the BPI. The 7m figure had been rounded up from an actual figure of 6.7m, gleaned from a 2008 survey of 1,176 net-connected households, 11.6% of which admitted to having used file-sharing software — in other words, only 136 people. That 11.6% was adjusted upwards to 16.3% 'to reflect the assumption that fewer people admit to file sharing than actually do it.' The 6.7m figure was then calculated based on an estimated number of internet users that disagreed with the government's own estimate. The wholly unsubstantiated 7m figure was then released as an official statistic."

34 of 313 comments (clear)

  1. Re:If it's bogus, it's probably too low. by SoupGuru · · Score: 2, Insightful

    That's what I was thinking. The summary makes it seem that estimating the number that high is outrageous. I certainly wouldn't wager any money that it's significantly higher than actual piracy.

    --
    What doesn't kill you only delays the inevitable
  2. Re:Story meaning? by Lemmy+Caution · · Score: 5, Insightful

    Because statistics are hard and outrage is easy.

  3. What's the confidence interval? by Anonymous Coward · · Score: 4, Insightful

    Whenever you estimate a statistic like that, you should also indicate the level of uncertainty surrounding the estimate. Why are they not reporting the upper and lower bounds of the confidence interval surrounding that estimate?

    1. Re:What's the confidence interval? by caffeinemessiah · · Score: 3, Insightful

      Whenever you estimate a statistic like that, you should also indicate the level of uncertainty surrounding the estimate. Why are they not reporting the upper and lower bounds of the confidence interval surrounding that estimate?

      Perhaps because it's hard to come up with confidence intervals when you admit to fudging your own data by bumping the estimate up by almost five percentage points.

      --
      An old-timer with old-timey ideas.
  4. Meaningless admission by Trevin · · Score: 5, Insightful

    Using file-sharing software does not equate to sharing files illegally. I admit to using BitTorrent to download Fedora ISO's, and there's nothing illegal about that.

    1. Re:Meaningless admission by Anonymous Coward · · Score: 1, Insightful

      They'd probably backtrack to only sharing music online... but some big names at the moment (Nine Inch Nails, Radiohead) have legally released their own music as free torrents.

      The music industry is STILL stuck in the "downloading"="file sharing"="always illegal" mindset and they show no sign of shifting from that idiotic viewpoint.

    2. Re:Meaningless admission by 4D6963 · · Score: 2, Insightful

      Yeah, sure, because I'm sure there's SOOOO many people who use BitTorrent only to download free linux ISOs and never ever download movies, series, porn, games, books, music.

      Translation : the number of people who only do that are insignificant. It would be dishonest or delusional to disagree.

      --
      You just got troll'd!
  5. This calls for a quote by SilverHatHacker · · Score: 2, Insightful

    "If they facts don't fit the theory, change the facts."
    ~Albert Einstein

    --
    Funny may not give karma, but +5 Informative never made anyone snort coffee out their nose.
  6. file sharing software=pirate??? by Anonymous Coward · · Score: 3, Insightful

    using file sharing software does not mean you pirate software or media.....

  7. Re:Story meaning? by caffeinemessiah · · Score: 5, Insightful
    Argh, where to begin?

    The summary tries to paint this study bad because it "downsides" the amount of filesharers

    I presume by "downsides" you mean "reduces"? Well the summary says "That 11.6% was adjusted upwards to 16.3% 'to reflect the assumption that fewer people admit to file sharing than actually do it.'" So they actually UPPED the number of filesharers. This is objection #1 to "good research":
    1. You do a survey to objectively measure the support of your hypothesis
    2. The survey of a tiny sample indicates that filesharers are a pretty low percentage
    3. You "adjust" this number -- otherwise known as "fudging the data" -- to better reflect your own hypothesis.

    The same tactics in any scientific endeavor would get your papers retracted, your funding canceled, some sort of disciplinary action initiated, etc.

    The second objection, and this applies to other studies too that try to make grand claims from small samples, is that it's A SMALL SAMPLE. For your survey to be representative, your sample has to be representative. It's also difficult to choose people independently at random, and without that assumption, all your basic statistics fall apart. Perhaps they went through a list of BT subscribers and pulled names at random -- but what if downloaders are overrepresented amongst BT subscribers? What if they only polled home internet users, but then used the "total number of internet users" -- which includes corporate subscribers -- to come up with their 11mil number? There are other possible, non-numerical issues too. What if the respondents confused downloading from bittorent with downloading from iTunes?

    If you want many other examples of "bad science", read Ben Goldacre's blog

    --
    An old-timer with old-timey ideas.
  8. Re:the story title is kind of lame by Abreu · · Score: 4, Insightful

    Is it ok to change "11.6%" to "16.3%" based on a "hunch"?

    I'm not a statistician, this is an honest question

    --
    No sig for the moment.
  9. Re:the story title is kind of lame by Trepidity · · Score: 3, Insightful

    If there was some previous result that only 2/3 of filesharers admit it when asked, then an upwards revision by 1/3 in an estimate would be defensible. A "hunch" is not quite as good evidence. of course.

    I was objecting mainly to the "how 136 people became 7 million" title, which to my ears reads mainly as a criticism of the sample size. But whatever the problems with this estimate, the sample size wasn't really among them.

  10. Re:the story title is kind of lame by caffeinemessiah · · Score: 5, Insightful

    Is it ok to change "11.6%" to "16.3%" based on a "hunch"? I'm not a statistician, this is an honest question

    IAAS, and the answer is no. That goes for the GP as well -- no one is contesting estimation theory, just that the fundamental assumptions are so grossly unmet in this "study" as to render it meaningless. And as someone else already commented, it's dangerous here because it's going to dictate public policy.

    If you're going to "adjust" your objective findings, based on some bizarre assumption that a certain percentage of people will lie about file sharing, then why do a survey at all if not to create mathematical/sciency-sounding smoke and mirrors?

    --
    An old-timer with old-timey ideas.
  11. Re:Story meaning? by Anonymous Coward · · Score: 5, Insightful

    1. the same size is small.. probably too small to make the claims they did.

    First statistics lesson I ever had, first thing the professor did was make an estimate based on 10 people about the whole population. He was correct, by the way. He went on to rant that anything that uses large amounts of people (by which he meant more than at most a few dozen) was not proper statistics. If you simply count everybody, it should be called "counting", you see, not statistics.

    2. they altered the numbers on an estimate of how many people fileshare on the assumption that the number was under-reported.

    And since they are right that the number turned out to be bigger in other studies, slightly. It seems a reasonable adaptation. It's easy to say it's unreasonable, of course. But they are absolutely correct that the number is most likely smaller. So how much should they adjust it ? Like I said, it seems a reasonable adjustment. Not absurdly high, not absurdly low.

    3. conflict of interest... it's like the tobacco industry sponsoring studies claiming that smoking doesn't have anything to do with lung cancer... there is significant reason to believe that the study carries significant bias in favor of their conclusion and must at the least be repeated by other sources.

    There don't exist studies that have no bias. Either research is funded by companies, or it's funded by government. Both have serious axes to grind, mostly pertaining to political ideology. If business intrest groups would not fund research we'd never have even the semblance of unbiased research that we have.

    By the way, who should pay for studies ? Obviously the government has a vested interest in more legislation. The ifpi (us dept) has a vested interest in creating legal instruments to counteract filesharing. And the filesharers have a vested intrest in more "privacy", and legal instruments against ISPs (for the same reason a thief wants privacy, obviously, let's please not start the "what about those who only share openbsd", we all know that's not the filesharers being talked about).

    How about we do the sane thing, and let all of them fund studies. Then read them all, and see what we believe to be true.

    Just because people are biased, by the way, does not mean the truth can be biased. We are simply limited to imperfect instruments for reading the truth. Truth is absolute, and the number of filesharers is just a single number, not 2, not 5. And yes, we'll probably need a better definition and classification than "filesharers". The effects of filesharing are negative for artists (certainly for pop artists), and especially for the "music industry". There can be little doubt about that. How much damage is done, is anyone's guess. But by criticizing their observations, AND listening to them criticize our observations, we can hope to get closer to the real truth.

  12. Why the BBC rocks by MosesJones · · Score: 5, Insightful

    This is yet another example as to why the BBC is the finest broadcasting and journalistic organisation on the planet (I've never worked for them, sold to them or have any other financial connection other than the license fee).

    They actually investigated something created by an industry group and found it to be bollocks and then reported it. The BBC are arguably the most "socialist" organisation in the democratic world (funded by a tax on everyone for the benefit of everyone) and yet they still question and challenge everything.

    The US seriously needs something that questions vested interests and rubbish statistics as much as the BBC. Jon Stewart and Bill Maher are just comedians and FoxNews is just comedy.

    Given a choice between the first amendment and the BBC, I'll take the BBC; its demonstrated more freedom of speech in a week than the US media has in a decade.

    --
    An Eye for an Eye will make the whole world blind - Gandhi
    1. Re:Why the BBC rocks by vivaelamor · · Score: 3, Insightful

      Oh come on, the BBC have reported this number many times since it was first used and you sing their praises because Radio 4 happens to do a show devoted to statistics? I wonder just how much time they will devote to debunking this statistic considering how many times they have quoted it.

      Just because the BBC is better than the US networks doesn't mean we should be proud, personally I'm appalled at how low the bar is set.

    2. Re:Why the BBC rocks by houghi · · Score: 2, Insightful

      The BBC are arguably the most "socialist" organisation in the democratic world [...] and yet they still question and challenge everything.

      What does their political thinking have to do with whether they challenge anything or not? I would call many unions more socialist then the BBC. And they do challenge everything all the time as well (Rightfully or not is another discussion).

      I think you confused "socialist" with "socially engaged" which is not the same thing.

      --
      Don't fight for your country, if your country does not fight for you.
  13. Re:Story meaning? by Volante3192 · · Score: 5, Insightful

    And since they are right that the number turned out to be bigger in other studies, slightly. It seems a reasonable adaptation. It's easy to say it's unreasonable, of course. But they are absolutely correct that the number is most likely smaller. So how much should they adjust it ? Like I said, it seems a reasonable adjustment. Not absurdly high, not absurdly low.

    Here's where I find a major problem. You do not fudge your data. Period. These other studies may show higher numbers, but do we have proof they weren't fudged as well?

    There's too many stories about companies performing pharmecutical trials and then throwing the data away because it didn't present a positive light.

    If you're going to adjust numbers, you better have a damn sound reasoning for it rather than "we have a hunch people lied, so..."

  14. Re:Wait, you believed them? by mdwh2 · · Score: 5, Insightful

    Indeed, let's look at the maths - supposing each person only shares 24 mp3s. By US standards at least, that's a cost of $1.92 million. So with 7 million file sharers, that's $13.44 trillion.

    Now let's check out http://en.wikipedia.org/wiki/List_of_countries_by_GDP_(nominal) - wow, these 7 million people are causing damage to the UK economy equal to almost 5 times the entire GDP of the UK...

  15. Re:Story meaning? by vikstar · · Score: 2, Insightful

    Every politician should undergo a statistics examination as a prerequisite.

    --
    The question of whether a computer can think is no more interesting than the question of whether a submarine can swim.
  16. Re:Story meaning? by Anonymous Coward · · Score: 4, Insightful

    The study is effectively making the assumption that filesharing = copyright infringement.

    I have a very hard time believing that the vast majority of people that use any filesharing application do so exclusivley for legit and non-copyright infringing purposes.
    Given the vast quantity of content, I seriously doubt that very many people go through any sort of hassle to determine what is legit and what is not, which results in virtually everyone obtaining material that is copyrighted, regardless whether they know (or care). Given that, I think its a fair guess on their part that yes, most people that claim they are using file-sharing software do so to obtain material illegally.

    I just don't understand the stance that most people on this board seem to take regarding this issue. How can everyone be so supportive of what very obviously amounts to theft? It appears to me that somehow people think it is their "right" to obtain copyrighted material for free. I just don't buy for a second that people who claim to only use file-sharing apps for legitimate purposes only actually do so.

    If you do indeed use all file-sharing applications for 100% legit purposes, please educate me what you use these services for that makes them so very essential to cause these very emotional posts here.

  17. Re:Story meaning? by Runaway1956 · · Score: 2, Insightful

    Yes, it makes a difference. When the lobbyists stand in front of lawmakers, those lawmakers want to know the real size of the problem. If the industry's lobbyists have to say, "We think we are losing almost a million pounds each and every year to piracy", lawmakers are going to be mildly concerned. However, if they lie, and claim that they are losing BILLIONS of pounds, those lawmakers realize that the tax collectors are losing a huge sum of money.

    When you want action, you always exaggerate your losses and/or the governments benefit.

    I think that claims in the us are 42 billion dollars lost annually. I followed THOSE studies back once, to find where the figures came from. That number is totally unsubstantiated as well, almost entirely based on guesses, estimates, and even false assumptions. One study after another cites the previous study, and almost no one knows where that 42 billion dollar figure came from, but it's impressive, so everyone continues to quote it.

    --
    "Windows is like the faint smell of piss in a subway: it's there, and there's nothing you can do about it." - Charlie Br
  18. Re:Story meaning? by vivaelamor · · Score: 2, Insightful

    Nice calculator, I think the GP's main point though was that there is no evidence of a properly selected sample. You would be right in saying that the sample size has very little to do with anything compared to whether the sample is biased or not.

  19. Re:Story meaning? by vivaelamor · · Score: 3, Insightful

    To me, the number is meaningless in itself. The fact that government agencies have been using the number is the issue. Either they knew that the number was wrong or they didn't bother checking it. Both possibilities can point to incompetence or malice and reflect very badly on the people responsible.

    You might be happy with government by making shit up and gut feelings but for the rest of us this is a good example of why government has no respect.

  20. Re:Story meaning? by hot+soldering+iron · · Score: 4, Insightful

    If they're going to "adjust" the numbers, why did they even bother doing the research at all? Why not just come out and say,"We didn't like what the numbers said, so we threw them away and we're making a WAG with some bullshit we're pulling out of our ass." I understand that they're a research (read "marketing") company, and so are constitutionally incapable of telling the plain truth because they could burst into flames, but it would be a new experience. And fun to watch!

    --
    When you want something built, come see me. If you want correct grammar and spelling, get a F*ing liberal arts student.
  21. Re:Story meaning? by Anonymous Coward · · Score: 0, Insightful

    most reasonable slashdotters advocate for reformed copyright pertly because of the unenforceable nature of longer copyright terms. many such as myself support the concept of a shorter more reasonable copyright term

    Why would the solution to something that is not easily enforceable be to make it legal? At the end of the day, the people that produced the content should be entitled to reap any benefits from it. If the market decides that it does not like the content, it will not pay for it. With this particular issue, it simply became trivial for virtually anyone to obtain copyrighted material illegally, to the point, it seems that the solution advocated is to simply not make it illegal to freely distribute someone else's work.

    encourage the advancement of the arts.

    I don't see how this relates. It seems to me that these are two different issues, one economical and one political. Nobody is going to stop the advancement of the arts if it is made more difficult to share copyrighted content. If as an artist, you really do give your content away for free, you always will be able to do so. Its not like artists are actively prevented from creating content.

    most of the anger is directed toward the music/movie industry's response to piracy- weaken/destroy fair use, demonize all p2p [possibly restricting its use in the future out of fear] suing people as a scare tactic, excessive/un-constitutional fines, DRMed media etc...

    As someone who makes a living creating copyrighted content, I don't see why these tactics are unreasonable. I don't see how anyone can even attempt to argue away the fact that vast amounts of content is distributed illegally. I would argue that the benefit gained by legit users of file sharing networks is far smaller than the damages incurred on the holders of copyrighted material. This is obviously just my own opinion, as the whole point of this thread is unreliability of statistics surrounding use or legitimate use of file-sharing networks, as it is very difficult to accurately estimate actual damages. However, I think we can all agree that sales lost due to file-sharing networks is greater than 0, which means it impacts someones attempt to make a living. It is even more difficult to attach a value to the legitimate uses of file-sharing networks, but if you can point me at examples of how file-sharing systems have a positive economic impact on anyone, please let me know. This opinion is simply based by observing everyone that I know personally. The argument that "No sales are lost because I wouldn't have purchased it anyway" is BS. At least in my town, ALL CD stores but one have been driven out of business, and virtually everyone I know has stopped purchasing CD's (its much easier to download what you want). I know ONE person that has significant amount of digital music and has purchased it all on Itunes, everyone else, not so much. If you start talking about software, it gets even worse. (It is profoundly confusing to me how people that work as software engineers have no issue pirating other peoples software)

  22. Re:Story meaning? by dlthomas · · Score: 2, Insightful

    It does not "obviously" amount to theft. It *is* illicit, and it may be immoral (see Free Rider Problem), but it is not theft. If I steal 10 M&Ms from you, you have 10 fewer M&Ms - not the case if I download your song, in which case you have less than you otherwise would have *if and only if* I would otherwise have paid for it. This clearly is not the case for, say, college students with tens of thousands of dollars "worth" of media on their hard drive.

    As for legal uses of "file-sharing" technologies, well - how about the entire world-wide web? We're sharing files...

    Specifically P2P file-sharing technologies? Linux ISOs and WoW updates, to name two common legal uses.

    Finally, I for one have an emotional reaction to assertions that technology should be restricted unless I can make you understand what it is for - and I don't even personally use any P2P software at the moment.

  23. Re:Story meaning? by Raistlin77 · · Score: 4, Insightful

    most of the anger is directed toward the music/movie industry's response to piracy- weaken/destroy fair use, demonize all p2p [possibly restricting its use in the future out of fear] suing people as a scare tactic, excessive/un-constitutional fines, DRMed media etc...

    ...I don't see why these tactics are unreasonable...

    So, just so that you can protect your "copyrighted content" from being stolen by someone other than me, you believe that it is "reasonable" to use bogus or flawed "research" to fool the government into a) taking away my legal rights (fair use); b) criminalizing software that can be and is used for legal purposes (P2P); c) abuse our legal system (suing people as scare tactic/impose excessive/unconstitutional fines); and d) crippling your "copyrighted content" so that I cannot exercise my right of fair use after I have purchased your "copyrighted content" (DRM/refer back to a) )?

    It is even more difficult to attach a value to the legitimate uses of file-sharing networks, but if you can point me at examples of how file-sharing systems have a positive economic impact on anyone, please let me know.

    Really? So you don't see value in a content provider being able to reduce operating expenses by distributing their content via P2P? Just because you are too lazy to do a simple search using any common search engine doesn't mean such examples don't exist. And why exactly does it have to have a positive economic impact on anyone - why does it have to have any economic impact at all? There are many things that have neither a positive economic impact nor any economic impact whatsoever, should those be illegal too?

  24. Re:Story meaning? by MeNeXT · · Score: 2, Insightful

    He was correct, by the way. He went on to rant that anything that uses large amounts of people (by which he meant more than at most a few dozen) was not proper statistics.

    10 people could in no way make any reasonable assumption of any population. He was wrong. First and foremost 10 randomly selected people of any population would not represent any complete demographics of any nation. Rich, poor, middle class, professional, artist, unemployed, computer literate, music fan, and student does not come close to representing the whole segment of any nation. I am positive that you could fill in hundreds more. Some are more prone to "Download than others". The combinations and permutations may affect the results. The sample size needs to be large enough and random to arrive at any reasonable accuracy.

    --
    DRM? No thanks, I'll just get it somewhere else...
  25. Re:Story meaning? by blueg3 · · Score: 4, Insightful

    Just as easily as a random sample can accurately reflect a population as a whole, it can equally be skewed to be a completely inaccurate representation of the real world.

    If by "just as easily" you mean "with an enormously lower probability", then yes. But then, that's what a statement of margin of error says.

    Statistics isn't all that complicated, and what a statistical measure means can be both demonstrated and proven. You don't need to get all faux existential about how "it's all just a bunch of crap, man". You don't know what you're talking about.

    Also, entropy? No such thing as random? Really? Don't inject physical phenomena you clearly don't understand in a discussion about pure mathematics.

  26. Re:Story meaning? by koxkoxkox · · Score: 2, Insightful

    Just as easily as a random sample can accurately reflect a population as a whole, it can equally be skewed to be a completely inaccurate representation of the real world.

    Of course, but not more than 5% of the time. Please read a bit more about statistics and maybe listen to real statisticians instead of journalists before spewing so much hate. Statistics don't try to prove anything, they are just tools we can use to make decisions.

  27. Re:Story meaning? by Runaway1956 · · Score: 2, Insightful

    You seem to be working from an assumption of some sort. How about we consider the taxman coming to your place of business to conduct an audit. He finds that you owe 7 million dollars in taxes, instead of the 4 million dollars that you claimed. Can you argue that the numbers aren't off by an order of magnitude? "But, sir, my numbers are only wrong by about 75%, this isn't fraud!" Or, we can work those same numbers backward - "But, sir, my numbers are only off by about 50% - of course it's not fraud!" Good luck with that, huh?

    The numbers are fraudulent, plain and simple. As others have pointed out, anyone in the scientific field(s) would be laughed out of academia for submitting such flawed numbers and such flawed reasoning.

    "In fact, unless they only surveyed people WITH internet access,"
    BTW - TFA specifically says that everyone in the survey had internet access.

    There is no line of work on planet earth where people are permitted to do such obviously fraudulent math. If similarly flawed mathematics were applied to a construction job by a bunch of backwoods hill billies, they would soon be out of business.

    You simply cannot justify the numbers with any sort of logic. Attempting to do so is an exercise in fraud.

    --
    "Windows is like the faint smell of piss in a subway: it's there, and there's nothing you can do about it." - Charlie Br
  28. Re:Story meaning? by JAlexoi · · Score: 2, Insightful

    The problem is that the government is taking numbers from a statistical research paper, that was commissioned by a biased entity.
    We all know, that statistics tend to sway towards the point of view of the paying party.
    And then we get a problem with Wikipedia Source paradox. The state takes numbers from a biased corporate study and present's them as fact, then that entity will say that "According to the government..." and present that study as official government sponsored study.

  29. Re:Story meaning? by FrankieBaby1986 · · Score: 2, Insightful

    Because 1,176 people is a miniscule (0.0168 %) amount compared to 7 million, so there is room for a LARGE margin of error, in either direction. The sample size is too small for the number of people they are trying to represent.

    --
    ERROR: SIG NOT FOUND (A)bort, (R)etry, (F)ail?: