Slashdot Mirror


Google Says Almost Every Recent 'Trusted' DMCA Notices Were Bogus (torrentfreak.com)

Reader AmiMoJo writes: In comments submitted to a U.S. Copyright Office consultation, Google has given the DMCA a vote of support, despite widespread abuse. Noting that the law allows for innovation and agreements with content creators, Google says that 99.95% of URLs it was asked to take down last month didn't even exist in its search indexes. "For example, in January 2017, the most prolific submitter submitted notices that Google honored for 16,457,433 URLs. But on further inspection, 16,450,129 (99.97%) of those URLs were not in our search index in the first place."

46 of 83 comments (clear)

  1. That's what you get for wording the DMCA that way by Opportunist · · Score: 5, Insightful

    Why should I check my demands, just carpet bomb them with everything and whatever hits is fine by me.

    How about this: Whatever the punishment for not following through with a rightful demand should be meted out for every bogus one made in bad faith.

    --
    We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
  2. Stop accepting takedown notices from BSers by presidenteloco · · Score: 5, Insightful

    Google should tweak their algorithm to block takedown requesters who are spamming google with generic requests.

    --

    Where are we going and why are we in a handbasket?
    1. Re:Stop accepting takedown notices from BSers by Baron_Yam · · Score: 5, Insightful

      IF Bogus_Takedown > 0
          SET Require_Review = True
          SET Charge_Fee = True

      When their DMCA takedown request processing rate drops from millions per month to a few per day and comes with a bill for the reviewer's time, maybe they'll smarten up.

    2. Re:Stop accepting takedown notices from BSers by Baron_Yam · · Score: 1

      That's the beauty of cutting them off from the automated submission system after a very low threshold of bogus submissions (by percentage, quantity, or a combination).

      It doesn't matter if they have an ulterior motive, they're shut down and have to pay a premium going forward... which means if they want to keep it up they'll be paying Google to employ extra verifiers and nobody else is affected.

    3. Re:Stop accepting takedown notices from BSers by budgenator · · Score: 1

      Sounds more like an example of http://example.com?query=chess... not returning a 404 error, and some clown firing off a takedown notice and collecting a bounty from the music publisher.

      --
      Apocalypse Cancelled, Sorry, No Ticket Refunds
    4. Re:Stop accepting takedown notices from BSers by nobuddy · · Score: 1

      Even easier.
      Require studios to register for automated submission.
      Studio abuses system- kick them out and require manual submission via snail mail.

      Studio will either stop abusing, or be stoppped from abusing. Win-win

  3. It seems obvious that... by Baron_Yam · · Score: 5, Insightful

    DMCA takedown requests for non-existent URLs, especially at a 99.97% invalid rate, should be evidence that the requestor is not properly verifying their DMCA claims and should:

    A) Lose their right to continue to submit 'trusted' DMCA takedown requests

    B) Be charged under the DMCA for filing false claims.

    But we know that will never happen.

    1. Re:It seems obvious that... by Mashiki · · Score: 4, Interesting

      Well if companies decided to start charging them because of the number of false claims, this problem would likely fix itself in the span of a couple of weeks. You can bet some company would stomp their feet and take it to court, and the court would likely agree that with the high percentage of false claims that the company has a reasonable expectation to recoup losses from false claims.

      --
      Om, nomnomnom...
    2. Re:It seems obvious that... by squiggleslash · · Score: 4, Interesting

      Nobody is making the claim that the URLs are non-existent, simply that Google has not indexed them. I'd imagine a lot of copyright infringing file sharing sites have a robots.txt that blocks Google et al to keep them under the radar.

      --
      You are not alone. This is not normal. None of this is normal.
    3. Re:It seems obvious that... by Baron_Yam · · Score: 5, Insightful

      OK.... So how is it Google's responsibility to remove something from their index that isn't in their index?

    4. Re:It seems obvious that... by Anonymous Coward · · Score: 1

      Asking them to not index it?

    5. Re:It seems obvious that... by AuMatar · · Score: 2

      1)Is it actually possible to know if a given URL has been indexed? Searching for it isn't sufficient, as it may just not be in your results.

      2)It may well be their repsonsibility to blacklist it from future indexing.

      --
      I still have more fans than freaks. WTF is wrong with you people?
    6. Re:It seems obvious that... by laughingskeptic · · Score: 2

      The search "site:nefarious.fileshare.biz/a/b/c filetype:ext filename" will not give hits if the file in question is not indexed.

    7. Re:It seems obvious that... by AuMatar · · Score: 1

      But it may be indexed and just not yet appearing. It takes time for data to propagate through their system. You can have two people search for the same query at the same time and get different results.

      --
      I still have more fans than freaks. WTF is wrong with you people?
  4. Weak/nonexistent punishments for faulty notices by barc0001 · · Score: 5, Insightful

    If notices had to be signed as accurate by an officer of the company that sent them under threat of perjury, and if it was possible for someone who was affected by a bogus notice to then start the wheels on a process that would see said officer of the company end up doing a few days in jail for that perjury I would think we'd see bogus notices drop to near zero.

    But of course that would only happen in a world where the lawmakers actually cared about doing what was right, not what their donors wanted.

    1. Re:Weak/nonexistent punishments for faulty notices by Bruce+Perens · · Score: 1

      All patent applications are signed under penalty of perjury. However, the US Patent and Trademark office disbanded its enforcement department in 1974. So, you can perjure yourself on a patent application with impunity.

      Unless it's testimony in a criminal case, or the perjury trap in front of a grand jury, or something they want to prosecute like lying on your tax form, the Federal government is in general lassiez faire about perjury, or even encouraging of it with their reluctance to prosecute, especially perjury committed by a so-called intellectual property holder.

    2. Re:Weak/nonexistent punishments for faulty notices by Anonymous Coward · · Score: 1

      Oh forgot to add:

      105th United States Congress: House 55R/45D, Senate 52R/48D. So R was in control of both the House and Senate.

    3. Re:Weak/nonexistent punishments for faulty notices by barc0001 · · Score: 1

      Exactly my point. I knew they were technically already perjury but it doesn't mean anything if it is never enforced.

    4. Re:Weak/nonexistent punishments for faulty notices by GrumpySteen · · Score: 1

      Won't someone think of the poor strawmen who are being hurt by these malicious facts?

  5. Re:Grammar cop says "Every" is singular by damn_registrars · · Score: 1

    That's because Google Knows That Slashdot Editors Don't Know What Their Job Is About

    --
    Damn_registrars has no butt-hole. Damn_registrars has no use for a butt-hole.
  6. Solution: suspend "trusted" status by davidwr · · Score: 1

    When you (Google) detect that a medium- or high-volume submitter has "obvious flaws" in more than a very small percentage of its submissions, suspend their "trusted sender" status until the flaws are fixed.

    Ditto if more than a small percentage of a medium- or high-volume submitter's requests get overturned or "overturned by default" by an un-challenged counter-notice.

    For low-volume submitters, the "kick out" threshold would need to be much higher, something like "5 bad submissions out of the last 10 or 10 bad submissions out of the last 100," along with a "you can't become 'trusted' until we see at least 10 consecutive submissions that aren't challenged and at least 90% of the ones submitted in the last month are okay."

    If the legal requirements of the DMCA prevent this, then take "bad submitters" to court and get a court order declaring that you (Google) can ignore requests from that submitter that haven't been approved by the court.

    --
    Knowledge is how to play a game, intelligence is how to win, wisdom is knowing what game to play.
  7. Re:That's what you get for wording the DMCA that w by Anonymous Coward · · Score: 2, Insightful

    I don't have to imagine. I just have to wait for it to get Trump's signature instead.

  8. Re:Grammar cop says "Every" is singular by mykepredko · · Score: 2

    When were the standards set higher?

  9. Re:What is the benefit? by squiggleslash · · Score: 3, Interesting

    Nothing. But the URLs are not non-existent, just not indexed by Google, probably due to a robots.txt blocking access to the sites concerned.

    --
    You are not alone. This is not normal. None of this is normal.
  10. Costing to the RIAA vrs Ignoring? by mykepredko · · Score: 3, Insightful

    When I RTFA, it seems like there are 100s of millions of DCMA requests going out each year - 35 million in the "latter half of September 2016"?

    Even if "detection" (I don't know how good their detection process is if 99%+ of the claims are not found by Google) filing costs came out to a dollar per then I would think the cost to the industry would be $250+ million.

    Looking at Warner Music revenues (the first company I could think of: https://www.statista.com/stati...), I would think that the cost of filing all these claims amounts to around 5% or so of total revenue.

    Two things came to mind thinking about this:
    1. The music company accountants should be going apeshit over the cost of this program and the damage to the company's bottom line.
    2. There's a whole bunch of money to be made creating an application for generating DCMA demands that reduces the number of DCMA requests (and the total cost of the requests) by 10x or more.

    1. Re:Costing to the RIAA vrs Ignoring? by green1 · · Score: 3, Insightful

      You grossly overestimate the cost per notice. To the rights holder the cost is basically zero. the automated system that google set up gets automated complaints generated by an automated system at the company submitting the complaints. There was a one time setup fee for developing the system, and it can run indefinitely in a closet somewhere unmonitored for decades.

      Do you really think there'd be this many takedown requests if if cost actual money to process them?

    2. Re:Costing to the RIAA vrs Ignoring? by mrchaotica · · Score: 1

      You grossly overestimate the cost per notice. To the rights holder the cost is basically zero.

      Well, the copyright holder first has to determine whether the content is actually infr-- (snicker, choke, guffaw)

      Sorry, I just couldn't get that whole sentence out while keeping a straight face.

      --

      "[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz

    3. Re:Costing to the RIAA vrs Ignoring? by gnasher719 · · Score: 1

      Well, the copyright holder first has to determine whether the content is actually infr-- (snicker, choke, guffaw)

      No, the copyright owner has to check that the content would be infringing if it was the content that they think it is. For example if there are two songs with the same title, and you own the rights to one of them, it is not illegal to send a notice for a song with that title - as long as you own the one that you think it is, even if you're wrong.

  11. Re:Grammar cop says "Every" is singular by davidwr · · Score: 1

    by mykepredko ( 40154 ) on Thursday February 23, 2017 @02:17PM (#53919283) Homepage

    When were the standards set higher?

    Before 2002-11-14?

    --
    Knowledge is how to play a game, intelligence is how to win, wisdom is knowing what game to play.
  12. A story as old as the DMCA. by Anonymous Coward · · Score: 1

    Got hit with a bogus violation years ago.

    A law firm sent out official looking paperwork en masse, to podunk regional ISPs claiming zomg copyright infringement from various IPs.

    Turns out, the law firm was unaffiliated with the rights holder of the songs in question (from an artist I've literally never heard); indeed, the law firm was not even an actual law firm. They did have a nice seedy looking form where you could pay $100 to make your violation just magically go away, however.

    My ISP wasn't interested in that, they were merely covering their own ass, but they were happy with my excuse of, "I DUNNO I PLUGGED IN A WIRELESS THINGER MAYBE I WAS HACKED?"

  13. Ban that idiot by AndyKron · · Score: 1

    The most prolific submitter needs to be banned from the Internet forever.

  14. Re:What is the benefit? by green1 · · Score: 2

    When submitting that many million requests, the odds that that high a percentage are legitimate but not in the index is pretty low.
    My bet is that the submitter just used an algorithm to generate possible names just in case, rather than actually find real infringers. For example, you know that www.questionablesite.com tends to host some content you don't like, so you do this:
    - www.knownquestionablesite.com/bignamemovie.html
    - www.knownquestionablesite.com/b1gnam3m0v13.html
    - www.knownquestionablesite.com/big_name_movie.html
    - www.knownquestionablesite.com/big-name-movie.html
    - www.knownquestionablesite.com/big.name.movie.html
    - etc

    Now you don't know if they were even going to host your movie, but instead of checking, you just spam DMCA takedown requests because they're free and there's no downside if the site doesn't exist.

  15. Re:Grammar cop says "Every" is singular by mykepredko · · Score: 1

    ;^)

    Nice to know I've had an impact.

  16. Those who can't get over it have to wait line by raymorris · · Score: 1

    Those who are having trouble getting over it are getting in a line, a British line to be exact. They are being queued. :)

    1. Re:Those who can't get over it have to wait line by jabuzz · · Score: 1

      You assume that stuff outside England is "English".

    2. Re:Those who can't get over it have to wait line by Coren22 · · Score: 1

      http://www.pbs.org/pov/bibliob...

      I don't know, 99% isn't too bad. It looks like it is about equal with most of Europe, though some countries are better.

      --
      APK likes to ask for responses to the same things over and over. Maybe he just likes the responses?
  17. Bogus URLs by DrYak · · Score: 1

    Yup it's basically that.

    With the additional peculiarity that here, "www.knownquestionablesite.com" will spit a valid page with suggestions, no matter what you throw as a name afterward (even if "big.name.move.html" isn't in their database, it would still give a list of not necessarily related download links).

    So they are not exactly issuing DMCA about links that don't even exist (these URLs do not return 404s), only DMCA about links that are not in google database (random links that elict a random answer from the website).

    The claim is borderline bogus because, as mentioned, the website return random non related download suggestions. So the website is not necessarily infringing on the DMCA submitter's IP. On the other hand, as the result page is random, Google can't prove that the submitter didn't get their IP showing by random chance on the result page on the precise occurence when they tested the URLs about which they decided to file the complain.

    So currently Google is deciding to accept the submissions. But that could easily get changed in the future.

    --
    "Sufficiently advanced satire is indistinguishable from reality." - [Tips: 1DrYakQDKCQ6y52z6QbnkxHXAocMZJE61o ]
  18. Re:That's what you get for wording the DMCA that w by sjames · · Score: 2

    Legally defined bad faith is hard to prove. We would need some metric, like number or proportion of bogus requests where bad faith is legally presumed. Of course, if a reasonable person hearing/viewing the target of the takedown knows it isn't the complainant's property, that too should be presumed to be bad faith..

  19. Numbers = $ by Princeofcups · · Score: 1

    Someone charges by the numbers for those take down notices. Someone at the media conglomerates got fleeced.

    --
    The only thing worse than a Democrat is a Republican.
  20. What is bogus? by AmericaRunsOnDunkin · · Score: 1

    The question I have is: what makes a notice "bogus"? It could mean at least any of the following: the URL never returned any useful content (returns a 404 or similar); the URL used to return content, but does not currently; the domain is not responding; the URL is not indexed by Google; the URL returns content, but none that contains a whiff of the copyrighted material; or other possibilities.

    If the only thing "bogus" is that the URL is not indexed by Google (but does impermissibly contain copyrighted material), then this shouldn't be a problem. Google quickly scans the URL, sees that it's not indexed, and tosses the request. There should be no need for human intervention. If they wanted to, Google could even add the URL to a list of DMCAed URLs to prevent possible indexing in the future. All automated responses, very cheap to implement.

    As for why copyright holders would do this, it's not difficult to see. Say you find a URL infringing your material. You don't know who indexes it. You can check Google, Yahoo, Bing, etc - but they are constantly updating their results, and it may not show up at the time you search. Especially if pirate sites appear and disappear quickly. So you send DMCA notices to every major search engine so the ones which index it stop and the ones which haven't indexed it don't pick it up later. Multiply this by 1,000 and you see why it's infeasible to verify that every takedown notice is currently active in Google's index.

    If you want to talk about whether the DMCA process itself is reasonable, that's another conversation. But given the current system and the definition of "bogus" implied by the summary, this approach doesn't look unreasonable.

  21. Re:What is the benefit? by Chalnoth · · Score: 1

    If infringing content is usually hidden behind robots.txt blocks, then this makes a fair amount of sense.

    I suspect it would mean that in order to find the infringing content, the people making the takedown requests are likely violating the robots.txt. I doubt that's enforceable, unfortunately.

  22. Re:That's what you get for wording the DMCA that w by Opportunist · · Score: 1

    When 99+ percent of your claims are bogus, it's pretty much a given that you're not acting on good faith.

    Just leave the bad faith out if it bothers you. You make a bogus claim, you pay.

    --
    We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
  23. Re:That's what you get for wording the DMCA that w by sjames · · Score: 1

    Sure, to you and me it's a given. But it needs to be written into law so we don't get bizarro decisions in court.

  24. Re:That's what you get for wording the DMCA that w by Solandri · · Score: 1

    That's probably why Google is publicizing this. To point out that the DMCA badly needs a disincentive for filing false takedown claims. If only 0.05% of claims are even factually correct (not even considering if they're legally valid), that's a huge problem with the law.

  25. Re:What is the benefit? by AmiMoJo · · Score: 1

    Some sites generate pages for every URL you try to access on their domain. site.com/movie-name.html is auto-generated when someone tries to access it and not stored anywhere. Google's bot recognizes this and doesn't bother listing the page, but the dumb DMCA bots are apparently not so smart.

    That's why sites with zero actual content can attract hundreds of millions of DMCA notices. I do wonder if they are honeypots designed to waste the bot's time or testbeds for anti-DMCA-bot technologies.

    --
    const int one = 65536; (Silvermoon, Texture.cs)
    SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC
  26. Re:That's what you get for wording the DMCA that w by Opportunist · · Score: 1

    That's why our judges get some leeway when it comes to interpreting laws. Else you get people who search for loopholes and get away with it.

    --
    We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.