Slashdot Mirror


Dropbox's New Policy of Scanning Files For DMCA Issues

Advocatus Diaboli (1627651) writes "This weekend a small corner of the Internet exploded with concern that Dropbox was going too far, actually scanning users' private and directly peer-shared files for potential copyright issues. What's actually going on is a little more complicated than that, but shows that sharing a file on Dropbox isn't always the same as sharing that file directly from your hard drive over something like e-mail or instant messenger. The whole kerfuffle started yesterday evening, when one Darrell Whitelaw tweeted a picture of an error he received when trying to share a link to a Dropbox file with a friend via IM. The Dropbox web page warned him and his friend that 'certain files in this folder can't be shared due to a takedown request in accordance with the DMCA.'"

43 of 243 comments (clear)

  1. Later Dropbox! by Anonymous Coward · · Score: 5, Insightful

    Its been nice while it lasted, now on to other services!

    1. Re:Later Dropbox! by noblebeast · · Score: 5, Interesting

      MEGA is looking like a better alternative every day. End-to-end encryption, and 50GB(!) free storage.

      --
      Its not so bad as long as you can keep the fear from your mind.
    2. Re:Later Dropbox! by MightyYar · · Score: 5, Funny

      And then mail it to your friends and colleagues? Might slow things down a bit, but it makes me feel nostalgic. Just today I considered faxing something, just for the pure walk down memory lane. Beeep. Beeep. Beeep. X-FER FAIL.

      --
      W..w..W - Willy Waterloo washes Warren Wiggins who is washing Waldo Woo.
    3. Re:Later Dropbox! by mister_playboy · · Score: 4, Informative

      For the kind of usage TFA talks about ("public" links), I recommend 1fichier.

      50GB(!) max file size, unlimited storage, 60 days retention for free users.

      https://www.1fichier.com/

      --
      Do what thou wilt shall be the whole of the Law ::: Love is the law, love under will
  2. Two solutions (Encrypt or leave) by kye4u · · Score: 5, Insightful

    If you are determined to use drop box, use an open source software as 7zip that will encrypt and zip. Otherwise, stop using drop box and move on to something else. One of the consequences of using the magical cloud is that your are bound to somebody else's rules for how they manage your data. Also note that those rules are subject to change at any time, and you don't have any say in those changes (I guess the only option is to speak with your wallet and move to greener pastures).

    1. Re:Two solutions (Encrypt or leave) by Xest · · Score: 4, Interesting

      I stopped using DropBox when it's Android app started asking for access to my contacts etc.

      Anything that asks for permissions unnecessary to its key purpose is dead to me.

    2. Re:Two solutions (Encrypt or leave) by Sockatume · · Score: 4, Informative

      Isn't that so that you can send links to contacts? Android has no granular permissions support so if you ever want to be able to email a link from the app, you have to grant that permission.

      --
      No kidding!!! What do you say at this point?
    3. Re:Two solutions (Encrypt or leave) by jellomizer · · Score: 2

      For an app intended to share data with different people, being able to access your contacts would make the program easier to use assuming that you are sharing data with people on your contact list.

      That said most apps work if you say No. I wouldn't call it an unnecessary request to ask for permission.
       

      --
      If something is so important that you feel the need to post it on the internet... It probably isn't that important.
    4. Re:Two solutions (Encrypt or leave) by aviators99 · · Score: 3, Insightful

      If you encrypt, it's not very convenient to do what the person in the article did: link to a video. His IM buddy would have to download/decrypt before seeing the video. Your point is well-taken, of course. But leaving for another cloud provider is likely not going to make things any better. Cloud storage, by its broad definition, is sacrificing security for convenience (to some extent). You can certainly mitigate that via encryption, but at the loss of much of the convenience, especially when it comes to this particular use case, which is the sharing of a video.

    5. Re:Two solutions (Encrypt or leave) by iq-0 · · Score: 2

      One of dropbox's key features is it's ability to share your files. So I hardly think access to your addressbook is really wrong. If they'd be sending that data to their server or whatever that would be unacceptable.
      You should actually be more annoyed with the Android permission system in this case, because it doesn't let you prohibit that part of the functionality. The current permissions system is that you must allow all permissions an app might need, eventhough you'll never use (or want to use) that part of it's functionality. Even delaying the accepting of the permssion would in many cases be preferable for these kinds of permissions that are related to your specific use-case for that app.

      --
      "Moo!" -- Anonymous Cow
    6. Re:Two solutions (Encrypt or leave) by Chrisq · · Score: 3, Insightful

      Isn't that so that you can send links to contacts? Android has no granular permissions support so if you ever want to be able to email a link from the app, you have to grant that permission.

      Its a shame that you cannot just deny that right and have it fail if you ever tried the email functionality. Or even let the application know what's granted so that it can disable the email options.

    7. Re:Two solutions (Encrypt or leave) by Xest · · Score: 4, Interesting

      Yes I believe that's the claim, but I'm more than content to just have a "Copy link to clipboard" button so I can paste it wherever I want - all they need to do is let me take the link where I want.

      Too many companies use such data for other purposes in the background (and ship your contacts etc. off to their servers) that it's a poison chalice to even ask for such permissions if it's not necessary to the underlying point of the application.

      I get that they want to make it easier for some users and I fully sympathise with the usability reasons for doing so, but ultimately when they do shit like this it just reinforces my view that it's not a permission I can trust most such companies with.

      They say they'll never do something, and they resist for a while, then they finally break, "just this once" they tell themselves. Like fuck "just this once".

      I used to have the Facebook app on my phone and I did give that permission - not because I trust them, but because I was going in knowing full well what they were going to do with it, but I drew the line at that app when it started asking permission to draw over other apps and such - what the fuck? No. Just no. There's not a chance in hell you're having permissions to view and render over the pixels on screen on my banking app or whatever.

      Now I'm far more tough with apps in general, which is why I wouldn't touch drop box anymore with this permissions change. Tired of being told our data wont be read, will be held securely and then suddenly such data turns up in completely unrelated places, like when contacts I only had through my MSN messenger list magically turned up as recommendations on LinkedIn despite me never having given permission for MS to share that data with LinkedIn nor LinkedIn permission to receive that data from MS.

      I used to be more laissez faire with my data, because I was lazy enough to put convenience over privacy, but each time I gave a company the trust they asked for based on the assurances they gave they really did lie and abuse it, so fuck them.

      Even something as innocent as a university course I did in my spare time has me getting text messages (2), e-mails (about 5), phone calls (7 of - land line and mobile), letters through the post (3) telling me to fill in the UK's student survey. Eventually I relented, any other comments? Yes, "Fuck your survey, all data I filled in is false. Leave me alone". Apparently I should've opted out of said survey, now if only I was ever given that choice.

      You literally can't put your data anywhere anymore without it being used to harass you. The convenience is no longer worth the inevitable follow on harassment which is anti-convenient, it's a distraction, a disruption, a pain in the fucking arse.

      I buy a TV and I have to give a postcode and house number so they can pass it on to the TV licensing authorities "It wont get used for junk mail, just for licensing" and what comes through the door after a year? "Your warranty is due to expire, your TV wont be covered if it breaks blah blah blah" - no it's fucking not, I'm covered by the consumer protection act you lying dipshits. Last time I bought one I gave the shop the postcode and number of their very own store, knowing full well the question would be coming having looked it up before hand, amusingly my theory that the sales drones would be too fucking dumb to notice was proven right.

      So it may be to let you more conveniently send a link directly, but you always pay in the end, that convenience doesn't come free, you lose the time gained by that convenience dealing with advertising crap, being sent friend invites from people you don't want, sorting junk mail into a recycle bin and phoning them to ask never to spam you again, or dealing with security nightmares because some retard company holding far more of your data than it ever needed got hacked.

      And that's why they can take their lame little "share this" or whatever button and fuck themselves with it.

  3. You wanted privacy? by DMacedo · · Score: 3, Interesting

    This is news, in the sense that Dropbox now actively crawls your files (DMCA still went about for publicly listed files anyway).

    But my question is why are there people in the tech industry still surprised by the fact that Dropbox does not encrypt it's users's files and can read them outright...
    That's how they do sharing between users, as well as file deduplication (Which probably works best for larger copyrighted files, funnily enough!)

    I still use Dropbox, and promote it slightly: with the stern advise to use it simply as a convenient way of sharing crap, but treat it as a "public USB drive"!

    Just never, ever, store sensitive data, like your business or evil masterplans, or your personal/bank/etc account details on it. But if you're sharing that MP3 you recorded on yesterday's block party, go right ahead!

    1. Re:You wanted privacy? by Ash+Vince · · Score: 4, Informative

      This is news, in the sense that Dropbox now actively crawls your files (DMCA still went about for publicly listed files anyway).

      You obviously didn't bother to read the article.

      The truth is that they always scan every single file uploaded to make sure they do not already have a copy of that file stored on their network. If they do, they throw your copy in the bin and just add an extra link to that stored copy in your account. That keeps their data usage lower as it means they never store duplicate copies of the same file, even if they are uploaded by completely different people.

      So there is no crawling involved, this was done at the point of upload. They found that the same file had already been uploaded by someone else, shared, and that user got the shared copy of that file DMCA'd. Once a file has been DMCA'd in their system it seems it is blocked from being shared so only people uploaded that file also get to download it.

      --
      I dont read /. to RTFA, I read /. to offend people in ignorance.
    2. Re:You wanted privacy? by suutar · · Score: 2
      1. Yes.
      2. Negligible. (I calculated in another post that the odds of a hash collision for SHA-1 and a trillion files was about 1 in 2^79; I have since learned that they actually use SHA-256, so make that 1 in 2^175).
      3. If you think it's worth worrying about then it's a legitimate concern for you; I wouldn't worry about it.
  4. Encrypt with publicly known key by Anonymous Coward · · Score: 2, Interesting

    All that's required of users is to use a encryption mechanism, even weak, to encrypt said files prior to uploading.

    You could potentially even use an encryption key as weak as "password" because DropBox aren't going to be in the business of guessing encryption keys (won't have the CPU grunt) so anything is going to deceive them - potentially even just XOR. Or even use the file's name.

    The only downside will be that DropBox will be just that little bit harder to use without some sort of application to make encryption and decryption of files easy.

  5. Re:Huh? by Sockatume · · Score: 2

    The DMCA is concerned with whether Dropbox is hosting an infringing file, not who they may be hosting the file for or for what purpose. Unfortunately this approach is forced upon Dropbox by a US law passed in an era of dial-up modems.

    --
    No kidding!!! What do you say at this point?
  6. Re:That's it by Richard_at_work · · Score: 5, Insightful

    But this isn't new, its been going on since Dropbox implemented their DCMA violation checking system a few years ago, and you can see *why* they are doing it.

    Lets clarify a few things for those that aren't going to RTFA - this isn't for private shared folders, or for folders within your own Dropbox. This is for when you create *public* links, by either using the "Shared Links" facility or when you create a public link from the old style Public folder.

    Thats it. The files Dropbox is including in these scans are *publicly linked* to - and they are fair game if Dropbox wants to stay ahead of the legal system on this front. Dropbox has no idea that you only intend to share it with yourself, or one other person, and there is no mechanism by which you can ensure that yourself anyway.

    Yet again its forced outrage against basically something which is common sense - if the file has been taken down before, its going to be again, and the less man power Dropbox expends while handling DCMA requests the better for them as a company.

  7. Not as bigger deal as it sounds if you RTFA by Ash+Vince · · Score: 4, Insightful

    This whole issue can be summarized as:

    1) User wants to ignore copyright law and share something they have no legal right to via a public service
    2) Public service being used has no idea how many people will want to access the shared resource but they do know it is copyrighted as they auto match everything uploaded so they can avoid keeping to separate copies of identical files and save storage space and had a DMCA take down request for that same file previously.
    3) Public service errs on the side of not getting their arse sued off by the various content owner conglomerates legal attack dogs and refuses to allow the file to be shared even though the person who uploaded it can still see it.

    All in all seems pretty reasonable. Until copyright law is changed (like that is ever going to happen) dropbox have to follow it to the letter. I suppose they could have avoided the whole thing by storing more data and then not doing the duplicate file scan thing but even that is no guarantee it would prevent them from being sued to oblivion.

    The only safe option for them that would also keep things private would be to use encryption keys that were only kept in the client. That way if you needed to share a particular folder you selected to store that under a different encryption key, and gave that key to other person / people who needed to access it.

    The big problem with this is that it then becomes more awkward to provide web access to the files. People are comfortable remembering a username and password, they are not so comfortable remembering a bunch of encryption keys. If you store the encryption keys on a server at your end anywhere then you can access the files so you therefore get the legal responsibility to make sure your system is not being used to flout copyright law. The only legal way to run this sort of service and not be liable for it's misuse is to design it in such a way that you cannot see what is being stored at all.

    --
    I dont read /. to RTFA, I read /. to offend people in ignorance.
    1. Re:Not as bigger deal as it sounds if you RTFA by erroneus · · Score: 2

      You don't "suffer." That's the first lie. If it's suffering to create, then you're doing it wrong.

    2. Re:Not as bigger deal as it sounds if you RTFA by Uberbah · · Score: 2

      YANAL.

      And you are?

      The DMCA states that companies must take reasonable steps to prevent reuploading. Designing a system with the express purpose of not being able to prevent uploading would be thoroughly illegal.

      [Citation needed]

  8. Only publicly shared files are scanned by Anonymous Coward · · Score: 2, Informative

    Publicly shared files that match known hashes are restricted, but not deleted, and any file can be shared to anyone privately without restriction, just not publicly to the world. Not much of a story. Read TFA.

  9. Re:Huh? by Ash+Vince · · Score: 2

    What does somebody else's data have to do with your data?

    There is no "your" data or "there" data. There is only dropbox data. It seems at the point you upload a file they check it to see if they already have a copy and of they do they just add a pointer to the existing file rather than store a fresh copy.

    And what if there is a hash collision?

    By the sounds of it they must actually do a direct file compare rather than use a hash. They probably use some kind of hash to narrow down the options of stuff to compare it with but in the fallback case of a hash collision, and both files being exactly the same size they must have to do an exact comparison. That probably does not happen very often though and it sounds like this is process is only done once at the point a file is stored.

    --
    I dont read /. to RTFA, I read /. to offend people in ignorance.
  10. Re:That's it by Dcnjoe60 · · Score: 2

    Do you know if dropbox is trying to determine what is a DMCA violation and stopping the share or if they have received actual takedown notices? I ask because if somebody shares something and dropbox recieves a takedown notice, then I would be okay with that. On the other hand, if they are trying to police what is out there, I'm not sure how they can make that determination or why they would stop at just shared content.

    Not trying to troll or inflame the discussion, just actually wondering how the process works.

  11. Re:Drop box .... Meh! by wonkey_monkey · · Score: 4, Informative

    Drop Box is nothing more than a gussied up repackaging of a SFTP or FTPS and a nice fancy ol' GUI.

    The post office is nothing but a gussied up repackaging of walking to your friend's house and giving him the letter yourself.
    The fax machine is nothing but a waffle iron with a phone attached!

    No, it's slightly more than that.

    You set up a server for SFTP or FTPS and download a nice, friendly little program called FileZilla.

    ...and then? Will Filezilla run on startup, settle itself inconspicuously in the systray without a running window you could accidentally close, connect to the SFTP server, download files automatically to local directories so they're instantly accessible, then monitor, sync and notify you of any changes? Will it allow you to dish out invitations to share directories and files direct from your desktop, and manage those permissions for an unlimited number of users and directories?

    --
    systemd is Roko's Basilisk.
  12. Re:Huh? by iq-0 · · Score: 2

    Part of it is in the 'terms of service' where you specifically allow dropbox to do certain things (like deduplication and retention after you've deleted it).

    They're not actively searching *your* files to seek out these violations, they got a specific complaint about that file's data, which they are obliged to make publicly inaccessible. If you also share that file's data than that too is, according to the DMCA, in infringing and is prohibited from being shared.

    About the hashes: they most certainly only use to hashes to find candidates for deduplication. All files with the same hash are most likely first compared byte-for-byte before they're really considered the same.
    The 'takedown' probably happens on the deduplicated file's entry in some database, where it's marked as a 'DMCA violation'. Any attempt to access it via a share will notice that flag and show the appropriate message. They wouldn't need to actually "go through your files" to look for violation, but in case they want to they can simply look who has a reference to the deduplicated file and whether or not it's shared by them in order to notify them of the fact (in that case they's still not be going through your files, but just following the link back to your account).

    They are actually very correct about it, since they only disable the sharing, not your access to the file (since that is yours and thus not necessarily infringing on the DMCA). They are just not allowing you to use their service to distribute a copyrighted work about which they we're told it's not allowed to be distributed by them.

    --
    "Moo!" -- Anonymous Cow
  13. Re:OwnCloud by heypete · · Score: 4, Interesting

    This is what OwnCloud is made for.

    I know not everyone is able to set up their OwnCloud server. There are places that will host it and set it up for you.

    OwnCloud is great, with one exception: the slightest change to a file necessitates an upload of the entire file. Dropbox does delta syncs using a modified version of rsync, so it only uploads change portions of a file.

    For typical files and fast connections, the lack of delta sync is tolerable, but when you're dealing with large files or slower transfer speeds it's an issue: if you, for example, you keep a large TrueCrypt container file in OwnCloud and make a change to a small file stored in the container, OwnCloud needs to reupload the entire container. Dropbox would just update the blocks that changed.

    Until OwnCloud implements some sort of delta sync functionality it is considerably less practical than Dropbox.

  14. Re:That's it by hattig · · Score: 2

    Or you could read the article and get answers immediately.

    They use file hashes of previous DMCA requests when new files are shared. If it transgresses, it's blocked just like this situation.

    It's not "policing", it's blacklisting the sharing of specific files via comparing file's hash against a list of blacklisted hashes.

    I just hope they're not using CRC16.

  15. Re:Huh? by StripedCow · · Score: 2

    But computing a hash-value IS going through your files.

    What if they use a hash that is computed like this:
    1. compute md5sum of the data
    2. make the last bit zero or one, depending on whether the file has some interesting property.

    Suddenly, they can profile you based on "hash-value" alone.

    --
    If Pandora's box is destined to be opened, *I* want to be the one to open it.
  16. Re:Drop box .... Meh! by Alioth · · Score: 4, Funny

    > Viola!

    I fail to understand what a stringed instrument, slightly larger than a violin, has to do with it...

  17. You Can by brunes69 · · Score: 4, Informative

    It's called AppOps. Was in Android hidden, then removed, but still ships in standard Cyanogenmod.

  18. Whomade the DMCA complaint ? by Alain+Williams · · Score: 2

    The image of the error message did not say who, or which corporation, had made the DMCA complaint. I thought that in order for something to be taken down under the DMCA the user had to be told who was complaining.

    In this case: the user admits that the file was something that he should not be sharing, but there have been cases where the DMCA is being used to prevent legal files - in a case like that the user must be told who is complaining so that they can challenge the DMCA complaint.

  19. Well duh by DrXym · · Score: 3, Informative
    Anyone who uploads copyright infringing content to a cloud server and entrusts it to the care of a company is an idiot. There are various ways that files could be scanned simply from looking at the filename or hash all the way through to analysis of the tag / contents / watermark.

    And DropBox is probably the most benign of mainstream cloud hosts. Google, Amazon, Apple and Microsoft all sell content and sign voluminous contracts for the sale of said content. It's not hard to imagine that they would or could be obliged to scan for infringing content and notify the content providers when they find any.

  20. They're using hashes by Quila · · Score: 4, Informative

    Change a character in the metadata fields, hash changes. If they're scanning the actual video portion of files, add a byte at the end. I don't think that would affect playback.

  21. Re:That's it by bberens · · Score: 2

    If you're not distributing copyrighted material I fail to see how this could be a problem in practice. You'd have to create a public link to your copyrighted file and that link would have to somehow wind up in the hands of the MPAA or other representative of copyright holders.

    --
    Check out my lame java blog at www.javachopshop.com
  22. Re:That's it by TangoMargarine · · Score: 3, Insightful

    If you're not distributing copyrighted material I fail to see how this article is relevant at all. They wouldn't care.

    --
    Unity? Screw that: XFCE. Slashdot Beta? Screw that: SoylentNews. Australis? Screw that: Pale Moon. UX developers DIAF
  23. Re:Huh? by blueg3 · · Score: 4, Informative

    He wasn't making an analogy between how you find a hash collision and how you win the lottery -- only comparing the odds.

    Dropbox uses SHA-256 hashes. I'm assuming this is what they use for this feature, since it's what they use internally for file identification and deduplication. They actually hash 2 MB file chunks, which means that any file more than 2 MB produces multiple hashes (one per chunk, naturally).

    The "many chances of winning" you're referring to here is the birthday collision problem. A good, rough approximation is that for an N-bit hash, while the number of different hashes is 2^N, the number you can generate before risking a collision is about 2^(N/2). So, with SHA-256, we run no significant risk of collision until we've generated around 2^128 ~= 10^38 hashes.

    The total amount of data stored worldwide is on the order of 1 ZB. That's room enough for about 10^15 2-MB chunks. Of course, some of our files might be smaller than this 2 MB chunk size, enabling us to be more efficient with storage. We might be able to get somewhere around 10^20 different files in there.

    That's a strange and untenable use of all of the world's storage, and it still puts us about 18 orders of magnitude short of being able to risk a SHA-256 collision. If you had this giant set of a ton of different files, the probability of a collision existing is about 1 in 10^37.

    So, short of a flaw in SHA-256, you can assume that a hash collision will never happen. We know of no such flaws. (If we do, it will almost certainly be the case that the collision only occurs because one of the two files was specifically manipulated to produce the collision.)

    On the other hand, the odds of winning the lottery are rarely worse than 1 in 10^9.

  24. Re:Drop box .... Meh! by Overzeetop · · Score: 2

    Dropbox is not useful because of what it does - it's useful because of how it does it (seamless for a non-technical end user) and its integration into other, especially mobile, applications. Until you can roll-your-own references into commercial mobile apps, or make sharing a cloud file with a colleague with a different OS and no access to your private net available with a single click, whatever you hack together won't be Dropbox.

    --
    Is it just my observation, or are there way too many stupid people in the world?
  25. Much, Much Later by Jane+Q.+Public · · Score: 3, Insightful

    I refused to use Dropbox ever since its "end to end encryption" claim was shown to be false, and they were de-duping your files. (De-duping required access to the original files, which Dropbox tried to claim they didn't have.)

    Then they said they were changing that practice. But how far could you trust them, considering that they had already lied to everybody? Fool me once, and all that.

    NOW, apparently they're checking your files -- which back when they again claimed they weren't accessing -- for copyrighted content, which again requires access to your original files. (Even if you're just doing an MD5 hash or some such, you still need access to the original file to do it.)

    So, yeah. For all those who didn't drop Dropbox when I did, maybe it's time.

    1. Re:Much, Much Later by jimbo · · Score: 5, Interesting

      I've used EncFS and BoxCryptor with Dropbox from day one and 'd do that with any cloud storage solution, no matter what they claim it is irrelevant. It is my data, by choice I'm retaining the responsibility for it's safety/security.

      I'll continue to use Dropbox because I never trusted them and made sure I didn't have to.

    2. Re:Much, Much Later by Lord+Crc · · Score: 2

      I refused to use Dropbox ever since its "end to end encryption" claim was shown to be false, and they were de-duping your files.

      I simply never assumed my Dropbox files were private to begin with.

      While I don't share everything in my public folders, I don't put anything in Dropbox that I don't mind the whole world to see.

  26. Re:That's it by neonKow · · Score: 2

    You don't need public links for keeping your own account in sync.

  27. Re:That's it by david_thornley · · Score: 2

    That's how it used to be. Nowadays, copyright exists once a creative work is put into tangible form (this comment is creative enough to be covered, and is copyrighted as I type it, since computer memory is tangible). I believe this is the case in all Bern convention countries.

    That doesn't mean there aren't advantages to registering a copyright, but if you bother to look up the law, you'll find that this comment is copyrighted.

    --
    "When you have eliminated the unacceptable, whatever is left, however improbable, must be the truthiness" - Holmes