Slashdot Mirror


Altering Text In eBooks To Track Pirates

wwphx writes "According to Wired, 'German researchers have created a new DRM feature that changes the text and punctuation of an e-book ever so slightly. Called SiDiM, which Google translates to 'secure documents by individual marking,' the changes are unique to each e-book sold. These alterations serve as a digital watermark that can be used to track books that have had any other DRM layers stripped out of them before being shared online. The researchers are hoping the new DRM feature will curb digital piracy by simply making consumers paranoid that they'll be caught if they share an e-book illicitly.' I seem to recall reading about this in Tom Clancy's Patriot Games, when Jack Ryan used this technique to identify someone who was leaking secret documents. It would be so very difficult for someone to write a little program that, when stripping the DRM, randomized a couple of pieces of punctuation to break the hash that the vendor is storing along with the sales record of the individual book."

12 of 467 comments (clear)

  1. So... by Impy+the+Impiuos+Imp · · Score: 5, Informative

    Normal book publishers have been doing this for decades, inserting the occasional misspelling here or there. Later, they inserted correct spellings, but of the wrong word, to get around auto-correction in scanner software.

    So...no, they can't patent it.

    --
    (-1: Post disagrees with my already-settled worldview) is not a valid mod option.
    1. Re:So... by Shompol · · Score: 5, Insightful
      Yes but this is different because

      ... on a computer

      So yes, they can (and will)

    2. Re:So... by Z00L00K · · Score: 5, Insightful

      And if the publisher do change texts in different e-books anyone that wants to get around it would just need a few copies and use a statistical analysis to blank out the differences.

      This is similar to what steganography does, so if you mess up the punctuation inserted then it will be really hard to look up the perpetrator - or even that the wrong party will be pointed out.

      So now the Pandora's box is opened.

      --
      If builders built buildings the way programmers wrote programs, then the first woodpecker would destroy civilization.
    3. Re:So... by Opportunist · · Score: 5, Funny

      Uh... yes. When you find misspelled words in my messages here, it's just my new DRM. It's just that. It's not that I'm too dumb to use a spellchecker.

      --
      We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
    4. Re:So... by Arrepiadd · · Score: 5, Informative

      There would be no need to reverse engineer a pristine copy of the work. Simply proofreading a single copy and correcting some of the existing errors, while at the same time, introducing a few new errors of the same type

      I didn't read the article because I had seen it earlier in another news source, so I don't if this is mentioned in the one mentioned here, but proofreading may not do it in this case. The source I read mentioned two specific types of change that do not introduce any typos (I'm choosing the exampled myself):
      - One of them was reordering of nouns when the order does not matter, e.g. "Peter and John went for lunch" vs "John and Peter went for lunch";
      - The other was playing with negatives: e.g. "something is unclear" vs "something is not clear"

      Since there are no actual typos, it's hard to spot the identifying bits. You'd have to change the text substantially, in order to have a good chance of being free from discovery. Adding your own typos may not serve any purpose, since the company selling can focus just on the changes they made, not looking for other changes introduced after.

      Of course, if there is a concerted effort to release documents, all pirates would need to do would be buying a few copies and diffing the documents. You may not get the original back, but if the changes are randomly put in a specific set of words, you certainly can end up with something close to the original than any of the sold copies and still free from pirate identification.

    5. Re: So... by Threni · · Score: 5, Funny

      And then "on a mobile device but with slightly rounded corners".

  2. Goddammit. by Chrontius · · Score: 5, Funny

    I catch all the typos in my books.

    They irritate me.

    I'd probably crack 'em, fix them all, and goddammit, that'd be "circumvention".

  3. That's not how traitor-tracing algorithms work by _Knots · · Score: 5, Informative

    They don't hash the whole shebang into one number. Rather, they take a (random) number and use that to generate a set of mutations and then probe for that set of mutations in the leaked document. So now, even if you alter the document further, you probably didn't undo the mutations in question. Even if you did, you probably didn't undo all of them and you almost certainly didn't produce a high-confidence result that it's somebody else's copy.

    --
    Anarchy$ dd if=/dev/random of=~/.signature bs=120 count=1
  4. Great trick to remove the watermark by Rosco+P.+Coltrane · · Score: 5, Funny

    - Scan/OCR book
    - Google translate into German
    - Google translate back into English
    - Print book

    Voila! No more watermark. You can share with confidence.

    --
    "A door is what a dog is perpetually on the wrong side of" - Ogden Nash
  5. strip by Tom · · Score: 5, Insightful

    It depends. If it's done well, it can be fairly resistant to any noise introduced into the system.

    As an author myself, I see a very different issue with this. I don't want some robot changing my text. Some of those words it might decide to change because they are similar I may have pained over and decided for a reason to use this one and not the other one. Granted, few authors pick every single word intentionally, but the software won't know which ones are carefully selected.

    Often times, there is subtle meaning. For example, I might decide to always use the same phrase in certain contexts, giving a very subtle hint to the reader which things are alike and which ones are different. One he might not even notice consciously.

    It also will cause all sorts of trouble to quoting. How will teachers handle this if a student quotes a text but the quote differs slightly from the version the teacher has read? One of the most important things we teach students is that quotes need to be exactly as they appear, with any omissions or changes clearly marked.

    That also extends to quotes within the text. If character A reports what character B said, I doubt the system will have enough text understanding to change both texts the same way, so the reader will be left wondering if it is intentional that there's a slight difference and what the author wants to hint at, when there's no such thing implied.

    --
    Assorted stuff I do sometimes: Lemuria.org
  6. Re:Learn by Macgrrl · · Score: 5, Funny

    That's not how dyeing industries work.

    You negative attitude is colouring your response.

    --
    Sara
    Designer, Gamer, Macgrrl in an XP World
  7. Re:Defeated in one... by Opportunist · · Score: 5, Funny

    No, I said it was music!

    --
    We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.