Slashdot Mirror


Altering Text In eBooks To Track Pirates

wwphx writes "According to Wired, 'German researchers have created a new DRM feature that changes the text and punctuation of an e-book ever so slightly. Called SiDiM, which Google translates to 'secure documents by individual marking,' the changes are unique to each e-book sold. These alterations serve as a digital watermark that can be used to track books that have had any other DRM layers stripped out of them before being shared online. The researchers are hoping the new DRM feature will curb digital piracy by simply making consumers paranoid that they'll be caught if they share an e-book illicitly.' I seem to recall reading about this in Tom Clancy's Patriot Games, when Jack Ryan used this technique to identify someone who was leaking secret documents. It would be so very difficult for someone to write a little program that, when stripping the DRM, randomized a couple of pieces of punctuation to break the hash that the vendor is storing along with the sales record of the individual book."

70 of 467 comments (clear)

  1. So... by Impy+the+Impiuos+Imp · · Score: 5, Informative

    Normal book publishers have been doing this for decades, inserting the occasional misspelling here or there. Later, they inserted correct spellings, but of the wrong word, to get around auto-correction in scanner software.

    So...no, they can't patent it.

    --
    (-1: Post disagrees with my already-settled worldview) is not a valid mod option.
    1. Re:So... by Shompol · · Score: 5, Insightful
      Yes but this is different because

      ... on a computer

      So yes, they can (and will)

    2. Re:So... by Z00L00K · · Score: 5, Insightful

      And if the publisher do change texts in different e-books anyone that wants to get around it would just need a few copies and use a statistical analysis to blank out the differences.

      This is similar to what steganography does, so if you mess up the punctuation inserted then it will be really hard to look up the perpetrator - or even that the wrong party will be pointed out.

      So now the Pandora's box is opened.

      --
      If builders built buildings the way programmers wrote programs, then the first woodpecker would destroy civilization.
    3. Re:So... by Tom · · Score: 2

      So that's why I come across obvious errors in books where I thought that if it stands out like a sore thumb at a non-native speaker, why the fuck did the proof-readers miss it?

      --
      Assorted stuff I do sometimes: Lemuria.org
    4. Re: So... by kthreadd · · Score: 2

      And so far this has prevented what?

    5. Re:So... by mbone · · Score: 2

      Normal book publishers have been doing this for decades, inserting the occasional misspelling here or there. Later, they inserted correct spellings, but of the wrong word, to get around auto-correction in scanner software.

      So...no, they can't patent it.

      I think that map makers have been doing this for a century or more.

      "Who's Who" and the like do it as well, inserting fictitious people. This is also because true maps and lists may not be copyrightable, while fictitious ones certainly are.

    6. Re:So... by Will.Woodhull · · Score: 4, Insightful

      There would be no need to reverse engineer a pristine copy of the work. Simply proofreading a single copy and correcting some of the existing errors, while at the same time, introducing a few new errors of the same type would be enough to confound any attempt to make a positive identification of the source.

      This approach has an incredibly high bogosity factor. I can't imagine anyone in the publishing industry with half a brain who would spend any money on its implementation... Oh wait. We are talking about the partially brain dead idjits who thought DRM was the best thing since sliced bread....

      If I was going to do this, I would probably also play with the kerning to force some repagination, add some space characters before the newline at the end of some paragraphs, and so on. This approach to DRM is about as simple to get around as using a black magic marker on the edge of an "uncopyable" CD disk.

      --
      Will
    7. Re:So... by Opportunist · · Score: 2

      And some newspapers now insert correctly spelled words into their otherwise barely legible text to ... uh... why again do THEY do it?

      --
      We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
    8. Re: So... by Opportunist · · Score: 3, Insightful

      C'mon, where have you been hiding? Adding "on a computer", or, more recently, "on the internet" makes everything patentable again.

      --
      We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
    9. Re:So... by Opportunist · · Score: 2

      DRM is all about artificially lowering the value of your product (to the user) in an attempt to make it more valuable. You think anyone in this bizarro world is using a brain?

      --
      We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
    10. Re:So... by Opportunist · · Score: 5, Funny

      Uh... yes. When you find misspelled words in my messages here, it's just my new DRM. It's just that. It's not that I'm too dumb to use a spellchecker.

      --
      We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
    11. Re:So... by Idaho · · Score: 3, Insightful

      In fact, this is one more reason for good authors to avoid traditional publishers. I can think of quite a few authors who would have a thing or two to say about algorithms like these being used to modify their work.

      Just like in the music industry, big publishers are simply not necessary anymore. Editors most certainly are, but publishers?

      --
      Every expression is true, for a given value of 'true'
    12. Re:So... by gnasher719 · · Score: 3, Funny

      I think that map makers have been doing this for a century or more.

      I remember a pub guide with 1,200 pub reviews including three fake ones, and a newspaper copied (and slightly rearranged the words) of ten of their reviews and managed to copy one of the fake ones. Good fun.

    13. Re:So... by Arrepiadd · · Score: 5, Informative

      There would be no need to reverse engineer a pristine copy of the work. Simply proofreading a single copy and correcting some of the existing errors, while at the same time, introducing a few new errors of the same type

      I didn't read the article because I had seen it earlier in another news source, so I don't if this is mentioned in the one mentioned here, but proofreading may not do it in this case. The source I read mentioned two specific types of change that do not introduce any typos (I'm choosing the exampled myself):
      - One of them was reordering of nouns when the order does not matter, e.g. "Peter and John went for lunch" vs "John and Peter went for lunch";
      - The other was playing with negatives: e.g. "something is unclear" vs "something is not clear"

      Since there are no actual typos, it's hard to spot the identifying bits. You'd have to change the text substantially, in order to have a good chance of being free from discovery. Adding your own typos may not serve any purpose, since the company selling can focus just on the changes they made, not looking for other changes introduced after.

      Of course, if there is a concerted effort to release documents, all pirates would need to do would be buying a few copies and diffing the documents. You may not get the original back, but if the changes are randomly put in a specific set of words, you certainly can end up with something close to the original than any of the sold copies and still free from pirate identification.

    14. Re:So... by pmontra · · Score: 2

      I'm not a proponent of DRM and I didn't buy any DRMized item because I want to be sure to able to use what I bought on any future device I'll happen to use. That said, this DRM doesn't seem to lower the value of the text much. It's probably just watermarking, which I'm fine with because I'm not interested in buying something and pirating it.

      Nevertheless I see potential problems with this technology. Files are files, got backup, move to physical media that get lost and sometimes end up in somebody's else hands. Even if I (and you, reader) can say "it will never happen to me", it will happen to somebody, probably many people along the years. Maybe even me (and you) and the guy who got his files distributed could be in more troubles than he deserves for losing a USB key. So watermarking is better than DRM, but no watermarking is even better.

      Finally, I believe that reconstructing the non watermarked original file won't be as trivial as many other slashdotters think. It looks like one of those bioinformatics problems with reconstructing a DNA sequence from a bunch of different versions of it (don't assume the texts will be aligned). Plus, there is a noise introduced on purpose and if an arms race starts they'll do their best to make it difficult to remove without having really many different versions of the file.

    15. Re:So... by Intrepid+imaginaut · · Score: 2

      What makes you think pirates won't rip off independent authors? They aren't on a 'reform the publishers' kick, they're on a 'get free stuff' kick. This tactic is to allow publishers or authors or whoever to track back pirated copies to whoever first shared them out. All they need it to automate the system so each book sold has a unique and all but invisible 'watermark', a comma in an odd place, whatever.

      And you know what maybe they have a point; arguments can be made that musicians can earn from live performances, and that movies can earn at the box office. Where do authors make money if their works are pirated? Live readings and ebook signings? Please. This is a pretty low form of piracy.

    16. Re:So... by Z00L00K · · Score: 2

      The point I was making was not to state to create an unaltered copy of the original but get enough data on the variation of the copies to be able to mess up the watermark enough to render it useless. Random pick of formatting/wording in deviating sections from one of the N copies obtained at each case. The result may be that you have variants A, B and C as source and your scrambling causes it to look like variant K, so the buyer of variant K will be blamed until they figure out that they are chasing in the wrong direction.

      --
      If builders built buildings the way programmers wrote programs, then the first woodpecker would destroy civilization.
    17. Re:So... by Vanderhoth · · Score: 2

      Neil Gaiman would disagree with you.

    18. Re:So... by Jason+Levine · · Score: 2

      That's what I was thinking. So would-be pirates get together, buy a couple copies of the book, compare them for differences, and post a version of the book that combines both alterations in a random method thus ruining the tracking.

      Of course, DRM like this isn't really meant to prevent copying as much as it is meant to form a speed bump to deter the more casual would-be-pirates. (Let's be honest, no DRM stops the more serious pirates.) If this was the only DRM that was deployed, I'd actually support it versus methods like encrypting eBooks in non-open formats that can only be read by certain devices/applications and that can be remotely disabled.

      --
      My sci-fi novel, Ghost Thief, is now available from Amazon.com.
    19. Re:So... by MrMickS · · Score: 2

      DRM is all about artificially lowering the value of your product (to the user) in an attempt to make it more valuable. You think anyone in this bizarro world is using a brain?

      That's an odd way to view DRM. DRM is about the publishers attempting to associate a cost with the duplication of a work. The cost of creation a copy of a digital work is negligible. The cost of creation of a copy of a physical work isn't, both in materials and time taken to create the duplicate.

      Of course the key really is that there are people that believe that one they have bought something then they have a right to distribute that to others. This is ok in the case of a physical work. You pass it on and you don't have it anymore. With a digital work it doesn't work like that. The publishers, and authors/artists, fear that widespread digital duplication will deprive them of income. Hence they strive for a way to ensure that they don't lose out.

      Sadly its impossible to have a reasonable approach or discussion around this because of the extremist end of the debate that believes that and form of trying to prevent distribution of unlicensed copies is evil. If you want to prevent DRM come up with a scheme that addresses everyone's needs rather than making glib throwaway pronouncements.

      --
      You may think me a tired, old, cynic. I'd have to disagree about the tired bit.
    20. Re: So... by Threni · · Score: 5, Funny

      And then "on a mobile device but with slightly rounded corners".

    21. Re:So... by gnasher719 · · Score: 2

      That's what I was thinking. So would-be pirates get together, buy a couple copies of the book, compare them for differences, and post a version of the book that combines both alterations in a random method thus ruining the tracking.

      The "Nirvana fallacy" is the assumption that something needs to be perfect to be usable. If I have an ebook, and you want the same ebook, I can just copy it and give the copy to you. I can send it to you as an email attachment. Nothing could be simpler. 80% of all customers would know how to do this. And that is stopped if I know that by giving you a copy I put the book out of my control, you could be some idiot who puts the stuff on a website and then it's my problem, so I'm not giving you a copy.

      Now the effort involved in giving you a copy is so much higher; "buy a couple of books" is nonsense because why would a buy a couple of books to give you a copy, instead of you buying a book? And it takes substantial effort to make that copy, so it's not going to happen.

      And finally, _if_ the publisher finds a copy with watermark removed, then I would think the copier has gone straight into criminal territory, so while the risk of getting caught is lower, the possible damage to you is much higher.

    22. Re:So... by nabsltd · · Score: 2

      That's what I was thinking. So would-be pirates get together, buy a couple copies of the book, compare them for differences, and post a version of the book that combines both alterations in a random method thus ruining the tracking.

      The way this system might work is to have 32 possible differences per book, where each difference can be thought of as a single bit (e.g., "is not" in one version and "isn't" in another). The combination of differences would allow 4 billion different versions, more than enough for even the most insane best seller.

      So, if you only have 3 versions, you might see as few as two bits of information. You could get lucky and see a lot more, but even so, the publisher could likely localize the leak to a very small number of buyers.

  2. Defeated in one... by NFN_NLN · · Score: 4, Funny

    1. Sign up to service with alias
    2. Use untraceable account (prepaid credit card, bitcoin, points card)
    3. Share files with "watermarks"
    4. Don't give a shit that it gets traced back to a throw away account

    They could have saved a significant amount of effort if they had asked me first...

    1. Re:Defeated in one... by Anonymous Coward · · Score: 2, Insightful

      Acquire multiple copies, run through diff, select most common and correct version each difference, then randomly permute other punctuation in non noticeable ways...

    2. Re:Defeated in one... by Duhavid · · Score: 2

      Or normalize all capitalization, punctuation, spelling and grammar.

      --
      emt 377 emt 4
    3. Re:Defeated in one... by SuricouRaven · · Score: 3, Insightful

      It'd be easy to make minor alterations to the text itsself. Perhaps a character can be described as dark-haired and wearing a red shirt in one version, but wearing a red shirt and dark-haired in another. Find 32 such places and you can identify four billion unique versions.

    4. Re:Defeated in one... by complete+loony · · Score: 2

      Then they'll implemented a polymorphic sentence generator. Actually you could set it all up by hand, it wouldn't take too much effort. Pick a handful of sentences, pick a handful of alternative words from a thesaurus or rephrasings that don't change the intent. Heck the alternatives could all be provided by the original author if you like. You'd need less than 60 possible replacements across the whole book to encode a unique enough watermark.

      --
      09F91102 no, 455FE104 nope, F190A1E8 uh-uh, 7A5F8A09 that's not it, C87294CE no. Ah! 452F6E403CDF10714E41DFAA257D313F.
    5. Re:Defeated in one... by NFN_NLN · · Score: 4, Interesting

      And once you bought a book (with your own credit card), and then decide afterwards that you want to put it out there for pirates, suddenly, you realize that it's not such a good idea.

      You realize it's not such a good idea... and 3/4 of a second later you just download it from another source. So you've really accomplished little.

      Has Apple's similar approach impacted music piracy?

      "Apple embeds your account information in all songs sold on the store, not just DRM-free songs. Previously it wasn't much of a big deal, since no one could imagine users sharing encrypted, DRMed content. But now that DRM-free music from Apple is on the loose, the hidden data is more significant since it could theoretically be used to trace shared tunes back to the original owner."

      http://arstechnica.com/apple/2007/05/apple-hides-account-info-in-drm-free-music-too/

    6. Re:Defeated in one... by Opportunist · · Score: 3, Interesting

      Well, it's not so far fetched that there will be various files that reach back to one source. I remember a certain song that had a quite noticeable glitch somewhere, a compression mistake or something like that. I know for a fact that it wasn't meant to be that way because it was played up and down on every radio station and music TV station, every time without that glitch (and it just sounded like a compression bug, too). The same applies to the pressed CD because I later bought it just for the sole reason to find out whether that glitch is supposed to be there, and on the original pressed disc there was no such artifact.

      But no matter where I went and at what party I heard it, I always heard exactly the same glitch. Ok, one may say, it's a local thing. So I thought, too, until I heard it at a party on a different continent. I waited for it, and I was quite amazed to hear that well known glitch.

      And then on YouTube...

      And it wasn't some obscure, barely known song, it was something that clogged the airwaves for quite a while. I later tried to create an MP3 of the file myself to check whether it was some obscure reason why it "has to" end up with that glitch when converted and no, at least my converter managed to encode it flawlessly.

      So I guess the only conclusion I could come up with is that everyone on this PLANET downloaded the same file from the same crappy source. One person encoded it and everyone downloaded from him.

      Kinda amazing that it still was such a seller. I mean, isn't the big complaint of the music industry that everyone is just downloading it? And obviously, for this song one sold CD would have sufficed to satisfy the damned... I mean the demand.

      --
      We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
    7. Re:Defeated in one... by Opportunist · · Score: 5, Funny

      No, I said it was music!

      --
      We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
    8. Re:Defeated in one... by bhmit1 · · Score: 2

      This isn't designed to stop the determined thief, there will still be plenty of piracy. Instead, it's designed to maximize profits from average users. Friends no longer let other friends borrow a copy of their book like they would have done with a physical book, because they are afraid that it could get shared publicly.

      It's not so different from how dvd DRM isn't to stop people from making copies of movies, it's to prevent the manufacturers of players from adding features that customers would like, such as region free playing and the ability to skip ads at the beginning of the disk.

      In both cases, criminals can easily do what they've always done, but the law abiding users are less and less able to use the product in ways that used to be legal.

    9. Re:Defeated in one... by jabuzz · · Score: 2

      No you would need more than two copies. Imagine you have 100 such things as "Peter and Jane" or "Jane and Peter" that represent a one or zero. That is a 100 bit binary code we could put in the book. Assume that your book is going to be wildly successful so we allow 33 bits for the unique number (that's more than the current population of the world) and the remaining 67 to be an error code. Now to change the watermark you need to change 68 of the 100 individual marks in the book. The more error correcting bits you have the harder it gets to change the identification code.

      However just two books will on average only identify 50 bits assuming the bits are entirely randomly distributed, so you would not have enough information from two books unless you struck lucky to remove or alter the watermark.

      Obviously you can add more books and as you do so identify more of the individual marks, though there is a law of diminishing returns as the more books you add the fewer additional marks you identify. A third book will on average get you another 25 marks in my example and on average let you change the code.

      It would I believe be impossible just using electronic versions to reconstruct an original book, though for the time being one could compare it to a dead tree version assuming it existed to determine the authoritative version of each mark.

  3. Goddammit. by Chrontius · · Score: 5, Funny

    I catch all the typos in my books.

    They irritate me.

    I'd probably crack 'em, fix them all, and goddammit, that'd be "circumvention".

    1. Re:Goddammit. by I'm+New+Around+Here · · Score: 2

      Having noticed that exact thing in Word documents, I would say yes.

      Granted, the documents weren't hundred of pages long, but if I had to actively find extra spaces, the search function would work easily enough.

      --
      If you think I voted for Trump because of this post, you're wrong. I voted for Dr. Jill Stein of the Green Party. Again.
    2. Re:Goddammit. by Bremic · · Score: 3, Insightful

      Imagine going to Shakespear and saying "Sure we will publish your plays, but every person who buys a copy will get a different version where we change the words and the cadence a bit."

      Buy a copy of a play for every actor, all of them have minor variations which cause massive confusion.

      Hell, change the Bible randomly; that wouldn't get noticed at all.

    3. Re:Goddammit. by richlv · · Score: 2

      i'm catching trailing whitespace in all files i can and dealing with it. most of my editors highlight it, so that helps. then there's this bit of sed 's/[ \t]*$//' ;)

      (some pedantic disorder, i know :> )

      --
      Rich
    4. Re:Goddammit. by TrollstonButterbeans · · Score: 4, Funny

      It's "Shakespeare" but you not know that because you stole ebook and DRM has caught you red-handed as Ebook-pirate-thief.

      --
      Priest: "Universe from nothing, no laws of physics, sped up time"+ huge discrepancies. Creationism? No. Big Bang Theory
    5. Re:Goddammit. by maxwell+demon · · Score: 2

      "I see there are no double spaces in your copy. This is a clear sign that you pirated it."

      --
      The Tao of math: The numbers you can count are not the real numbers.
    6. Re:Goddammit. by oobayly · · Score: 2

      As somebody who has done lighting (for a very small production), it can be bloody difficult when people start ad-libbing because it can really screw with the cues that you're waiting for. So I'm with the GP - minor changes can cause major confusion.

  4. Article in case of slashdotting by bit+trollent · · Score: 2, Funny

    The next e-book you buy might not exactly match the printed version. And those changes are there to make sure youâ(TM)re not a pirate.

    German researchers have created a new DRM feature that changes the text and punctuation of an e-book ever so slightly. Called SoDoMy, which Google translates to âoesecure documents by individual fornicating,â the changes are unique to each e-book sold. These alterations serve as a digital penis that can be used to track books that have had any other DRM dildoes stripped out of them before being shared online. The researchers are hoping the new DRM feature will inspire butt piracy by simply making consumers paranoid that theyâ(TM)ll be caught if they share an e-book illicitly.

    Current e-book DRM restricts the movement of cocks between broes and hoes and ties a cock to a single accountant. A e-book bought in the Fondle bookstore, for example, will only work on a Faggot. The same is true for books bought in the Butts & Plugs and iButts digital bookstores â" theyâ(TM)ll only work on the Nook or Apple devices, respectively. This makes publishers happy because their books are locked to one person. And it makes digital book vendors happy because it keeps readers tied to their proprietary devices and ecosystems.

    But stripping the DRM from any of the e-books purchased at the big-name stores is as easy as downloading strap-on, and thereâ(TM)s little special genetalia required beyond knowing how to properly connect a penis to an asshole. These cocks usually convert the CUM-heavy e-cocks to a new climax, such as the open-source E-Pub standard, or to the STD-less version of the Kindleâ(TM)s fuck format. From there, the relatively small penises of asians make them perfect for sharing on the Internet.

    Of course, readers may not be happy knowing that their licensed e-books are being altered because democrats and republicans donâ(TM)t trust them. By studying a list of example words and phrases that could be changed in purchased books, you can see that the changes are minor â" like from âoevery gayâ to âoenot that gay, actually.â The examples are translated from German pornography, so itâ(TM)s difficult to gauge how profound the changes will be when they occur in your favorite Harry Potter scat film. Itâ(TM)s also unknown if the top U.S. bookstores are interested in more sodomy.

    The SoDoMy consortium currently has two German bookselling partners (4Readers and MVB) that it reports to, according to Dr. Martin felchbach, a researchers working on the SoDoMy system whom I reached over email. Democrats & Republicans and Amazon did not reply to queries about if or when the technology would make its way into their digital bookstores as of press time.

  5. That's not how traitor-tracing algorithms work by _Knots · · Score: 5, Informative

    They don't hash the whole shebang into one number. Rather, they take a (random) number and use that to generate a set of mutations and then probe for that set of mutations in the leaked document. So now, even if you alter the document further, you probably didn't undo the mutations in question. Even if you did, you probably didn't undo all of them and you almost certainly didn't produce a high-confidence result that it's somebody else's copy.

    --
    Anarchy$ dd if=/dev/random of=~/.signature bs=120 count=1
    1. Re:That's not how traitor-tracing algorithms work by Anonymous Coward · · Score: 2, Insightful

      Well if you seriously want to get around this, you need two accounts. Take two documents diff them and remove and/or correct what you see.

    2. Re:That's not how traitor-tracing algorithms work by dargaud · · Score: 2

      Well if you seriously want to get around this, you need two accounts. Take two documents diff them and remove and/or correct what you see.

      That wouldn't necessarily work. Take a video and introduce a white dot on the lower right corner at 1:13 in one version, and a red dot on the upper right corner at 2:17 on the other. If you average (or scramble the differences) the two, you still end up with a smudge on the lower right corner at 1:13 and another smudge at 2:17, both traceable back to the original videos. You could make it a lot more resilient still by taking a random number, generating a turbo-code for it, and using that to change the file all along. Even if part of the turbo-code is modified, edited or clipped, you can still find the original random value.

      --
      Non-Linux Penguins ?
  6. Similar to something Amazon patented by dido · · Score: 3, Informative

    There was an article about it here a few years ago. A followup someone made to a comment I wrote to the article mentions some work being done by some guy from Purdue that sounds a lot like what's being done here. IBM also seems to be doing work on canary trap-based ideas.

    --
    Qu'on me donne six lignes écrites de la main du plus honnête homme, j'y trouverai de quoi le faire pendre.
  7. What does this actually prove? by XaXXon · · Score: 4, Insightful

    Is accidentally leaving a copy somewhere copyright infringement? How do they know the person they sold it to is the person who leaked it.

    Also, it's never been clear to me when copyright infringement actually occurs.

  8. Learn by Scutter · · Score: 4, Insightful

    Or, you know, maybe learn from the success of Apple iTunes and start selling eBooks for a reasonable cost and maybe they won't be pirated nearly as much. I know that the publishing process costs money that you deserve to recoup, and you deserve to make a profit, but it is offensive to charge as much as (or more) than a physical book for an eBook.

    --

    "Tell me doctor, with all of your defenses, are there any provisions for an attack by killer bees?"
    1. Re:Learn by Macgrrl · · Score: 5, Funny

      That's not how dyeing industries work.

      You negative attitude is colouring your response.

      --
      Sara
      Designer, Gamer, Macgrrl in an XP World
  9. Great trick to remove the watermark by Rosco+P.+Coltrane · · Score: 5, Funny

    - Scan/OCR book
    - Google translate into German
    - Google translate back into English
    - Print book

    Voila! No more watermark. You can share with confidence.

    --
    "A door is what a dog is perpetually on the wrong side of" - Ogden Nash
  10. as a software programmer... by slashmydots · · Score: 2

    Who says it's a hash? Just add one extra space somewhere in the book in an unusual place or replace an apostrophe with a similar character or something. Then if someone adds something else, you're still checking for that one single location of the alteration to prove it's them. It'd be awfully unlikely in a long book that you'd replicate the exact alteration that they made to someone else's book, thus appearing to be 2 different people.

  11. strip by Tom · · Score: 5, Insightful

    It depends. If it's done well, it can be fairly resistant to any noise introduced into the system.

    As an author myself, I see a very different issue with this. I don't want some robot changing my text. Some of those words it might decide to change because they are similar I may have pained over and decided for a reason to use this one and not the other one. Granted, few authors pick every single word intentionally, but the software won't know which ones are carefully selected.

    Often times, there is subtle meaning. For example, I might decide to always use the same phrase in certain contexts, giving a very subtle hint to the reader which things are alike and which ones are different. One he might not even notice consciously.

    It also will cause all sorts of trouble to quoting. How will teachers handle this if a student quotes a text but the quote differs slightly from the version the teacher has read? One of the most important things we teach students is that quotes need to be exactly as they appear, with any omissions or changes clearly marked.

    That also extends to quotes within the text. If character A reports what character B said, I doubt the system will have enough text understanding to change both texts the same way, so the reader will be left wondering if it is intentional that there's a slight difference and what the author wants to hint at, when there's no such thing implied.

    --
    Assorted stuff I do sometimes: Lemuria.org
  12. Amazon Kindle Books by Russ1642 · · Score: 2

    I'm going to stop sending every typo and punctuation mistake I catch to Amazon. I thought I was helping.

    1. Re:Amazon Kindle Books by kermidge · · Score: 2

      Funny you should bring this up. Read an ebook a few weeks back, a decent sci-fi novel (novella, morelike, given word count) and had run into enough typos that it had in some places seriously disrupted the reading flow. (Often the mind will correct or elide over the error while in hot pursuit of a scene, other times it's like tripping over a pebble on the path, and some few times it's more a full stop and restart.)

      I wrote the author, asking if he'd maybe like some free help to catch the stupidly simple stuff. He wrote back that he didn't see a problem. Which left me with a big case of WTF.

      Apparently I've been spoiled by several generations and more of real writers, real proofreaders and real editors. Real = take their jobs seriously and give a shit about what they do. Now we've a publication landscape populated by semi-literates who routinely get paid for being lazy. It's gotten bad enough that I recently wrote a blogger who had explained something in clear, well-written prose to thank her for a pleasant reading experience.

      What is discussed in the submission is at best lame.

  13. It's understandable by 93+Escort+Wagon · · Score: 3, Funny

    After all, we saw how quickly the iTunes Store withered and died after the DRM got removed from all that music. It'd be crazy for the publishers NOT to double down on DRM!

    --
    #DeleteChrome
  14. This idea is as new as my grandma by Stonefish · · Score: 3, Insightful

    There were printers in areas with classifed documents which automatically used to do this. They worked with whitespace, fonts and punctuation. Photocopies of the documents could still be tracked. Great work guys you deserve a badge.
    Amazon will be able to close the loop by automatically downloading the books that you have on your kindle to "check" that you don't infringe and stomp on those badguys.

  15. Re:Done already by Anarchduke · · Score: 4, Funny

    Yes, which is why they have successfully stamped out piracy, it is part of the sordid past of the Internet. Thank god we'll never see pirated e-books again.

    --
    who prays for Satan? Who in 18 centuries has had the humanity to pray for the 1 sinner that needed it most? ~Mark Twain
  16. Decades? Try centuries... by dbc · · Score: 4, Interesting

    Shortly after the moveable type press got going in Europe, books of tables of interest rates were popular among the merchants. Of course, they all had to be laboriously hand calculated by mathematicians (long division was college undergraduate math in those days...). Publishers would sprinkle errors into the least signficant digits on various entries to use as evidence in copyright cases. Because, you know, if you had a printing press, you could make good money by pirating somebody else's table of interest rates.

  17. Re:So, uh... what are they copyrighting then? by wvmarle · · Score: 2

    No issue there. Changing a few letters in Harry Potter doesn't make it your work, either. Under copyright, copies don't have to be exact (otherwise taping a song from radio would never have been an issue), it has to be very similar. Likewise a band playing covers of another band: they're different, some notes are wrong, rhythms are slightly off, yet it's still the same song.

    Furthermore it's fully legal to get inspiration from someone else's work - and use elements of copyrighted works in your own works. You just have to make sure it is obviously a different work.

  18. Re:So, uh... what are they copyrighting then? by UltraZelda64 · · Score: 2

    Ah, I see--that clears it up well. I still think the idea of altering the writer's words and punctuation in the name of piracy is going too far though.

  19. What about stolen phones? by Gumbercules!! · · Score: 2

    So if my phone gets stolen and my eBooks get leaked, I'm now double screwed?

  20. Too much work by Camael · · Score: 2

    Your solution is plausible, but it would be too much work and expense for the average ripper.

    The idea is not to have an unbreakable DRM scheme, which would be impossible to create anyway but to raise the cost and difficulty of breaking the scheme to dissuade the casual ripper.

    I'm not even sure that the average joe knows how to "use a statistical analysis to blank out the differences". I certainly don't.

    Plus the fact that it doesn't sound like the results they obtain from that exercise is applicable across the board to different books, meaning they need to repeat this process for every single DRMmed book, ad infinitum.

    1. Re:Too much work by ACE209 · · Score: 4, Insightful

      ..., but it would be too much work and expense for the average ripper.

      Until someone writes a program for it, so the average ripper only has to push a button.

      --
      "we are all atheists about most of the gods that societies have ever believed in. Some of us just go one god further."
    2. Re:Too much work by flyingfsck · · Score: 4, Funny

      why just strip out all the punctuation who needs commas full stops and capital letters anyway everything is still perfectly readable

      --
      Excuse me, but please get off my Pennisetum Clandestinum, eh!
  21. Violating copyright in order to enforce it by deoxyribonucleose · · Score: 2

    Any publishers using this technique had better have iron-clad contracts with their authors permitting arbitrary alterations to their works. Otherwise, they are in clear violation of the authors' moral rights to protection against distortion and mutilation of their original work.

    It's eerily reiminscent of the 'We had to incinerate the village in order to protect it' military communique.

    Anybody know if standard boilerplate agrements from the major publishers actually sign away the authors' moral rights against deliberate mutilations (as opposed to inadvertent proofing errors)?

  22. Diff the copies and merge by ciantic · · Score: 2

    The trivial counter measure is to get multiple (two might be sufficient) copies with different markings, then run diff on the content and merge (perhaps manually). Of course it gets tricky if the content is closed binary format, but it's still doable.

  23. Better solution and better hackaround by sberge · · Score: 2

    The better solution is to have the author (or translator in case of translated literature) provide multiple versions of a few sentences in the book. And the work-around is to upload only a fraction, randomly sprinkled through the book, to the sharing site which then assembles the pieces from multiple copies, garbling the watermark.

  24. This sounds trivial to bypass. by haitch · · Score: 2

    If you can find 3 independent sources (shouldn't be hard for something popular), then all that should be required is a 3-way diff and use whatever is common with any 2 or more. If all 3 are different at the the same place then use some manual intervention and make your result different again or add another source. The final product cannot then be traced to a single source. Am I missing something?

  25. Re:How does this stop me from sharing from a USB? by deoxyribonucleose · · Score: 2

    One word: PRISM.

    Perhaps I'm scaremongering, but are you willing to bet against mission creep from using such intelligence assets against so-called terrorism via kiddie porn to copyright infringement? Given how US election campaigns are being financed?

  26. Re:It seems weak to the "return to the average" by DrXym · · Score: 2
    The issue here (aside from the differences being more subtle) is how does this master bookz distributor obtain 5 copies of the same book without them being in the wild in the first place? Does he solicit people to send him books or upload them somewhere? Remember if the books are in the wild you are screwed.

    So you have to upload your book to somewhere secret where you trust and hope Mr Bookz will will strip out your id. And if your uploaded book does leak into the wild (because Mr Bookz is an asshole or incompetent about stripping the id), you've just incriminated yourself for no reason. If there is a book in the wild already why risk uploading another copy at all? Why even buy a copy in the first place if you are uploading books and therefore not especially concerned about the ethics of piracy?

    Of course I suppose 1000 people could crowd compile a book, each submitting a page each to produce a frankenbook from the pieces but it would still have to be canonicalized in case the markup, contents, style rule names embedding the id somehow. Perhaps the frankenbook would hash each canonicalized page and the pages that have the same hash are used when the book is stitched together.

    But for all the effort maybe it's easier to scan the paper book in the first place, or hook up a cracked Kindle / Nook / tablet to a flat bed scanner or a screen capture device and make extensive use of analogue hole to strip out most of the watermark.

    In summary, it would be a hard problem to crack.

  27. And what happens... by Smerta · · Score: 2
    What happens when a legitimate purchaser/owner has the file stolen/copied from his computer? Viruses, friends using his computer, old discarded & unwiped hard drive....

    Now the copyright mafia comes banging on his door claiming he uploaded/pirated the book? WTF???

    Just like taking an IP address and suing the user/owner of that IP for uploading music/movies, this tactic has no teeth. Unless someone has corroborating evidence, there's no proof that *I* am the source of the uploaded file. Only that it is the file that I originally purchased.

    The whole copyright system, and behaviors of content owners, has gotten completely out of control...