Slashdot Mirror


Altering Text In eBooks To Track Pirates

wwphx writes "According to Wired, 'German researchers have created a new DRM feature that changes the text and punctuation of an e-book ever so slightly. Called SiDiM, which Google translates to 'secure documents by individual marking,' the changes are unique to each e-book sold. These alterations serve as a digital watermark that can be used to track books that have had any other DRM layers stripped out of them before being shared online. The researchers are hoping the new DRM feature will curb digital piracy by simply making consumers paranoid that they'll be caught if they share an e-book illicitly.' I seem to recall reading about this in Tom Clancy's Patriot Games, when Jack Ryan used this technique to identify someone who was leaking secret documents. It would be so very difficult for someone to write a little program that, when stripping the DRM, randomized a couple of pieces of punctuation to break the hash that the vendor is storing along with the sales record of the individual book."

311 of 467 comments (clear)

  1. So... by Impy+the+Impiuos+Imp · · Score: 5, Informative

    Normal book publishers have been doing this for decades, inserting the occasional misspelling here or there. Later, they inserted correct spellings, but of the wrong word, to get around auto-correction in scanner software.

    So...no, they can't patent it.

    --
    (-1: Post disagrees with my already-settled worldview) is not a valid mod option.
    1. Re:So... by Shompol · · Score: 5, Insightful
      Yes but this is different because

      ... on a computer

      So yes, they can (and will)

    2. Re:So... by Z00L00K · · Score: 5, Insightful

      And if the publisher do change texts in different e-books anyone that wants to get around it would just need a few copies and use a statistical analysis to blank out the differences.

      This is similar to what steganography does, so if you mess up the punctuation inserted then it will be really hard to look up the perpetrator - or even that the wrong party will be pointed out.

      So now the Pandora's box is opened.

      --
      If builders built buildings the way programmers wrote programs, then the first woodpecker would destroy civilization.
    3. Re:So... by Tom · · Score: 2

      So that's why I come across obvious errors in books where I thought that if it stands out like a sore thumb at a non-native speaker, why the fuck did the proof-readers miss it?

      --
      Assorted stuff I do sometimes: Lemuria.org
    4. Re:So... by wvmarle · · Score: 1

      Please let them patent it! Then no-one else will use anything like it for fear of patent infringement.

    5. Re: So... by kthreadd · · Score: 2

      And so far this has prevented what?

    6. Re:So... by mbone · · Score: 2

      Normal book publishers have been doing this for decades, inserting the occasional misspelling here or there. Later, they inserted correct spellings, but of the wrong word, to get around auto-correction in scanner software.

      So...no, they can't patent it.

      I think that map makers have been doing this for a century or more.

      "Who's Who" and the like do it as well, inserting fictitious people. This is also because true maps and lists may not be copyrightable, while fictitious ones certainly are.

    7. Re:So... by Anonymous Coward · · Score: 1

      They would still have the markers which are common to all versions used for the comparison. Suppose there are only 1000 marker bits and you compare three versions. Then each bit has a one in four chance of being the same in all three versions (000, 001, 010, 011, 100, 101, 110, 111), so you'd expect to get about 250 undetected marker bits, which is plenty to find the exact three copies which were used as the source for the "clean" copy.

    8. Re:So... by Will.Woodhull · · Score: 4, Insightful

      There would be no need to reverse engineer a pristine copy of the work. Simply proofreading a single copy and correcting some of the existing errors, while at the same time, introducing a few new errors of the same type would be enough to confound any attempt to make a positive identification of the source.

      This approach has an incredibly high bogosity factor. I can't imagine anyone in the publishing industry with half a brain who would spend any money on its implementation... Oh wait. We are talking about the partially brain dead idjits who thought DRM was the best thing since sliced bread....

      If I was going to do this, I would probably also play with the kerning to force some repagination, add some space characters before the newline at the end of some paragraphs, and so on. This approach to DRM is about as simple to get around as using a black magic marker on the edge of an "uncopyable" CD disk.

      --
      Will
    9. Re:So... by Opportunist · · Score: 2

      And some newspapers now insert correctly spelled words into their otherwise barely legible text to ... uh... why again do THEY do it?

      --
      We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
    10. Re: So... by Opportunist · · Score: 3, Insightful

      C'mon, where have you been hiding? Adding "on a computer", or, more recently, "on the internet" makes everything patentable again.

      --
      We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
    11. Re:So... by Opportunist · · Score: 2

      DRM is all about artificially lowering the value of your product (to the user) in an attempt to make it more valuable. You think anyone in this bizarro world is using a brain?

      --
      We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
    12. Re:So... by Opportunist · · Score: 5, Funny

      Uh... yes. When you find misspelled words in my messages here, it's just my new DRM. It's just that. It's not that I'm too dumb to use a spellchecker.

      --
      We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
    13. Re:So... by Will.Woodhull · · Score: 1

      I never thought about how errors needed to be inserted into maps so they could be copyrighted. Wow.

      I wonder if this is the reason why Google maps of Portland Oregon sometimes label the Banfield Expressway as "Soldiers Field Drive", which is in Boston Massachusetts. The errors seem to come and go, and seem to be limited to road names that are also identified by route numbers.

      --
      Will
    14. Re:So... by Idaho · · Score: 3, Insightful

      In fact, this is one more reason for good authors to avoid traditional publishers. I can think of quite a few authors who would have a thing or two to say about algorithms like these being used to modify their work.

      Just like in the music industry, big publishers are simply not necessary anymore. Editors most certainly are, but publishers?

      --
      Every expression is true, for a given value of 'true'
    15. Re:So... by Anonymous Coward · · Score: 1

      Apple argued that an icon with a green phone proved that Samsung was copying, even thought it had been used on physical buttons on handsets for years. The courts ruled in favour of Apple - that this was an example of "blatant" copying, so anything is possible.

      Never assume that lawyers interested in self-licking icecreams won't allow this tiny variation of changing styles/etc to be patented.

    16. Re:So... by gnasher719 · · Score: 3, Funny

      I think that map makers have been doing this for a century or more.

      I remember a pub guide with 1,200 pub reviews including three fake ones, and a newspaper copied (and slightly rearranged the words) of ten of their reviews and managed to copy one of the fake ones. Good fun.

    17. Re:So... by farrellj · · Score: 1

      Of course, ebooks being just text, (HTML zipped for the epub format, for example), I would not be surprised if a good spelling and grammar check software wouldn't render this attempt at DRM useless...or will they try to outlaw spelling/grammar check software as DRM circumvention tools?

      --
      CAN-CON 2019 - Ottawa's only book oriented Science Fiction Convention! October 18-20, Sheraton Hotel, Ottawa, Canada h
    18. Re:So... by Arrepiadd · · Score: 5, Informative

      There would be no need to reverse engineer a pristine copy of the work. Simply proofreading a single copy and correcting some of the existing errors, while at the same time, introducing a few new errors of the same type

      I didn't read the article because I had seen it earlier in another news source, so I don't if this is mentioned in the one mentioned here, but proofreading may not do it in this case. The source I read mentioned two specific types of change that do not introduce any typos (I'm choosing the exampled myself):
      - One of them was reordering of nouns when the order does not matter, e.g. "Peter and John went for lunch" vs "John and Peter went for lunch";
      - The other was playing with negatives: e.g. "something is unclear" vs "something is not clear"

      Since there are no actual typos, it's hard to spot the identifying bits. You'd have to change the text substantially, in order to have a good chance of being free from discovery. Adding your own typos may not serve any purpose, since the company selling can focus just on the changes they made, not looking for other changes introduced after.

      Of course, if there is a concerted effort to release documents, all pirates would need to do would be buying a few copies and diffing the documents. You may not get the original back, but if the changes are randomly put in a specific set of words, you certainly can end up with something close to the original than any of the sold copies and still free from pirate identification.

    19. Re:So... by pmontra · · Score: 2

      I'm not a proponent of DRM and I didn't buy any DRMized item because I want to be sure to able to use what I bought on any future device I'll happen to use. That said, this DRM doesn't seem to lower the value of the text much. It's probably just watermarking, which I'm fine with because I'm not interested in buying something and pirating it.

      Nevertheless I see potential problems with this technology. Files are files, got backup, move to physical media that get lost and sometimes end up in somebody's else hands. Even if I (and you, reader) can say "it will never happen to me", it will happen to somebody, probably many people along the years. Maybe even me (and you) and the guy who got his files distributed could be in more troubles than he deserves for losing a USB key. So watermarking is better than DRM, but no watermarking is even better.

      Finally, I believe that reconstructing the non watermarked original file won't be as trivial as many other slashdotters think. It looks like one of those bioinformatics problems with reconstructing a DNA sequence from a bunch of different versions of it (don't assume the texts will be aligned). Plus, there is a noise introduced on purpose and if an arms race starts they'll do their best to make it difficult to remove without having really many different versions of the file.

    20. Re:So... by Intrepid+imaginaut · · Score: 2

      What makes you think pirates won't rip off independent authors? They aren't on a 'reform the publishers' kick, they're on a 'get free stuff' kick. This tactic is to allow publishers or authors or whoever to track back pirated copies to whoever first shared them out. All they need it to automate the system so each book sold has a unique and all but invisible 'watermark', a comma in an odd place, whatever.

      And you know what maybe they have a point; arguments can be made that musicians can earn from live performances, and that movies can earn at the box office. Where do authors make money if their works are pirated? Live readings and ebook signings? Please. This is a pretty low form of piracy.

    21. Re:So... by mapkinase · · Score: 1

      In other words, publishers peeing against the wind.

      --
      I do not believe in karma. "Funny"=-6. Do good and forbid evil. Yours, Oft-Offtopic Flamebaiting Troll.
    22. Re:So... by Z00L00K · · Score: 2

      The point I was making was not to state to create an unaltered copy of the original but get enough data on the variation of the copies to be able to mess up the watermark enough to render it useless. Random pick of formatting/wording in deviating sections from one of the N copies obtained at each case. The result may be that you have variants A, B and C as source and your scrambling causes it to look like variant K, so the buyer of variant K will be blamed until they figure out that they are chasing in the wrong direction.

      --
      If builders built buildings the way programmers wrote programs, then the first woodpecker would destroy civilization.
    23. Re:So... by RabidReindeer · · Score: 1

      There would be no need to reverse engineer a pristine copy of the work. Simply proofreading a single copy and correcting some of the existing errors, while at the same time, introducing a few new errors of the same type would be enough to confound any attempt to make a positive identification of the source.

      This approach has an incredibly high bogosity factor. I can't imagine anyone in the publishing industry with half a brain who would spend any money on its implementation... Oh wait. We are talking about the partially brain dead idjits who thought DRM was the best thing since sliced bread....

      If I was going to do this, I would probably also play with the kerning to force some repagination, add some space characters before the newline at the end of some paragraphs, and so on. This approach to DRM is about as simple to get around as using a black magic marker on the edge of an "uncopyable" CD disk.

      We are talking about the brain dead idjits who want ebooks to "wear out" after being read 25 times.

      Actually, I was looking forward to having a one-of-a-kind copy of a book. Uniqueness makes them collectable, right?

    24. Re:So... by RabidReindeer · · Score: 1

      There would be no need to reverse engineer a pristine copy of the work. Simply proofreading a single copy and correcting some of the existing errors, while at the same time, introducing a few new errors of the same type

      I didn't read the article because I had seen it earlier in another news source, so I don't if this is mentioned in the one mentioned here, but proofreading may not do it in this case. The source I read mentioned two specific types of change that do not introduce any typos (I'm choosing the exampled myself):

      - One of them was reordering of nouns when the order does not matter, e.g. "Peter and John went for lunch" vs "John and Peter went for lunch";

      - The other was playing with negatives: e.g. "something is unclear" vs "something is not clear"

      Since there are no actual typos, it's hard to spot the identifying bits. You'd have to change the text substantially, in order to have a good chance of being free from discovery. Adding your own typos may not serve any purpose, since the company selling can focus just on the changes they made, not looking for other changes introduced after.

      Of course, if there is a concerted effort to release documents, all pirates would need to do would be buying a few copies and diffing the documents. You may not get the original back, but if the changes are randomly put in a specific set of words, you certainly can end up with something close to the original than any of the sold copies and still free from pirate identification.

      The ebooks I buy are encrypted using credentials that are unique to my account with the vendor. If I was to implement a watermarking scheme, I'd likely use these same credentials to power the watermarking process. In theory, that would make a pirated copy more traceable, and make gimmicked copies more obviously evidence of a deliberate attempt at piracy.

      There's also the fact that the current generation of at least some readers hide the book files in an area that is apparently physically inaccessible even to people who root the devices, so "accidental" leakage cannot be claimed. Although, for the record, next reader I but WILL have the books out in the open or I won't buy it. Not because I'm intending to pirate, but because I don't want my library evaporating if the bookstore does a "Borders" and turns off its support servers. Or because I changed booksellers.

      Or - and this is perhaps the most immediate concern - that I offend the Gods of Amazon in some way, such as letting my credit card expire on a non-book purchase and they shut my library down along with ever other service they offer. Which allegedly has been done to someone recently.

    25. Re:So... by RabidReindeer · · Score: 1

      So that's why I come across obvious errors in books where I thought that if it stands out like a sore thumb at a non-native speaker, why the fuck did the proof-readers miss it?

      Actually, one or two books I've bought have either had major-disruptive goofs (pages out of place, repeated paragraphs, etc.) in them or actual "black holes" where the reader software locks up.

      So far I've resisted the temptation to decrypt and repair them, but...

    26. Re:So... by Tom · · Score: 1

      Where do authors make money

      With or without "piracy", that is a real question, because the vast majority of authors don't.

      I'm not sure about the exact numbers as it's been a while since I read the articles on this, but roughly 90% of authors don't have an income worth mentioning from their books or articles. Of the remaining 10%, less than 1% makes their living entirely on their writing. The vast majority of authors have a "real" job in addition to writing.

      --
      Assorted stuff I do sometimes: Lemuria.org
    27. Re: So... by easyTree · · Score: 1

      Why not? too specific? :P

    28. Re:So... by dintech · · Score: 1

      "Peter and John went for lunch" vs "John and Peter went for lunch"

      If we had two copies of the book, would diff be enough to muddy the waters of who's copy was the naughty one?

    29. Re:So... by Vanderhoth · · Score: 2

      Neil Gaiman would disagree with you.

    30. Re:So... by mic0e · · Score: 1

      I still refuse to believe that piracy makes as much as a dent in sales.

    31. Re:So... by SuricouRaven · · Score: 1

      Alternatively, you could write a piece of malware that snatches the books from all those it infects and sends back to your piracy collective. You get the books, someone else gets the blame, at least until the publisher realises what is going on.

    32. Re:So... by SuricouRaven · · Score: 1

      So much like music, a tiny fraction of superstars gain fame and fortune while the vast majority toil away in relative obscurity.

    33. Re:So... by SuricouRaven · · Score: 1

      I doubt they'd go for something so obvious. I've only heard of one specific case (on a documentary TV program), and their example as a phantom cul-de-sac on the map that didn't actually exist. Cul-de-sacs are ideal, as no-one is going to plot a route that passes through one.

    34. Re: So... by geminidomino · · Score: 1

      No, because someone already has it. Duh. ;)

    35. Re: So... by domatic · · Score: 1

      And you can also add "on a mobile device" and yet again repatent everything.

    36. Re:So... by Vanderhoth · · Score: 1

      Yes, I agree.

      In terms of getting exposure, I like Neil's stance on the issue. I read Sandman because a friend of mine pirated it and gave me a copy. I read the first three volumes. Then one day when I was in a comic store with some friends I saw the series and bought the whole thing because I had already read some of it, and liked what I had read. I would have never bought any of them and never heard of Neil Gaiman if it hadn't been because a friend of mine pirated his books. Since then I've bought a few other series written and co-written by Neil.

    37. Re:So... by Jason+Levine · · Score: 2

      That's what I was thinking. So would-be pirates get together, buy a couple copies of the book, compare them for differences, and post a version of the book that combines both alterations in a random method thus ruining the tracking.

      Of course, DRM like this isn't really meant to prevent copying as much as it is meant to form a speed bump to deter the more casual would-be-pirates. (Let's be honest, no DRM stops the more serious pirates.) If this was the only DRM that was deployed, I'd actually support it versus methods like encrypting eBooks in non-open formats that can only be read by certain devices/applications and that can be remotely disabled.

      --
      My sci-fi novel, Ghost Thief, is now available from Amazon.com.
    38. Re:So... by MrMickS · · Score: 2

      DRM is all about artificially lowering the value of your product (to the user) in an attempt to make it more valuable. You think anyone in this bizarro world is using a brain?

      That's an odd way to view DRM. DRM is about the publishers attempting to associate a cost with the duplication of a work. The cost of creation a copy of a digital work is negligible. The cost of creation of a copy of a physical work isn't, both in materials and time taken to create the duplicate.

      Of course the key really is that there are people that believe that one they have bought something then they have a right to distribute that to others. This is ok in the case of a physical work. You pass it on and you don't have it anymore. With a digital work it doesn't work like that. The publishers, and authors/artists, fear that widespread digital duplication will deprive them of income. Hence they strive for a way to ensure that they don't lose out.

      Sadly its impossible to have a reasonable approach or discussion around this because of the extremist end of the debate that believes that and form of trying to prevent distribution of unlicensed copies is evil. If you want to prevent DRM come up with a scheme that addresses everyone's needs rather than making glib throwaway pronouncements.

      --
      You may think me a tired, old, cynic. I'd have to disagree about the tired bit.
    39. Re: So... by Threni · · Score: 5, Funny

      And then "on a mobile device but with slightly rounded corners".

    40. Re:So... by cdrudge · · Score: 1

      Soldiers Field Road is in Boston, not a drive. And it doesn't look like it's that case any more.

      Google had some trouble with identification of highway names along US-30 which Banfield Expressway was (still is?). For the longest time most of US-30 was also named Quebec Route 366.

    41. Re:So... by gnasher719 · · Score: 2

      That's what I was thinking. So would-be pirates get together, buy a couple copies of the book, compare them for differences, and post a version of the book that combines both alterations in a random method thus ruining the tracking.

      The "Nirvana fallacy" is the assumption that something needs to be perfect to be usable. If I have an ebook, and you want the same ebook, I can just copy it and give the copy to you. I can send it to you as an email attachment. Nothing could be simpler. 80% of all customers would know how to do this. And that is stopped if I know that by giving you a copy I put the book out of my control, you could be some idiot who puts the stuff on a website and then it's my problem, so I'm not giving you a copy.

      Now the effort involved in giving you a copy is so much higher; "buy a couple of books" is nonsense because why would a buy a couple of books to give you a copy, instead of you buying a book? And it takes substantial effort to make that copy, so it's not going to happen.

      And finally, _if_ the publisher finds a copy with watermark removed, then I would think the copier has gone straight into criminal territory, so while the risk of getting caught is lower, the possible damage to you is much higher.

    42. Re:So... by omnichad · · Score: 1

      Maybe those are meaningless changes in an academic text. But if I were an author of fiction, I'd self-publish rather than have the carefully chosen cadence of my words altered.

    43. Re:So... by home-electro.com · · Score: 1

      More importantly, what exactly does this prove? So my copy was ripped and shared, now what? In order to convict and fine me, one would need to prove that *I shared* the copy, not that my copy merely got shared. Good luck with that, a lot of things could've happen - my computer hacked, my phone lost, etc.

      On another note, how is that happened that punctuation is not important anymore?

    44. Re:So... by home-electro.com · · Score: 1

      So they traced, now what? It is not enough to penalize anybody.

    45. Re:So... by ceoyoyo · · Score: 1

      If I buy a book, crack the DRM, and send it to you, no publisher is going to have any idea I did it, watermark or not.

      If I'm even a moderately serious pirate, I'm not going to balk at buying (or acquiring) an extra copy to do a diff with.

      The only situation this addresses is the casual user who breaks the DRM and then uploads his copy for widespread distribution. That scenario seems to be very rare. If it wasn't, there'd be lots of copies of everything already available anyway.

    46. Re:So... by CohibaVancouver · · Score: 1

      However if it's noticable and annoying enough like a low quality OCR scan, a lot of readers will turn to other copies of the work that are cleaned up.

      I highly doubt it will be garbled text, or even noticeable.

      I highly doubt it will be garbled text - Or even noticeable.

      Frankly, I think it's a good idea. Ensures people will be less likely to share their books, without them being burdened with DRM.

    47. Re:So... by K.+S.+Kyosuke · · Score: 1

      The "Nirvana fallacy" is the assumption that something needs to be perfect to be usable.

      In the realm of DRM and encryption, that's basically how things work. Once you break it, the only ones being punished are the paying users of the "official" edition.

      --
      Ezekiel 23:20
    48. Re:So... by bzipitidoo · · Score: 1

      You could have sold your copy to a used bookstore. Could have ripped a copy you checked out from the library. Could have traded or given it to a friend.

      DRM proponents have suggestions for those problems. The used bookstore and library could be forced to collect names and track every purchase or loan. Not too hard with most credit cards, but what if customers use cash or an anonymous kind of card? Customers could give false information. But of course their preferred solution is to outlaw the used bookstore entirely, and kill the public library with budget cuts. As to trading with friends, their solution seems to be to tie copies to devices so you can't. But you could still trade devices. To deal with that, they'd like to tie the device to the person, setting it up so it will only work after something like a retina scan matching the original purchaser. Would have to check quite frequently. This is of course marketed as enhancing the customer's security, because, assuming it can't be jail broken, thieves would not be able to use the device either.

      It's all fantasy. People will not accept such patently absurd artificial limitations. Even if all that was accepted, it still wouldn't work because it takes only one crack to break out. Someone could use the 'analog hole", and simply type what they read right back into a general purpose computer, if necessary making alterations to throw off any watermarks and any Windows Vista style scanning for copyrighted material.

      --
      Intellectual Property is a monopolistic, selfish, and defective concept. It is "tyranny over the mind of man"
    49. Re:So... by Will.Woodhull · · Score: 1

      Thanks for clarifying. I was misremembering the Quebec Route 366 / Banfield Expressway goof and confusing it with another goof (not Google) of a photo caption that had put an approach to the Fremont Bridge (in Portland) on Soldiers Field Road (in Boston). Silly me.

      --
      Will
    50. Re:So... by Culture20 · · Score: 1

      Imagine a group of people (lets say 30, although it could be in the hundreds) who are all required to buy an expensive ebook for a college course. Even if just a study group of ten people got together from a large lecture, they could find vast savings by buying three ebooks and making a "pristine" copy then sharing that within the group. If there is a "toss of the coin" phrase (X&Y vs Y&X), then they could remove or completely rewrite those sections. If they're collectively saving thousands of dollars, it seems worth the effort, legality aside.

    51. Re:So... by JMJimmy · · Score: 1

      Or just take one copy, add a few other punctuation errors (or corrections) and presto - untrackable copy!

    52. Re:So... by wiredlogic · · Score: 1

      It would not be that simple. If there were 100 changes to a text to which you removed 50 and added 50 of your own there would still be 50 alterations remaining that could identify the original source. The only way to combat this is to diff multiple versions and untangle all the changes so as to remove any remaining ID information.

      --
      I am becoming gerund, destroyer of verbs.
    53. Re:So... by ganjadude · · Score: 1

      why add new errors? if you can correct the entire book, then you just release a clean version knowing they dont have a clean version out there.

      --
      have you seen my sig? there are many others like it but none that are the same
    54. Re:So... by ArcadeX · · Score: 1

      Instead of going through all that trouble, buy a pre-paid credit card with cash, create a fake email account, then buy all the books you want and release them on the interwebs. Your serious pirates were already doing this before, not much has changed...

      --
      An I.T. motto in the hands of an idiot is a dangerous thing...
    55. Re:So... by TemporalBeing · · Score: 1

      The point I was making was not to state to create an unaltered copy of the original but get enough data on the variation of the copies to be able to mess up the watermark enough to render it useless. Random pick of formatting/wording in deviating sections from one of the N copies obtained at each case. The result may be that you have variants A, B and C as source and your scrambling causes it to look like variant K, so the buyer of variant K will be blamed until they figure out that they are chasing in the wrong direction.

      Or by doing diffs on the text you might find the variations and be able to eliminate them, producing the original unvariated work and thus no watermark at all.

      --
      Truth is like the sun. You can shut it out for a time, but it ain't goin' away. - Elvis Presley (source: imdb.com)
    56. Re:So... by nabsltd · · Score: 2

      That's what I was thinking. So would-be pirates get together, buy a couple copies of the book, compare them for differences, and post a version of the book that combines both alterations in a random method thus ruining the tracking.

      The way this system might work is to have 32 possible differences per book, where each difference can be thought of as a single bit (e.g., "is not" in one version and "isn't" in another). The combination of differences would allow 4 billion different versions, more than enough for even the most insane best seller.

      So, if you only have 3 versions, you might see as few as two bits of information. You could get lucky and see a lot more, but even so, the publisher could likely localize the leak to a very small number of buyers.

    57. Re:So... by Marxist+Hacker+42 · · Score: 1

      Or, as GK Chesterton put it in _All Things Considered_:
       
       

      When I was a very young journalist I used to be irritated at a peculiar habit of printers, a habit which most persons of a tendency similar to mine have probably noticed also. It goes along with the fixed belief of printers that to be a Rationalist is the same thing as to be a Nationalist. I mean the printer's tendency to turn the word "cosmic" into the word "comic." It annoyed me at the time. But since then I have come to the conclusion that the printers were right. The democracy is always right. Whatever is cosmic is comic.

      --
      SJW: a person who perceives an injustice, and while correcting it, commits a greater injustice.
    58. Re:So... by danomac · · Score: 1

      Even if I (and you, reader) can say "it will never happen to me", it will happen to somebody, probably many people along the years.

      Yep, all it takes is dropping a computer off for service and the tech "helps" himself to your media. It's happened before...

    59. Re:So... by X0563511 · · Score: 1

      Why would you want to introduce new errors? Wouldn't that just give your corrected pirated copy it's own unique fingerprint?

      --
      For large sets, this will be our guide even unto death, for the LORD will work for each type of data it is applied to...
    60. Re:So... by suutar · · Score: 1

      They're not trying to attack serious pirates with this. They're trying to scare casual copiers, who they believe (correctly or not) generate orders of magnitude more "lost sales" (defined as 'person reads book without having purchased it') overall, and who are more likely to garner public sympathy (since they're typically not doing it for monetary profit).

    61. Re:So... by suutar · · Score: 1

      sideband attack wins again!

    62. Re:So... by kwaw · · Score: 1

      It's funny to know that such watermarking is available on the market as a service for years, including Germany (shown on Frankfurt book fair).
      For example from http://elib.pl/.

    63. Re:So... by Opportunist · · Score: 1

      Any action that lowers the accessibility, the usability, the reusability, the convenience of use or its resale value inherently reduces the value of the product for its consumer.

      It may not matter to you, but it is a reduction of objective value. If I cannot resell the product to recover some of the investment, I lose the associated cash value. If I cannot resell an object I bought for 50 for, say, 20, the product instantly became 20 more expensive since I cannot recover this by parting with the product. If I have to jump through some DRM hoops to use a product, the convenience of use is lower than with one where I do not have to jump them. Whether and how much this "costs" you depends on whether your time has a cash value. Mine does. I could either spend an hour struggling with your DRM or I could go and work overtime, the earning opportunities of which outmatches the value of your CD by some margin and then some.

      --
      We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
    64. Re:So... by Opportunist · · Score: 1

      What's odd about it? If I cannot create a backup copy for when the copy I have breaks, I have to spend again to replace it. Either time&money to send for a replacement (hoping that I can get one in the first place, that is) or time&bandwidth (which may be time&money again, depending on your online plan). Unless of course I have to rebuy the content because I do not get a replacement at all.

      If I cannot resell my copy, I am directly losing money, i.e. the resale value of the object in question.

      And in this case, I may well have to take care of a huge liability, because when my ebook reader gets stolen I somehow have to prove that it wasn't me that distributed the content. Hope that this liability doesn't turn into a costly lawsuit.

      Still want to convince me DRM doesn't lower the value of the product to me?

      --
      We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
    65. Re:So... by fuzznutz · · Score: 1

      I can speak for fact that the "official" city map of my hometown has a street that doesn't exist on it. When I fist saw it as a kid, I wondered if it was a street that was planned and never built. Of course, now I recognize it as DRM as there is no room for a street to ever be there. It was fun to order a pizza now and then and have it delivered there.

    66. Re:So... by Will.Woodhull · · Score: 1

      Why would you want to introduce new errors? Wouldn't that just give your corrected pirated copy it's own unique fingerprint?

      Precisely right! Copies with unique fingerprints that do not match any of the publisher's records! Should help drive them further 'round the bend.

      On a more pragmatic note, it is likely that the proofreading pirate would miss some of the errors of the publisher's fingerprint, perhaps enough that the publisher's sniffer would still be able to identify who bought the copy that was copied. By introducing some more errors, the pirate introduces false leads that end up going nowhere. The sniffer will have a harder time identifying which copy was copied.

      End result: rather than a prosecutor being able to say "the fingerprint is conclusive evidence that the copy bought by J. Doe is the one the pirate used", all he would be able to say is "there is a 70% match between the pirated version and the copy that was sold to J. Doe", whereupon the Court would have to decide how much of an imperfect matchup was sufficient evidence?

      --
      Will
    67. Re:So... by Harik · · Score: 1

      And finally, _if_ the publisher finds a copy with watermark removed, then I would think the copier has gone straight into criminal territory, so while the risk of getting caught is lower, the possible damage to you is much higher.

      Right, it's finding the watermark removed that's the big red flag, not that they found it on a filesharing service. Do people think about what they type before prognosticating on /.?

    68. Re:So... by Harik · · Score: 1

      They could rewrite the entire book, keeping only some of the sections with deliberate watermark errors, and it'd still be tracked down to them.

      You miss out on the fact that they're not looking for errors - they're looking for specific errors in specific places. Think back to old detective novels with a piece of cardboard with little squares cut in it. Put it over the right page of what looks like a love letter and "we bust out of the back exercize yard at midnight" pops out.

      With sufficient redundancy in their data (Come on, people, QR codes, PAR2? ECC? How does a group of computer people not instantly comprehend the idea of redundancy?) you couldn't be sure that random selection of bit flips would be enough to obscure your trail.

    69. Re:So... by Opportunist · · Score: 1

      You know all hope is lost when you ask a user to check his grammar and he informs you that she died 5 years ago.

      --
      We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
    70. Re:So... by niftymitch · · Score: 1

      Yes but this is different because

      ... on a computer

      So yes, they can (and will)

      Well this is stenography where the hidden message is a DRM ID/Code.
      Well this is stenography where the hidden message is a DRM ID/Code.
      Well this is stenography where the hidden message is a DRM ID/Code.
      Well this is stenography where the hidden message is a DRM ID/Code.
      Well this is stenography where the hidden message is a DRM ID/Code.
      Well this is stenography where the hidden message is a DRM ID/Code.

      This seems to be a crazy OMG nightmare for the DHS and friends.
      The risk for honest law enforcement is that all sorts of content (both public and private) can now serve for key distribution or message distribution... and if the DRM wankers do a lot of small space insertion and random application of kerning etc then ANY document becomes suspect.

      --
      Truth is stranger than fiction, but it is because Fiction is obliged to stick to possibilities; Truth isn't. Mark Twain.
    71. Re:So... by Meski · · Score: 1

      http://theory.stanford.edu/~aiken/publications/papers/sigmod03.pdf seems a similar type of thing, and I'm not sure that's original either. (2003)

    72. Re: So... by Meski · · Score: 1

      I take your 'slightly rounded corners' and raise you a 'series of straight lines'

  2. Defeated in one... by NFN_NLN · · Score: 4, Funny

    1. Sign up to service with alias
    2. Use untraceable account (prepaid credit card, bitcoin, points card)
    3. Share files with "watermarks"
    4. Don't give a shit that it gets traced back to a throw away account

    They could have saved a significant amount of effort if they had asked me first...

    1. Re:Defeated in one... by noh8rz10 · · Score: 1

      i think that's fine, but you're forgetting the second part of this - the person who downloads is at risk of being tracked too. similarly, if you download then upload again, then it's more exposure. lots of it is FUD, but imho very convincing fud.

    2. Re:Defeated in one... by Anonymous Coward · · Score: 2, Insightful

      Acquire multiple copies, run through diff, select most common and correct version each difference, then randomly permute other punctuation in non noticeable ways...

    3. Re:Defeated in one... by Anonymous Coward · · Score: 1

      Tor

    4. Re:Defeated in one... by brit74 · · Score: 1

      Yeah, if your goal from the very start is to buy a book so that you could put it online for other pirates. Most people aren't putting that much forethought into their crimes. And once you bought a book (with your own credit card), and then decide afterwards that you want to put it out there for pirates, suddenly, you realize that it's not such a good idea.

    5. Re:Defeated in one... by brit74 · · Score: 1

      Yeah, because a pirate wants to buy multiple copies of the same book now.

    6. Re:Defeated in one... by Duhavid · · Score: 2

      Or normalize all capitalization, punctuation, spelling and grammar.

      --
      emt 377 emt 4
    7. Re:Defeated in one... by AK+Marc · · Score: 1

      Some pirates do do that. But many end up shared from someone who was more careless than complicit.

    8. Re:Defeated in one... by Anonymous Coward · · Score: 1

      My pirated copy of Finegan's Wake would be rather odd.

      Normalizing "...all capitalization, punctuation, spelling and grammar." wouldn't really work for a majority (probably) of the serious fiction that is published. Authors like to play with the language.

    9. Re:Defeated in one... by richlv · · Score: 1

      ah, but in germany you have to provide passport or similar id just to get a simcard, thus they can trace you by the connection. just in germany, you say ? surely they will push for this to cover whole eu and then more...
      and germans still had the guts to lecture usa on the internet freedom, anonymity and privacy.

      on the other hand, i don't see how this prevents something like getting a usb stick with the book stolen. usb stick might even be re-found later.

      --
      Rich
    10. Re:Defeated in one... by SuricouRaven · · Score: 3, Insightful

      It'd be easy to make minor alterations to the text itsself. Perhaps a character can be described as dark-haired and wearing a red shirt in one version, but wearing a red shirt and dark-haired in another. Find 32 such places and you can identify four billion unique versions.

    11. Re:Defeated in one... by complete+loony · · Score: 2

      Then they'll implemented a polymorphic sentence generator. Actually you could set it all up by hand, it wouldn't take too much effort. Pick a handful of sentences, pick a handful of alternative words from a thesaurus or rephrasings that don't change the intent. Heck the alternatives could all be provided by the original author if you like. You'd need less than 60 possible replacements across the whole book to encode a unique enough watermark.

      --
      09F91102 no, 455FE104 nope, F190A1E8 uh-uh, 7A5F8A09 that's not it, C87294CE no. Ah! 452F6E403CDF10714E41DFAA257D313F.
    12. Re:Defeated in one... by NFN_NLN · · Score: 4, Interesting

      And once you bought a book (with your own credit card), and then decide afterwards that you want to put it out there for pirates, suddenly, you realize that it's not such a good idea.

      You realize it's not such a good idea... and 3/4 of a second later you just download it from another source. So you've really accomplished little.

      Has Apple's similar approach impacted music piracy?

      "Apple embeds your account information in all songs sold on the store, not just DRM-free songs. Previously it wasn't much of a big deal, since no one could imagine users sharing encrypted, DRMed content. But now that DRM-free music from Apple is on the loose, the hidden data is more significant since it could theoretically be used to trace shared tunes back to the original owner."

      http://arstechnica.com/apple/2007/05/apple-hides-account-info-in-drm-free-music-too/

    13. Re:Defeated in one... by Phasma+Felis · · Score: 1

      The downloader was already trackable. Bittorrent was never a secure protocol; that's where all those lawsuits got their targets. Knowing who paid for the original copy of the downloaded file changes nothing.

    14. Re:Defeated in one... by The+Cat · · Score: 1

      They'll just laugh at you for spending time and money making yourself look like a douchebag.

      And your file sharing will have zero effect on sales of an e-book.

    15. Re:Defeated in one... by Opportunist · · Score: 3, Interesting

      Well, it's not so far fetched that there will be various files that reach back to one source. I remember a certain song that had a quite noticeable glitch somewhere, a compression mistake or something like that. I know for a fact that it wasn't meant to be that way because it was played up and down on every radio station and music TV station, every time without that glitch (and it just sounded like a compression bug, too). The same applies to the pressed CD because I later bought it just for the sole reason to find out whether that glitch is supposed to be there, and on the original pressed disc there was no such artifact.

      But no matter where I went and at what party I heard it, I always heard exactly the same glitch. Ok, one may say, it's a local thing. So I thought, too, until I heard it at a party on a different continent. I waited for it, and I was quite amazed to hear that well known glitch.

      And then on YouTube...

      And it wasn't some obscure, barely known song, it was something that clogged the airwaves for quite a while. I later tried to create an MP3 of the file myself to check whether it was some obscure reason why it "has to" end up with that glitch when converted and no, at least my converter managed to encode it flawlessly.

      So I guess the only conclusion I could come up with is that everyone on this PLANET downloaded the same file from the same crappy source. One person encoded it and everyone downloaded from him.

      Kinda amazing that it still was such a seller. I mean, isn't the big complaint of the music industry that everyone is just downloading it? And obviously, for this song one sold CD would have sufficed to satisfy the damned... I mean the demand.

      --
      We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
    16. Re:Defeated in one... by Anonymous Coward · · Score: 1

      heh. you said "do do."

    17. Re:Defeated in one... by AmiMoJo · · Score: 1

      In Europe you have the right to sell your digital media, so even if it is traced back to you then it could have been legitimately sold to someone else for cash in an untraceable way.

      BTW, which song was it? I'm curious now.

      --
      const int one = 65536; (Silvermoon, Texture.cs)
      SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC
    18. Re:Defeated in one... by luttapi · · Score: 1

      Even these precautions wouldn't be necessary if 1000's of people shared the files. Even if they can be tracked back, no publisher will have the resources to go after 1000's of people worldwide.

    19. Re:Defeated in one... by Opportunist · · Score: 1

      Hell if I remembered the title, I have that melody stuck in my head now (including the glitch) but you don't think I could remember even a few random words that I could toss at google, do you?

      --
      We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
    20. Re:Defeated in one... by Opportunist · · Score: 5, Funny

      No, I said it was music!

      --
      We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
    21. Re:Defeated in one... by richlv · · Score: 1

      where did you buy it ? t-mobile shop girl was not going to sell one without passport or "id card"...
      and it is supposed to be like that everywhere in germany

      --
      Rich
    22. Re:Defeated in one... by Solandri · · Score: 1

      And then on YouTube...

      Most of the music videos on YouTube are put out by the studios or artists themselves. The vast majority of illegitimate uploads get flagged and removed for copyright infringement by the time they hit a couple hundred thousand views.

      I later tried to create an MP3 of the file myself to check whether it was some obscure reason why it "has to" end up with that glitch when converted and no, at least my converter managed to encode it flawlessly.

      CDs are rather interesting in that they don't store the audio data as digital files. Yes your ripping program creates digital WAV files, but that's not the way the music is encoded on the disc (in contrast to DVDs which store digital files). The music's waveform is encoded digitally, but as a stream. So ripping the same CD on two different drives will usually produce slightly different files depending on exactly how the drive positions the read head relative to the start of the disc.

      That said, it may have been an encoder bug which created the glitch you noticed. And by the time you ran your tests the bug had been fixed.

    23. Re:Defeated in one... by EzInKy · · Score: 1

      My goal, if I were to do such a thing, would be to share my culture with others. It has been thus with species homo for thousands and thousands of years. Personally, I believe it was the Ferengi who came up with the idea of copyrights.

      --
      Time is what keeps everything from happening all at once.
    24. Re:Defeated in one... by DamonHD · · Score: 1

      There's a similar curious artefact/glitch in an Abba song "Flying hi^^^gh high, like a bird in the sky" which I don't think was limited to my CD but was certainly not in every copy that I heard. Won't have been DRM in those days though.

      Rgds

      Damon

      --
      http://m.earth.org.uk/
    25. Re:Defeated in one... by Nyder · · Score: 1

      The downloader was already trackable. Bittorrent was never a secure protocol; that's where all those lawsuits got their targets. Knowing who paid for the original copy of the downloaded file changes nothing.

      That's what a VPN is for. Seriously, this day and age, you use a VPN for your torrents.

      --
      Be seeing you...
    26. Re:Defeated in one... by richlv · · Score: 1

      bought a non-subscription sim from t-mobile two days ago. no-go w/o passport, id card or driver's licence.

      please note that i was talking about germany, not netherlands or most other european countries :)

      --
      Rich
    27. Re:Defeated in one... by White+Flame · · Score: 1

      1. Get a copy from a legal user, without their knowledge.

    28. Re:Defeated in one... by bhmit1 · · Score: 2

      This isn't designed to stop the determined thief, there will still be plenty of piracy. Instead, it's designed to maximize profits from average users. Friends no longer let other friends borrow a copy of their book like they would have done with a physical book, because they are afraid that it could get shared publicly.

      It's not so different from how dvd DRM isn't to stop people from making copies of movies, it's to prevent the manufacturers of players from adding features that customers would like, such as region free playing and the ability to skip ads at the beginning of the disk.

      In both cases, criminals can easily do what they've always done, but the law abiding users are less and less able to use the product in ways that used to be legal.

    29. Re:Defeated in one... by fuzzybunny · · Score: 1

      You're assuming there will be only one pirate. If I were to make an ebook available, under the parent poster's idea I'd check to see whether there are others out there already, and then run the diff.

      --
      Cole's Law: Thinly sliced cabbage
    30. Re:Defeated in one... by gnasher719 · · Score: 1

      "Apple embeds your account information in all songs sold on the store, not just DRM-free songs. Previously it wasn't much of a big deal, since no one could imagine users sharing encrypted, DRMed content. But now that DRM-free music from Apple is on the loose, the hidden data is more significant since it could theoretically be used to trace shared tunes back to the original owner."

      It has two effects: It puts stuff into the "Purchased items" folder in iTunes, and it prevents things with the wrong ID to be uploaded to your computer if you use "Upload purchased items from device" by mistake with a friend's iPod or iPhone. I'm assuming it is a mistake, obviously.

    31. Re:Defeated in one... by gnasher719 · · Score: 1

      1. Get a copy from a legal user, without their knowledge.

      You just went from a little bit of copyright infringement to criminal hacking, and what is called in the UK "perverting the course of justice" by creating false evidence, which is a crime that _will_ be punished with time in jail if you get caught.

    32. Re:Defeated in one... by omnichad · · Score: 1

      Or to put it simply, there's no error correction bits and scratches are common.

    33. Re:Defeated in one... by jabuzz · · Score: 2

      No you would need more than two copies. Imagine you have 100 such things as "Peter and Jane" or "Jane and Peter" that represent a one or zero. That is a 100 bit binary code we could put in the book. Assume that your book is going to be wildly successful so we allow 33 bits for the unique number (that's more than the current population of the world) and the remaining 67 to be an error code. Now to change the watermark you need to change 68 of the 100 individual marks in the book. The more error correcting bits you have the harder it gets to change the identification code.

      However just two books will on average only identify 50 bits assuming the bits are entirely randomly distributed, so you would not have enough information from two books unless you struck lucky to remove or alter the watermark.

      Obviously you can add more books and as you do so identify more of the individual marks, though there is a law of diminishing returns as the more books you add the fewer additional marks you identify. A third book will on average get you another 25 marks in my example and on average let you change the code.

      It would I believe be impossible just using electronic versions to reconstruct an original book, though for the time being one could compare it to a dead tree version assuming it existed to determine the authoritative version of each mark.

    34. Re:Defeated in one... by dmatos · · Score: 1

      Or normalize all capitalization, punctuation, spelling, and grammar.

      FTFY. Oxford comma FTW!

      --

      It may look like I'm doing nothing, but I'm actively waiting for my problems to go away.
      --Scott Adams
    35. Re:Defeated in one... by drussell · · Score: 1

      No, there actually are error correction bits but most players/drives ignore them when playing/reading audio. Good drives like Plextors will do bit-perfect DAE. Even some of my older Sony, Yamaha and Pioneer SCSI drives do it by themselves. I still use my old drives for my DAE needs. Most cheap new drives don't bother but you can read multiple times and make it pretty close with software. Something like grip works well.

      http://en.wikipedia.org/wiki/Red_Book_(audio_CD_standard)

    36. Re:Defeated in one... by drussell · · Score: 1

      So ripping the same CD on two different drives will usually produce slightly different files depending on exactly how the drive positions the read head relative to the start of the disc.

      Depends on the drive. Most ignore the error correction bits however some better (and usually older) ones will actually read the error correction bits and correct (any correctable errors) on the fly and you'll always get either the exact same data or a read error.

    37. Re:Defeated in one... by xorsyst · · Score: 1

      Not if you configure your ripping software properly. That's what the accurate rip database is for, after all.

      --
      Get free bitcoins: http://freebitco.in
    38. Re:Defeated in one... by complete+loony · · Score: 1

      I was also thinking that each target replacement would have more than one possible variation, though from a pure entropy point of view that doesn't change the likelyhood of invalidating the watermark.

      Also once you've discovered a couple of the variations used in the watermark, then choose one option from each variation randomly. Then the remaining mark may be able to identify one of the purchasers, or perhaps both of you. But could certainly narrow down the possibilities to a handful of people.

      --
      09F91102 no, 455FE104 nope, F190A1E8 uh-uh, 7A5F8A09 that's not it, C87294CE no. Ah! 452F6E403CDF10714E41DFAA257D313F.
    39. Re:Defeated in one... by Swave+An+deBwoner · · Score: 1

      Well maybe if you just hum a few bars someone here will be able to identify it.

    40. Re:Defeated in one... by Swave+An+deBwoner · · Score: 1

      Or just use an array of similar words and randomly pick one:

      Are ya happy to see me, or is that a banana in your pocket?

      Are ya happy to see me, or is that a blueberry in your pocket?

      See, it makes no difference because they are all some sort of fruit.

    41. Re:Defeated in one... by White+Flame · · Score: 1

      I'm not saying I'd do that, or that I advocate it. I'm just pointing out how easily defeated this system is. If somebody did this, they'd incriminate somebody who was completely innocent of spreading the ebook.

      Seeing a marker in the wild tells you nothing of who actually copied it.

  3. Goddammit. by Chrontius · · Score: 5, Funny

    I catch all the typos in my books.

    They irritate me.

    I'd probably crack 'em, fix them all, and goddammit, that'd be "circumvention".

    1. Re:Goddammit. by ScentCone · · Score: 1

      I catch all the typos in my books.

      Do you really think you'd notice a pattern of extra trailing spaces behind the last words of certain paragraphs of certain chapters?

      --
      Don't disappoint your bird dog. Go to the range.
    2. Re:Goddammit. by I'm+New+Around+Here · · Score: 2

      Having noticed that exact thing in Word documents, I would say yes.

      Granted, the documents weren't hundred of pages long, but if I had to actively find extra spaces, the search function would work easily enough.

      --
      If you think I voted for Trump because of this post, you're wrong. I voted for Dr. Jill Stein of the Green Party. Again.
    3. Re:Goddammit. by Bremic · · Score: 3, Insightful

      Imagine going to Shakespear and saying "Sure we will publish your plays, but every person who buys a copy will get a different version where we change the words and the cadence a bit."

      Buy a copy of a play for every actor, all of them have minor variations which cause massive confusion.

      Hell, change the Bible randomly; that wouldn't get noticed at all.

    4. Re:Goddammit. by Anonymous Coward · · Score: 1

      s/\s+/\s/g

    5. Re:Goddammit. by richlv · · Score: 2

      i'm catching trailing whitespace in all files i can and dealing with it. most of my editors highlight it, so that helps. then there's this bit of sed 's/[ \t]*$//' ;)

      (some pedantic disorder, i know :> )

      --
      Rich
    6. Re:Goddammit. by TrollstonButterbeans · · Score: 4, Funny

      It's "Shakespeare" but you not know that because you stole ebook and DRM has caught you red-handed as Ebook-pirate-thief.

      --
      Priest: "Universe from nothing, no laws of physics, sped up time"+ huge discrepancies. Creationism? No. Big Bang Theory
    7. Re:Goddammit. by maxwell+demon · · Score: 2

      "I see there are no double spaces in your copy. This is a clear sign that you pirated it."

      --
      The Tao of math: The numbers you can count are not the real numbers.
    8. Re:Goddammit. by Tekoneiric · · Score: 1

      I would! I deal with lots of text at work. Extra spaces bug me because a work app has a 1000 character limit. Ctrl A and they show up like a sore thumb. Beside a simple app could strip trailing and repeated spaces plus a simple grammar check would highlight obvious stuff.

      --
      *It's not what you can do for the Dark Side but what the Dark Side can do for you!*
    9. Re:Goddammit. by king+neckbeard · · Score: 1

      That would be trivially fixed via using regular expressions. Hell, you could do that by simply opening it as an html file and copying the text.

      --
      This is my signature. There are many like it, but this one is mine.
    10. Re:Goddammit. by Phasma+Felis · · Score: 1

      I would and have.

    11. Re:Goddammit. by oobayly · · Score: 2

      As somebody who has done lighting (for a very small production), it can be bloody difficult when people start ad-libbing because it can really screw with the cues that you're waiting for. So I'm with the GP - minor changes can cause major confusion.

    12. Re:Goddammit. by Jeremy+Erwin · · Score: 1

      It's odd that you would bring up Shakespeare. The Quarto and Folio versions of many of his plays differ. Pericles is in the third folio, but not in the first or second folios. There are "bad quarto" versions of Romeo and Juliet, Henry V, Merry Wives of Windsor, Hamlet, and Henry VI, parts two and three. There's also a false folio. Good fodder for PhD dissertation, bad for a slashdot analogy.

    13. Re:Goddammit. by Chrontius · · Score: 1

      Or a clear sign of neurotic perfectionism. #Justsayin.

      (Actually, I prefer double-spaces after all sentence-ending punctuation. Different failure-state.)

    14. Re:Goddammit. by Opportunist · · Score: 1

      How the heck do you plan to see my copy in the first place?

      --
      We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
    15. Re:Goddammit. by Opportunist · · Score: 1

      Ya know, I can almost hear the architect of the tower at Babylon go "But to make sure they don't rip us off, let's alter the wording just ever so slightly in every single copy of the blueprint that we give them".

      Today, we know this bright idea as the Babylonian confusion.

      --
      We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
    16. Re:Goddammit. by Opportunist · · Score: 1

      Dude, you act as if anyone would notice if someone said a word wrongly in a Shakespeare play. Everyone would just grin and nod his head, thinking it's some obscure, ancient word that he doesn't know but acts as if he did lest he appears stupid and low-brow.

      --
      We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
    17. Re:Goddammit. by Opportunist · · Score: 1

      Considering the "cultural artifacts" created today, we can only put our hopes in DRM or the future will judge us by the way we were "entertained".

      --
      We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
    18. Re:Goddammit. by Hognoxious · · Score: 1

      Yes, if he's a Python programmer.

      --
      Confucius say, "Find worm in apple - bad. Find half a worm - worse."
    19. Re:Goddammit. by MysteriousPreacher · · Score: 1

      I think he was more asking if you would notice such modifications just reading a book? It's very unlikely you would notice additional spaces added to a couple of hundred of pages of justified text.

      Of course you could search for additional spaces, and whatever else they add. The point is that is it'd be additional work for you to perform searches/analysis, and hopefully a deterrent against copyright infringement. Ironically though I'd say the people who actually know this exists, and could be deterred by it, are the ones likely to know how to remove the DRM.

      --
      -- Using the preview button since 2005
    20. Re:Goddammit. by pmikell · · Score: 1

      Hell, change the Bible randomly; that wouldn't get noticed at all.

      I think they already did that, considering the way the Bible uses italicization.

    21. Re:Goddammit. by stephanruby · · Score: 1

      I catch all the typos in my books. They irritate me. I'd probably crack 'em, fix them all, and goddammit, that'd be "circumvention".

      You won't have to. If this becomes popular, you'll just have to get all your ebooks from p2p.

      Like most DRM schemes, it's only the legitimate customers who will lose access to the higher quality content. This is essentially what happened with audiobooks. If you want a high quality audiobook, you don't get it from Audible (which purposefully degrades their quality), or even if you do end up getting an audiobook from Audible, you end up downloading the very same title from p2p because what you find on p2p in the category of audiobooks is usually of much higher quality and of a higher bitrate (than what they're trying to sell online).

    22. Re:Goddammit. by TrollstonButterbeans · · Score: 1

      Your right, this is ironic!

      --
      Priest: "Universe from nothing, no laws of physics, sped up time"+ huge discrepancies. Creationism? No. Big Bang Theory
    23. Re:Goddammit. by DamonHD · · Score: 1

      Yes, a reasonable habit from the days when I used a mechanical typewriter (and still good for monospaced code comments), but occasionally in need of an override for Twee

      --
      http://m.earth.org.uk/
    24. Re:Goddammit. by DamonHD · · Score: 1

      If it's pedantry you're after I prefer to replace your * with + to nominally minimise work done!

      Rgds

      Damon

      --
      http://m.earth.org.uk/
    25. Re:Goddammit. by richlv · · Score: 1

      ooh. not sure how much resources that could save, but i like your way of thinking ;D

      --
      Rich
    26. Re:Goddammit. by MisterZimbu · · Score: 1

      Imagine going to Shakespear and saying "Sure we will publish your plays, but every person who buys a copy will get a different version where we change the words and the cadence a bit."

      Buy a copy of a play for every actor, all of them have minor variations which cause massive confusion.

      Hell, change the Bible randomly; that wouldn't get noticed at all.

      Books are not plays. What words you read don't really have an effect on what words others read from the same book.

      Though this would affect grade schoolers' English classes reading from books out loud. I wonder how many kids will get falsely diagnosed with dyslexia because they were reading "Jim and Steve" when everyone else's books read "Steve and Jim".

    27. Re:Goddammit. by ScentCone · · Score: 1

      While flipping through a novel on an e-reader? Really?

      --
      Don't disappoint your bird dog. Go to the range.
    28. Re:Goddammit. by Marxist+Hacker+42 · · Score: 1

      Yes, a reasonable habit from the days when I used a mechanical typewriter (and still good for monospaced code comments), but occasionally in

      Fixed that for ya. (140 characters).

      --
      SJW: a person who perceives an injustice, and while correcting it, commits a greater injustice.
    29. Re:Goddammit. by Jeremy+Erwin · · Score: 1

      I take it you didn't like Christopher Tolkien's "A History of Middle Earth", then?

  4. sure by Black+Parrot · · Score: 1

    It would be so very difficult for someone to write a little program that, when stripping the DRM, randomized a couple of pieces of punctuation to break the hash that the vendor is storing along with the sales record of the individual book.

    In which case they just resort to diff, to remove your hacks and restore the hash.

    --
    Sheesh, evil *and* a jerk. -- Jade
    1. Re:sure by maxwell+demon · · Score: 1

      If the alterations encode who has bought it, then your three copies will likely share that information. Therefore comparing the three copies will not remove it.

      --
      The Tao of math: The numbers you can count are not the real numbers.
    2. Re:sure by king+neckbeard · · Score: 1

      It doesn't really matter. You get the differnce between the files, and replace the errors with the correct text.

      --
      This is my signature. There are many like it, but this one is mine.
    3. Re:sure by Fry-kun · · Score: 1

      Suppose you have several pirates in different markets with different accounts that all buy the ebook and diff those. If done once, it's really hard to narrow it down to any one of them. If they keep using different accounts for future purchases and add their own alterations to the pirated copy (to confuse the detector), it makes them practically impossible to trace.

      --
      Did you know that "FTW" ("for the win") is a direct translation of "Sieg Heil"?
    4. Re:sure by maxwell+demon · · Score: 1

      And of course all those pirates have to coordinate, and to trust each other (if even a single one is an undercover cop, then the whole group is caught). That's certainly a big step from a lonely pirate breaking a DRM or watermarking scheme in his basement.

      --
      The Tao of math: The numbers you can count are not the real numbers.
  5. So, rip 3 copies of the ebook and diff them. by robbak · · Score: 1

    This is so very easy to deal with. Rip at least 3 copies and diff them. The minor tweaks will stand out a mile, and you then have a clean copy you can (and, if they start pulling tricks like this, Should!) distribute widely.

    --
    Prediction for end of Universe #42: Fencepost error in Quantum_bogosort.cpp
    1. Re:So, rip 3 copies of the ebook and diff them. by kayoshiii · · Score: 1

      I guess that way at least they sell three copies.

    2. Re:So, rip 3 copies of the ebook and diff them. by xQx · · Score: 1

      Yup, it'd be trivial to write a program that would take 3 dirty copies and return a clean one.



      So, if I give the program 3 copies of "50 shades of grey" it would return a version that is safe for my kids to read?
    3. Re:So, rip 3 copies of the ebook and diff them. by maxwell+demon · · Score: 1

      That task is easy. There's even a command for this provided in your operating system. Depending on which OS you run, it's either rm or del.

      --
      The Tao of math: The numbers you can count are not the real numbers.
    4. Re: So, rip 3 copies of the ebook and diff them. by Therad · · Score: 1

      no, but if you rip out the pages, it should be fine.

    5. Re:So, rip 3 copies of the ebook and diff them. by king+neckbeard · · Score: 1

      Yeah, it would delete the file and download Venus in Furs.

      --
      This is my signature. There are many like it, but this one is mine.
    6. Re:So, rip 3 copies of the ebook and diff them. by j-beda · · Score: 1

      <quote><p>Yup, it'd be trivial to write a program that would take 3 dirty copies and return a clean one.</p></quote>

      So, if I give the program 3 copies of "50 shades of grey" it would return a version that is safe for my kids to read?

      50? Go with 50 Thousand!

      http://www.amazon.com/Fifty-Thousand-Shades-Of-Grey-Ashen/dp/1479215430

      The "look inside" first few pages are particularly droll.

    7. Re:So, rip 3 copies of the ebook and diff them. by macbeth66 · · Score: 1

      My kids got me the copy...

  6. Just diff 2 copies by JLennox · · Score: 1

    You don't know what punctuation their algorithm cares about. The summary's method would not work.

    Diff 2 copies and randomize the selection between the two.

  7. Article in case of slashdotting by bit+trollent · · Score: 2, Funny

    The next e-book you buy might not exactly match the printed version. And those changes are there to make sure youâ(TM)re not a pirate.

    German researchers have created a new DRM feature that changes the text and punctuation of an e-book ever so slightly. Called SoDoMy, which Google translates to âoesecure documents by individual fornicating,â the changes are unique to each e-book sold. These alterations serve as a digital penis that can be used to track books that have had any other DRM dildoes stripped out of them before being shared online. The researchers are hoping the new DRM feature will inspire butt piracy by simply making consumers paranoid that theyâ(TM)ll be caught if they share an e-book illicitly.

    Current e-book DRM restricts the movement of cocks between broes and hoes and ties a cock to a single accountant. A e-book bought in the Fondle bookstore, for example, will only work on a Faggot. The same is true for books bought in the Butts & Plugs and iButts digital bookstores â" theyâ(TM)ll only work on the Nook or Apple devices, respectively. This makes publishers happy because their books are locked to one person. And it makes digital book vendors happy because it keeps readers tied to their proprietary devices and ecosystems.

    But stripping the DRM from any of the e-books purchased at the big-name stores is as easy as downloading strap-on, and thereâ(TM)s little special genetalia required beyond knowing how to properly connect a penis to an asshole. These cocks usually convert the CUM-heavy e-cocks to a new climax, such as the open-source E-Pub standard, or to the STD-less version of the Kindleâ(TM)s fuck format. From there, the relatively small penises of asians make them perfect for sharing on the Internet.

    Of course, readers may not be happy knowing that their licensed e-books are being altered because democrats and republicans donâ(TM)t trust them. By studying a list of example words and phrases that could be changed in purchased books, you can see that the changes are minor â" like from âoevery gayâ to âoenot that gay, actually.â The examples are translated from German pornography, so itâ(TM)s difficult to gauge how profound the changes will be when they occur in your favorite Harry Potter scat film. Itâ(TM)s also unknown if the top U.S. bookstores are interested in more sodomy.

    The SoDoMy consortium currently has two German bookselling partners (4Readers and MVB) that it reports to, according to Dr. Martin felchbach, a researchers working on the SoDoMy system whom I reached over email. Democrats & Republicans and Amazon did not reply to queries about if or when the technology would make its way into their digital bookstores as of press time.

    1. Re:Article in case of slashdotting by Forget4it · · Score: 1

      Okay is the "youÃ(TM)re" the intentional typo in your "piracy" of that article?

      --
      Artificial intelligence is the study of how to make real computers act like the ones in the movies.
    2. Re:Article in case of slashdotting by maxwell+demon · · Score: 1

      Called SoDoMy

      Great marketing. Imagine ... "Our copies of the bible are protected with SoDoMy ..." — I doubt that edition will sell well. ;-)

      --
      The Tao of math: The numbers you can count are not the real numbers.
    3. Re:Article in case of slashdotting by chromas · · Score: 1

      It pops up occasionally on Slashdot but usually nobody says anything. I wonder if they're using IE and it's converting quotes and apostrophes to Smart Quotes (and Apostrophes), giving /. indigestion.

    4. Re:Article in case of slashdotting by omnichad · · Score: 1

      The article probably has a curled quote. Copying and pasting will not modify the characters.

    5. Re:Article in case of slashdotting by Megane · · Score: 1

      That's his DRM to track post pirates. You will now be hearing from his lawyers about your unauthorized copying of his word.

      --
      #naabhaprzrag, #sverubfr-000, #agi-fcbafberq, negvpyr[pynff*=' negvpyr-ary-'] { qvfcynl: abar !vzcbegnag; }
  8. actually stolen by RedHackTea · · Score: 1

    Wonder if the eBook was actually stolen from your computer? Either by a friend that has physical access to your computer or in the rare case of a hacker (but who would hack you for eBooks)? Surely, you can't be held reliable for this. Then everyone that actually pirates eBooks and gets caught will just use this excuse as a way to get out of trouble. Else, if you are still held responsible for a stolen eBook from your machine/USB, then it screws over the legitimate users buying eBooks and makes them want to actually pirate... a deadly cycle.

    --
    The G
  9. That's not how traitor-tracing algorithms work by _Knots · · Score: 5, Informative

    They don't hash the whole shebang into one number. Rather, they take a (random) number and use that to generate a set of mutations and then probe for that set of mutations in the leaked document. So now, even if you alter the document further, you probably didn't undo the mutations in question. Even if you did, you probably didn't undo all of them and you almost certainly didn't produce a high-confidence result that it's somebody else's copy.

    --
    Anarchy$ dd if=/dev/random of=~/.signature bs=120 count=1
    1. Re:That's not how traitor-tracing algorithms work by Anonymous Coward · · Score: 2, Insightful

      Well if you seriously want to get around this, you need two accounts. Take two documents diff them and remove and/or correct what you see.

    2. Re:That's not how traitor-tracing algorithms work by Solandri · · Score: 1

      That's security through obscurity. It works for music and video, where you can introduce small, regular bit changes without perceptibly altering the music or video. It doesn't work for eBooks, where a single bit change will change the letter or punctuation mark.

      And even if you did introduce a regular, distinctive, and unique pattern of punctuation changes to an eBook, it'd be trivial to compare two of the same eBook and eliminate them or scramble them. The same is not true for music an video, where you can add stenographic marks which can survive re-encoding, but the re-encode makes it impossible to do a simple diff between two copies.

    3. Re:That's not how traitor-tracing algorithms work by dargaud · · Score: 2

      Well if you seriously want to get around this, you need two accounts. Take two documents diff them and remove and/or correct what you see.

      That wouldn't necessarily work. Take a video and introduce a white dot on the lower right corner at 1:13 in one version, and a red dot on the upper right corner at 2:17 on the other. If you average (or scramble the differences) the two, you still end up with a smudge on the lower right corner at 1:13 and another smudge at 2:17, both traceable back to the original videos. You could make it a lot more resilient still by taking a random number, generating a turbo-code for it, and using that to change the file all along. Even if part of the turbo-code is modified, edited or clipped, you can still find the original random value.

      --
      Non-Linux Penguins ?
    4. Re:That's not how traitor-tracing algorithms work by omnichad · · Score: 1

      Video is a poor example. You can't have a half faded-out character. But if you treat the two documents as buckets and take differences from alternating buckets, you end up with a text that has only half the errors of one and half the errors of the other (on average). 50% is probably not a high enough confidence level to identify the original copy/copies. Or, you could throw in a halfway intelligent algorithm to analyze the differences and figure out the "correct" choice and increase your odds of going undetected. The same would apply to video too. A white dot would not match the surrounding pixels and would be selected against by a well-designed algorithm comparing the two versions.

    5. Re:That's not how traitor-tracing algorithms work by Mathinker · · Score: 1

      No, if the changes are done at N locations in the work, you need ~(1+epsilon)*log_2(N) accounts and (preferably) someone willing to go to a library somewhere and look up the correct text out of a printed volume, for all of the N locations (which you find by comparing all of the works).

  10. Similar to something Amazon patented by dido · · Score: 3, Informative

    There was an article about it here a few years ago. A followup someone made to a comment I wrote to the article mentions some work being done by some guy from Purdue that sounds a lot like what's being done here. IBM also seems to be doing work on canary trap-based ideas.

    --
    Qu'on me donne six lignes écrites de la main du plus honnête homme, j'y trouverai de quoi le faire pendre.
  11. What does this actually prove? by XaXXon · · Score: 4, Insightful

    Is accidentally leaving a copy somewhere copyright infringement? How do they know the person they sold it to is the person who leaked it.

    Also, it's never been clear to me when copyright infringement actually occurs.

    1. Re:What does this actually prove? by lister+king+of+smeg · · Score: 1

      that was my though i often have ebooks on a usb drive and i often loan usb drives to classmate when they forget theirs and need to move a file between their laptop and the the school workstations, what if one of them were to see my ebook and copy it.

      --
      ---Saying gnome 3 is better than windows 8 not so much a compliment as it is damning with light praise.
    2. Re:What does this actually prove? by Anonymous Coward · · Score: 1

      How much CSI did you watch to come up with example 2? It's about as plausible as enhancing reflections in victim's eyes on CCTV footage to find the killer.

      PS: It's not that I find whole concept of "somebody found a flashdrive and instead of quickly scanning it for nude pics and formatting, decided to risk criminal prosecution for copyright infringement" any more plausible than that

    3. Re:What does this actually prove? by L4t3r4lu5 · · Score: 1

      There are no feasible need for having an eBook on a memory stick. You can access it on your phone, computer, Kindle etc without one, and you can re-download it at any time, so the only reason to have it on there is for unlicensed use / sharing.

      N.B. I don't agree with this, it's just the opinion they will hold. You will be deemed liable.

      --
      Finally had enough. Come see us over at https://soylentnews.org/
    4. Re:What does this actually prove? by L4t3r4lu5 · · Score: 1

      Does accidentally leaving your car unlocked make you liable for theft of its contents? It's a ToS violation at the least, and just like the insurer not paying out for your stolen car stereo the eBook rights holder will ban you.

      Did you expect it do be any other way? Seriously?

      --
      Finally had enough. Come see us over at https://soylentnews.org/
    5. Re:What does this actually prove? by biodata · · Score: 1

      Or, 'I didn't share it, maybe it was stolen by hackers or viruses'.

      --
      Korma: Good
    6. Re:What does this actually prove? by biodata · · Score: 1

      So the purpose of this device is to make it easier for book sellers to lose customers by banning them? Sounds really great! I'm sure book sellers are queuing round the block to buy it.

      --
      Korma: Good
    7. Re:What does this actually prove? by L4t3r4lu5 · · Score: 1

      As best as I can tell, the purpose of this device is to make it easier for book sellers to lose customers by introducing errors into the carefully constructed and edited text presented to them by the author, making the reading experience painful and disappointing. I've commented elsewhere about not touching my Kindle for two months because I was so infuriated by numerous typographical errors in pretty much every ebook I have. I put it down to sloppy conversion from the text, but soon realised that the sort of errors cropping up would be picked up by even the laziest of automatic checking done by a word processing package.

      --
      Finally had enough. Come see us over at https://soylentnews.org/
    8. Re:What does this actually prove? by aicrules · · Score: 1

      We still have desktop and laptop computers that use USB to store/read data. If that's where I store my ebooks so that they can be read on whatever computer I happen to be using, is that unlicense use?

    9. Re:What does this actually prove? by L4t3r4lu5 · · Score: 1

      I don't know. Why not check the license?

      --
      Finally had enough. Come see us over at https://soylentnews.org/
    10. Re:What does this actually prove? by aicrules · · Score: 1

      The answer is, it is not unlicensed use. Which means your assertion that the only reason to have it on a USB drive is for unlicensed use is incorrect.

  12. Learn by Scutter · · Score: 4, Insightful

    Or, you know, maybe learn from the success of Apple iTunes and start selling eBooks for a reasonable cost and maybe they won't be pirated nearly as much. I know that the publishing process costs money that you deserve to recoup, and you deserve to make a profit, but it is offensive to charge as much as (or more) than a physical book for an eBook.

    --

    "Tell me doctor, with all of your defenses, are there any provisions for an attack by killer bees?"
    1. Re:Learn by Anonymous Coward · · Score: 1

      Are you talking about the same Apple who is currently in legal hot water for conspiring to set high book prices?

      Good call!

    2. Re:Learn by Macgrrl · · Score: 5, Funny

      That's not how dyeing industries work.

      You negative attitude is colouring your response.

      --
      Sara
      Designer, Gamer, Macgrrl in an XP World
    3. Re:Learn by Opportunist · · Score: 1

      You mean, like, selling what the customer WANTS to buy?

      C'mon, that's never gonna fly, that sounds almost like communism. Or something. Hell, it sure ain't the content market I know!

      --
      We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
  13. Great trick to remove the watermark by Rosco+P.+Coltrane · · Score: 5, Funny

    - Scan/OCR book
    - Google translate into German
    - Google translate back into English
    - Print book

    Voila! No more watermark. You can share with confidence.

    --
    "A door is what a dog is perpetually on the wrong side of" - Ogden Nash
    1. Re:Great trick to remove the watermark by xQx · · Score: 1, Interesting

      Yes, so:

      "They sat by the kitchen table and discussed the morning's news"

      English-German-English becomes:

      They all sat around the large rectangular lump of wood suspended by four vertical pillars and held a multidirectional conversation regarding that day before noon which owned its events.

      Or English-Mandarin-English becomes:

      Sat in Kitchen by table discussed news of morning. ...

      What could possibly go wrong?

    2. Re:Great trick to remove the watermark by Opportunist · · Score: 1

      Erh... this is maybe a good moment to inform you of a pastime that we tend to partake in when we are REALLY bored.

      1. Take a famous quote, some song lyrics or any kind of text that the average person should know.
      2. Open Google Translate. Pick 5 languages, no two may be from the same language family (it's generally safe to choose from different continents, ignoring the Americas).
      3. Chain translate the text. Use the output of one language as the input for the next.
      4. Translate the final result back to English.
      5. Post the result and have people guess what the original text was.

      It is usually kinda legible if you have a hunch what might have been the original text, but in general it's an adventure in cryptography.

      --
      We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
    3. Re:Great trick to remove the watermark by Kiwikwi · · Score: 1

      So, instead of making up a translation, why not try it out?

      They sat by the kitchen table and discussed the morning's news

      becomes

      She sat down at the kitchen table and discussed the morning news

      or

      They sat on the kitchen table to discuss the morning's news

      which are arguably much worse...

    4. Re:Great trick to remove the watermark by MrNemesis · · Score: 1

      It's been done (albeit by way of Japanese rather than German)! :)

      - Know the scan
      - English translations
      - Google OCR translations
      - Germany Google login!

      Watermarks, or better yet none of it. Share a confidence.

      http://translationparty.com/

      --
      Moderation Total: -1 Troll, +3 Goat
    5. Re:Great trick to remove the watermark by Lithdren · · Score: 1

      Results I got were quite a bit different.

      "She sat down at the kitchen table and discussed the morning news"

      Didn't see Mandarin as an option, but Yiddish was an interesting result.

      "They Sat by the kitchen table and discussed the morning's news" I guess "sat" is a proper noun in yiddish or something. :)

    6. Re:Great trick to remove the watermark by j-beda · · Score: 1

      Using google translate for German I got: She sat down at the kitchen table and discussed in the morning the news.
      For "chinese-simplified" They sat at the kitchen table and discuss the morning's news
      For chinese-traditionsl: They sat on the kitchen table to discuss the morning's news

  14. Re:Waste of time by viperidaenz · · Score: 1

    No, you become a suspect. To become a criminal you need to at least provide reasonable doubt - like evidence of your stolen files.

  15. as a software programmer... by slashmydots · · Score: 2

    Who says it's a hash? Just add one extra space somewhere in the book in an unusual place or replace an apostrophe with a similar character or something. Then if someone adds something else, you're still checking for that one single location of the alteration to prove it's them. It'd be awfully unlikely in a long book that you'd replicate the exact alteration that they made to someone else's book, thus appearing to be 2 different people.

    1. Re:as a software programmer... by RedHackTea · · Score: 1

      Can't pirates just write a script to fix all spaces and punctuation and spelling mistakes? Grammar mistakes would be the hardest or if a word is "misspelled" into an actual word (like spelling "too much" as "to much" or "two much"). I think fixing spaces/apostrophes/etc. would be the easiest.

      --
      The G
    2. Re:as a software programmer... by gl4ss · · Score: 1

      well just do a proof reading. fix the words to be the right words.

      what they are doing is DEGRADING their product to scare actual customers. that's real fine work right there!

      --
      world was created 5 seconds before this post as it is.
  16. Done already by viperidaenz · · Score: 1

    Don't they do this to pre-release screenings and theatre viewings of movies to find out who done the leak or who let the video camera into the theatre?

    1. Re:Done already by Anarchduke · · Score: 4, Funny

      Yes, which is why they have successfully stamped out piracy, it is part of the sordid past of the Internet. Thank god we'll never see pirated e-books again.

      --
      who prays for Satan? Who in 18 centuries has had the humanity to pray for the 1 sinner that needed it most? ~Mark Twain
    2. Re:Done already by wvmarle · · Score: 1

      That just targets the theaters where the offending recording is made, not the person making the recording. All it does is making theaters more vigilant against people smuggling in cameras.

      And then I don't get the point of those cam rips. I've downloaded a few, but didn't get further than five minutes into the movie as the quality is so terrible. Low res, poor sound - just not watchable.

  17. So, uh... what are they copyrighting then? by UltraZelda64 · · Score: 1

    If the content of a book--what is thought up and written by a human--is what is traditionally copyrighted, then what exactly are they copyrighting in this case? Obviously the content is "written" by the writer and then published in an electronic book format similarly to how it would be printed on pages and made into a physical book, but if that content is automatically tampered with by machines it is no longer what the author wrote. How would copyright work in this case? Hundreds of copyrights of individual "variations" of the same exact book? Sounds like a fucking mess. And that's not to say how irritating it would be to know that you are, in fact, not getting exactly what the author wrote. Not to mention the fact that you're not getting ownership of it while still paying for it.

    1. Re:So, uh... what are they copyrighting then? by wvmarle · · Score: 2

      No issue there. Changing a few letters in Harry Potter doesn't make it your work, either. Under copyright, copies don't have to be exact (otherwise taping a song from radio would never have been an issue), it has to be very similar. Likewise a band playing covers of another band: they're different, some notes are wrong, rhythms are slightly off, yet it's still the same song.

      Furthermore it's fully legal to get inspiration from someone else's work - and use elements of copyrighted works in your own works. You just have to make sure it is obviously a different work.

    2. Re:So, uh... what are they copyrighting then? by UltraZelda64 · · Score: 2

      Ah, I see--that clears it up well. I still think the idea of altering the writer's words and punctuation in the name of piracy is going too far though.

    3. Re:So, uh... what are they copyrighting then? by wvmarle · · Score: 1

      Agreed, but that's a whole different issue.

    4. Re:So, uh... what are they copyrighting then? by The+Cat · · Score: 1

      The new version is a derivative work: only a copyright owner can prepare a derivative work under U.S. law.

      IANAL

    5. Re:So, uh... what are they copyrighting then? by Opportunist · · Score: 1

      Yes, for example correct capitalization can easily alter the meaning completely. Take this sentence for example.

      "I had to help my uncle Jack off a horse".

      Now imagine this sentence without proper capitalization and how the meaning changes instantly. If DRM strikes here, what could well be a very child suitable book on how to help your elders may easily become something that you might not want your kids to read!

      --
      We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
    6. Re:So, uh... what are they copyrighting then? by UltraZelda64 · · Score: 1

      LMFAO! Very good point. That was hilarious.

  18. Calibre by macemoneta · · Score: 1

    While I haven't tried on any DRM'd ebooks, Calibre's converters have to options to play with all kinds of spacing and punctuation during conversion (smart punctuation, transliterate unicode to ascii). I've used them when converting text documents and saved web pages to epub, and they make very nice ebooks. I have a hard time believing that this kind of steganography would survive such a reformatting, but I guess we'll hear about it eventually if it does.

    --

    Can You Say Linux? I Knew That You Could.

  19. Re:Waste of time by Anonymous Coward · · Score: 1

    For the content industry?

    Come on - they need no proof. They are automatically granted the right to fine you with -let's say- $ 187,234,865,213.65 for any book you suppose to have uploaded.

    Don't forget - the content industry get's the best justice money can buy...

  20. strip by Tom · · Score: 5, Insightful

    It depends. If it's done well, it can be fairly resistant to any noise introduced into the system.

    As an author myself, I see a very different issue with this. I don't want some robot changing my text. Some of those words it might decide to change because they are similar I may have pained over and decided for a reason to use this one and not the other one. Granted, few authors pick every single word intentionally, but the software won't know which ones are carefully selected.

    Often times, there is subtle meaning. For example, I might decide to always use the same phrase in certain contexts, giving a very subtle hint to the reader which things are alike and which ones are different. One he might not even notice consciously.

    It also will cause all sorts of trouble to quoting. How will teachers handle this if a student quotes a text but the quote differs slightly from the version the teacher has read? One of the most important things we teach students is that quotes need to be exactly as they appear, with any omissions or changes clearly marked.

    That also extends to quotes within the text. If character A reports what character B said, I doubt the system will have enough text understanding to change both texts the same way, so the reader will be left wondering if it is intentional that there's a slight difference and what the author wants to hint at, when there's no such thing implied.

    --
    Assorted stuff I do sometimes: Lemuria.org
    1. Re:strip by celle · · Score: 1

      "It also will cause all sorts of trouble to quoting. How will teachers handle this if a student quotes a text but the quote differs slightly from the version the teacher has read? One of the most important things we teach students is that quotes need to be exactly as they appear, with any omissions or changes clearly marked."

            What makes you think students haven't already gotten in trouble over this? I see lawsuits on the horizon for colleges who kicked out students that this crap caused. Something about false accusations.

    2. Re:strip by Tom · · Score: 1

      You don't need to actually change words, and they probably won't for all the reasons you cited.

      RTFA. Changing words and phrases is exactly what this is about.

      Whitespace is trivial to "fix". There's only one correct way to do it (one space), so a script to correct all of it (removing the watermark) would be an hour of work, tops.

      Changing meaningful whitespace, i.e. linebreaks into spaces or vice-versa runs into the same problem I outlined. As an author, I actually make a choice there and there's a reason for where I start a new line or paragraph.

      --
      Assorted stuff I do sometimes: Lemuria.org
    3. Re:strip by Opportunist · · Score: 1

      I can feel your pain, I, too, write (don't worry, my language of choice is not English). And I usually pick the words I use carefully and with reason. There are minimal but very important differences and my readers usually enjoy my choice of words because that choice by itself often has a meaning.

      In this particular case I don't think that the robot would change words for ones of similar meaning like a badly done thesaurus on steroids. Rather, I'd expect them to add blanks (like, say, putting down two or three spaces between words instead of the usual one), omit a period at the end of a paragraph (where the average reader would probably not even miss it) or introduce subtle spelling mistakes (which is more easily done in languages that use diacritics with minimal impact on word spelling). My bet would be on the introduction of trailing whitespace characters or similar "invisible" things that the average reader would most likely not even notice.

      --
      We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
    4. Re:strip by locofungus · · Score: 1

      Reminds me of that Oscar Wilde joke.

      Wilde comes down to lunch.
      "Written another chapter of your book Oscar?"
      "No. I spent this morning putting in a comma"

      Wilde comes down to dinner.
      Sarcastically, "Put another comma into your book Oscar?"
      "No! I took out the comma I put in this morning."

      --
      God said, "div D = rho, div B = 0, curl E = -@B/@t, curl H = J + @D/@t," and there was light.
    5. Re:strip by Opportunist · · Score: 1

      Who said you may still quote in the future? A reference has to do, most likely.

      --
      We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
    6. Re:strip by Opportunist · · Score: 1

      Ok, you're right. That's pretty insane if you ask me. Not every word is a suitable substitute of another one in every situation. I'd rather endure someone copying my book than someone altering it and claiming it's from me.

      --
      We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
    7. Re:strip by Tom · · Score: 1

      I don't think that the robot would change words for ones of similar meaning like a badly done thesaurus on steroids.

      That is exactly what it does. The article with the examples is in german, which is my native language. There are examples where it changes linebreaks to spaces and such, but most of the examples actually do change words.

      --
      Assorted stuff I do sometimes: Lemuria.org
    8. Re:strip by Huge_UID · · Score: 1

      Often times, there is subtle meaning. For example, I might decide to always use the same phrase in certain contexts, giving a very subtle hint to the reader which things are alike and which ones are different. One he might not even notice consciously.

      "you are in a maze of twisty passages"?

  21. Amazon Kindle Books by Russ1642 · · Score: 2

    I'm going to stop sending every typo and punctuation mistake I catch to Amazon. I thought I was helping.

    1. Re:Amazon Kindle Books by L4t3r4lu5 · · Score: 1

      I've just realised why I've not touched my Kindle in over two months. I remember being too irritated by the poor quality of editing in eBooks, and put it down to Amazon farming out the work to minimum wage interns or leaving it to text-recognition software. I stopped buying eBooks and went back to paperbacks.

      There is nothing so sharp as the jolt back to reality when, midway through a tense scene, bad grammar or punctuation means I have to re-read a sentence several times. They might has well have stamped on my foot, or set off a fire alarm. In the back of my mind I keep thinking over about fucking stupid mistake, like a comma after a space, and how irritating it was. I have to stop reading, because I'm just not in the book anymore.

      If this is DRM, it's costing eBook publishers sales.

      --
      Finally had enough. Come see us over at https://soylentnews.org/
    2. Re:Amazon Kindle Books by kermidge · · Score: 2

      Funny you should bring this up. Read an ebook a few weeks back, a decent sci-fi novel (novella, morelike, given word count) and had run into enough typos that it had in some places seriously disrupted the reading flow. (Often the mind will correct or elide over the error while in hot pursuit of a scene, other times it's like tripping over a pebble on the path, and some few times it's more a full stop and restart.)

      I wrote the author, asking if he'd maybe like some free help to catch the stupidly simple stuff. He wrote back that he didn't see a problem. Which left me with a big case of WTF.

      Apparently I've been spoiled by several generations and more of real writers, real proofreaders and real editors. Real = take their jobs seriously and give a shit about what they do. Now we've a publication landscape populated by semi-literates who routinely get paid for being lazy. It's gotten bad enough that I recently wrote a blogger who had explained something in clear, well-written prose to thank her for a pleasant reading experience.

      What is discussed in the submission is at best lame.

  22. It's understandable by 93+Escort+Wagon · · Score: 3, Funny

    After all, we saw how quickly the iTunes Store withered and died after the DRM got removed from all that music. It'd be crazy for the publishers NOT to double down on DRM!

    --
    #DeleteChrome
  23. easy solution by period3 · · Score: 1

    So just remove all punctuation STOP Like old telegrams STOP Problem solved STOP

  24. Future Confrontation by sehlat · · Score: 1

    "You're under arrest for possession of a pirated copy of "Megasuper Blockbuster."

    "How do you know it's pirated?"

    "There are no spelling or punctuation errors in it!"

    1. Re:Future Confrontation by c0lo · · Score: 1

      Possession?

      The way I looked, it is the uploading and downloading that may enter under the incidence of copyright law (upload infringes on the "distribution rights" while download infringes the "reproduction rights"). In some countries, the download is actually legal (as long it's not for profit)

      Possession? I can imagine heaps of ways in which one can be in the possession of a digital copy without be necessarily responsible for copyright infringement.
      Imagine someone places a bootlegged copy on the shelves of a movie rental store or in a library. Are these institutions to be held liable for that?
      Assume one pays for a genuine fashion item but is delivered a good counterfeit (only an expert could tell the difference): is the buyer to be held liable for that?

      --
      Questions raise, answers kill. Raise questions to stay alive.
    2. Re:Future Confrontation by stdarg · · Score: 1

      Assume one pays for a genuine fashion item but is delivered a good counterfeit (only an expert could tell the difference): is the buyer to be held liable for that?

      Believe it or not I read about a similar law being proposed in New York City.

  25. Should be easy to defeat by AaronW · · Score: 1

    It should be fairly easy to defeat. All someone needs is several different copies of the book and do a comparison. It should be easy to spot what has changed and then undo them.

    --
    This post is encrypted twice with ROT-13. Documenting or attempting to crack this encryption is illegal.
  26. This idea is as new as my grandma by Stonefish · · Score: 3, Insightful

    There were printers in areas with classifed documents which automatically used to do this. They worked with whitespace, fonts and punctuation. Photocopies of the documents could still be tracked. Great work guys you deserve a badge.
    Amazon will be able to close the loop by automatically downloading the books that you have on your kindle to "check" that you don't infringe and stomp on those badguys.

  27. Re:Defeated in one...two... by Anonymous Coward · · Score: 1

    I thought that virus that I cleaned off my system seemed to make my internet access sluggish. Well what you know - it must have downloaded a copy of all my files!

  28. So obviously by Anarchduke · · Score: 1

    After you run a couple of copies through to strip this DRM, you need to add your own back in so their DRM verifier will translate it to, "I bet you thought this technique was clever, you fucking git."

    --
    who prays for Satan? Who in 18 centuries has had the humanity to pray for the 1 sinner that needed it most? ~Mark Twain
  29. You catch the buyer. by stanlyb · · Score: 1

    So what? What that does prove? That someone (maybe the one who bought the book, maybe not) took this book and shared it???
    I still don't see how based on such a funny "watermark" they this could stand in the court. Anyone? Can you prove me wrong?

    1. Re:You catch the buyer. by coId+fjord · · Score: 1

      You don't think these companies actually care whether or not the amount of evidence they have proves anything, do you?

      --
      Check UIDs. I'm COLD FJORD(826450). User COID FJORD(2949869) has impersonated me. Don't confuse us if he trolls you.
    2. Re:You catch the buyer. by turp182 · · Score: 1

      It wouldn't be criminal court, it would be civil court where the burden of proof is much lower. More than likely it would never reach a court room, there would be a settlement demand and then negotiations. This approach (massive settlement demand operations) started with satellite pirates, then eventually moved into music and movies, and, apparently now books.

      If the work was materially changed, in that characters or punctuation were altered, one could potentially argue that they didn't receive the original work. But probably not as their copy would still be over 99% the same as the original (or 99.999% the same for a Stephen King or George R. R. Martin novels given their length...).

      --
      BlameBillCosby.com
    3. Re:You catch the buyer. by gnasher719 · · Score: 1

      So what? What that does prove? That someone (maybe the one who bought the book, maybe not) took this book and shared it??? I still don't see how based on such a funny "watermark" they this could stand in the court. Anyone? Can you prove me wrong?

      Are you willing to try it? In a civil court, I'm quite sure that "this is the exact copy of the book that was sold to X" will stand up in court. And if you were found to have that copy, and you are not X, then this will be enough evidence that you have an illegal copy, unless you can get X to testify that they sold it to you.

      On the other hand, if you are X and your copy of the book was available for download on the internet, you'd also have to have some pretty good excuse. Saying "I gave only one copy to Y and he swore he would delete it after reading it and I'm really sorry" might be a good enough excuse and put the onus on the other person.

  30. chinese ebook pub qidian.com by Anonymous Coward · · Score: 1

    chinese ebook pub qidian.com had used this technique for pass few years without any success. The pirate just compares multiple version of some book and auto replace the differ words with their synonyms.

    1. Re:chinese ebook pub qidian.com by wvmarle · · Score: 1

      As you're talking about China, are you talking about pirates that are commercial businesses reselling those works? Or just individuals that like to share their stuff with friends and friends of friends?

  31. will be quickly cracked by Khashishi · · Score: 1

    Just two copies of a book are probably enough to learn how to break the system, and a few more to know how to rig the text to target a particular poor schmuk.

    1. Re:will be quickly cracked by coinreturn · · Score: 1

      Just two copies of a book are probably enough to learn how to break the system, and a few more to know how to rig the text to target a particular poor schmuk.

      At least all the pirates will be buying multiple copies!

  32. Decades? Try centuries... by dbc · · Score: 4, Interesting

    Shortly after the moveable type press got going in Europe, books of tables of interest rates were popular among the merchants. Of course, they all had to be laboriously hand calculated by mathematicians (long division was college undergraduate math in those days...). Publishers would sprinkle errors into the least signficant digits on various entries to use as evidence in copyright cases. Because, you know, if you had a printing press, you could make good money by pirating somebody else's table of interest rates.

    1. Re:Decades? Try centuries... by Opportunist · · Score: 1

      The difference is maybe that the same publishers that earlier benefited from piracy are now the ones that fight it.

      --
      We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
  33. That explains it! by dbIII · · Score: 1

    So that explains why the paid for ebooks of older texts have a pile of annoying mistakes while the Project Gutenberg version doesn't. I'd thought it was just publishers being sloppy and having very little respect for their customers, but at least now I know it's because they have even less respect for their customers and think their customers are thieves that want to "steal" the older books the publishers are not paying any royalties on.

    Enough ranting at the big guys who are going for maximum dollar extraction from public domain stuff - anyone know how small publishers are coping with ebooks? Is it giving them more of a chance since distribution can be done on the net or are Amazon, Kobo etc locking them out? There were a lot of areas, such as non-US/UK science fiction, where publishers would have trouble finding more than half a dozen shops that would sell their stuff.

    1. Re:That explains it! by xenobyte · · Score: 1

      All the more reason to pirate them. Why pay for something intentionally made defective?

      --
      "For every complex problem, there is a solution that is simple, neat, and wrong." -- H.L. Mencken (1880-1956) --
  34. What about stolen phones? by Gumbercules!! · · Score: 2

    So if my phone gets stolen and my eBooks get leaked, I'm now double screwed?

  35. Re:Waste of time by SuricouRaven · · Score: 1

    It'd only be a criminal matter if someone tried to get the DMCA anti-circumvention measures or the NET act involved. Your basic copyright infringement is a civil matter, so the burden of proof is lower.

  36. Probably older than your great-grandma by petes_PoV · · Score: 1

    I'm told map makers have been doing this forever. They move symbols slightly, change the placing of text and introduce new, insignificant features. All to stop other publishers from copying their maps, or using them as the basis for maps they pass off as their own work

    --
    politicians are like babies' nappies: they should both be changed regularly and for the same reasons
  37. oh, let's grant them a patent! by stenvar · · Score: 1

    Of course, it's an idea that has been around for ages, even for electronic documents. Of course, it doesn't meet the criteria of patentability or even publishability.

    But, I say, let's give them a patent anyway. I think any dumb idea, and in particular any DRM method, deserves a patent granted exclusively to patent trolls. We should even let them get away with "renewing" it indefinitely by the usual dumb stunts.

  38. Re:The new black by maxwell+demon · · Score: 1

    Wel, ad leest wee nou ann exxcuse hav four ourr speling annd grammer erors. ;-)

    --
    The Tao of math: The numbers you can count are not the real numbers.
  39. Great by Fuzzums · · Score: 1

    We're not allowed any more to share or give away a book after reading it.

    --
    Privacy is terrorism.
    1. Re:Great by turp182 · · Score: 1

      If you only shared with friends and family you would never get caught (and even if you were I doubt they would pursue you). The issue is sharing to the public.

      I've shared my physical copy of All My Friends Are Dead with over 50 people (it takes 90 seconds to read, but it is fabulous). I'd have to work to share it with thousands of people (maybe open a library for it...). Online sharing makes it trivial to share to the masses.

      --
      BlameBillCosby.com
  40. Limp by The+Cat · · Score: 1

    If you are going to any trouble to pirate a 99 cent e-book, you need a job.

    1. Re:Limp by tepples · · Score: 1

      If you are going to any trouble to pirate a 99 cent e-book, you need a job.

      For one thing, it's just as illegal as copyright infringement for some people to even have a job.

  41. plus ca change by backwardMechanic · · Score: 1

    From what I've seen of the copy editing in ebooks, this has already been happening for some time. Or they're just badly checked. I can't wait to see the first science books coming through with random additions in the equations - that will be helpful.

    1. Re:plus ca change by Chrontius · · Score: 1

      You could try looking at my high school math book.

      Some of the equations in the physics chapter were giving results that were off by as much as 14% from the same equation presented in a physics book, or in Wikipedia.

      Naturally, I was the only guy taking both classes simultaneously, and spent six hours trying to figure out what the hell went wrong that night, skipped all the rest of my homework, and turned in a corrected version of my homework for a 2% score. (I got one question wrong enough, it approximated the answer key!)

  42. You just failed your exam by Chrisq · · Score: 1

    What makes you think Lincoln said "Fourscore and seven years ago our fathers brought forth on this continent a new country, dedicated to the proposition that all men are made equal, and conceived in liberty"? Can't you quote correctly?

  43. Too much work by Camael · · Score: 2

    Your solution is plausible, but it would be too much work and expense for the average ripper.

    The idea is not to have an unbreakable DRM scheme, which would be impossible to create anyway but to raise the cost and difficulty of breaking the scheme to dissuade the casual ripper.

    I'm not even sure that the average joe knows how to "use a statistical analysis to blank out the differences". I certainly don't.

    Plus the fact that it doesn't sound like the results they obtain from that exercise is applicable across the board to different books, meaning they need to repeat this process for every single DRMmed book, ad infinitum.

    1. Re:Too much work by ACE209 · · Score: 4, Insightful

      ..., but it would be too much work and expense for the average ripper.

      Until someone writes a program for it, so the average ripper only has to push a button.

      --
      "we are all atheists about most of the gods that societies have ever believed in. Some of us just go one god further."
    2. Re:Too much work by flyingfsck · · Score: 4, Funny

      why just strip out all the punctuation who needs commas full stops and capital letters anyway everything is still perfectly readable

      --
      Excuse me, but please get off my Pennisetum Clandestinum, eh!
    3. Re:Too much work by AmiMoJo · · Score: 1

      Actually you don't even need particles in English:

      "why just strip out all punctuation who needs commas full stops capital letters anyway everything still perfectly readable"

      English can be strange at times. Both verbose and wordy but simultaneously devoid of real meaning or even clarity.

      --
      const int one = 65536; (Silvermoon, Texture.cs)
      SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC
    4. Re:Too much work by preflex · · Score: 1

      Actually you don't even need particles in English:

      "why just strip out all punctuation who needs commas full stops capital letters anyway everything still perfectly readable"

      You stripped a conjunction and a copula, but you left the particle in place.

      "why just strip all punctuation who needs commas full stops capital letters anyway everything still perfectly readable."

    5. Re:Too much work by K.+S.+Kyosuke · · Score: 1

      I'm not even sure that the average joe knows how to "use a statistical analysis to blank out the differences". I certainly don't.

      If you have, say, more than just a few different copies, it's all a matter of diffing and voting by concensus. If the distribution of the canary edits is any particular document version is random, any one of them is likely to be absent in a majority of the copies.

      --
      Ezekiel 23:20
  44. Useless but interesting by Yvanhoe · · Score: 1

    It means that to safely pirate ebooks, you will need to rephrase and repunctuate it to remove the watermarks. How much do you need to modify a text to make it a derived work? I wonder if this can be done without authors' approval, especially if they add mistakes (yes, punctuation can be erroneous)

    --
    The Wise adapts himself to the world. The Fool adapts the world to himself. Therefore, all progress depends on the Fool.
  45. Re:Some systems MUST be 100% fool-proof by The+Cat · · Score: 1

    Piracy has absolutely zero impact on e-book sales.

    a) 70% of the best-sellers are romance novels. Most pirates are neckbeard fatasses who are afraid of women.

    b) 70% of e-book readers are women. See above.

    You could pirate e-books naked in front of a police station. Nobody would give a shit.

  46. Re:OCR scan hardcopy by Chrontius · · Score: 1

    Scan, or if they're really dedicated, copy longhand.

  47. Re:Waste of time by Opportunist · · Score: 1

    Give it time. I don't think it'll take long 'til you're fully responsible for what happens with files that you pay for.

    --
    We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
  48. Re:The new black by Jeremy+Erwin · · Score: 1

    Early printers used the letter 'e' to justify lines. Thus "Queen" and "Queene"

  49. Like the olden days by TheSeatOfMyPants · · Score: 1

    Years before ebooks were sold, there was a community of people that would buy old paperbacks by popular authors, unbind them, run the pages through a scanner, maybe OCR the result, then post as version 0.x; readers would then proofread (often using a physical copy as a reference), bump the version number to reflect how readable/correct it was, and resubmit the book.

    Given the condition of early releases back then and how many more people are into sharing & proofing ebooks, there's no way that a "DRM" scheme that consists of inserting errors will last for long. Especially given the existence of tools like Calibre plugins -- even if Calibre doesn't come with a relevant plugin, it won't take long for someone to develop one.

    --
    Now mostly at Usenet:comp.misc & SoylentNews.org (it's made of people!)
  50. Violating copyright in order to enforce it by deoxyribonucleose · · Score: 2

    Any publishers using this technique had better have iron-clad contracts with their authors permitting arbitrary alterations to their works. Otherwise, they are in clear violation of the authors' moral rights to protection against distortion and mutilation of their original work.

    It's eerily reiminscent of the 'We had to incinerate the village in order to protect it' military communique.

    Anybody know if standard boilerplate agrements from the major publishers actually sign away the authors' moral rights against deliberate mutilations (as opposed to inadvertent proofing errors)?

  51. So personalize them again by bytesex · · Score: 1

    Take the text - run it through your own 'correctifier', and re-distribute. The trail will never stop at you.

    --
    Religion is what happens when nature strikes and groupthink goes wrong.
  52. How does this stop me from sharing from a USB? by Angrywhiteshoes · · Score: 1

    Is there something I don't know about the reader software on my computer? Is it leaking info about what books I'm reading?

    On the other hand, how does anyone know if I put it in dropbox and share my dropbox folder with someone? Or rename the file or strip any identifying meta-data and just host it on a private website that requires password to view?

    There's lots of things that don't make sense to me about how this will actually thwart piracy by striking fear into people's hearts. But then again, I (and we) are probably not the intended target(s) of that fear.

    1. Re:How does this stop me from sharing from a USB? by deoxyribonucleose · · Score: 2

      One word: PRISM.

      Perhaps I'm scaremongering, but are you willing to bet against mission creep from using such intelligence assets against so-called terrorism via kiddie porn to copyright infringement? Given how US election campaigns are being financed?

  53. Re:Decreasing the value of the paid product again by Opportunist · · Score: 1

    This is actually becoming a problem with this. Aside of the usual devaluation of the product by adding DRM to it, any book with this DRM you buy becomes a liability. If, for some odd reason beyond your control, this somehow ends up in someone else's hands, you may suddenly become an unwitting "pirate". Your ebook reader gets stolen and suddenly your ebook gets shared all over the internet. And now prove that you didn't.

    Explain again why the heck I should touch any "legal" ebook with this landmine buried inside with a ten foot pole.

    --
    We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
  54. Diff the copies and merge by ciantic · · Score: 2

    The trivial counter measure is to get multiple (two might be sufficient) copies with different markings, then run diff on the content and merge (perhaps manually). Of course it gets tricky if the content is closed binary format, but it's still doable.

    1. Re:Diff the copies and merge by Megane · · Score: 1

      If it's in a "closed binary format", the first step before re-distrubtion should be to remove it from the shackles of being encumbered by a proprietary format. It is a moral imperative. Text is text (especially when there are no illustrations); there is no excuse for proprietary formats.

      --
      #naabhaprzrag, #sverubfr-000, #agi-fcbafberq, negvpyr[pynff*=' negvpyr-ary-'] { qvfcynl: abar !vzcbegnag; }
  55. It seems weak to the "return to the average" by aepervius · · Score: 1

    Since those mutation are random, and spread over the whole text, you can just buy or take 6 or 10 text, then compare them all. The difference will be local. Return to the average and you can build a version which is safe.

    Example:
    The red poney, the traitor all, is dead !
    The red poney; the traitor all, is dead ?
    The red poney, The traitor all, is dead !
    The red poney, the traitor All, is dead !
    The red poney, the traitor All, is dead !

    You look at all the changes and find out :

    (The red poney)*5 ,*4;*1 t*4T*1 (he traitor )*5 a*3A*2 (ll, is dead )*5 !*4?*1

    You then compare all the frequency and take the highest. And you get :

    (The red poney)*5 ,*4 t*4 (he traitor )*5 a*3 (ll, is dead )*5 !*4

    In other word all mutation are stripped.

    --
    C. Sagan : A demon haunted world:
    http://www.amazon.com/gp/product/0345409469/
    visit randi.org
    1. Re:It seems weak to the "return to the average" by DrXym · · Score: 2
      The issue here (aside from the differences being more subtle) is how does this master bookz distributor obtain 5 copies of the same book without them being in the wild in the first place? Does he solicit people to send him books or upload them somewhere? Remember if the books are in the wild you are screwed.

      So you have to upload your book to somewhere secret where you trust and hope Mr Bookz will will strip out your id. And if your uploaded book does leak into the wild (because Mr Bookz is an asshole or incompetent about stripping the id), you've just incriminated yourself for no reason. If there is a book in the wild already why risk uploading another copy at all? Why even buy a copy in the first place if you are uploading books and therefore not especially concerned about the ethics of piracy?

      Of course I suppose 1000 people could crowd compile a book, each submitting a page each to produce a frankenbook from the pieces but it would still have to be canonicalized in case the markup, contents, style rule names embedding the id somehow. Perhaps the frankenbook would hash each canonicalized page and the pages that have the same hash are used when the book is stitched together.

      But for all the effort maybe it's easier to scan the paper book in the first place, or hook up a cracked Kindle / Nook / tablet to a flat bed scanner or a screen capture device and make extensive use of analogue hole to strip out most of the watermark.

      In summary, it would be a hard problem to crack.

  56. Better solution and better hackaround by sberge · · Score: 2

    The better solution is to have the author (or translator in case of translated literature) provide multiple versions of a few sentences in the book. And the work-around is to upload only a fraction, randomly sprinkled through the book, to the sharing site which then assembles the pieces from multiple copies, garbling the watermark.

  57. This sounds trivial to bypass. by haitch · · Score: 2

    If you can find 3 independent sources (shouldn't be hard for something popular), then all that should be required is a 3-way diff and use whatever is common with any 2 or more. If all 3 are different at the the same place then use some manual intervention and make your result different again or add another source. The final product cannot then be traced to a single source. Am I missing something?

    1. Re:This sounds trivial to bypass. by morgauxo · · Score: 1

      I'd run it through something similar to a code beautification script, something that would standardize the whitespace, correct mis-spellings, etc...

      Then.. your 3 sources idea might still be a good one in order to check for whole word changes.

  58. Hardly a novel idea by DrXym · · Score: 1
    Throw some extra spaces here or there where it doesn't matter, use similar characters like the different forms of apostrophe or hyphens, twiddle the underlying markup, add unused style rules or inconsequential differences in styling. You could easily find enough bits in there to to "watermark" a book without it being obvious. You'd practically have to canonicalize both books and do a comparison and keep working on the canonicalizer until the results were the same. Chances are by that point that the book would be mangled out of all recognition.

    Someone would have to possess an another copy of the same book (more or less defeating the point of sharing the their own and incurring a personal risk) in the same published form in order to even know that the differences were intentional. Even then it doesn't make them easy to remove, if for example style names or other marks in the book were randomized.

    Similar measures would have easily found the culprit of a mass leak of information like wikileaks. Every page could contain 1 bit of variation based on the user's id and the result page. Each bit you could glean from a page would cut the search space of culprits in half so you'd nail the perp in no time. Even if the document was canonicalized it cannot strip out all the ways that this bit of variation could be sent and wikileaks would be extremely unlikely to be in possession of two independent copies of the same document to even know what to look for.

  59. What are you talking about? by mha · · Score: 1

    The ONLY one they can potentially track at all is the original buyer. What use is it to track the NEXT uploader (*with this method*)? They can find out who he/she is anyway (trace IPs) - which they can already do.

    What they are interested in here specifically is the original buyer turned uploader, because with the current method of tracking IPs they can get an uploader - but they still don't know if he/she is the source.

    1. Re:What are you talking about? by biodata · · Score: 1

      >They can't find out who he/she is anyway (trace IPs). Fixed that. The idea that IP == person has been pretty comprehensively demolished in relevant legal jurisdictions.

      --
      Korma: Good
  60. Get two copies and diff? by LostMyBeaver · · Score: 1

    Seriously, how hard is this? Just download two copies and diff them, then "correct" the difference.

    This is just nonsense.

  61. I wondered what was wrong with those new books... by knarf · · Score: 1

    ...but I have not been able to put a finger on what is was.

    'To question not be or to be, the question that is'...
    'The needs of the one outweigh the needs of the few, or the many'
    'Ask not what you can do for your country but ask what your country can do for you'
    'To each according by their means, by each according to their needs'
    'It was a giant step for man, a small step for mankind'

    Something just did not seem right... now I know.

    --
    --frank[at]unternet.org
  62. And what happens... by Smerta · · Score: 2
    What happens when a legitimate purchaser/owner has the file stolen/copied from his computer? Viruses, friends using his computer, old discarded & unwiped hard drive....

    Now the copyright mafia comes banging on his door claiming he uploaded/pirated the book? WTF???

    Just like taking an IP address and suing the user/owner of that IP for uploading music/movies, this tactic has no teeth. Unless someone has corroborating evidence, there's no proof that *I* am the source of the uploaded file. Only that it is the file that I originally purchased.

    The whole copyright system, and behaviors of content owners, has gotten completely out of control...

  63. Better: Reader Bot by ThatsNotPudding · · Score: 1

    A very good LCD screen, a very good camera, and an autonomous image to text program; probably do War And Peace in a day.

  64. What self respecting Authror by wisnoskij · · Score: 1

    ... Would allow an algorithm to randomly change the punctuation and spelling of even a small portion of their novel?

    --
    Troll is not a replacement for I disagree.
    1. Re:What self respecting Authror by Swave+An+deBwoner · · Score: 1

      I'm not even an Authror and I know for sure that I woodn't.

  65. Re:Waste of time by biodata · · Score: 1

    This is wrong. You don't need to provide evidence of reasonable doubt to not be a criminal. The prosecution needs to provide evidence that you are a criminal. The presence online of a copy of a book you bought is not evidence that you are a criminal. The whole idea is full of fail at every level.

    --
    Korma: Good
  66. Sweet! by morgauxo · · Score: 1

    Everybody needs an English(or other language) textbook that has had some script randomly alter it's punctuation a bit!

  67. sue the publishers! by Sq · · Score: 1

    So, authors should excercise their moral rights (at least here in Europe, they're inalienable rights of authors enshrined in copyright law) and sue publishers for mutilating their works deliberately.

  68. Compare multiple copies by dhasenan · · Score: 1

    Get four or five copies of the book, compare them, and wherever they differ, go with the majority variant.

  69. Re:more randomizing by dhasenan · · Score: 1

    It's unlikely that you would alter the changes specific to your copy. It means they'll have to record the specific changes made for each ebook and compare, and sometimes with your algorithm multiple people's would match, but I imagine oftentimes they'd still be able to match it against a unique person.

  70. Get your money back? by m.alessandrini · · Score: 1

    What about reading it, then asking your money back because it's a defective book? When I buy a book, I want an exact copy of that book, not a randomly altered one.

  71. Pff... by AndrewX · · Score: 1

    diff -c copy1.txt copy2.txt

    sed 's/discrepancy1/correction1/;s/discrepancy2/correction2/;s/etc/etc/' copy1.txt > newcopy.txt

    well, that was hard.

  72. Re:Fred Saberhagen covered this already by RustyTheCat · · Score: 1

    See the short story "The Annihilation of Angkor Apeiron" in the collection "THE ULTIMATE ENEMY, THE BERSERKER SERIES" By Fred Saberhagen

  73. Is this why Harry Potter 1 had different titles? by Aristos+Mazer · · Score: 1

    In Britain: Harry Potter and the Philosopher's Stone
    In USA: Harry Potter and the Sorcerer's Stone

    The publisher could've just said, "No, we're not changing it for Americans who we think will be scared off by the word 'philosophy'. It's just one of our DRM changes that happened to end up in a particularly visible location." :-)

  74. Re:Cordwainer Smith covered this already by Hartree · · Score: 1

    Cordwainer Smith got there many years earlier with Mother Hitton's Littul Kittons.

    In the story, looking up the words Littul Kittons in an electronic encyclopedia acted as a tripwire for a planet's security services.

  75. off v. (slang) to kill by tepples · · Score: 1

    "I had to help my uncle Jack off a horse".

    Does this mean help Jack dismount from the horse, or help Jack put the horse out of its misery?

  76. Re:If I were to pirate e-books ... by X0563511 · · Score: 1

    So, you are saying you want to give the copy you just sourced a detectable fingerprint? How do you not see that as a terrible idea?

    --
    For large sets, this will be our guide even unto death, for the LORD will work for each type of data it is applied to...
  77. Re:Waste of time by gmuslera · · Score: 1

    I'm mostly ok with the watermarking of the ebook if there is no DRM involved, as it don't limit how/when/where i will use what i bought. But using the watermark to label criminal to someone (and WILL be used that way, knowing the trend even could put you in jail for decades) is dangerous, suddently not having perfect security or privacy becomes a liability, a stolen phone or tablet, a rooted/botnetted/shared pc, not understanding privacy on social networks or just rogue NSA agents could turn into a nightmare ever buying an ebook.

  78. Re:more randomizing by Swave+An+deBwoner · · Score: 1

    You may not be able to "buy it used" like a dead tree book.

    I know for example that Barnes and Noble sells ebooks that are tied to the buyer's account and some of them can be loaned to a friend for a limited period of time. (And that's a feature that B&N advertises, you silly person who thought that you can give away or sell what you bought with your own money.)

    I'm pretty sure that Amazon does the same with the ebooks they sell.

  79. this has been done since 1994 in UK by eionmac · · Score: 1

    1 this was routine in all secure embargo-ed documents issued in UK by certain government departments and other organizations (e.g with Politicians on distribution list) so that tracing a 'leaked document' was possible. 2. Sometimes we changed complete paragraphs ( without changing the sense). 3. It caught a few folks.
    so very 'prior art' no patent possible.

    --
    Regards Eion MacDonald
  80. Re:Waste of time by viperidaenz · · Score: 1

    Yes I got my sentence a bit mixed up. You got the point.

    If I rob your house, and the police find your stuff in my living room - and they know its your stuff because of the serial numbers, is that not evidence to prove I robbed you?

  81. Re:Waste of time by viperidaenz · · Score: 1

    Wilful and for profit copyright violations can be prosecuted.
    Its been that way in USA since the 1897.

  82. Re:If I were to pirate e-books ... by doccus · · Score: 1

    So, you are saying you want to give the copy you just sourced a detectable fingerprint? How do you not see that as a terrible idea?

    I'd tend to agree with you.. since it is apparent that most anti-piracy approaches appear to simply be altered versions of the old communist techniques used to spy on and control ones' own citizens.. ie.. if you know what someone reads, where they obtained it, and what their reading habits are, or.. more recently, if you know what someone watches, what they share, who they share it with, or, what they listen to where they bought it, or obtained it etc.. you can either stop them from doing it, *control" what they read, listen to, or watch, such as "Pravda" did, collect information on them, or , as they did in Soviet Union, do all three, all the while without any signs that this indeed being done. In order to use these techniques to protect copyright, only difference is that they would have to show their hand (blow their cover. so to speak). Copyright laws are a windfall for KGB and Stasi-like organizations.. such as CIA and DHS...

  83. Re:Waste of time by SuricouRaven · · Score: 1

    The NET act made it even easier. It makes all commercial electronic copyright infringement criminal - but defines commercial as including an expectation of receiving infringing works in return. So basically, p2p.

  84. Re:Waste of time by biodata · · Score: 1

    Well... assuming that your stuff isn't in your living room anymore, and that the serial numbers are unique, then it seems I have a case to answer. This case seems more like your stuff is still in your living room, and there is also a photo of your stuff in my garden, I think the onus would be on you to prove that I took a photo of your stuff and left it in my garden.

    --
    Korma: Good
  85. Re:Waste of time by viperidaenz · · Score: 1

    If I prove you took a photo of my stuff that I created and you distributed that photo, if you're in the USA you've just committed copyright infringement.
    If I can prove you done so with the expectation of profiting from the photo or the distribution of it, you're up for a criminal prosecution.