Slashdot Mirror


Google To Resume Scanning Books

SenseOfHumor writes "The Wall Street Journal is reporting that Google will resume scanning copyrighted books from Stanford and Univ of Michigan libraries. Let the battle resume!" From the article: "It isn't known just what percentage of library holdings fall into the category of being in copyright but out of print. About 18% of the books held by the libraries working with Google were printed prior to 1923 and are therefore in the public domain, according to an analysis by the Online Computer Library Center, a Dublin, Ohio, nonprofit library cooperative. An unknown percentage of the rest still are protected by copyright, depending on whether it was renewed. Google's resumption of its scanning of copyrighted works comes amid heated debate in the library community over participation in the program."

57 of 257 comments (clear)

  1. How exactly are they doing this? by DroopyStonx · · Score: 5, Interesting

    Is there some machine they have that separates all the pages and scans each one?

    How do they verify that the items being scanned are being scanned properly?

    --
    We have secretly replaced these Slashdot mods' sense of humor with a rusty nail. Let's see if they notice!!
    1. Re:How exactly are they doing this? by GoodOmens · · Score: 3, Informative
      Here is one solution:

      http://www.rod-neep.co.uk/books/production/scan/ scanning.htm

      If you notice that it requires someone to turn the pages. While tedious it would protect some of the much older books google will be scanning. If there is a automated soltion I do not know ....

    2. Re:How exactly are they doing this? by GoodOmens · · Score: 2, Informative
      I'm sorry here is a better link to the acctual manufacture of the scanner.

      http://www.imageware.de/

    3. Re:How exactly are they doing this? by Xerotope · · Score: 5, Interesting

      Here's a paper that describes a system at CMU they built for such purposes.

    4. Re:How exactly are they doing this? by Fordiman · · Score: 2, Informative

      Check this out.
      http://kirtas-tech.com/

      --
      110100 1101000 1101000 1100110 0 1101111 1101000 1100011 1
  2. Out of print - fair game by Tim+Ward · · Score: 5, Interesting

    I'm not usually one to support the knee-jerk slashweenie response, that all intellectual property is theft, but I do object to commercial enterprises refusing to sell me something and insisting that I can't copy it.

    It might be their intellectual property but it's my culture, dammit. If they won't keep it in print and sell me a copy, which I'm willing to pay for, then they should keep their mouths shut when I go and find one for myself.

    Anybody got a DVD of Dance of the Vampires they can let me copy then?

    1. Re:Out of print - fair game by dascandy · · Score: 2, Informative

      Nope, but I DO have a full-copy of a book that is out of print for 10 years, only 25 years old in IT. It's one of the few books in IT where age doesn't matter, but which doesn't sell well but is still very useful. So, you can only buy second-hand books at a greatly increased rate (original book between 30 and 50 dollars, second hand cheapest is 150 dollars).

      F*** that, I'll copy.

    2. Re:Out of print - fair game by Kelson · · Score: 2, Insightful

      Yeah, that's one of those cases where the legality is clear, but the ethics are debatable. You're certainly not damaging their ability to "leverage their assets" since they aren't doing anything to leverage them, and if they'd just sell you a copy, you'd be quite happy to buy it. (Of course, this argument would fall apart laughably with physical goods.)

      The saddedst stories are the ones about silent-era films left practically rotting in the studio vaults. If the studio doesn't think they'll get any money for it, they have no reason to maintain/restore the old film, but film historians, who have every reason to want the film preserved, can't get at it.

      On the other hand, you occasionally hear singers, authors, etc. talking about that early album or novel that they wish hadn't been published, and joking that they want to buy up every surviving copy and burn it.

      And then there's the question of material that has never been published in the first place...

    3. Re:Out of print - fair game by Scrameustache · · Score: 2, Insightful

      I do object to commercial enterprises refusing to sell me something and insisting that I can't copy it.
      It might be their intellectual property but it's my culture, dammit.


      Ah, but the owners of the Disney Vault disagree... and it so happens that they also own some key lawmakers.
      They like creating artificial scarcity in order to raise the asking price. It pleases them. The law of supply and demand is a lot of fun when you controll the supply.

      --

      You can't take the sky from me...

    4. Re:Out of print - fair game by shawn(at)fsu · · Score: 2, Interesting

      You're certainly not damaging their ability to "leverage their assets" since they aren't doing anything to leverage them, and if they'd just sell you a copy, you'd be quite happy to buy it.

      Disney does this or rather did do this with it's movies, they would go on moratorium for a while and you couldn't buy it. Snow White, sleeping beauty etc. I only know this from my experience with working at a video rental store, If someone left the tape in the back of their car in sunny Fl it would get all melted/warped and we needed to replace it.

      Not that I am defending the copyright holders or anything I'm just playing devils advocate. I know you said this falls in to the realm of laughable when you talk about physical products, but that's what books are right and you also mentioned silent films.

      Again I'm not defending the copyright thing here, but giving you a counter argument.

      --
      500 dollar reward for tip(s) leading to the arrest of the person(s) who stole my sig.
    5. Re:Out of print - fair game by neoform · · Score: 2, Informative

      Ever heard of the "Disney Vault" as they call it?

      They remove their movies from the market for 10 years in order to create artificial demand.

      --
      MABASPLOOM!
    6. Re:Out of print - fair game by Mechanik · · Score: 4, Funny

      Anybody got a DVD of Dance of the Vampires they can let me copy then?

      Actually, since according to IMDB this movie is just the Fearless Vampire Hunters by another name, you are in luck as it was recently released to DVD

      Mechanik

    7. Re:Out of print - fair game by natophonic · · Score: 2, Insightful

      I'm not usually one to support the knee-jerk slashweenie response, that all intellectual property is theft, but I do object to commercial enterprises refusing to sell me something and insisting that I can't copy it.

      It might be their intellectual property but it's my culture, dammit.


      I think I've figured out to how win the intellectual property wars... recast the battle as a culture war and you've got the support of knee-jerk conservative bush-apologists everywhere! ;)

    8. Re:Out of print - fair game by Simonetta · · Score: 2, Informative

      I support this 100%. By bribing politicians to extend the copyright period, the global media corporations have stolen the public domain. For that reason, they have forfeited any claim to any intellectual property that they believe that 'own'.

            Suppose that you have bought a car and are making regular monthly payments on it. After five years you have one more payment to make and the ownership of the car is completely yours. The finance company bribes a politician, who puts a rider into a huge bill that can't be prevented from passing into law, and suddenly all the monthly payment schedules for cars are doubled. You won't own the car for another five years and you must continue to make monthly payments on it in order to prevent it from being repossessed. It's great for them, bad for you.

          The copyright laws work the same way. For a set period, everyone must pay to consume the copyrighted material. After that period is over, the consumption of the material is free. By extending the period that the public must pay in order to consume this material, the public has had their free access stolen from them.
      This theft vastly exceeds any financial loss from people ignoring copyrights. And the copyright period has been extended every time that it has been set to expire. Which is a consistent pattern of criminal behavior. The global media companies, under the RICO act of the USA, are criminal organizations and have no legal right to demand people be placed in prison or be fined for accessing what they claim is copyrighted material.

          The USA courts think nothing about stealing a person's house or car if they are found with a few pennies worth of cannabis on their person. This is 'asset forfeiture' and is a sweet little legalized theft business for the police and their associates.

          When we make copies of materials that a supposedly 'owned' by the global media corporations, we are making a citizen's 'asset forfeiture' of the property acquired by these criminal organizations through illegal means, through the theft of the public domain.

          There is no such thing as 'theft' of intellectual property. There is only the use of violence to either remove or protect the world's culture from the use by the people of the world. Everything else is just a legal smokescreen.

        Nobody has any intellectual property rights to anything anymore in the information age. If you understand this basic concept, you will be able to make intelligent and balanced decisions concerning your own use of cultural resources and 'intellectual property'.

  3. I can not wait by w.p.richardson · · Score: 4, Insightful

    until this is available! I can't count the number of times that I have been flipping thru a book and wished I could use the search function, only to realize that it doesn't work in meatspace. Searching is up to me, using old fashioned tools, like the table of contents or the index!

    --

    Curb CO2 emissions: Kill yourself today!

  4. Good. by Gothic_Walrus · · Score: 4, Insightful
    Books are fragile. They need to be preserved somehow.

    Would Google be allowed to store scanned copies of books even if the authors opt out? Someday, those print copies are going to be destroyed or deteriorate to the point of uselessness, which means that Google could be archiving works that might otherwise be lost forever.

    I still don't get the uproar over the scanning, because it's not like the entire book is made available for free. The search is so crippled that it makes me think the people who are upset have never used it before.

    --
    Goo goo g'joob.
    1. Re:Good. by daniil · · Score: 2, Insightful

      Books are fragile. They need to be preserved somehow /.../ which means that Google could be archiving works that might otherwise be lost forever.

      Nonsense. Books are far less fragile than any of this digital crap. Drop a book, and nothing bad happens to it (unless you drop it into water). Drop a hard drive, and it's dead. All in all, it is more likely that digitally stored information will be lost forever than a book.

      --
      Man is a slave because freedom is difficult, whereas slavery is easy.
    2. Re:Good. by MindStalker · · Score: 2, Insightful

      Yes, and no. Digital information while easily lost can be copied indefinatly. Joe shmo making a scanned copy of a book is more likly to lose their data before that book is destroyed. But we are talking about a consistantly archieved raided to the max system of storage. Googles storage of the huge amount of material from website caches to searching index is googles product. There is litterly billions of dollars invested in Googles data. If they wern't being discustingly obnoxious in their backup system, their investors would have a cow.

    3. Re:Good. by Kelson · · Score: 2, Informative

      The big advantage of paper is that all you need to view it are your eyes. (Or your fingers, if it's written in Braille.)

      On the other hand, anyone who has visited the US National Archives can see the efforts needed to preserve the original US Constitution and Declaration of Independence. We have zillions of copies, but the originals are kept in sealed vaults with limited lighting. Exposure to air damages the paper. Exposure to light fades the ink. They've struck a careful balance to make sure that people can still see the physical artefact without destroying it.

      Both paper and digital need to be copied to new physical media from time to time. It's a lot easier to make that digital-to-digital copy!

    4. Re:Good. by oliverthered · · Score: 2, Informative

      You know nothing about paper do you?

      Most modern books will be printed on acidic wood pulp paper (as apposed to acid free hemp, cloth (cotton based paper) or more expensive acid free wood pulp). Over a period of time the acid will erode the paper until between 20-60 years later all your left with is crumbs.

      Modern print is crippled by paper tech that means you'll never have a copy survive until it's out of copyright, just like modern digital is crippled by DRM and can never be released into the public domain by someone who owns a copy.

      --
      thank God the internet isn't a human right.
  5. Funny how things change... by conJunk · · Score: 5, Interesting

    A lot of folks are going mental about the "copyright implictions" of google books, and I'm just laughing. On my bookshelf is a first-edition colleciton of George Bernard Shaw plays, printed in the UK in 1911. There's a legend on the inside cover that is a reference to the U.S.'s lack of copyright laws at the time: (paraphrasing from memory:)

    Please note that unauthorized editions of the book are available in the United States, and that neither the author nor the publisher has received any payment for them.
  6. Hmm. by daniil · · Score: 5, Insightful

    FTA: "I feel that this is a potential disaster on several levels," said Michael Gorman, president of the American Library Association and university librarian at California State University, Fresno. "They are reducing scholarly texts to paragraphs. The point of a scholarly text is they are written to be read sequentially from beginning to end, making an argument and engaging you in dialogue."

    The sad thing is, scholarly texts are so abundant nowadays that it's neigh impossible to keep oneself current with all the new things published. Already there are magazines that only (or mostly) contain abstracts or reviews of new dissertations and articles. I fail to see how Google Print is a greater disaster than this. If anything, it'll only improve the situation.

    --
    Man is a slave because freedom is difficult, whereas slavery is easy.
    1. Re:Hmm. by joelpt · · Score: 2, Insightful

      Well, this librarian just doesn't get it. He thinks that by making the texts searchable, that readers will consequently read just the 1-2 sentence snippets and that will be the end of it.

      In reality, readers will actually just locate the texts pertaining to their search, and then read those texts in their entirety outside of the search results page -- inasmuch as they choose to. Presumably, scholarly readers would choose to read the entire texts more often than others.

      Following his reasoning, we would all have incomplete understandings of many paperback novels which we've flipped through in a bookstore. Which may in fact be true, if we didn't read the whole book -- but we read all that we *wanted* to. And we will continue to do so with the emergence of Google Print.

  7. Did I miss something? by Descalzo · · Score: 5, Insightful
    Assuming Google won't make these scanned copies available to everyone to just read on their Palm Pilot, what is the big deal? I would think that it would be to the copyright owner's benefit to have their text in a database to be searched. If Google has my book in its database, and someone searches for text that happens to be included in my book, doesn't that make it more likely someone will buy it?

    Seriously, though, I feel like I'm missing something here. What is it?

    --
    I cried real tears when Li Mu Bai died.
    1. Re:Did I miss something? by slavemowgli · · Score: 2

      It's all about control - the publishers don't like the fact that Google didn't grovel at their feet and offer to pay for the priviledge of exercising its fair use rights.

      --
      quidquid latine dictum sit altum videtur.
  8. Intellectual Property Taxes by Safe+Sex+Goddess · · Score: 5, Interesting
    There is nothing new created in the world that is not built upon the foundations of human history. What is created today in science, technology, and the arts, is built on the foundations of those who created before them.

    As a recognition of this debt to society, intellectual property that is not in the public domain should be taxed. Just as our other physical property is taxed, why not intellectual property?

    And the taxes can be used to invest in new science, technology, and the arts.

    This has the added benefit of also moving a bunch of stuff into the public domain.

    If the taxes aren't paid within two years, then the item moves into the public domain. If you aren't sure on the status of an item, see if it has had IP taxes paid in the last two years. If not, then it's free!

    --
    Abstinence is a government conspiracy. www.SafeSexZone.co
  9. Index! by Karma_fucker_sucker · · Score: 3, Informative

    There are actually books that do not have an index. And boy is it a pain in the ass! I can understand why. From what I've heard from authors, indexing a book is the most boring and tedious thing to do.

    --
    Evil people don't think they're evil. - George Lucas, Making of Ep III
  10. Book Tablets by GoodOmens · · Score: 4, Insightful
    This will make those book tablets that never really sold a neat idea.

    1.Connect to google.

    2.Download a rare book only found in a handfull of libraries.

    3.Go read it....

  11. 1923 - 1990: the gap years by G4from128k · · Score: 3, Insightful
    The move to digitize out-of-copyright works combined with the natural movement of new materials to web creates a kind of internet information gap. Many things appearing after 1990 or so are now on the net in some form or other. And things that occurred before 1923 are in material now lapsed from copyright and will be on the net courtesy of Google. Things in between are too old to be on the net directly and too new to be out of copyright.

    In 75 years, give or take, the gap will close for oldest years on. But for a while the internet will not have as much on a wide array content on pre-digital topics.

    --
    Two wrongs don't make a right, but three lefts do.
  12. It's a shame... by coupland · · Score: 2, Interesting

    It's a shame to see many libraries taking such a regressive approach to this. I happen to think libraries and access to information are the cornerstone of a free society -- they level the playing field and provide education and knowledge to people who otherwise couldn't afford it. However as a result of the digital age, our current standard of living and draconian copyright laws, the importance of libraries has diminished of late, as people turn to the web for information, buy their books at Barnes & Noble instead of borrowing them, or are merely incapable of borrowing copyrighted material because of digital or legal barriers created by content owners. What Google is doing here is progressive and in the spirit of free access to information. They're not giving away books, here. They're merely adding the ability to search for and view specific pieces of information. If anything this will only increase demand for the books they're scanning. And on top of that, it will provide access to information for a new generation who might not otherwise be able to afford it. Really now, I thought most librarians were more hip and with-it than this...

    1. Re:It's a shame... by BoRegardless · · Score: 2, Interesting

      Plagarism becomes a lot harder when wide searches can be done, in addition to other benefits of scanned books.

      Ultimately, a "whole earth" attitude toward information and learning is going to shift increasingly away from paper, so it is a matter of time before it is going to be more efficient for libraries to hold server banks in "the stacks", than actual paper copies, of which they can only have a very very limited small % of the works out there.

      Time to get in with the new era, and actually, these recalcitrant authors, libraries, and publishers are similar to the music publishers, in that they are not taking the long and wide view of what digital libraries means to the widest range of interests of the sectors of societies.

      If my out-of-print book were digitized and someone wanted a copy of the 'last chapter', why shouldn't I put a price on various digital copies of my work, and let Google give me a dime for each chapter and a dollar for the whole book?

      Sheesh

  13. I love the fact by gregbains · · Score: 4, Insightful

    Its great that a lot of the people opting out think this will save them, when all you need to do is visit local library and get entire book to read for free. Surely this is worse?

  14. Correction by dq5+studios · · Score: 3, Interesting
    Google's resumption of its scanning of copyrighted works comes amid heated debate in the library community over participation in the program."
    It's my understanding that the library community and most authors support this. It's the publishing industry that's having a hissy fit.
  15. Lessig's Tough Call by loggia · · Score: 5, Interesting

    My colleague Jamie wrote the following letter to Wired yesterday regarding Lawrence Lessig's column supporting Google Print.

    I think she makes some compelling points about the problems with Google's plan...

    -------------

    Lessig's Tough Call

    In defending Google Print ("Google's Tough Call," issue 13.11), Lawrence Lessig and others overlook one thing. If the publishers and authors have no rights to prevent this, what rights does Google have to protect its own extensive efforts in creating this database? By their own arguments, the answer must be: none. Google does not own the raw data. In almost talking point fashion, Google, Lessig and others describe this as nothing more than a "card catalog." This description could come back to haunt Google, as the only thing they own is their original presentation of the data itself. And the image of a card catalog does not bring to mind "originality."

    If the Google DRM is broken and I create my own "Jamie Print" index on the web... without Google's ads... what basis would Google have to argue? Google can scan a million books and by Lessig's arguments, that investment is irrelevant. If I find a way to download those million books from Google, store the data and use my own search engine, Google's supposed benevolence in creating this project will be hard to swallow amidst a flurry of lawsuits against my superior ad-free index. Google would have little basis to sue except under the DMCA, a statute whose very existence is vilified by Lessig and the very people defending Google Print as progress (and I don't care for it either).

    If Google's investment in the project cannot be protected, they may have little incentive to create this and other projects. Isn't this much the same for the publishers and authors seeking protection for the right to control their work? Lessig defends Google Print in the name of progress, but progress is a careful balance of reward and public benefit. Google might not create Google Print if it cannot profit from the ads it inserts and publishers may lose out if they cannot choose how to profit from their properties.

    It is almost inevitable that Google Print will be subverted and Google will seek the very same protections that it claims the publishers should not have.

    Jamie Cole
    New York, NY

    1. Re:Lessig's Tough Call by judmarc · · Score: 3, Insightful

      Why would Google care? You can make "Jamie Web Search" right now and Google has no right to stop you. Go ahead, index the Web. MS and Yahoo have, and Google isn't suing them. It isn't the *data* that's the secret sauce, it's the *search algorithm(s)*. The very same is true of Google Print.

  16. spread information by dingDaShan · · Score: 5, Insightful

    I currently attend the University of Michigan, and have encountered much frustration in tracking down books. The university has several libraries and finding some books is nearly impossible. Additionally, the University has old collections and manuscripts that are barely indexed in the University's system. For the purposes of research, scanning the books is a dream come true. Searching for keywords, the ability to quickly find books, and the ability to view old manuscripts that one would normally need to be present at the library (and under supervision) to view. The copyright issue is important, but the books that are in public domain (primary sources especially) should definitely be scanned. As for the copyrighted books themselves, Google does not allow the full book to be viewed. If anything, Google advertises for these books. For a student such as myself, I would not buy the book as it is, so what is the harm?

  17. Digital information is just too volatile by October_30th · · Score: 2, Insightful
    Huh?

    Have you ever thought about how much more effort it takes to destroy a book in comparison to the effort it takes to destroy its digital copy?

    It's the same thing with all digital data: in a few centuries this era will be called the dark ages of information - most of the historical data (text, images, sound) will be lost because it was stored on media that just couldn't hack it. People are just too eager to store precious data in a digital form just because it is convenient.

    --
    The owls are not what they seem
  18. Increase your Google Page Rank! by mOdQuArK! · · Score: 3, Funny

    You've got to wonder if there will be books published where the contents are designed to increase the book's Google Page Rank.

  19. Google[black]mail by matt+me · · Score: 2, Insightful

    Though I personally believe what Google are doing is not ethically/morally wrong, they are most probably 'breaking' our unjust (injust?) copyright laws. The only reason they are 'getting away' with it is because they are the most powerful domain on the net. No-one dares mess with Google.

    A law suit against Google is very bad publicity, and they could subtly drop your page rank and you'd never notice until the visitors stopped coming.. or even remove you completely.

    1. Re:Google[black]mail by 99BottlesOfBeerInMyF · · Score: 3, Informative

      hough I personally believe what Google are doing is not ethically/morally wrong, they are most probably 'breaking' our unjust (injust?) copyright laws.

      My research into the subject suggests the opposite. Although the laws are somewhat vague, Google appears to meet all four criteria for fair use and every single district has filed supporting briefs supporting a case with significant precedent, except the district in which the case against google has been filed. I suspect this is because the lawyers involved know they will be unlikely to prevail in the end, but are hoping to win the initial case and force the issue to the supreme court, possibly with an injunction in place. This is because they hope to delay and possible temporarily stop Google's actions while they try to get laws pushed through the courts to make what Google is doing illegal.

      The only reason they are 'getting away' with it is because they are the most powerful domain on the net. No-one dares mess with Google.

      I think you are overstating Google's influence by a lot. First, the people suing Google don't care if they are findable by Google as they are not a consumer facing body. Second, they are a bunch of middle men, what do they care about publicity? Will you stop buying books from those publishers and hurt the authors (who mostly support Google's actions)? No I think you have this backwards. Google is legally going to prevail, and these publishers are just delaying while trying to pass some laws to avoid the future possibility of being cut out of the deal. They fear for their position as middle men and are fighting hard to stop anything that might be progress.

    2. Re:Google[black]mail by dennison_uy · · Score: 2, Interesting

      Though I personally believe what Google are doing is not ethically/morally wrong, they are most probably 'breaking' our unjust (injust?) copyright laws. The only reason they are 'getting away' with it is because they are the most powerful domain on the net. No-one dares mess with Google.

      About 18% of the books held by the libraries working with Google were printed prior to 1923 and are therefore in the public domain

      From what I gather, these books printed prior to 1923 are considered public domain and hence, no longer in the scope of copyright laws. Lawsuits are still bad press for Google as is for any publicly-listed company, and I seriously doubt no-one would "dare mess" with Google simply because they are "the most powerful domain" on the net. If anything else, Google has money and can definitely pay large sums of money to settle any lawsuits hurled against them, which makes it a good move as long as you are at the side of the law. Take a look at guys like RIAA and Microsoft -- they may be powerful both as an entity and economically-speaking, however, it does not render them safe from lawsuits.

      A law suit against Google is very bad publicity, and they could subtly drop your page rank and you'd never notice until the visitors stopped coming.. or even remove you completely.

      Google drops your page rank or removes you? Fine, guess I'll just fire another lawsuit against them. I'm sure there's a law being broken somewhere for this.

      --
      Take off every 'sig'!
      All your 'sig' are belong to us!
  20. People always forget by mcc · · Score: 5, Interesting
    It might be their intellectual property but it's my culture, dammit. If they won't keep it in print and sell me a copy, which I'm willing to pay for, then they should keep their mouths shut when I go and find one for myself.

    What I find funniest about the entire copyright debate is how so few people are actually aware of what a flimsy basis copyright rests on.
    The Congress shall have power ... To promote the progress of science and useful arts, by securing for limited times to authors and inventors the exclusive right to their respective writings and discoveries -- The U.S. Constitution
    Intellectual property rights are not property, nor rights. They're grants (a decidedly un-libertarian form of state monopoly), given by the government, with the explicit intent of promoting the public good. Copyright holders are created for the good of society, not the other way around. The way I see it, once copyright starts being used to limit the creation and propagation of information and culture rather than encourage it, copyright might as well just not exist
    1. Re:People always forget by codyk · · Score: 2, Informative

      At the time, the common usage of those words referred to something else - "useful arts" was essentially what we would consider science and engineering today, while "science" essentially meant human knowledge in any field, including music or visual art.

  21. Out-of-print books by dillon_rinker · · Score: 4, Interesting

    The fact that out-of-print books have copyright protection is further proof that Congress is more interested in hewing to the corporate line than adhering to Constitutional principles. How does preventing any further publication of a work for nearly 100 years promote the useful arts and sciences?

    I would make copyright dependent upon making the copyrighted material available for the duration of the copyright. If it falls out of publication for a year and a day, then the copyright lapses. Making the material available online would be a cheap and easy way to maintain your copyright. Those that don't like that notion are free to publish and warehouse physical copies. In order to close an obvious loophole, I would further require that the copyrighted material be available at no more than the original cost, adjusted each year for inflation.

    1. Re:Out-of-print books by jrboatright · · Score: 2, Insightful

      So, Joe Autor writes a cool new book.

      He sells publication rights to Acme Giant Publishing for oh say $3000.

      Acme Giant publishing prints 1000 books, sells some of them, and pulps the rest.

      One year and a day later, the book falls out of copyright, and Acme prints 10,000 copies and sells them, and Joe Author gets nothing.

      What is wrong with this picture?

  22. Flaunt/Flout--From TFA by adavies42 · · Score: 3, Insightful
    Mr. Gorman, who said the American Library Association doesn't have an official position on the subject, described Google's argument that Web users will be able to look at several snippets and then decide whether they want to buy or read the book as "ridiculous." Further, he noted that as a published author, he opposes Google's intention to build an enormous database that includes copyrighted texts. "It's a flaunting of my intellectual property rights," he said. [emphasis added]
    If the president of the American Library Association doesn't know the different between flaunt and flout, I think civilization is doomed.
    --
    Media that can be recorded and distributed can be recorded and distributed.
    -kfg
  23. What you missed by reptilicus · · Score: 3, Interesting

    What you missed here is that Google intends to make a lot of money from selling ads on the pages where the search results are shown. The publishers see this as someone else taking their intellectual property and using it to make a profit, without asking their permission, and more importantly, without offering them a share of the profits. This is not about fears of people downloading books or not buying the books because they can get a 3 sentence answer from Google (anyone whose question can be answered in 3 sentences wasn't going to buy the book anyway). It's clear that even the publishers involved in the lawsuits realize that this service would result in higher booksales.

    It's a question of how far fair use rights extend. Fair use does not prohibit you from making a profit from exercising your rights (the NY Times Review of Books can excerpt content in a review and profit from selling ads on that page). The publishers argue that Google's use of their material goes beyond fair use, particularly because they're copying the entire work and not just an excerpt.

    It will be up to a court to weigh the many factors involved and come up with a ruling. The obvious question is, if Google loses the lawsuits, what then happens to web search engines that spider and cache copyrighted text without the author's explicit permission? Will they be held to the same standard, and will plain old Google, Yahoo!, MSN Search and their ilk become "opt in" only search engines?

  24. Gatekeepers by Toloran · · Score: 4, Interesting

    This has probably been mentioned in previous articles but oh well.

    Many librarians I have met (not all, or even most, but some) have this weird mentality of "I am the gatekeeper of knowledge, you must have my leave to access the wisdom of the ages." The basically believe that knowledge is so sacred (it is) that only they are fit to gaurd it and distribute it (very not true).

    When I was younger (elementry school, early-mid 90s) and you needed to research something you had to goto the library (either your schools or the public one), use a computer to look up a book (if you knew what it was) or (more often) ask a librarian to help you find books that would be useful for your topic. This gave the librarians great power because it allows them to deturmine all the information you are going to be using. When you learn and retain something, it becomes a part of you, by deciding what you learn they are in essense chaning you.

    Now (for me, ever since middle school), you want to know more about ancient egyptian art? Google it and find 100s of pages of information (well, realistically you will only likely use about 10 of those pages but you get the idea). Want to know more about the 2000 US election? Google it. Before, if you wanted to find out information about certain topics (primarily recent or highly specific) then you were out of luck because often the libraries didn't have it. However, with things such as google and wikipedia, you now have access to almost any information you want from anywhere you have a computer with an internet connection.

    (Beware, point soon approaching. Be prepared to duck)

    Taking all this into account, it is not suprising that many librarians are reacting so harshly to this. They are all for making information more accesible but not if it doesn't go through them. Its like a company with a monopoly that it has had for ages: They've become used to the power and don't want to give it up.

    The world has been slowly changing. It has become more and more difficult to control information. And as the cliche goes: Knowledge is power.

    --
    Speaking is NOT communication
  25. flouting, not flaunting says diction police of ALA by Anonymous Coward · · Score: 2, Funny

    Mr. Gorman, who said the American Library Association doesn't have an official position on the subject, described Google's argument that Web users will be able to look at several snippets and then decide whether they want to buy or read the book as "ridiculous." Further, he noted that as a published author, he opposes Google's intention to build an enormous database that includes copyrighted texts. "It's a flaunting of my intellectual property rights," he said.

    Shouldn't that be "flouting"?

  26. The Kelly v. ArribaSoft argument by Anonymous Coward · · Score: 3, Insightful
    One of Google's arguments is that a marvelous Ninth Circuit decision of a few years ago, one of the few to deal with Internet issues, gives them the right to do what they're doing. But Kelly v. ArribaSoft is a weak reed to rest Google Print's case on. True, there are parallels. The thumbnails of art and photos that Arriba was indexing and posting as the case went to trial are like the short excerpts that Google Print will use and the helpful Ariba quickly complied with opt-out requests, including those by the ill-tempered Kelly who sued.

    Both the district and appeals courts stressed the service that Arriba was providing to everyone by linking to web sites where an artist/photographer had made his art available online. The thumbnail itself, the court noted, was of such poor quality as to be of no value, something that isn't always true of a quote. (And one of the worst ways to treat an author is to quote them out-of-context, a practice Google's scheme will encourage.) The artist/photographer had also chosen to post his work online, thus putting it on the market. The Arriba link took interested parties to a site where they could pay the artist/photographer for the rights to a usable image. Arriba was creating a win/win situation for everyone and, once the image was thumbnailed, the full version no longer existed at Arriba's web site. All those factors taken together were sufficient to make what ArribaSoft was doing legal.

    But the Ariba/Google parallel only exists for books that are in print, being marketed online, and paying royalties to the copyright holder. Out of print, books that are only available used or through libraries do not parallel the AribaSoft case. What Google is much more like sending someone with a digital camera to art galleries and museums, ignoring any wishes of the artist. Owning a copy isn't owning the copyright. That's why the "approval" Google has from the libraries is so silly, as are its claims that it is simply making a 'really big' card catalog.

    The reader may benefit. Google and whoever profits from Google's linking may benefit, but no royalties flow to the author due to the linking, nor has the author chosen (present tense) to place the book online or in the marketplace. He may, in fact, consider the earlier work so dreadful, he intends to use copyright laws to their full extent to keep down his embarassment. And despite the squawks of some posting here, we have no legal right to get easy access to what someone else has published. A copyright bestows the right to say, "No more copies will be published." That's why, for instance, an author can prevent anyone from making a movie derivative.

    Arriba is a marvelous case for defending unauthorized linking and for indexing the web itself, as Google does. And as a Ninth Circuit appeals decision for someone living in the Ninth Circuit, it was "controlling" in my successful battle with Tolkien estate lawyers over whether my chronology of a fictional work was fair use--the law in that matter having been corrupted by some dreadful Second Circuit court actions in 1998.

    But Arriba is weak precisely where Google is being challenged most strongly by authors and publishers--Google's right to scan and index the entire text of books that are in library collections but are, for the most part, are out of print. For those books, Google cannot link to a website where the purchase of the book will result in income for the copyright holder. That's the key issue. Google Print may be a winning situation for Google and readers, but the copyright holder doesn't get a cent. Indeed, the very point of Google's action is to blast ahead, not bothering to even look for authors because that would be too much trouble. Authors, on the other hand, are expected to go to the enormous trouble of tracking down every instance of the use of their material by Google and a thousand Google-clones, and opt-out of each individually.

    And I might add that I say this as a "one Mac mini" author/editor/publisher who's placed virtually every

  27. "Scanning" is done with a camera and cradle by dananderson · · Score: 4, Informative
    Scanning of old books isn't done with a consumer-grade (or even business grade) flat-bed scanner. That's too expensive and too damaging to old books.

    "Scanning" of old books is typically done with a camera photographing a book lying in a cradle (to not split the binding). One image is taken of each page or every two pages (the latter is faster, but has focus problems).

    Once photographed, OCR software grinds away. There are errors. Some projects proof-read the errors (this is very expensive), but with Google's volume they cannot. Even when not proof-read, however, the OCR'ed text has high value in search engines.

    For examples of the resulting product, see U of Michigan's Making of America or the Library of Congress American Memory.

    New, in-print books can be scanned destructively. That is, saw off the binding and feed into a sheet feed scanner. This works with publishers who have extra copies they can expend.

  28. Corrected, live link by dananderson · · Score: 2, Informative
  29. Re:Compilation works have mixed copyrights by Enzo+the+Baker · · Score: 3, Interesting

    No, they would have to go back and find a public-domain edition that contains the story. For instance, if you search for "Moby Dick", you will see several recent editions that have the copyright restrictions on them. However, Google could find a pre-1923 edition of Moby Dick, scan it, and put the whole thing online.

    --
    I may twist orthodoxy to partly justify a tyrant. But I can easily make up a German philosophy to justify him entirely.
  30. Google actually preserves these books by dennison_uy · · Score: 2, Insightful

    I had this idea once that the only way I can make sure my important document withstands the test of time is to make multiple copies and distribute the copies across different locations (i.e. home, office, gf's house, etc). This is the same reason why I make multiple copies of an important document and post it in multiple locations over the internet.

    Making a book available online is just about the same thing -- it only serves to preserve it, and what better way to do so other than making something like Google, with its many layers of protection against data corruption, its backup? It's the best thing to preserve your most treasured works other than spreading it via P2P.

    How many times have I tried to open an old link, only to discover that it is already gone or most of the content has already changed? Well, good thing there's Google cache and the Wayback Machine as form of "backup".

    --
    Take off every 'sig'!
    All your 'sig' are belong to us!
  31. Unfortunately this is the way the system works! by DaoudaW · · Score: 4, Interesting

    There is really no precedent for what Google is doing, so it has become a test-case for the limits of fair-use. We may all agree that it seems obvious that it is fair-use, in fact many lawyers have suggested just that, but until a court of law deems it fair-use Google will be challenged. It will probably go to the Supreme Court within a couple of years and we can only hope that the conservative justices being appointed by Bush will allow it under fair-use. Fortunately, Google has fairly deep pockets so may be able to win the case.

  32. Re:You missed the truck she drove thru Google stan by kebes · · Score: 2, Insightful

    The owners of the copyrighted works cannot be forced into depending on the discretion of a third party to protect their works, regardless of Google's assurances, or whether the owner ever heard of the ability to opt-out or not.

    Yes, owners of copyrighted works CAN be forced into depending on the discretion of a third party to protect their works. That's life. If copyright holders are really so scared of their works being copied, they can lock them all up in a vault and never sell them to anyone. Then they are really protected.

    A library has a bunch of books. They protect those books. I can go to a library, steal the books, make copies, and sell the copies. This is illegal. I don't think any court would honestly hold the library responsible. In fact, I don't have to steal the books. I can borrow them, take them home, make copies and violate copyright and the library is STILL not responsible. In fact, I can use the photocopiers INSIDE the library to do my dirty work. That's life. I broke the law, the library did not.

    Now I admit that the Google database is a little bit different. But as long as each copy they are making is fair use, then they are allowed to hold the database. Copyright holders can't say "but what if someone steals it and makes copies!!??" If that happens, you can sue the thief/copyright-violator, but not Google (barring any obvious negligence etc.).

    Now, is it fair in the first place for Google to make those copies, and let people search (but not view) them? That's a separate issue that the courts are looking into. I personally find that it advances society without compromising the copyright-holder's monopoly. Therefore, I think it's legal. I also happen to think it's the "right" (ethical, etc.) thing to do, for society.

    if Google loses control of the data that they do not own, they have very little legal basis to protect it.

    Indeed, if Google loses control somehow, it will be up to the actual copyright holders to pursue legal action and so on. That's life. Libraries are not responsible (unless they willfully encourage people to break the law), and so neither should Google be responsible (unless they willfully encourage people to break the law).

  33. Intellectual Property is Neither by Phoenix666 · · Score: 2

    It is simply greed. Limiting culture through copyright is the same thing to a society as putting a copyright on the use of the letter "E" would be to language. Language is how individuals communicate with each other. Culture (inclusive of music, literature, etc.) is how a society communicates with itself about its values, aspirations, and innovations. If you freeze culture in the straitjacket of copyright, then you necessarily freeze the ability of a society to dream, improve, and innovate.

    There is a reason why the Founding Fathers fought about copyright. In a limited fashion, it could promote innovation, as creators would have the economic incentive to create. Unrestricted, it would stifle innovation as surely as unrestricted reproduction would. We now find ourselves proving out the naysayers in that original debate.

    Children of the RIAA will argue that copyright has been a boon to the U.S. But if we had honored it from the first the American Industrial Revolution would have been long delayed because the engineers and entrepreneurs of the Republic would not have been able to copy British innovations. If we had honored it from the first then the evolution of an American voice in literature would have been delayed because many early novelists borrowed liberally from Imperial British sources. And, most famously of all, especially here on Slashdot, this country would not even have had a national anthem if Francis Scott Key had not lifted the melody of the Star-Spangled Banner wholesale from a British comic opera.

    So consider that carefully, children of the RIAA. Much of what you associate with American strength would not now exist if you had ruled then. Many of the things, the rights, the freedoms, the material comforts, the power, and the influence would not now be yours to cast away if you sat in a council of influence then. It simply would not have come to be.

    Human societies and cultures live and die on innovation, which is wholly dependant on the free flow of information.

    --
    Do what you can, with what you have, where you are.