Amazon Plan Would Allow Text Search Of Books
emmastory writes "The New York Times is running a story (free registration required) about a new development at Amazon - they plan to assemble "a searchable online archive with the texts of tens of thousands of books of nonfiction." Users would only be able to read a certain portion of the text from any one book, but it sounds promising nonetheless. The Times article suggests that this is part of a larger strategy to compete with Google and Yahoo by making Amazon an authoritative source of information on everything book-related."
If this happens, maybe we'll finally be able to find books based on their actual content instead of the (usually pretty crappy) writups that Amazon does on them.
Shouldn't somebody patent this process before Bezos does??
Have you noticed that they now offer web searching as well, and are also generating third-party ads based upon what you're looking for?
This development may bite them back - when I look for something on Amazon now, I often find in their ads that other people have the item cheaper. Amazon may get a nickel or quarter for the referral, but they lose the dollars from the markup.
Get off my launchpad!
... someone writes a distributed bot to query targeting a specific book and sections to finally retrieve the entire book. If it's a distributed app, then it would be tougher for Amazon to block. You could even have it only go after certain parts of the books at different times to make it tougher. Now not to say that this is a good use of effort, but that never stopped anyone from doing such a thing before :)
I remember when doing a search on Amazon for "Database Admin" returned the number 1 response of "The fine art of vaginal fisting" and the reviews that it prompted ... pushing this book up into the top 100 bestsellers. Now what would the ability to read some text from books do ;-)
I always find it annoying when reading a paper boo when I can't Ctrl-F to find a certain segment.
Now I can just hop online to amazon, do the search, it will tell me what page it's on, and I can go read it!
no comment
And minimum wage laborers in 3rd world countries find themselves scanning books into computers and correcting the text using crappy OCR technology for 12 hours a day. This is one job I'd be happy to export to India.
This is my sig. There are many like it but this one is mine.
Would this be like OReilly's Safari online books on steroids? Safari is my favorite bookstore for a while now.
---- join dshield.org Distributed Intrusion Detec
Looks like they'll be going with a proprietary solution. Even though the article seems to indicate that Amazon is launching this new service as a response to Google's "Froogle" shopping search product, wouldn't partnering with Google make more sense for them?
and if you look for "TEH", will you be redirected to Salshdot ?
Trolling using another account since 2005.
Any returns of C or C++ code might get SCO's law team on your ass..
Trolling is a art,
doesn't this infringe on basically every copyright that the publishing industry has?
I write code.
Isn't this a violation of the privacy of all the people who have biographies for sale at amazon? John Ashcroft could search the text and find out anything they want about Abraham Lincoln! This article should be listed under "Your Rights Online".
This would be awesome for students. I've always wished I could just execute a search function through a book to find what I was looking for. It can be a p.i.t.a. to use indexes and thumb around until you find what you need.
The real issue is that Amazon's system doesn't do moderation very well, and as a result the reviews get spammed with people who really really like something.
Or, you get situations where teachers apparently tell their classes to submit reviews on Amazon for a book, and you have 30 reviews that say nothing.
And, of course, being a bookseller, there is a strong motivation for them to bias things so that positive reviews outweigh negative ones.
Jon Acheson
All opinions expressed herein are my own, and not those of my employers, who are appalled.
This will then prompt publishers to include several pages at the beginning of every book with nothing but "sex sex sex sex sex sex..."
The NIH has a good start with something of this nature. The NCBI (part of the National Library of Medicine) has a fully-searchable set of about 20 books. The books are generally cover biology topics, but represent some of the standard texts used in college courses. They call the project Bookshelf and it is entirely free. Several books contain direct links to gene sequences, etc.
"Of course this *could* be great for college paper researchers, looking for a quote or two to stick in a research paper. Depends on how much meat you can really get at."
College is great in this respect. No matter how crazy, ill-conceived, or outlandish your premise is, there are a thousand nut-jobs out there with nice quotations to support it. This would make it even easier to back that dribble up. Especially late the night before it's due, when you need to support that last flimsy claim in order for your paper to make sense.
GeekNights!
Late Night Radio for Geeks!
This sounds like a good project that they could get some gov't funding for.
Besides the obvious copywrite problems, if the gov't was to get involved and Amazon (or whoever) was allowed to permit searching an entire book for concepts / keywords but not be able to view the entire book without paying for it this would both increase sales and usefulness.
If this was the origional model for online music, think of all the problems that would have been avoided. Perhaps a second look at this type of archiving will help the movie industry as bandwidth increases.
rejected (19) accepted (0)
Is there a psychological term related to getting your stories rejected on slashdot?
I'm surprised nobodys mentioned Project Gutenberg - I mean, they've been OCRing public doman books for a long time now, and there are thousands of texts available... not in some crappy interface that Amazon will use, but in wonderful, sweet, ascii text format. Couple this with some good regular expressions and you're in business... want to see how many times Sherlock Holmes talked about using cocaine? It's elementary!
How authors will react is another question.
Isn't this what happens in the RealWorld? You walk into a bookstore, open it up, read a few pages and make a decision on whether or not you want to buy it?
I think publishers and authors would be rather short-sighted to not allow potential customers shop online the same way they shop in brick and mortar stores.
Ryan O'Rourke
search a little, store a little. Search a little store a little more.
Pretty soon you'll have the entire book.
They'll have an app out to search the pieces out and stich them together into one complete book..
Yeah, this will work, thanks for the free ebooks Amazon..
> Are Amazon obtaining each and every rights owners' permission to perform this duplication? I doubt it
Why do you doubt it? You do realise that Amazon has a direct business relationship with every publisher whose books it sells already, don't you? They don't buy their books from Barnes & Noble...
Amazon's book buyers will offer this facility to publishers (whose salespeople they already work with directly - many publishers will employ one person whose entire job is selling books to Amazon) as a marketing benefit - and charge them for the privilege, no doubt - just as they do today with their 'look inside' feature. In order to keep competitive, publishers will prepare and supply the text in the format Amazon wants. It's really not hard for Amazon to do this at all.
*Accessing http://www.amazon.com/search*
Enter your search criteria:______________
*Enter search "Moby Dick"*
Search Complete:
Moby Dick
by: Herman Melville
Call me...
Would You Like to Read More? This title can be purchased for $14.95 through our...
*Back Button*
Enter your search criteria_____________
*Enter search "Tale of Two Cities"*
Search Complete:
A Tale of Two Cities
by: Charles Dickens
It was the best of times, it was the...
Would You Like to Read More? This title can be purchased for $29.95 through our...
*Back Button-Back Button-Back Button-Close*
Just imagine if Amazon did some deal with the Library of Congress that allowed them to scan in nearly every book published in the United States. Once the information is digitally stored, it could be utilized in other ways as well:
- Libraries around the country could offer consoles on which you could read any book through a secure connection of some type, preventing unauthorized copying, which would prevent book publishers from agreeing to this. You could essentially read any book, even if the library doesn't have it.
- Bookstores, schools and other organizations might get in on this network and offer the same service.
This service doesn't even have to be free. I'd pay a subscription fee to have access to this information, as would the bookstores and whatnot.