Boiling Down Books, Algorithmically
destinyland writes "A year ago, Aaron Stanton harangued Google over his new project, a web site analyzing patterns in books to generate infallible recommendations. In March he finally finished a prototype which he showed to Google, Yahoo, and Amazon, and he's just announced that he's finally received a big contract which 'gives us a great deal of potential data to work with.' The 25-year-old's original prototype examined over 200 books, plotting 729,000 data points across 30,293 scenes — but its universe of analyzed novels is about to become much, much bigger."
The difference between now and 100 years ago becomes more apparent each day. Then, owning books was a sign of affluence, of intelligence. Now? Everything is up to question, and should be. Analyzing books and other public material is just another step in putting intelligence out there for everyone, not just those that can afford it. I applaud it, and all the dangers it brings. Such hurdles are necessary, but we must assault them to overcome barriers that should no longer exist.
Support NYCountryLawyer RIAA vs People
I love how the prototype version in the link gives a 98% match between George Orwell's '1984' and the text of the USA Patriot Act!
when i come to think of it, building a tool that rates the level of novelty in an idea would be good one. it would make the job of large companies or opportunity hunters easier as well..
cheers,
mbilgi
I, for one, welcome our new book analyzing overlords!
The only infallible book recommendation has existed for nearly 2000 years now.
Don't want to read it? Heretic! But "translations" do exist for public convenience.
...and if you do not read, you won't want this.
I am skeptical that analyzing the content of the books can lead to good recommendations, let alone "infallible". Two books can be very similar in subject matter and writing style and yet one can be great and the other one awful. The difference is just too subtle for an algorithm to figure out, though I hope I am wrong and it turns out that it works, it would be very useful. Same applies to movies and music as well. I always found "Customers who purchased this book also purchased...." section on amazon to be more valuable than my personalized recommendations
Negative moral value of force outweighs the positive value of good intentions.
This is just another pointless project that's going to waste the time and skull-sweat of a good but unrealistic programmer. All he's going to have when he's done is the solution to a problem that doesn't, for all practical purposes, exist. Good writers won't need it because they know what to do and how to do it, so they won't use it. It will only be used by poor writers, who won't know how to put the suggestions into effect properly. It may, possibly, tell a writer where their book needs work, or where it's not interesting enough, but I doubt it. Most likely, all it will do is tell it where it's not like other successful books because it won't be able to recognize or take into account any originality. Even if its recommendations are right, a poor writer is highly unlikely to profit from them, because by definition a poor writer won't know which suggestions are good or the skills to take advantage of them properly. No, what a poor writer who wants to get better needs is either a good critique group or some friends who will act as beta-readers, telling him not only what doesn't work but why (Something, I might add, that I find it hard to believe this program could ever do.) and discuss things with the author until they understand each other. Mechanical criticism of literature can only result in mechanical literature, not good writing.
Good, inexpensive web hosting
how long before someone figures how to fool the algorithm, and we all start reading books about enlarging our genetalia, but in a classy way?
...considering the quantity of "classic" tripe that I had to read in high school and college. Who needs an algorithm when you have English teachers who follow flawed formulae?
Laughter is the Spackle of the Soul.
What I wonder is: What happens if there's a different style of writing that's not accounted for? I hope they're not just marked down. What will it consider a good book, what's truly interesting and insightful or books that are made to sell like The Da Vinci Code? I can see how much easier it would be to identify which books well sell well but I fear that this will be its only use, and the less said about doing the same for movies the better.
This computer should do fine, assimilating every book ever written. We'll just need to hire someone to periodically delete every Agatha Christie novel from its database.
"Nine times out of ten, starting a fire is not the best way to solve the problem." - my wife
What really hits a nerve with me is why the scientific community hasn't opened up all their journals for others to read.
Do you know what you're saying? Do you really want to release possible Weapons of Intellectual Destruction on the world?
I look at the titles in the archives of
http://www.misq.org/
and I'm thinking that some of this stuff is best kept locked in the ivory tower.
Get thee glass eyes, and, like a scurvy politician, seem to see things thou dost not.--King Lear
Is there any thing whereof it may be said, See, this is new? it hath been already of old time, which was before us.
Ecclesiastes 1:10
Get thee glass eyes, and, like a scurvy politician, seem to see things thou dost not.--King Lear
What's the big deal, except that Google has Google books?
Anyone could do this. There's plenty of narrative analysis software: the government's outpouring of our tax dollars to "protect us" since 9/11 has triggered every halfwit software development firm in the country to develop
and sell them to the local militia.
There is one persistent son of a bitch on their forum, Joe, who seems to be their nemesis. I wonder what his angle is.
Other than that, I like their approach - involve the community *really* early on.
Apart from Joe.
Open Source Drum Kit, LPLC deve board - mjhdesigns.com
His prototype sounds in a way like Netflix's suggestion system for movies, where you vote your favorites and it'll suggest other ones based on your liking. But books are much more complicated, so I can see how his detailed analysis tool can really be the ultimate suggestion tool. I wonder if people will use this to discover copyright infringement on a new level. Hmm... my book and your book are a 99.5% match. Gee where did the .5% discrepancy occur. My character is a 19 yr old hobo, so is yours. My story is about him eventually becoming a successful company executive by pimping himself out to different high-powered women. My character's name is Matt, yours is Mike. Aha.
Whoever modded this to 'troll' never took the English classes I had. Yo.
Laughter is the Spackle of the Soul.
Other old journals will likewise have a lot of valuable information in them. Archaeologists discover a lot through searching their own journals, discovering lost and forgotten reports of discoveries. Mathematicians routinely publish in arcane and super-obscure journals, making what is known far more extensive than what is known to be known.
It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
Comment removed based on user account deletion
How about Project Gutenberg? They've got lots of books that have already been scanned.
Nothing for 6-digit uids?
http://www.baetzler.de/cgi-bin/country.pl
http://www.outofservice.com/country/
I don't read your sig. Why are you reading mine?
The best books I've read are the ones that broke away from what I had read before. The ones that gave me a new experience, and new view on things. I really really don't want to read copies of books I've already read. I want something out of the ordinary. I'll stick to my old methods if you, Mr. Algorithm, don't mind.
I'm an English Lit mayor, and before I became a programmer I use to work in a publishing house in NYC. And let me just say, that no good writer is not even going to give two cents to this project.
They already know their stuff. This portal is more geared towards writers with insecurities: a fourteen year old teenage boy trying to woo his girlfriend, as a plagirizer: "'Come, live with me and be my love,' ... and what next... let's see here... 'You are a phantom of delight.' Ingenious, I'm chortled in my joy."
Then again, I might be for a student who's trying to complete an essay without 'thinking.'
It's funny about digital books, I just don't read them. I love reading bound-paper books, always have, and it's not like I dislike the principle of digital books, but for whatever reason (tactile, olfactory?) I just can't read more than a couple pages. And it's funny too, browsing through people's digital-book folders, how often there's this type of person with a "philosphy" folder containing plenty of neitzsche, and maybe a couple other people who, after conversing a short bit, I find they are really looking forward to reading someday.*
* probably confusing disinterested grammar which could be used against me in a dashing rebuttal
Anything that analyzes anything in a way that is useful for people is a step in the right direction for AI.
So much of human reaction is related to story telling and shared references where a behavior, event, activity or other interaction can be connected to something we've read or heard or done. It isn't only memory of such things, it is the internal summary we've made as well. If this software can create just such a summary which can be easily cross-referenced with other summaries... then it can help AIs create internal dialogue so that NEW experiences can immediately be summarized in relation to past learned information and categorized accordingly.
An example.... say your AI knows about Auto Mechanics, it then is exposed to a scenario where it needs to diagnose a boat engine problem... it can look at the components, compare it to the knowledge it already has and see that the boat engine is similar in design to an Auto engine. This will narrow down the set of knowledge it needs to utilize to move forward with a diagnosis.
What will be really cool is when they find similarities in patterns that we Humans just don't see.... due to our own close mindedness and need to compartmentalize. As in they will find biological equivalents to mechanical processes and vice versa....
A fool throws a stone into a well and a thousand sages can not remove it.
Print's dead.
RTFA
just burn them instead...at least that way you can use them as a fuel source instead of using fuel to turn them into mush.
Down With Slashdot BETA!!! I've been around the corner and seen the oliphant; you can only abuse me from your perspecti
I'm delighted that the advertisers now know my taste in fiction. I'm pretty much the last person you should consult on that one.
"We entertain you, and you ARE entertained."