Slashdot Mirror


Algorithm Aims To Predict Fiction Bestsellers

benonemusic writes "Three computer scientists at Stony Brook University in New York believe they have found some rules through a computer program that might predict which fiction books will be successful. Their algorithm had as much as an 84 percent accuracy rate when applied to already published manuscripts in Project Gutenberg and other sources. Among their findings was that more successful books relied on verbs describing thought processes rather than actions and emotions. However, some disagree with the findings. Author Ron Hansen said style is not the key, but instead readers' interest in the topics in the book." There has been work done already on finding the formula for a hit song, and using analytics to craft a blockbuster movie.

146 comments

  1. And modern life ... by Anonymous Coward · · Score: 1

    ... becomes more cold.

    1. Re:And modern life ... by Garridan · · Score: 1

      Next, in finance: Local main aims to make millions by purchasing a lottery ticket!

      Seriously. This is news if they succeed. Chumps.

    2. Re:And modern life ... by Anonymous Coward · · Score: 0

      Sadly, maybe triumphantly, many successful movies are created with formulas. The only real problem is when they try to make a dozen blockbusters in one summer. Transformers was all formula. - Girl should show off navel here. Next. - It was successful as far as producers were concerned.

      Seriously, this will be new if they don't succeed. For various definitions of succeed.

    3. Re:And modern life ... by Anonymous Coward · · Score: 0

      That reminds me of a lottery TV ad quite some time ago in Germany.

      They (humorously) interviewed a "luck researcher" about his great discovery. The interview was roughly as follows:

      Interviewer: So you've researched luck and came to a stunning result.
      Researcher: Yes. My research indicates that there are two types of people. There are people who never win anything. Never. They just don't win. Not a single time. And there are other people, who do sometimes win. One can say, they have a real chance.
      Interviewer: That's indeed an interesting result.
      Researcher: Indeed, but most importantly, I've found that you can influence to which group you belong.
      Interviewer: Really?
      Researcher: Yes. My research clearly indicates that all people who won had a lottery ticket!

  2. topic by deodiaus2 · · Score: 1

    Sex , drugs, and rock 'n roll.

    1. Re:topic by mcgrew · · Score: 1

      Damn, so that's why Nobots is selling so poorly... I forgot to put rock and roll in it! Damn... didn't you tell me before I wrote it?

    2. Re:topic by Quirkz · · Score: 1

      Ah, mine has all three, and it's still selling poorly. I suspect it's the puns and spoonerisms keeping people away. I mean, how many bestsellers have spoonerisms? Other than Lolita, which is probably the exception that proves the rule.

    3. Re:topic by Anonymous Coward · · Score: 0

      Katie Price's novels are written by a published author who sells damn all when writing under her own name. This also happened with Ivana Trump.

  3. If I had a penny by Anonymous Coward · · Score: 0

    Oh, if I had a penny for every time an algorithm aimed to do something...

    1. Re:If I had a penny by Chrisq · · Score: 4, Funny

      Oh, if I had a penny for every time an algorithm aimed to do something...

      on (anyAlgorithmProposed) {
      give yourself a penny
      }

    2. Re:If I had a penny by behrooz0az · · Score: 0

      penny_t* get_penny(const algorithm_t *algorithm) {
      return new penny_t(algorithm);
      }

      FTFY

      --
      Moderating "-1, Disagree" is simple censorship. Have the guts to post your opinion. -- Spazmania (174582)
    3. Re:If I had a penny by RabidReindeer · · Score: 3, Funny

      Add friendly vampires. If that doesn't work, add werewolves. Alternate version: zombies.

    4. Re:If I had a penny by Anonymous Coward · · Score: 3, Insightful

      So, a love triangle with a vampire, a werewolf, and a girl with the emotional depth of a zombie?

    5. Re:If I had a penny by Anonymous Coward · · Score: 0

      If all else fails, add more sex.

    6. Re:If I had a penny by Quirkz · · Score: 1

      I keep thinking it's time for some of the other undead to break out. Where's the best-selling ghoul story? Maybe a skeleton romance (shame the title "Lovely Bones" is already taken). The Mummy had one and a half good movies out of three, so there's probably more potential there. Wistful wights, wandering wraiths, maybe a misunderstood banshee - you can be friends as long as you leave your earplugs in.

      Nearly two decades ago I read a comment by an author (sadly can't remember who at the moment) who asked for some originality. "Where's the lamia story?" she said, and I thought that was a good idea. I still haven't seen one of them, even though I've seen multiple cycles of vampire and werewolf boom and bust since then.

    7. Re:If I had a penny by mcgrew · · Score: 1

      I keep thinking it's time for some of the other undead to break out. Where's the best-selling ghoul story?

      I couldn't write a story like that. Well, I could but it would suck because my heart wouldn't be in it. It takes a special kind of ghoul to hack out nonsense that people will pay for that the author has no interest whatever in.

    8. Re:If I had a penny by as.kdjrfh+sxcjvs · · Score: 1

      Lamia story -- I think Tim Powers wrote one. Set in Venice, maybe? and excellent.

    9. Re:If I had a penny by RabidReindeer · · Score: 1

      Lamia story -- I think Tim Powers wrote one. Set in Venice, maybe? and excellent.

      Partially. Had Vampires, too. In the unique way that only Tim Powers can do such things.

  4. There is so much money by LookIntoTheFuture · · Score: 1

    to be made from suckers. No fancy computer program is going to replace actual talent.

    --
    Brave Sir Robin ran away. ("No!") Bravely ran away away. ("I didn't!")
    1. Re:There is so much money by symbolset · · Score: 3, Interesting

      So you haven't been to the movies or read a bestselling book lately? There is no talent to replace.

      --
      Help stamp out iliturcy.
    2. Re:There is so much money by mcgrew · · Score: 2

      No fancy computer program is going to replace actual talent.

      I don't think there's any correlation between talent and success whatever. Wikipedia quotes Stephen King as saying that James Patterson "is a terrible writer, but very successful." I read Patterson's "When the Wind Blows" and wasn't very impressed with his writing, either, especially the switching back and forth between 1st and 3rd person. But almost every time I see a woman with a book it's one of his.

      Asimov's Hugo-winning Foundation trilogy didn't earn him a dime for ten years, until Doubleday bought the rights from the original publisher.

      Meanwhile I know a lot of incredibly talented musicians who play in bars because the labels offered them ridiculous contracts.

      Anyone remember Milli Vanilli?

      Marketing is king, talent is a dime a dozen.

    3. Re:There is so much money by RabidReindeer · · Score: 1

      So you haven't been to the movies or read a bestselling book lately? There is no talent to replace.

      Lately? Sturgeon's Law is 50 years old or more.

    4. Re:There is so much money by RabidReindeer · · Score: 1

      Funny, I have the same opinion of Steven King.

      Well, maybe not "terrible", but there have been some pretty bad moments. And not enough good ones.

    5. Re:There is so much money by ebno-10db · · Score: 1

      I don't think there's any correlation between talent and success whatever. Wikipedia quotes Stephen King as saying that James Patterson "is a terrible writer, but very successful."

      "Terrible writer" is subjective. While I'm sure that initial luck and subsequent promotion have something to do with it, he obviously writes stories that a lot of people like. I think Harry Turtledove is a complete hack of a writer, but I read his alternative history stuff because I like the subject so much.

      wasn't very impressed with his writing, either, especially the switching back and forth between 1st and 3rd person

      You may not like switching back and forth between 1st and 3rd person, but it's not an unusual technique. I like it when it's done well (never read Patterson so I couldn't say if he does it well).

    6. Re:There is so much money by bluefoxlucid · · Score: 1

      The Black Swan will explain why this research is so ludicrously stupid.

    7. Re:There is so much money by smallfries · · Score: 1

      Unlikely. Their comparison is the outcome of a popularity contest, which in the terminology that Taleb used is an inhabitant of Mediocrastan. The distribution is relatively smooth as it involves the average opinion of a large population.

      --
      Slashdot: where don knuth is an idiot because he cant grasp the awesome power of php
    8. Re:There is so much money by smooth+wombat · · Score: 1

      I forget which King story I read, it involved a woman going to a cabin and being tied to a bed by her lover who dies on top of her. While she's there she starts to have images of things that might happen to her.

      I don't think I finished it, the writing and droning was so bad. I only read it because I had never read King before and someone handed it to me.

      Reminds me of the Family Guy scene where King is in front of his publisher who is asking what book King is working on. King grabs the lamp and says it's about a lamp which comes to life and attacks people, to which the publisher remarks, "You're not even trying any more."

      --
      We will bankrupt ourselves in the vain search for absolute security. -- Dwight D. Eisenhower
    9. Re:There is so much money by bluefoxlucid · · Score: 1

      Taleb's point was that you can see patterns in past behavior that don't necessarily indicate future performance. He even used literary work as one of his first examples.

      Here's the completely predictable: One day, a small movie studio will start pushing its own movies where they explicitly try to make "golden-age" movies that aren't formulaic. They'll become wicked-popular. How do we know this? All of the push-back against formulaic crap!

      Here's the unpredictable: Someone made a formula for movies based on the kind of analysis we're seeing, and all movies became this garbage. That it's spreading to books and music is predictable since it happened once and is being repeated--it's an idea that's in marketers' heads, we know how that works. We couldn't have predicted that the idea would come up until someone came up with it.

    10. Re:There is so much money by boristdog · · Score: 1

      Long ago I determined that having talent was a ticket to poverty.

      I know dozens of highly talented musicians, writers, artists, etc. who make jack from their talent. They play lots of gigs, get art shows, get books published but they make very little money from it. All but one have menial jobs to support their dream of someday succeeding with their talents. The only one who doesn't have a real job is very good at marketing her art and builds sculptures for local businesses. But she only makes about $40K/yr doing it and she works far harder than I ever have.

      Meanwhile, talentless fucks like myself sit in a comfy office and write lame software for the man ( I admit that I really am a half-ass programmer, and I spend half my day surfing the web) and pull in $120K+/yr doing it.

      Even less talented people than me earn far more money than I ever will. So if you have talent, keep it a secret if you want to make money.

    11. Re:There is so much money by Tipa · · Score: 2

      Huh? Asimov originally serialized the Foundation series in Astounding Magazine, for which he was paid quite well.

      Those Golden Age SF pros didn't write a word if they weren't going to be paid for that word. This was their livelihood.

    12. Re:There is so much money by mcgrew · · Score: 1

      Funny, I have the same opinion of Steven King. Well, maybe not "terrible", but there have been some pretty bad moments.

      Well, I never cared for his genre (horror), but I don't see how anyone could say The Green Mile isn't some great writing. I only read it because I'd seen the movie and a friend had a copy I could borrow (six very skinny volumes). It really sucked me in. Patterson? I write better than him, King kicks my ass..

      Of course, since I wasn't a literature or English major, my opinion of Patterson and King are pretty worthless, unlike King's (and maybe yours) since he was, in fact, an English major.

    13. Re:There is so much money by mcgrew · · Score: 1

      "Terrible writer" is subjective.

      Well, it is to me but if you're a literature or English major I'd say your opinion carries quite a bit of weight since wikipedia says "From 1966, King studied English at the University of Maine, graduating in 1970 with a Bachelor of Arts in English." So I'd say his opinion (terrible) carries far more weight than mine (not that good).

      he obviously writes stories that a lot of people like.

      Yes, he writes murder mysteries with sex. Women eat those up, he panders to them. Actually, the book I read had a pretty good story that wasn't told badly, but the writing didn't transport me into the book's world. If a writer is any good at all I'll lose myself.

      You may not like switching back and forth between 1st and 3rd person, but it's not an unusual technique.

      Perhaps you're right, I've surely seen it before but never noticed it. With Patterson's book, the first several chapters are written from the female protagonist's point of view, then there's a chapter where the protagonist is absent and is 3rd person, which kind of worked. But then it went back for several chapters to 1st person, then switched mid-chapter from first to third person in the sex scene. It just seemed really clumsy, contrived, and artificial to me. I was disappointed, I thought since he was so popular he must be good.

    14. Re:There is so much money by hey! · · Score: 1

      I don't think there's any correlation between talent and success whatever. Wikipedia quotes Stephen King as saying that James Patterson "is a terrible writer, but very successful."

      I think you are confusing *craft* with *talent*. Craft, talent and taste are all distinct things. So a talented author can write a sloppy and vulgar book. Likewise an author of little talent can write a tasteful and and technically admirable book. I see this in my writer's group all the time, diligently crafted and thoughtful manuscripts that nobody but their author will ever love. The world of unpublished manuscripts is full of irredeemable garbage, but there are plenty of ambitious, clever, and disciplined writers out there who unfortunately have very little talent. You read their manuscripts and while you see much to be admired, but the most they achieve in overall impression is "not bad".

      Craft is important to make the most of whatever talent you have, but I think in the end talent is indispensable but craft, unfortunately, is not.

      The idea that you can manufacture a hit out of complete swill is a myth rooted in the cognitive bias called the halo effect. When you like something, you tend not to find *any* faults whatsoever. When you hate something, it's hard not to find any virtues. There's no doubt that marketing can be a key ingredient in a hit, but it's not true you can manufacture success with total swill.

      For example, the first "manufactured" hit band was 60s TV show group THE MONKEES. The songs were written by highly successful songwriters like Neil Diamond and Carole King and performed by accomplished pro studio musicians. The TV producers spared no effort in hiring top drawer pop music talent, it just wasn't in the people whose faces were on the album. To achieve success, you don't have to get everything right, you have to get some things right enough that the halo effect covers your shortcomings.

      As for books, the same applies. There are a lot of successful books by lousy writers, but not any I can think of that were written by untalented writers. Take Dan Browne or Stephanie Meyer. These are both dreadful writers from a technical standpoint, but if you read their work with a non-judgmental stance you'll see they aren't untalented writers -- Meyer especially. Browne is something of a one trick pony, but there are actually several stretches of very good writing in TWILIGHT. And Meyer sets up her story to be appealing *to her audience* with uncanny instinct, which is surely a talent. The main fault of TWILIGHT is dreadful, meandering, pointless dialog. This actually works for her in her target audience, but makes the novel unbearable for anyone else.

      Speaking of one trick ponies, that describes Tom Clancy's work, although it was a good trick that could be revived with each new generation of weaponry and every historical twist of the geopolitical kaleidoscope. His characterization and dialog were laughably bad, although he had an unholy knack for military jargon. He was very good at scenarios though. In that he's very much of a piece with Stephanie Meyer as a writer. You just don't notice his faults if you're in his target audience, any more than Meyer's audience notices her writing faults.

      --
      Post may contain irony: discontinue use if experiencing mood swings, nausea or elevated blood pressure.
    15. Re:There is so much money by smallfries · · Score: 1

      Taleb's point was that you can see patterns in past behavior that don't necessarily indicate future performance. He even used literary work as one of his first examples.

      No, it really wasn't. His point was that the phenomena that we encounter are modeled by two very different types of distribution. In one kind the past is a good predictor of the future because deviations from the norm don't happen. In the other kind the past is a poor predictor of the future because although deviations from the norm don't happen regularly, when they do the impact of the deviation is immense.

      Popularity contests are of the former kind.

      --
      Slashdot: where don knuth is an idiot because he cant grasp the awesome power of php
    16. Re:There is so much money by nintendoeats · · Score: 1

      He who does not see with his eyes is blind. Case1: Person see's with eyes Person does not see with mind Person is blind Case 2: Person sees with eyes Person sees with mind Person is not blind Case 3: Person does not see with his eyes Person does not see with his mind Person is blind Case 4: Person does not see with his eyes Person sees with his eyes Person is blind Ergo: Seeing with your eyes is a prerequisite to not being blind.

    17. Re:There is so much money by bluefoxlucid · · Score: 1

      Except "groundbreaking authors" come out of nowhere, and literature experiences sudden extreme changes in what's stylistically popular all the time. They're trying to use this week's popularity contest to predict the next eternity's type of fluff to write; what they're going to do is produce pulp that doesn't sell for very long.

    18. Re:There is so much money by smallfries · · Score: 1

      True (to the first part).

      But they are not trying to predict the success of authors, they are trying to predict the success of works. Predicting the output of any author would be difficulty, modelling human creativity and all that jazz. But predicting the success of a work is simple(-ish) machine learning. Build a learning bias for style-features in the text and throw an optimisation at it.

      For the second part - when do styles of literature experience sudden extreme changes in popularity? I've seen slow changes that peak suddenly due to shifts in demographics, but never sudden changes. Could you provide an example?

      --
      Slashdot: where don knuth is an idiot because he cant grasp the awesome power of php
    19. Re:There is so much money by mcgrew · · Score: 1

      One does indeed need both talent and craft for a work to be good. However,

      it's not true you can manufacture success with total swill.

      Pet rocks, mood rings, milli vanilli... all you need is money.

      For example, the first "manufactured" hit band was 60s TV show group THE MONKEES.

      I'm not sure they were the first, but at any rate it was never a secret that they didn't perform their own music; they weren't designed to be a real band, but fiction about a fictional band.

      As to Clancy, I only read one of his books (Red October) but if IIRC (It's been years since I've taken it off the shelf) the dialogue fit the situation. I wouldn't call him either a bad writer or a great writer. He did transport me on to that sub. My idea of a bad writer is one that bores me or makes me cringe. Pohl used to be good, but his last book was drivel. I'd borrowed it from the library and returned it only half read; it was just pointless and boring. He seemed to have lost his talent in his extreme old age.

      I'm sure the same will happen to Pratchett, since he has early onset Alzheimer's. That fellow has a hell of an imagination and a magical way with words (he knew he wasn't the sharpest knife in the drawer and was afraid that he was a spoon).

    20. Re:There is so much money by mcgrew · · Score: 1

      Yes, the magazine paid him, but when it was published as books (I forgot the name of the publisher) it didn't sell and he received no royalties for the books; the publisher just didn't have the marketing muscle that Doubleday did. Asimov recounts this story in one of his books, I don't remember which one.

  5. New Coke/New Waists/New Privacy Invastions/New ETC by Anonymous Coward · · Score: 1

    Bias and other flaws in the design and statistical analysis.

    Suffering increases every day from the ever increasing Marketing Research and its derivations and accompanying costs. Keep in mind, there are more to costs then just money.

  6. Automated response by ackthpt · · Score: 5, Funny

    Is for the enjoyment like article much very.

    Posted by Comment Bot v1.0, Universe Algorithms, division 9 Sirius Cybernetics Corporation.

    --

    A feeling of having made the same mistake before: Deja Foobar
    1. Re:Automated response by abies · · Score: 0

      Well, this sounds a lot more like Offshore Division rather than Algorithms Division...

    2. Re:Automated response by plover · · Score: 1

      Actually, it's syntactically perfect PostScript. :-)

      --
      John
    3. Re:Automated response by Anonymous Coward · · Score: 0

      maybe if they out sourced it to Shiba Inu.

  7. So does this explain... by Virtucon · · Score: 1

    How Jackie Collins sells so many books? She uses too many verbs? I thought it was about the overly dripping romance themes that women seem to like?!?!

    --
    Harrison's Postulate - "For every action there is an equal and opposite criticism"
    1. Re:So does this explain... by icebike · · Score: 1

      Don't forget: Successful books relied on:
         

      verbs describing

      .

      All this time I thought adjectives described. Silly me. No wonder my great novel failed.

      --
      Sig Battery depleted. Reverting to safe mode.
    2. Re:So does this explain... by ArbitraryName · · Score: 1
    3. Re:So does this explain... by Anonymous Coward · · Score: 3, Informative

      Don't forget: Successful books relied on:
         

      verbs describing

      .

      All this time I thought adjectives described. Silly me. No wonder my great novel failed.

      If that's what you thought then yes, that's probably one of your problems. Compare the following sentences:
      "He pitched the ball."
      "He hurled the ball."
      "He tossed the ball."
      "He lobbed the ball."
      "He chucked the ball."

      Where's the adjective to describe the manner in which the ball moved? There isn't one. The verb gives you the description of HOW the ball moved.
      In direct contradiction to this "algorithm", stronger writers tend to rely more on descriptive verbs, weaker writers tend to rely on less descriptive words which need to be padded with adjectives or adverbs.

    4. Re:So does this explain... by ebno-10db · · Score: 1

      All this time I thought adjectives described.

      Never mention grammar on Slashdot. It'll bring out more responses than a programming language flame war.

      P.S. That's why I always got a laugh out of the stereotype that engineers and programmers are semi-literate. My experience is that many are sticklers for the language, and that's not just limited to grammar.

    5. Re:So does this explain... by CrimsonAvenger · · Score: 1

      My experience is that many are sticklers for the language, and that's not just limited to grammar.

      Many are.

      And others can't tell the difference between "lose" and "loose", or "they're" and "their" and "there", or "where" and "wear", or "your" and "you're".

      Those aren't exactly uncommon mistakes on /.

      --

      "I do not agree with what you say, but I will defend to the death your right to say it"
    6. Re:So does this explain... by fazig · · Score: 1

      Most creative writing classes on descriptive writing will tell you that adjectives and adverbs very often are lazy and sloppy descriptions. They teach to use strong verbs and nouns, and not to rely on adjectives. It is taught that a good writer uses adjectives and adverbs to support verbs and nouns, but never as a description.

    7. Re:So does this explain... by ebno-10db · · Score: 1

      I always try to be careful about such things, but those differences are strictly about the stupidity of spelling in the English language. I think bad spellers are mostly people who believed their teacher's claims that English is more than vaguely phonetic. I also think some "rebels" should get together, decide on a single spelling for each set of homophones, and tell everybody else to go screw themselves. No, I haven't had the guts to do it myself yet.

    8. Re:So does this explain... by Anonymous Coward · · Score: 0

      I always try to be careful about such things, but those differences are strictly about the stupidity of spelling in the English language.

      Bah. "Then" and "than" aren't homophones; people can't get those right, either.

    9. Re:So does this explain... by Anonymous Coward · · Score: 0

      And others can't tell the difference between "lose" and "loose", or "they're" and "their" and "there", or "where" and "wear", or "your" and "you're".

      Indeed, when I see those my reaction is usually "syntax error: I hope you're not a programmer."

    10. Re:So does this explain... by Anonymous Coward · · Score: 0

      Yep. Best advice for new writers is to shun adverbs, and be sparing with adjectives. Right up there with avoiding synonyms for "said" in dialog tags (where you usually want the verb to be almost invisible).

      Mind, if you're Clive Cussler or equivalent, you might write the above sentence with some crazy analogy: "Dirk hurled the ball like a trebuchet launching a piano." (Okay, that sounds more like Douglas Adams.)

    11. Re:So does this explain... by Anonymous Coward · · Score: 0

      Oh, English is phonetic ... it just has so many ways of spelling each phoneme. This is largely as a result of borrowing from so many other languages.

      But really, "loose" for "lose"? The latter is the first syllable of "loser" and the former rhymes with "goose", "noose", "caboose" ... it's not that difficult.

      (Personally, since I probably read and write more than I talk, I like English spelling; it disambiguates many homophones, making understanding easier.)

    12. Re:So does this explain... by icebike · · Score: 1

      That is ridiculous.

      --
      Sig Battery depleted. Reverting to safe mode.
    13. Re:So does this explain... by fazig · · Score: 1

      That statement makes me want to burst out in laughter or tears.

  8. Reading Level by TubeSteak · · Score: 4, Informative

    They began their research with Project Gutenberg, a database of 44,500 books in the public domain. A book was considered successful when it was critically acclaimed and had a high download count. The books chosen for analysis represented all genres of literature, from science fiction to poetry.

    Then, they added some books not in the Gutenberg database, including Charles Dickens' "Tale of Two Cities," and Ernest Hemingway's "The Old Man and the Sea." They also added Dan Brown's latest novel, "The Lost Symbol," and books that have won the Pulitzer Prize, the National Book Award, and other awards.

    Nowadays, marketing and signalling has as much to do with sales as anything else.
    I imagine that if some publisher could make the kind of advertising push that Bill O'Reilley does,
    they could put anything onto the NYTimes best seller list too.

    --
    [Fuck Beta]
    o0t!
    1. Re:Reading Level by Charliemopps · · Score: 2

      All books written by politically active people like O'Reilley are nothing more than slush funds to funnel money towards a particular party or candidate. The Clintons have done it, Sarah Palins a master of it... Your donors buy up your books, giving you fame, getting the press to talk about you... and then "donate" them to fund-raisers who "Give" them away to donors. It looks like you sold lots of books, your all over the news because of it but no-ones reading the book, not even the anchors claiming to interview you about it. God I hate marketing.

    2. Re:Reading Level by retchdog · · Score: 3, Interesting

      It's not just legit donors, either. One of the games these people play is to charge institutions speaking fees for a public appearance, part of which charge is the required purchase of, say, 5,000 books for their library or for "promotional purposes". The institution plays along, sending 90%+ of the books to be pulped the next day, and the speaker's sales stats get bumped. Ridiculous.

      --
      "They were pure niggers." – Noam Chomsky
    3. Re:Reading Level by westlake · · Score: 1

      They began their research with Project Gutenberg, a database of 44,500 books in the public domain.
      Then, they added some books not in the Gutenberg database, including Charles Dickens' "Tale of Two Cities," and Ernest Hemingway's "The Old Man and the Sea." They also added...books that have won the Pulitzer Prize, the National Book Award, and other awards.

      How does Project Gutenberg select its texts?

      A book was considered successful when it was critically acclaimed and had a high download count.

      "Critically acclaimed" by who and when?

      How many of the most downloaded titles are on academically "required" or "recommended" reading lists?

      The prize-winner can sometimes tell you more about the internal and external dynamics of the judging than the quality of the book,

    4. Re:Reading Level by aslashdotaccount · · Score: 1

      And don't you hate it when you're berated for not having read the crud that everyone's talking about in the office cafeteria? I've succumbed to this peer pressure a number of times and read these so-called masterpieces and wondered at the end just what insights the authors had that I didn't.

    5. Re:Reading Level by cffrost · · Score: 1

      [...] Sarah Palins a master of it...

      Sarah Palin's handler(s)/management (team), more likely. We're talking about a person who thought the 2003 invasion of Iraq was (to paraphrase) "revenge for 9/11," or some such nonsense. In other words, I "betcha" there's little acumen of any utility rattling around in that skull of hers.

      God I hate marketing.

      I hope for all exposed beings to possess the wherewithal to resist for-profit and political propaganda in all of its forms, and manipulation therefrom, particularly anything shat out by the United States' six-headed corporate "news" media (i.e., corporate and government press releases, backed by unchallenging commentary).

      I find it interesting that amongst our nation-wide, free-press establishment, PBS (a government- and corporate-backed entity) seems to be the only major source for investigative journalism (particularly via Frontline). Fortunately, contributions from the David H. Koch Foundation seem to have been used only to suppress broadcasting of investigation into the Koch brothers, rather than to steer PBS's agenda.

      --
      Thank you, Edward Snowden.

      "Arguments from authority are worthless." —Carl Sagan
    6. Re:Reading Level by RabidReindeer · · Score: 1

      Tale of Two Cities is in Gutenberg. That's where I read it from.

      Marketing never hurts, but the advent of minimal-cost publishing via ebooks also has helped some authors. There are several best-selling authors who started out as "dollar discounts" from one of the e-publishers.

    7. Re:Reading Level by ebno-10db · · Score: 1

      Tale of Two Cities is in Gutenberg. That's where I read it from.

      Charles Dickens, Mark Twain and others were heavily marketed in the 19th century. It's not a 20th century invention. Speaking of Mark Twain, you'll find satire about advertising in "A Connecticut Yankee in King Arthur's Court". Thanks to the protagonist, there were knights running around with advertisements for toothpaste on their suits of armor.

    8. Re:Reading Level by psithurism · · Score: 1

      And books on Project Guttenberg have more to do with which are on high school reading lists than anything else. I'd say 90% of the reading I've done of public domain books/peoms was done for assignments.

    9. Re:Reading Level by Anonymous Coward · · Score: 0

      Obama was too stupid to have written his own books. "Let someone else do it" as he learned from Homer Simpson.

  9. Stagnation by aslashdotaccount · · Score: 2

    I was about to say that this speaks poorly of the breadth of the current generation's literary interests, and then I recalled books like Little Women and Lord of the Files, or even Arthur C. Clarke's Childhood's End (although the Rama series might be more about descriptions than emotional exposes). Still, it's a little disheartening that technical manuals don't hit the bestseller lists. On the upside, Noam Chomsky will be overjoyed by this development; soon software systems will be developed to 'generate' hit books. Someone get Angelina (Mike Cook's, not Pitt's).

    1. Re:Stagnation by noh8rz10 · · Score: 4, Insightful

      On the upside, Noam Chomsky will be overjoyed by this development; soon software systems will be developed to 'generate' hit books. Someone get Angelina (Mike Cook's, not Pitt's).

      I see, so Angelina Jolie used to be an academy-award-winning actress, but now she's just Mrs. Pitt?

    2. Re:Stagnation by aslashdotaccount · · Score: 1

      Had Mike Cook named the program 'Pitt' I'd have said "Jolie's"

    3. Re:Stagnation by noh8rz10 · · Score: 0

      oh, i see. now academy-award-nominated brad pitt is just Mr. Jolie? Protip - one human being can never own another human being.

    4. Re:Stagnation by Anonymous Coward · · Score: 0

      Protip for your algorithm:
      Human beings often identify other human beings by their association with yet other human beings.

    5. Re:Stagnation by jollyreaper · · Score: 1

      I see, so Angelina Jolie used to be an academy-award-winning actress, but now she's just Mrs. Pitt?

      She's an aging sack of bad plastic surgery who's been in too many terrible movies. A pretty good match for her hubby at that.

      --
      Kwisatz Haderach
      Sell the spice to CHOAM
      This Mahdi took Shaddam's Throne
    6. Re:Stagnation by Impy+the+Impiuos+Imp · · Score: 1

      Protip for your algorithm:

      Human beings often identify other human beings by their association with yet other human beings.

      You mean tip for nerds.

      --
      (-1: Post disagrees with my already-settled worldview) is not a valid mod option.
    7. Re:Stagnation by noh8rz10 · · Score: 1

      I see, so Angelina Jolie used to be an academy-award-winning actress, but now she's just Mrs. Pitt?

      She's an aging sack of bad plastic surgery who's been in too many terrible movies. A pretty good match for her hubby at that.

      That's why they didn't take each other's names :)

      But seriously son she's the mother of six children, so stop being a douche

    8. Re:Stagnation by SleazyRidr · · Score: 1

      Sure they can. It's been happening for millennia. Have you been living under a rock?

    9. Re:Stagnation by Anonymous Coward · · Score: 0

      She is an independent women who needs no men...

  10. What about marketing? by sd4f · · Score: 1

    We all know advertising and product placement can make a big difference and return on investment, so what about including paid for marketing and tv show plugs into the modelling? Nothing can be successful if no one has heard of it.

  11. Authors fail to understand ... by MacTO · · Score: 2

    Two quotes stand out for me:

    "It's very difficult to quantify decisions that are often made by intuition and relationships."

    The study claims that at least some of those decisions are quantifiable, which pretty much contradicts Hamilburg's point.

    "Of stylistic characteristics, the scientists are flying in the face of most teaching of creative writing when they emphasize nouns over verbs. Verbs are the engine of fiction and quality writing is often measured by their variety, precision, and force,"

    Hansen appears to have missed the point of the study: it is about what sells, rather than what's taught or what makes quality writing.

    1. Re:Authors fail to understand ... by plover · · Score: 4, Interesting

      However, the sample's study makes exactly the same mistake. They used Project Gutenberg as the source, and download counts as a substitute for sales. Sales has one measure: the number of dollars in the cash box at the end of the day. They should be measuring books on the NY Times bestseller list, or the Amazon Top 10 list, which have actually sold for money and are actually popular (fraudulently placed books aside.) And they should be comparing them against books from their own genres, or at least books that had similar attributes.

      I think what they'd really find is that "books that sell well are those that are marketed well", regardless of the words they contain.

      Maybe they could focus on a specific key reviewer: what does Oprah like and not like? Maybe when they cross compile the data from all the books, they will find they've only discovered Oprah's tastes. Which isn't a bad outcome, if they are ultimately trying to discover what kinds of books will be better positioned to make the author money. But I don't think they've come close to predicting fiction "best-sellers" yet.

      --
      John
    2. Re:Authors fail to understand ... by RabidReindeer · · Score: 3, Interesting

      Success comes in two flavors.

      Gutenberg is stacked with classics. Stuff that has been successful over a long period of time. Some classics were flops when they were first published and some go periodically in and out of favor.

      The NYT bestseller list, Oprah, et. al. focus on what's popular today. Relatively few books that make those lists will be popular in a century just as many of the bestsellers from Dickens' day would only be known to literary historians. And missing from Gutenberg.

    3. Re:Authors fail to understand ... by plover · · Score: 2

      I was commenting based on the title of the articles discussing the study: "Algorithm aims to predict fiction bestsellers"; and "Computer Algorithm Seeks to Crack Code of Fiction Bestsellers". The strong implications are that the algorithm is designed to unlock the secret of making money by writing books that contain certain words or linguistic structures. I'm arguing that a book's financial success has much less to do with any ephemeral "bestsellerness" quality, and has a much stronger association with "marketing campaigns".

      Now, that may or may not be the basis for why the researchers performed their study, or even what they hoped to learn, but it's how their study is being perceived by the media. Which is ironically making my point: it isn't the facts or the content of the study that's important, it's the coverage of the story that's put the slant on what they found. If the [marketing|reporting] for this study had instead said "Researchers develop algorithmic approach to search for linguistic commonalities in Project Gutenberg texts", it probably wouldn't even have merited notice on Slashdot.

      Tl;dr: marketing wins.

      --
      John
    4. Re:Authors fail to understand ... by AthanasiusKircher · · Score: 2

      Gutenberg is stacked with classics. Stuff that has been successful over a long period of time. Some classics were flops when they were first published and some go periodically in and out of favor.

      Or, in other words, what counts as a "classic" right now is simply what's popular today. I think the trends can be better seen in music history. Take, for example, Pachelbel's Canon in D, that piece which seemingly shows up everywhere as "classical music." Johann Pachelbel, however, was a master composer, well-known in his lifetime for all sorts of compositions. Today he has one stupid piece played at thousands of weddings and other occasions every year, just because of some whims of audiences in the late 1960s who got interested in it.

      Take Antonio Vivaldi, who was hugely popular in his lifetime, then almost completely forgotten for centuries (he died a pauper, so his fame was as short-lived as many pop artists today), until some Italian archivists dug up his thousand-or-so compositions in the 1920s, and these pieces were then deliberately promoted as part of Italian cultural history beginning in the 1930s.

      Or, heck, for a recent example, look at Thomas Tallis's Spem in alium, a Renaissance motet that was pretty obscure until the past couple of years after it appeared in the novel Fifty Shades of Grey. Suddenly, recordings of the piece bounded up to the top of the charts, and it has led to a new interest in Renaissance music and certain early music performance groups.

      I'm not saying that these pieces or composers don't have great value or that they shouldn't be "classics." But I do think that interest in particular "classics" is driven almost as much by current culture as actual current art/literature/music is. Measuring downloads from Project Gutenberg is giving us a particular snapshot into what is considered "classic" literature for the past few years. Fifty years ago, or a hundred years ago, I can guarantee you that the lists would be different -- and not just because of works written since then.

    5. Re:Authors fail to understand ... by AthanasiusKircher · · Score: 1

      Tl;dr: marketing wins.

      Well, I think it would be more fair to say marketing and current fads win. A bestselling author may not need to do any marketing at all, other than mentioning, "By the way, I'm coming out with another book," and it will probably still sell well. Books about famous people or written by celebrities will also often sell, regardless of whether they are marketed heavily. Similarly, books about current fads (diets, financial advice, etc.) may also sell pretty well -- the first book regarding a fad may need some marketing to get going, but subsequent books often just follow on that.

      On the other hand, there are PLENTY of examples of things that were marketed like crazy and still fail.

      My general point is that marketing is most critical when something isn't yet well-known or associated with something well-known. Ultimately, fame wins... particularly when we're talking about bestsellers. That, and random fads.

    6. Re:Authors fail to understand ... by plover · · Score: 1

      I just read the first few pages of the study, and it seems the authors tried to control for the "fame of the author" aspect as much as they could, with things like excluding a second text by the same author in the same genre, that sort of approach. And as suspected, the study is much more modest than the article titles suggest. They are looking for "success" as defined by their own criteria, not "money" or "bestsellers".

      But it was the marketing hype that got me to read a study by some random researchers. That's got to say something.

      --
      John
  12. Hypnosis by Neo-Rio-101 · · Score: 1

    Hypnosis is nothing other than an elegant description of a process.

    I think it's interesting that as you find yourself looking at this screen, and focusing in clearly to these words you are reading.... you remember a time when you felt very VERY tired... maybe after a long work meeting or after staying up late working on a paper.... What you become to notice is that you are slowly find your eyes relaxing deeply,..... and as you become aware that your mind is slowing down and your mouth widens and begins to yawn... you feel your eyes are closing as you drift off to sleep....

    --
    READY.
    PRINT ""+-0
    1. Re:Hypnosis by Anonymous Coward · · Score: 0

      I think it's interesting that your offtopic tripe was modded up.

  13. can it explain... by able1234au · · Score: 3, Interesting

    Perhaps they can explain why Fifty Shades did well despite being badly written.

    There is a danger in this process that we end up with a "Save the cat" problem where everything has to follow a formula
    http://www.slate.com/articles/arts/culturebox/2013/07/hollywood_and_blake_snyder_s_screenwriting_book_save_the_cat.html

    1. Re:can it explain... by bob_super · · Score: 5, Insightful

      50 shades is a textbook example of a perfect marketing campaign. It cannot fit an algorithm, it's a total outlier.

      They sent out press releases to all the agencies about the new phenomena of women using the wonderful anonymity of e-readers/tablets to read Mommy porn, like that "50 shades" thing.
      Journalists just repeated the press releases, over and over again, almost exactly word for word, on various networks, because that's a topic that draws viewer attention.

      And suddenly everyone knew that apparently a lot of people were reading that "50 shades" book, and that reading it was both cool and risqué. Jackpot.

      I read one page of the book that was published on a website. It was worse than the transcript of a reality TV show. it wasn't just bad literature, it was barely passable English.
      But the marketing was absolutely brilliant.

    2. Re:can it explain... by able1234au · · Score: 1

      That's classic. I would prefer to read the book on the marketing campaign. It is original, brilliantly executed and delivered results. Forget the original book.

    3. Re:can it explain... by techno-vampire · · Score: 1

      If you think Fifty Shades was bad you need to read Naked Came the Stranger, a best seller with absolutely no literary or social merit that became even more popular when it was revealed to be a hoax.

      --
      Good, inexpensive web hosting
    4. Re:can it explain... by clickclickdrone · · Score: 1

      Or Harry Potter which as Rowling was constantly told, had none of the right tick boxes ticked and many of the 'avoid' ones ticked. Didn't end up doing too badly as I recall after the first few dozen rejections. To be fair, that probably falls into the ~15% 'got it wrong' region the story mentions.

      --
      I want a list of atrocities done in your name - Recoil
    5. Re:can it explain... by Anonymous Coward · · Score: 0

      Original? Faking success to acquire success and preying on humans' pack animal instincts are very old marketing techniques. Publishers will have authors buy large amounts of their own book to get on the bestseller list.

    6. Re:can it explain... by hey! · · Score: 2

      I read Snyder's book because he was a friend of a friend. First off, it's not about *everything*. It's about movie scripts. Secondly it's a bit naive to blame the lack of creativity of modern movies on his book; that's a trend that predates 2005.

      In any case screenwriters are nothing like the olympian figures playwrights are in theater. The main creative force in a movie is the director, and writers are relatively minor figures in the enterprise. In the theater the script is gospel. In the movies a director routinely adds to, deletes from or reorganizes a script as he sees fit. It's important to realize that screenwriters structure screenplays, but directors structure the movies. A screenplay isn't the movie story; it's a guideline that helps the director imagine the story HE will tell. Thus things like the page count for each story beat *are for the benefit of the director*, and don't have much if any relationship to the pace of the story as seen by the moviegoers.

      What Snyder did for screenwriting was analogous to what agile programming advocates did for programming. He codified the practices from successful projects. What the linked article does is at best intellectually sloppy or at worst, disingenuous. Mr Suderman applies the 15 beat structure to recent movies, but fails to note that the same can be done for nearly every commercially successful movie in the last 80 years.

      As for 50 SHADES, it's possible the formula *might* explain why it is more successful than other readily available erotica. And marketing helps too, but remember this was initially a self-published book that took off by word of mouth.

      Ultimately, when you become a discerning reader, you realize that practically every novel is flawed in some way or another. And while all things being equal a better written novel is more likely to be successful, all things are most definitely NOT equal. You cannot craft your way to success with readers, you have to speak to something in them. It's more important that a story does something right, than it does everything, or even most things right.

      --
      Post may contain irony: discontinue use if experiencing mood swings, nausea or elevated blood pressure.
    7. Re:can it explain... by martin-boundary · · Score: 2
      Porn works like that. Have you ever landed on a porn video site? Most of the videos have no story, and show even less acting ability or camera skills.

      But you know what? Nobody cares! It's the same with 50 shades, people don't read it because it's art. Women read it to get ideas and phantasies. And to be honest most porn sites don't cater to women, so they have a limited choice in the matter.

    8. Re:can it explain... by dkleinsc · · Score: 1

      Perhaps they can explain why Fifty Shades did well despite being badly written.

      Because people like reading about sex. That's also why romance novels routinely feature good-looking half-naked people on the cover.

      Think of it this way: If the movie is about sex, we'll put up with inane dialog, completely predictable plots, and wooden acting, just to watch a couple of people we'll never meet get it on. Why would you expect books to be much different?

      --
      I am officially gone from /. Long live http://www.soylentnews.com/
    9. Re:can it explain... by wisnoskij · · Score: 1

      The algorithm would be trying to guess how well the book would do on the market, not how well it was written.
      How well a book is written has little to do with how many copies you can sell of it.

      --
      Troll is not a replacement for I disagree.
    10. Re:can it explain... by wisnoskij · · Score: 1

      I am really surprised at this. I really like the series, but I would never consider it anything other than a somewhat bland very easy read. I think they need to review their formula, because I think HP is a text book example of a mass marketable, guilty pleasure/easy read, that everyone can enjoy.

      --
      Troll is not a replacement for I disagree.
    11. Re:can it explain... by umafuckit · · Score: 1

      50 shades is a textbook example of a perfect marketing campaign. It cannot fit an algorithm, it's a total outlier.

      I suspect that, almost by definition, many best-sellers are outliers. They owe their popularity to marketing, the whims of the book-buying public, what's currently trendy, etc. Like 50 shades of grey, they likely won't succumb to an algorithm.

    12. Re:can it explain... by sandytaru · · Score: 1

      There was nothing original about that book. You can find 500,000 similar things on Fanfiction.net.

      --
      Occasionally living proof of the Ballmer peak.
    13. Re:can it explain... by clickclickdrone · · Score: 1

      Reason's she had were it was far too long, kids books don't make money etc. It only got published as a favour after the publisher's 8yo daughter got sight of the manuscript and pestered him for more. The initial print run was 500 copies, they really weren't expecting it to sell.

      --
      I want a list of atrocities done in your name - Recoil
    14. Re:can it explain... by Impy+the+Impiuos+Imp · · Score: 1

      Well, porn can't follow that formula because it *must* save its "all is lost moment" for the final seconds.

      --
      (-1: Post disagrees with my already-settled worldview) is not a valid mod option.
    15. Re:can it explain... by the+phantom · · Score: 1

      Um... the OP was suggesting that the marketing campaign was original, not the book.

    16. Re:can it explain... by Anonymous Coward · · Score: 0

      Wasn't that the way Music worked until recently. Labels pushing radio stations to repeat a song over and over and over again until it sticks.

      I'm guessing publishers could have done the same.

    17. Re:can it explain... by able1234au · · Score: 1

      What Snyder did for screenplays was fine. He helped writers understand a structure. The problem was management being risk averse and insisting that all movies follow the Cat even in some cases to the exact minute. Management not knowing their industry and so constraining script writers from doing what they do best. It is the same problem where one successfully movie comes out and suddenly there are lots of copies in the same genre simply because that is lower risk. None of this is Snyder's fault. One reason i do not watch many Hollywood movies. I find foreign language movies and indie movies much more interesting, unpredictable and refreshing.

    18. Re:can it explain... by able1234au · · Score: 1

      I get that, but 50 shades is badly written sex. There is no shortage of better written books that will steam up your glasses.

  14. Re:Algorithm Aims To Predict Fiction Bestsellers by MobSwatter · · Score: 1

    71.4% of algorithms agree.

  15. Quantum Literature by edibobb · · Score: 1

    Look for modern fiction to adjust to fit the parameters of the application, degrading to a common level and uniform format. The literature cannot be observed without being altered. It will be lot like the mandatory movie formula. The content itself is irrelevant.

  16. What a stupid idea by OhANameWhatName · · Score: 2

    1. Read the algorithm
    2. Write a book
    3. Profit!!!

    I just wrote an algorithm that predicts that no book detailing the death of creativity at the hands of science will ever be written.

    1. Re:What a stupid idea by Anonymous Coward · · Score: 0

      I just wrote an algorithm that predicts that no book detailing the death of creativity at the hands of science will ever be written.

      Well, perhaps not book length, but that theme has been done in several SF stories.

  17. Uck by speedplane · · Score: 2

    Does this article make everyone else as sick as it makes me?

    --
    Fast Federal Court and I.T.C. updates
    1. Re:Uck by symbolset · · Score: 4, Funny

      Nowhere does it mention the one weird trick that effortlessly melts away the pounds in six minutes while you sleep - that the government doesn't want you to know because it creates instant wealth for the few who know this secret.

      --
      Help stamp out iliturcy.
    2. Re:Uck by Anonymous Coward · · Score: 0

      Your ideas are intriguing to me, and I wish to subscribe to your newsletter.

    3. Re:Uck by Anonymous Coward · · Score: 0

      Does this article make everyone else as sick as it makes me?

      I don't know. Maybe I should write an algorithm that tells me.

    4. Re:Uck by psithurism · · Score: 1

      Does this article make everyone else as sick as it makes me?

      Nope, I got no idea what you are talking about. In fact, I found it pleasant.

      Acknowledging large shortcomings of their study, the one thing they seem to find was that if you want your fiction book to remain popular with a broad audience, you should take my middle school English teacher's advice and show don't tell.

      They came up with no magic: "save the cat" formulas to make hits and the industry expert says that this study won't help him much, stories still too complex to predict best sellers.

      Further, they point out that finding the magic rules for broad-audience success books still won't ruin the industry since topics are so important; as a hyperbolic example: I'm currently enjoying "More effective STL." The plot is pretty bland, but it's one of the best books I've read in a year, and I've highly recommended it to certain friends. However, I doubt it will outsell "The Lost Symbol."

  18. Re:New Coke/New Waists/New Privacy Invastions/New by Anonymous Coward · · Score: 1

    Keep in mind, there are more to costs then just money.

    Well, if you get "just money" after the costs it shouldn't be too hard to deal with them.

  19. while (!book.isBestSeller()) book.text = random(); by Anonymous Coward · · Score: 0

    If this is anything like the song version, expect the text equivalent of dubstep and oh-ahs.

  20. Well... by Anonymous Coward · · Score: 0

    Never heard of 'Fitcoin', is that a descendant/parent of 'Bitcoin'? Please advise :)

    1. Re:Well... by Anonymous Coward · · Score: 0

      It's a new currency where you have to provide proof of physical work, so you work on your fitness while mining.

  21. Waste of time by Karmashock · · Score: 1

    These things don't actually work. They're curiosities and nothing more.

    When they finally develop strong AI... then you might have something. But a non-intelligent system is not going to figure these things out.

    --
    I've decided to stop wasting my time responding to AC trolls/sockpuppets... so if you want a response from me... login.
    1. Re:Waste of time by Anonymous Coward · · Score: 0

      The real waste of time is all this "fiction" to begin with. If you really want something to read, why not start with Feynman. Or you could just go back to this

  22. The illusion of choice made real. by VortexCortex · · Score: 1

    Preceding any great scientific advancement or discovery it is no accident that you will find a surge in the fiction and cultural themes surrounding it.

    The New World, Forensics, Avionics, Electronic Computing, Nuclear Reaction, Rocketry, Robotics.

    The cultural mind thinks as you do. Its subconscious boils with the direction it will soon take. Ask yourself: What is seen much more now in your culture? What makes you think you have any choice but to latch onto any thoughts but those which come to mind from within? What makes you think society can choose from among the roiling themes anything other than what pattern is most apt? What makes you think?

    Cybernetics.

    1. Re:The illusion of choice made real. by rivercityrandom · · Score: 1

      Ask yourself: What is seen much more now in your culture? What makes you think you have any choice but to latch onto any thoughts but those which come to mind from within? What makes you think society can choose from among the roiling themes anything other than what pattern is most apt?

      Judging from the large amount of post-apocalyptic movies and books that are currently popular, I'd say the end of the world is fast approaching.

  23. here's the article by martin-boundary · · Score: 1
    http://bit.ly/1dgDo7d . Come on slashdot editors, do the legwork and link the article directly! Otherwise people will post a link in the comments, and who's to say it's not a goatse?

    Anyway, I'm a little worried about the methodology. If you train on PG, and test on PG your generalization error will suffer. This is especially easy to get wrong when both the train and test set are constructed repeatedly with various thresholding rules, and the classifier features are (presumably) optimized during the research being conducted.

    1. Re:here's the article by magic+maverick+ · · Score: 1

      You're a cunt for linking to a URL shortener instead of the article directly. You're as bad as the shithead editors. If you're worried about goatse or similar, don't use some shit URL shortener.

      http://aclweb.org/anthology/D/D13/D13-1181.pdf

      There. Easy. Why couldn't you have done that you high-horsed cunt?

      --
      HELP MY ACCOUNT HAS BEEN HACKED BY AN ILLIBERAL ART STUDENT SET TO DESTROY THE INTERWEBZ!
    2. Re:here's the article by mcgrew · · Score: 1

      http://bit.ly/1dgDo7d . Come on slashdot editors, do the legwork and link the article directly!

      Come on, martin, do the legwork and link it directly. This isn't twitter and most folks are wary of shortened links; trolls love hiding their goatse and tubgirl links. I only clicked it because your UID is relatively low and you hadn't (yet) been modded down.

  24. Already done - albeit in fiction by jimicus · · Score: 1

    It's already been done - though only in fiction.

    Roald Dahl wrote about a machine called the Great Automatic Grammatizator. A machine that you plug in various parameters - such as type of book, characters, proportions of violence/sex/humour - and it churns out something that's pretty much guaranteed to be a bestseller according to those parameters in fifteen minutes flat. Being a writer himself - and a somewhat dark one at that - the end result was a dystopian universe in which writers were forced to give up writing and just license their name to the man with the machine, simply because the machine brought the cost of production down so much that this was the only way to earn a living as a writer.

  25. algorithm by argStyopa · · Score: 1

    Remember, there's a HUGE difference between successful and "good".

    "Successful" means appealing to the dozen or so big publishers' editors, such that they are willing to pimp your book and market it. They can - and have, obviously - taken utter crapola to the top of the "bestseller" lists.

    I entirely understand that the algorithm favors deep internal monologues, because those editors clearly love them.

    --
    -Styopa
    1. Re:algorithm by ebno-10db · · Score: 1

      there's a HUGE difference between successful and "good"

      "Good" is subjective. It's some sort of consensus amongst people who are, for whatever reason, considered literary experts. Consider the "classics". Some are good and some suck. I tried to read "Moby Dick" and found the perfect cure for insomnia. People said "just get past all the boring and extraneous stuff". Sorry, but if a book is full of boring and extraneous stuff, then it's not a good book. Maybe it would have been if Melville had had a good editor. OTOH some classics are great. I just read "All Quiet on the Western Front", and it's one of the best books I've ever read. It's not self-consciously literate, which is part of why it's good. It was also a best-seller in it's day.

      BTW, a contemporary author who is "literate", but also thinks novels should have a plot and be enjoyable to read, is Michael Chabon. He even promotes and has written what's usually dismissed by literary types as genre fiction.

  26. Hindsight is 20/20 by tomhath · · Score: 1

    Their algorithm had as much as an 84 percent accuracy rate when applied to already published manuscripts

    I could write an algorithm that's 100% accurate selecting yesterday's lottery numbers.

    1. Re:Hindsight is 20/20 by umafuckit · · Score: 1

      I could write an algorithm that's 100% accurate selecting yesterday's lottery numbers.

      That's why data analyst's cross-validate their models. Granted, cross-validation doesn't cure everything (e.g. If the question is already overly specific, or if the analyst double dips in some other way) but it will stop over-fitting and performing at 100%. I downloaded the paper and did a quick search: the authors used a support vector machine for the classification (which effectively allows for fitting of very non-linear boundaries) and they tested it with 5-fold cross-validation. So they given that they did the latter, they got the basics right.

  27. A block buster? by ai4px · · Score: 2

    A blockbuster movie? Space, cowboys, roughnecks, scenes of things blowing up, impending doom saved at the last minute and a guy who doesn't make it home and leaves behind a beautiful girl. Oh and crazy Russians. Perfect formula. A blockbuster song? repeating lyrics which drone on and a drum machine. The public just seems to love it this way!

    1. Re:A block buster? by turning+in+circles · · Score: 1

      We know the formula for a blockbuster movie, it's been published in a book called Save the Cat (and reviewed here).

      The great thing about books, especially now, is that there is very low cost to publishing them compared with movies so you can be more experimental. Hopefully, no one will figure out the complete secret to their success in the same way movies are known.

      --
      Might as well face it I'm addicted to data.
  28. This is nothing new - or troublesome. by ebno-10db · · Score: 1

    What the algorithm looked at was writing style. That's hardly new. Teachers have been recommending this or that writing style, probably since the preferred medium was stone tablets. Slavish devotion to such recommendations is obviously undesirable, and a few outliers and experiments are necessary if you don't want writing styles to become stultified. But taking some advice about it is nothing new or undesirable. This study said nothing about structure (for which there are also standard recommendations) or subject.

    All creative endeavors require a certain amount of less creative craftsmanship to be done well.

  29. 50 Shades of Marketing by mariox19 · · Score: 1

    It was time to sell the client on the proposed campaign, and I came to the meeting with trepidation. Mr. House could be cruel. That was his reputation. And I wondered, would he like my proposal? Would he like me? Or would I find myself on the receiving end of one of his legendary smack-downs? Would I leave, hurt? I knocked at the door of the meeting room and heard his stern answer. "Come in," he said, menacingly. I entered, knowing my life might never be the same. [...]

    Trust me, you don't want to read about the marketing campaign.

    --

    quiquid id est, timeo puellas et oscula dantes.

  30. Extreme selection bias. by Jartan · · Score: 1

    If you read the article they're not really examining best sellers at all. A site like Gutenberg has no correlation with modern best sellers.

    Film, TV and Internet have all had drastic effects on the market as well. Thus old books aren't really representative.

  31. Obviously by avandesande · · Score: 1

    Writers aren't going to spend effort to create a well written book about subjects people aren't interested in.

    --
    love is just extroverted narcissism
  32. This is *really* a BAD idea by whitroth · · Score: 1

    Get something that, krufted up, will work... and the publishers will use it, rather than have readers decide what should be published. You like the crap packaged as "music" from the members of the RIAA? You'll see that in books, too....

                      mark

  33. Next Step by Anon,+Not+Coward+D · · Score: 1

    Making an algorithm to WRITE the next fiction bestseller

    --
    Sometimes it's better not having signature
  34. How does Project Gutenberg select its texts? by as.kdjrfh+sxcjvs · · Score: 1

    http://www.gutenberg.org/wiki/Gutenberg:General_FAQ#G.13._How_does_Project_Gutenberg_choose_books_to_publish.3F

    All-volunteer; what people scan and proofread is what's there, after a copyright check. Some things that were popular and are therefore common; some things that were always rare and therefore an enthusiast scanned a copy; some things people sought out to fill out a subject heading. There's *lots* of old light fiction, adventure stories and social comedies, that no-one's cared about for a century. (I find it fascinating what changed, and what didn't, and what changed *first*. I love old B-side books.)

  35. Predicting the past? by dpbsmith · · Score: 1

    "[they believe they have found an algorithm that might] predict which fiction books will be successful. Their algorithm had as much as an 84 percent accuracy rate when applied to already published manuscripts in Project Gutenberg and other sources."

    I can predict the success rate of already published books with 100% accuracy.

    Backtesting is usually bogus because it means nothing unless the experimenter can precisely enumerate the total number of rules that were formulated and discarded--including those formulated and discarded intuitively--before arriving at the one that tested well. If you consider 100 possible systems, the chances that at least one of them will test with results significant at the 1% level is 63%.

    Also, "A Tale of Two Cities" IS in the Project Gutenberg database, right here, which doesn't give me much confidence in anything else they say...

  36. Changes the Agency and Publishing game by rhyous · · Score: 1

    This is actually going to change the publishing game. It may take time to roll this out, but soon, novels will have to be uploaded digitally to agents and publishing houses electronically. They will buy this software and let it "proofread" the authors work. Only if the book has all the qualities of a best seller, will it be read by the agent or publishing house (or one of their lackeys). Then an online site will rise that allows authors to check their work before they submit it.