Slashdot Mirror


Netflix Prize May Have Been Achieved

MadAnalyst writes "The long-running $1,000,000 competition to improve on the Netflix Cinematch recommendation system by 10% (in terms of the RMSE) may have finally been won. Recent results show a 10.05% improvement from the team called BellKor's Pragmatic Chaos, a merger between some of the teams who were getting close to the contest's goal. We've discussed this competition in the past."

83 comments

  1. No info about the Netflix prize by Daimanta · · Score: 2, Insightful

    C'mon, the Netflix prize isn't THAT well known. At least you could have given some basic info about it.

    --
    Knowledge is power. Knowledge shared is power lost.
    1. Re:No info about the Netflix prize by Reikk · · Score: 5, Informative

      Background: The Netflix Prize is an ongoing open competition for the best collaborative filtering algorithm that predicts user ratings for films, based on previous ratings. The competition is held by Netflix, an online DVD-rental service, and is opened for anyone (with some exceptions). The grand prize of $1,000,000 is reserved for the entry which bests Netflix's own algorithm for predicting ratings by 10%.

    2. Re:No info about the Netflix prize by circletimessquare · · Score: 1

      the new york times had a great story about it in november

      slashdot story below includes links to that nyt article, and other slashdot stories about the netflix prize:

      http://science.slashdot.org/article.pl?sid=08/11/22/0526216

      --
      intellectual property law is philosophically incoherent. it is your moral duty to ignore it or sabotage it
    3. Re:No info about the Netflix prize by MrMista_B · · Score: 4, Informative

      What, you didn't even read the /summary/?

      I know, this is Slashdot, but 'some basic info about it' is /right there/.

    4. Re:No info about the Netflix prize by bogjobber · · Score: 4, Informative

      If the first sentence didn't explain it enough, perhaps you could RTFA.

    5. Re:No info about the Netflix prize by Abstrackt · · Score: 1

      So RTFS means read the first sentence eh? ;)

      --
      They say a little knowledge is a dangerous thing, but it's not one half so bad as a lot of ignorance. - Terry Pratchett
    6. Re:No info about the Netflix prize by Korin43 · · Score: 2, Informative

      Except it doesn't mention what an improvement of 10% means (unless you know what RMSE means, which I don't).

    7. Re:No info about the Netflix prize by Anonymous Coward · · Score: 0

      "The long-running $1,000,000 competition to improve on the Netflix Cinematch recommendation system by 10%..."

      It's right there in the opening sentence. Easy to miss.

    8. Re:No info about the Netflix prize by JohnnyBGod · · Score: 2, Informative
    9. Re:No info about the Netflix prize by romcabrera · · Score: 1
  2. 1 Million split 7 ways by basementman · · Score: 4, Funny

    Let's see, $1,000,000 split 7 ways gives us $142,857.14 each. Let's say taxes take half, now you are down to only $71,428.57 each. Unless one of them kills all of their partners like in The Dark Knight that ain't much of a prize.

    1. Re:1 Million split 7 ways by quanticle · · Score: 5, Insightful

      Well, just like the Ansari X Prize didn't cover the costs of developing and launching a suborbital rocket, the Netflix Prize isn't really meant to be a large enough prize to fully fund the development of a new recommendation algorithm. The purpose of the prize is to stimulate interest and get people started. The real reward will come when they turn their algorithm into commercialized software - the rewards from making such a thing applicable outside of Netflix could be large indeed.

      --
      We all know what to do, but we don't know how to get re-elected once we have done it
    2. Re:1 Million split 7 ways by neokushan · · Score: 5, Insightful

      Pretty sure having it on their CV means they can effectively write their own pay cheque in terms of job opportunities.

      --
      +1 IDisagreeSoHeMustBeATrollOrAnAstroturferOrAShill
    3. Re:1 Million split 7 ways by Anonymous Coward · · Score: 0

      That's one perspective on it, and a bit of a leap that this is will see much use outside of Netflix. On the other side, Netflix got tons of people to do something for much less than it would have cost them in-house. So: lots of losers, a few people who did okay and have a small chance of getting good money from it, and Netflix getting labor on the cheap. If the people who entered the competition did it out of a genuine labor of love then bully for them, but if someone entered that competition for the money then it was a bad decision.

    4. Re:1 Million split 7 ways by QuantumG · · Score: 1

      The X-Prize was designed to encourage the creation of a vehicle that would demonstrate the feasibility of a new market. It was backed up with a whole lot of market research which showed that people would happily sign up for a flight on such a vehicle. The anticipated business plan that it was trying to encourage was:

      1. Build a vehicle that is very reusable and can put passengers into space.
      2. Win the prize and get the PR.
      3. Take bookings and reservation fees to fund the next flight.
      4. Fly the first passengers in your winning vehicle.
      5. Repeat 3, 4 until Profit!!

      Unfortunately they didn't anticipate that some big personality like Burt Rutan would come along and do:

      1. Convince an investor to invest *twice* the price to build a vehicle.
      2. Win the prize and get the PR.
      3. Use the PR to get more venture capital.
      4. Took bookings and reservation fees for a new vehicle that hasn't even been built yet.
      5. Never fly the winning vehicle again.. hang it in the Smithsonian like a freakin' monument or something.
      6. Watch your schedule slip, try to catch up by cutting corners, have a fatal accident.
      7. Announce that your vehicle will be delayed still more as you work on making it a "platform" for doing shit that wasn't in the original spec (can you say space shuttle?)
      8. Demonstrate a fly over of your carrier aircraft, after having to delay the fly over due to malfunctions.
      9. ?????? We're still waiting.
      10. Profit?

      --
      How we know is more important than what we know.
    5. Re:1 Million split 7 ways by Anonymous Coward · · Score: 0

      Man, I don't know about you (obviously), but $71k would go a pretty long way towards making my life a lot better.

      And yes, I know money doesn't buy happiness, but I would be content with rent being paid and food on the table without worry.

    6. Re:1 Million split 7 ways by denzacar · · Score: 1

      Unless one of them kills all of their partners like in The Dark Knight that ain't much of a prize.

      Yeah, but if they deliver him to the competition in a body bag they get another 500 hundred grand.
      A million if alive, so they can teach him some manners first.

      --
      Mit der Dummheit kämpfen Götter selbst vergebens
    7. Re:1 Million split 7 ways by Anonymous Coward · · Score: 0

      Well, I have already killed two of the other guys. Hope fully this doesn't tip off the others.

    8. Re:1 Million split 7 ways by Anonymous Coward · · Score: 0

      "Let's say" that taxes don't take half. There's pretty much nowhere in the country where someone earning 143k would be in the 50% tax bracket.

    9. Re:1 Million split 7 ways by GuruBuckaroo · · Score: 1

      only $71,428.57 each ... that ain't much of a prize.

      That'd pay off my house, with a little to spare for new windows. I call that much of a prize.

      --
      Poor means hoping the toothache goes away.
    10. Re:1 Million split 7 ways by WinstonWolfIT · · Score: 0

      First burn 70 large for laffs. Then say the prize isn't impressive. In that order.

    11. Re:1 Million split 7 ways by SunTzuWarmaster · · Score: 1

      If this was your only job, you would have to pay rent/utilities on the building where you work in addition to rent/utilities on the building where you live. You also have to pay your own health insurance, dental/vision, and retirement/401K.

      The above should cut it down to around a bit above (if not below when you have an office) the average starting salary of an Engineer (55K at last I checked).

      $1M ain't what it used to be.

    12. Re:1 Million split 7 ways by Anonymous Coward · · Score: 0

      Resume item worth much more.

    13. Re:1 Million split 7 ways by Anonymous Coward · · Score: 0

      what the hell are you rambling about? you just did that to invoke an overused meme, right? commercialization of rutan's winning craft is well underway.

    14. Re:1 Million split 7 ways by QuantumG · · Score: 1

      And will continue to be "well underway" for the next 5 years with no-one having flown in it is my bet..

      --
      How we know is more important than what we know.
    15. Re:1 Million split 7 ways by Pollardito · · Score: 1

      When they write their own paycheck, they should put Blockbuster in the account name

  3. Do they keep the prize money? by whoever57 · · Score: 1

    Assuming no-one turns up a better score in the next 30 days, do the team members who work for Yahoo and AT&T get to keep their share of the prize money?

    --
    The real "Libtards" are the Libertarians!
    1. Re:Do they keep the prize money? by lee1026 · · Score: 3, Informative

      AT&T have committed to giving all money to charity. The person at yahoo developed his entry while working at AT&T, so I will be surprised if yahoo gets any of it.

  4. Well done! by Slurpee · · Score: 4, Informative

    Well done Bellkor.

    But now the real race begins.

    Now that the 10% barrier has been reached, people have 30 days to submit their final results. At the end of the 30 days, whoever has the best result wins.

    This is going to be a great month!

    1. Re:Well done! by nmb3000 · · Score: 2, Informative

      Now that the 10% barrier has been reached, people have 30 days to submit their final results. At the end of the 30 days, whoever has the best result wins.

      That's true, but like the story title indicates, the prize may have been achieved. From the contest rules:

      The RMSE for the first "quiz" subset will be reported publicly on the Site; the RMSE for the second "test" subset will not be reported publicly but will be employed to qualify a submission as described below. The reported RMSE scores on the quiz subset provide a public announcement that a potential qualifying score has been reached and provide feedback to Participants on both their absolute and relative performance.

      So the publicly available submission beat the 10% mark, but only by a narrow margin of 0.05%. The private submission must also have surpassed 10% for them to be considered a preliminary winner, otherwise the contest goes on (at least, this is my understanding). In any case, I agree that the outcome should be interesting.

      --
      "What do you despise? By this are you truly known." --Princess Irulan, Manual of Muad'Dib
      /)
    2. Re:Well done! by Wildclaw · · Score: 2, Interesting

      Actually, this email has been sent out

      "As of the submission by team "BellKor's Pragmatic Chaos" on June 26, 2009 18:42:37 UTC, the Netflix Prize competition entered the "last call" period for the Grand Prize. In accord with the Rules, teams have thirty (30) days, until July 26, 2009 18:42:37 UTC, to make submissions that will be considered for this Prize. Good luck and thank you for participating!"

  5. they were able to get the extra 0.5% over the top by circletimessquare · · Score: 4, Funny

    by simply ignoring data from anyone who ever rented SuperBabies: Baby Geniuses 2, Gigli, From Justin to Kelly, Disaster Movie, any movie by Uwe Boll and any movie starring Paris Hilton

    suddenly, everything made sense

    --
    intellectual property law is philosophically incoherent. it is your moral duty to ignore it or sabotage it
  6. Wonderful. by Anonymous Coward · · Score: 0

    Now will somebody please fix that goddamned Silverlight player?

  7. most of them did it by circletimessquare · · Score: 4, Insightful

    for simple intellectual satisfaction, like a giant puzzle or a game of chess

    money is not the motivation for everything in this world

    --
    intellectual property law is philosophically incoherent. it is your moral duty to ignore it or sabotage it
    1. Re:most of them did it by basementman · · Score: 1

      Jesus guys I thought the Dark Knight comment would tip you off, I was trying to be funny.

    2. Re:most of them did it by morgan_greywolf · · Score: 2, Insightful

      Well, it was for AT&T. No, they don't want the prize money; they're donating it charity. But what they do have now is an algorithm that can be turned into a commercial product or service. The individual researchers may not have had money as their primary motivator, but their employer sure has hell did.

    3. Re:most of them did it by Saba · · Score: 1

      money is not the motivation for everything in this world

      But it's waaaay ahead of what's in second place.

    4. Re:most of them did it by ouimetch · · Score: 1

      That may be true, but $1,000,000 is still some pretty freaking sweet motivation...

    5. Re:most of them did it by hydromike2 · · Score: 0

      right, money is just what pays for the giant puzzle and the food consumed solving it

    6. Re:most of them did it by OeLeWaPpErKe · · Score: 0, Flamebait

      Heh, it's more than I make in a year. Much more I might add.

      And they could always pay it to a charity. You know "Society for the improvement of John Q. Myself", headquartered in Bermuda, to "cheat" taxes. The whole Obama administration are tax dodgers, or at least, one could be forgiven for thinking so, meaning it can't be that wrong. If "tax-cheaters anonymous" is good enough a government for the american people, surely they can forgive an enterprising tax dodging geek, right ?

    7. Re:most of them did it by Anonymous Coward · · Score: 0

      money is not the motivation for everything in this world

      True... Some men just want to watch the world burn...

  8. Interesting by coaxial · · Score: 5, Informative

    I published a paper using Netflix data. (Yeah, that group.)

    It's certainly cool that they beat the 10% improvement, and it's a hell of a deal for Netflix, since it would have cost them more than a prize money paid out to hire the researchers, the interesting thing is whether or not this really advances the the field of recommendation systems.

    The initial work definitely did, but I wonder how much of the quest for the 10% threshold moved the science, as opposed to just tweaking an application. Recommender systems still don't bring up rare items, and they still have problems with diversity. None of the Netflix Prize work address any of these problems.

    Still, I look forward to their paper.

    1. Re:Interesting by Anonymous Coward · · Score: 1

      Here's the problem with the terms "rare" and "diversity":

      1.) Rare could also be defined as unpopular. Trying to recommend unpopular movies is problematic. Is the computer program going to be able to discern under-rated (Glengarry Glen Ross) or just crap (Ishtar)

      2.) Suggesting "My Little Pony" when I usually rent black and white Samurai movies could be called diversity. Do you want a program that recommends things that are different or things that are similar?

    2. Re:Interesting by Anonymous Coward · · Score: 0

      Cost? Try a genetic algorithm, a decent machine, and some free time. Hell, I wouldn't be surprised if it was a genetic algorithm that combined the other teams' findings into a fitter solution.
      The 'cost' of that would be literally time. Can't get much better than that.

    3. Re:Interesting by Anonymous Coward · · Score: 1, Interesting

      You think using a Generic Algorithm could help?! are you kidding??!! :-)

      The search space is far too great and what would you actually be searching for with this technique? (just curious)

      I would hope to see techniques evolved from an energy variant of Markov Decision Processes! now that 'would' be a nice direction.

    4. Re:Interesting by Trepidity · · Score: 3, Insightful

      Trying to recommend unpopular movies is problematic. Is the computer program going to be able to discern under-rated (Glengarry Glen Ross) or just crap (Ishtar)

      That is indeed an interesting question, and I think it's what the grandparent meant when he pointed out Netflix's contest didn't really address it. The performance measure Netflix used was root-mean squared error, so every prediction counts equally in determining your error. Since the vast majority of predictions in the data set are for frequently-watched films, effectively the prize was focused primarily on optimizing the common case: correctly predict whether someone will like or not like one of the very popular films. Of course, getting the unpopular films right too helps, but all else being equal, it's better to make even tiny improvements to your predictions of films that appear tons of times in the data set, than to make considerable improvements to less popular films' predictions, because the importance of getting a prediction right is in effect weighted by the film's popularity.

      You could look at error from a movie-centric perspective, though, asking something like, "how good are your recommender algorithm's predictions for the average film?" That causes you to focus on different things, if an error of 1 star on Obscure Film predictions and an error of 1 star on Titanic predictions count the same.

    5. Re:Interesting by failedlogic · · Score: 1

      Awesome combo. I'd pay $100 for a Zatoichi vs My Little Pony movie.

    6. Re:Interesting by Anonymous Coward · · Score: 0

      The initial work definitely did, but I wonder how much of the quest for the 10% threshold moved the science, as opposed to just tweaking an application.

      Exactly. The first big "ah-ha" moment was that matrix factorization is the way to go for datasets like this. The second big "ah-ha" moment was that, hey, we need to be using temporal data, too! It's been my opinion that pretty much everything else has been the operators manually training their programs and blends against the quiz set ("tweaking," to put it gently). Put another way, I seriously doubt Netflix could take all the same code, run it over their up-to-date database, and still see a 10% improvement; I don't know if they'd even see a 9% improvement. Personally, I don't think there's even "10%" more signal there.

    7. Re:Interesting by Wildclaw · · Score: 1

      so every prediction counts equally in determining your error

      True, but not true. If we had talked about plain mean error I would have agreed. But as it stands, not every error counts the same. Making a serious error (guessing 1 instead of 5) costs 16 times what making a small error (guessing 4 instead of 5) costs.

      With popular movies you usually have enough data to come decently close with guesses. Sure, you can optimize them more to get even closer with the guesses. But it is a minor profit. On the other hand, you don't have much data on the less popular movies, so the guesses will be far less accurate, and it will pay off far more to make those guesses more accurate.

      If anything, you could say that the competition focused on minimizing the amount of serious errors.

    8. Re:Interesting by Trepidity · · Score: 2, Insightful

      That's true, but since there's not a huge range in ratings, that root-squaring doesn't have nearly as big an effect as the many orders of magnitude difference in popularity. I don't recall the exact numbers offhand, but I think the top-10 movies, out of 17,500, account for fully half the weight.

    9. Re:Interesting by coaxial · · Score: 2, Insightful

      1.) Rare could also be defined as unpopular. Trying to recommend unpopular movies is problematic. Is the computer program going to be able to discern under-rated (Glengarry Glen Ross) or just crap (Ishtar)

      You know what. I actually like Ishtar. I really do. The blind camel, and the line "We're not singers! We're songwriters!" gets me every time.

      So really, the even harder problem is to know when to buck your your friends and go with the the outlier. It's hard, because kNN methods work pretty well, and they're all about going with the consensus of whatever cluster you're in.

  9. alas, basementman by circletimessquare · · Score: 1

    batman even has a better sense of humor than you ;-)

    --
    intellectual property law is philosophically incoherent. it is your moral duty to ignore it or sabotage it
    1. Re:alas, basementman by Anonymous Coward · · Score: 1, Funny

      I'm BatCoward.

  10. i assume you're joking, but.. by spiffmastercow · · Score: 1

    That actually makes a lot of sense... Remove the wildcards and you'll be able to get a much more accurate result for everyone else. You might suffer a bit when you're reccomending to people who like movies that are absolutely terrible, but you'd make up for it my not factoring that into the equation at all. This is, of course, assuming that only a few people actually watched those movies.

  11. "recommendations" by Gothmolly · · Score: 2

    Who listens to these sort of things anyway?

    --
    I want to delete my account but Slashdot doesn't allow it.
    1. Re:"recommendations" by Anonymous Coward · · Score: 0

      Who listens to these sort of things anyway?

      Anyone who listens to pandora.

    2. Re:"recommendations" by Anonymous Coward · · Score: 0

      Who listens to these sort of things anyway?

      Anyone who listens to pandora.

      And exactly who would listen to someone who released all evils upon mankind?

  12. i was joking, however by circletimessquare · · Score: 5, Interesting

    from the excellent nyt article about the competition in november:

    http://science.slashdot.org/article.pl?sid=08/11/22/0526216

    it isn't bad movies that are the problem, taste in bad movies can still be uniform

    the real problem is extremely controversial movies, most notably Napoleon Dynamite

    http://www.imdb.com/title/tt0374900/

    not controversial in terms of dealing with abortion or gun control, but controversial in terms of some people really found the movie totally stupid, while some people really found the movie to be really funny

    movies like napolean dynamite are genre edge conditions, and people who apparently agree on everything else about movies in general encounter movies like this one and suddenly dramatically differ on their opinion of it, in completely unpredictable ways

    --
    intellectual property law is philosophically incoherent. it is your moral duty to ignore it or sabotage it
    1. Re:i was joking, however by bill_mcgonigle · · Score: 3, Interesting

      Yeah, all the recommendation systems where I've bought or rented movies, and most of my friends all said I needed to see 'Fight Club', so I did, and ... meh.

      Consider this list of movies I've bought/rated highly:

        12 Monkeys
        V for Vendetta
        Lost in Translation
        Donnie Darko
        A Beautiful Mind
        Dogma

      I might be grouped with folks who enjoy flicks about identity, man vs. man, those who aren't easily offended, etc. But there doesn't seem to be as clear a way to find a group of people who find aggression offensive, which is basically the driving theme of Fight Club. Perhaps given enough negative ratings it could be possible, but even though I've clicked 'Not Interested' on all the Disney movies, they keep suggesting I want their latest direct-to-DVD crapfest, so I'm left to assume they're rating mostly based on positive ratings.

      --
      My God, it's Full of Source!
      OUTSIDE_IP=$(dig +short my.ip @outsideip.net)
    2. Re:i was joking, however by Anonymous Coward · · Score: 0

      at first i found the movie totally stupid but then i smoked pot and watched it again and found it totally funny and if i had to rate the movie now i would pick funny. the real problem is in determining who these "fringe reefer addicted" movie-people are, and through logic and deduction of a person's movie review library (for: harold & kumar 1 & 2, how high, don't be a menace in south central while drinking your gin and juice, cheech & chong, etc. et. all) to calculate a "stoner" ratio system which corrects for the controversial quotient. ya dig?

    3. Re:i was joking, however by moosesocks · · Score: 1

      Examining the standard deviation of the ratings received by each film should provide a reasonable (albeit imperfect) indicator of the "controversialness" of any given film.

      Of course, there are a few ways to improve upon this, and I'm sure that the winning teams have taken this into account.

      --
      -- If you try to fail and succeed, which have you done? - Uli's moose
    4. Re:i was joking, however by vuffi_raa · · Score: 1

      it isn't bad movies that are the problem, taste in bad movies can still be uniform

      also there are people like my wife and I who share the netflix since we watch most of the movies together and have different tastes in movies: for one I HATE '30s and '40s dramas and she loves them so she will get them sometimes and I love B grade sci fi and horror movies and she isn't the biggest fan of them

    5. Re:i was joking, however by Pollardito · · Score: 3, Interesting

      Perhaps given enough negative ratings it could be possible, but even though I've clicked 'Not Interested' on all the Disney movies, they keep suggesting I want their latest direct-to-DVD crapfest, so I'm left to assume they're rating mostly based on positive ratings.

      A co-worker gets almost no recommendations at all from Netflix, and customer service told him that they generate recommendations based on ratings of 4 or 5 (though you'd think that the recommendations that they do generate would have to filter through similar movies that you've rated at 0). He was told to rate the movies that he likes higher in order to fix it, but that's never really accomplished anything as he has several hundred movies in the 4-to-5 range and maybe a dozen recommendations total.

      I'm pretty sure that the Disney/children's movie recommendation flood that most everyone seems to be getting is driven by parents who don't actually love those movies, but are rating those movies on behalf of their children. That causes a weird connection to movies that they themselves enjoy, and it makes it seem like the same audience is enjoying both types of movie. They need to have an "I'm a parent" flag somewhere to help them sort that out

    6. Re:i was joking, however by bill_mcgonigle · · Score: 1

      Useful, thanks.

      > They need to have an "I'm a parent" flag somewhere to help them sort that out

      I'd love to have family member tags in the queue. Associating an age with them would be fine. They have a thing where you can have separate queues for different family members, but it's a separate login IIRC, and a real pain to manage cohesively. I'd much rather just have one queue, and have it send discs in a round-robin for thing1, thing2, and the folks.

      --
      My God, it's Full of Source!
      OUTSIDE_IP=$(dig +short my.ip @outsideip.net)
  13. Re:they were able to get the extra 0.5% over the t by interkin3tic · · Score: 2, Funny

    by simply ignoring data from anyone who ever rented SuperBabies: Baby Geniuses 2, Gigli, From Justin to Kelly, Disaster Movie, any movie by Uwe Boll and any movie starring Paris Hilton

    Hey, I (along with the rest of my frat, our school hockey team, and most of the town) was in a movie starring Paris Hilton, you insensitive clod!

  14. Re:they were able to get the extra 0.5% over the t by Hurricane78 · · Score: 1

    You can generalize that to any movie with an action hero, and a baby. Any movie with a pseudo-star (Hilton, Spears, Madonna, etc.). And any movie with Uwe Boll or similar people.

    On a more serious note: I think the best way to improve recommendations, is to first relate the IMDB rating to the IQ of the rater. I found that more intelligent people do not like movies with a simple plot, because it bores them, and less intelligent people do not like movies with a complex, subtle plot, because they don't get it. You can further separate this into the EQ and the IQ, which will considerably improve the experience for emotional people (like most women).
    I found that the best way to quickly and realistically do that, is to check the length of the sentences and the percentage of the questions in the comments of that user.

    This will give you a 3D space with a rating, a EQ and a IQ axis.
    Now apply the values of the user that wants to get recommendations as coordinates, and order by the distance from that point.
    Then apply the traditional recommendation system with a blending factor.

    I bet I could code that in less than an hour in Haskell.

    P.S.: If you want to patent this, mind you that I will seriously kick your ass for doing so. I do not know any laws when it comes to giving credit. You can probably sue me afterwards, but it won't fix what I did to you. ^^ (Be fair to me, and I will be the nicest man you ever met.)

    --
    Any sufficiently advanced intelligence is indistinguishable from stupidity.
  15. I've been doing behavioral data analysis... by Anonymous Coward · · Score: 0

    this is my career, doing behavioral data analysis.

    I was really excited when they announced the project, and began tinkering with it. Unfortunately, they stripped so much data out of the exercise that it became an academic statistical exercise, rather than an insightful behavioral modeling exercise. It rewarded an approach where the training population was continually wildly segmented, with different model parameters on each segment.

    I wish they had opened up more data, there could have been lots of cool stuff in there. I'm not saying it was bad or against it, just not *my* cup of tea. I rather prefer statistically simple models driven off unique customer segments.

  16. real world by TheSHAD0W · · Score: 1

    So... What does this mean in real-world analysis? What does the score represent? Since the score shown seems to be smaller-is-better, does this mean that 85+% of the movies recommended won't be attractive to the target, and less than 15% would be found interesting?

    That doesn't seem very accurate...

    1. Re:real world by Anonymous Coward · · Score: 0

      Roughly speaking, it means that, on average, the recommender will guess your actual rating for a movie to within ~0.85 points (on a five-point scale). Granted, that doesn't sound much better than being within ~0.95 points (which is what they had). Supposedly, though, it makes a big difference in the way the entire movie list would be sorted for you in order of "most likely to like".

    2. Re:real world by Wildclaw · · Score: 2, Interesting

      ~0.85 points (on a five-point scale)

      Actually the scale is not 0-1-2-3-4 but 0-1-4-9-16 as they use Root-Mean-Square. Just thought it was worth pointing out.

  17. Re:they were able to get the extra 0.5% over the t by Shikaku · · Score: 1

    Hey, I (along with the rest of my frat, our school hockey team, and most of the town) was in Paris Hilton, you insensitive clod!

    Fixed

  18. Re:they were able to get the extra 0.5% over the t by interkin3tic · · Score: 1

    Kind of what I was implying...

  19. Re:they were able to get the extra 0.5% over the t by Anonymous Coward · · Score: 1, Insightful

    You're being elitist and silly. Perhaps you could do some research, but I have seen no evidence that interest in simple vs. complex plots has anything to do with intelligence. Certainly the type of plot one likes in a movie is something reasonable to consider. But assuming a relationship to IQ or EQ from that is silly.

  20. I'm confused... by Anonymous Coward · · Score: 0

    Recent results show a 10.05% improvement

    How many library of congresses is that?

  21. Re:they were able to get the extra 0.5% over the t by thunderclap · · Score: 1

    by simply ignoring data from anyone who ever rented SuperBabies: Baby Geniuses 2, Gigli, From Justin to Kelly, Disaster Movie, any movie by Uwe Boll and any movie starring Paris Hilton

    suddenly, everything made sense

    Ok, From Justin to Kelly wasn't really that bad. Now, Ishtar...and Battlefield Earth; those were baaaad.

  22. Film recommendations by michuk · · Score: 3, Interesting

    Does anyone find Netflix recommendations any good anyway? I used http://criticker.com/ for quite a while and was very happy about the recommended stuff. Recently switched to http://filmaster.com/ (which is a free service) and it's equally good, even though both probably use a pretty simple algorithm compared to Nextflix.

    --
    Polish your GNU/Linux! http://polishlinux.org
    1. Re:Film recommendations by jfengel · · Score: 2, Interesting

      Reasonably good, actually. I often add 4 star movies to my queue, and rarely regret it.

      The problem is the bell curve. There aren't a lot of 5 star movies out there, and I've seen them. There are a lot of 3 star films, but my life is short and I don't want to spend a lot of time on movies I merely "like".

      In fact, it's not really a bell curve. I rarely provide 1-star or 2-star ratings simply because it's not at all difficult for me to identify a film I'm going to truly hate. I don't have to waste two hours of my life to find out whether I'd merely dislike the new Transformers movie or whether it will fill my soul with disgust.

      The left side of the curve is actually quite fat with movies that simply won't interest me at all. The existing algorithm is actually fairly good at telling me I won't like them. The hard part is picking out the very few movies that ARE worth my time.

      They do show both the average and expected rating for each film. What I'd really like to see is a list sorted by the difference: where do I stand out from the crowd? Such movies are likely to have extra appeal.

      So the 10% difference isn't completely worthless, but the real problem is that they're pursuing the wrong goal. There's a lot of information they're dropping on the floor.

    2. Re:Film recommendations by michuk · · Score: 1

      To me it's essential to read a few reviews (even very short ones like "too sentimental" or "powerful and original") by people I know to have a similar movie taste to me. That's why I like the two websites I mentioned above -- they exactly tell you whose opinions you should care for and whose you can safely ignore. On the other hand, http://jinni.com/ and http://clerkdogs.com/ have some pretty cool ideas as well - they suggest you movies based on your mood and your taste. Still in beta though, so it does not always work as expected, but worth trying anyways.

      --
      Polish your GNU/Linux! http://polishlinux.org
    3. Re:Film recommendations by jfengel · · Score: 1

      The key advantage Netflix has over other services is that it's right there. They know what you watch and you don't have to go searching.

      Of course you still have to go though the back catalog and talk about all the things you've already seen, which runs in the hundreds for somebody who likes movies. That pain is essentially the same with any service.

      But going forward, Netflix can present you the opportunity to rate a movie more easily. It's a small user-interface thing, but significant.

    4. Re:Film recommendations by coaxial · · Score: 3, Informative

      I believe that Netflix is still using Cinematch. You could look into movielens. It's from the GroupLens group at U Minn.

      [E]ven though both probably use a pretty simple algorithm compared to Nextflix.

      You do know that Netflix said on the outset "You're competing with 15 years of really smart people banging away at the problem." and it was beat in less than week.

      That's not to meant as a knock against Netflix's engineers, but more about that they didn't really build a state of the art recommender system. Simple SVD (aka latent semantic indexing outperfomed them as well.) They did something a bit more than straight up kNN clustering, but that was pretty much it.