Slashdot Mirror


Build a Better Netflix, Win a Million Dollars?

An anonymous reader writes "In a quest to better movie recommendations, Netflix is opening their database (nytimes, registration and first child required) to users to try to craft a better recommendation technology. The problem is not easy. Says one researcher: 'You're competing with 15 years of really smart people banging away at the problem.'" Recommender systems are really an interesting problem, and that is likely very interesting data to play with.

197 comments

  1. database? by Franio · · Score: 0

    "Nothing for you to see here. Please move along."

    Have they now?

    1. Re:database? by NoTheory · · Score: 1

      I saw this last night in the wee hours of the morning (EST), and still can not find a link to the contest on the netflix site for the life of me.

      Anyone else have better luck?

      --
      There are lives at stake here!
    2. Re:database? by thatnerdguy · · Score: 4, Informative
      --
      I saw the Sign, and it opened up my eyes
    3. Re:database? by curtisk · · Score: 1

      Not even a mention in their press releases section....hmmmmmm?

      --

      Sehr geehrter Toilettenbenutzer!

    4. Re:database? by mkosma · · Score: 1
      http://www.netflixprize.com/

      I submitted this to /. with a link, but apparently wasn't the first.

    5. Re:database? by Anonymous Coward · · Score: 1, Informative

      No-registration-required article:
      http://www.foxnews.com/story/0,2933,217021,00.html

      Posting as AC because:
      Linking to foxnews equals automatic Troll/Flamebait/Offtopic.

  2. Seems like a free gift for Netflix to me... by garcia · · Score: 2, Insightful

    If no one wins within a year, Netflix will award $50,000 to whoever makes the most progress above a 1 percent improvement, and will award the same amount each year until someone wins the grand prize.

    But if someone does win within a year they will still have the ability to use others' code, free of charge, as part of their product.

    The article doesn't say but how will you know if your code is making choices better than their existing system? I wouldn't be submitting my code unless I was sure I was going to win. Then again I'm not a gambler or a coder ;)

    1. Re:Seems like a free gift for Netflix to me... by wc_paladin · · Score: 1

      The contest site has a rules page that tells what accuracy you have to beat. You have to train your system with the dataset they provide and then test against what those people actually liked for the year after the dataset ends.

      I could not find anything about whether you get to keep the rights to your code if you are not a winner.

    2. Re:Seems like a free gift for Netflix to me... by buswolley · · Score: 1
      How about this: Instead of using a one dimensional rating system you ask them to rank on several aspects of the movie: such as:

      Would you rent this movie again? 1 2 3 4 5

      Would you recommend this to a friend? 1 2 3 4 5

      Then choose the choices of why you liked or disliked it by of 5-6 choices, such as: Acting, plot, cinematography, moral values, hotness

      --

      A Good Troll is better than a Bad Human.

    3. Re:Seems like a free gift for Netflix to me... by Sparr0 · · Score: 2, Insightful

      Because each option you add cuts the number of responses in half. A vast majority of users use the one-rating system. Almost no one would fill out a 20-question survey about every movie they watch.

    4. Re:Seems like a free gift for Netflix to me... by curunir · · Score: 2, Informative

      From the rules, it looks like your submission isn't code, it's a processed dataset. It's only in the terms for winning are that you have to explain your method to them (so that they don't get bitten by a horribly obfuscated entry) and have to non-exclusively license your submission to Netflix (it looks like you retain copyright and can license it to others if you so choose).

      But that seems pretty reasonable...you only have to hand over your code if you win, otherwise you're only submitting the results of your program.

      --
      "Don't blame me, I voted for Kodos!"
    5. Re:Seems like a free gift for Netflix to me... by stonecypher · · Score: 2, Informative

      If you read NetFlix' prize site, you'll find that they give clear cut statistical requirements for winning that are well defined. It's actually quite impressive the detail into which they go; it's clear that they want real engineers on this, and that they're willing to get seriously specific in order to make sure people know what's what.

      --
      StoneCypher is Full of BS
    6. Re:Seems like a free gift for Netflix to me... by buswolley · · Score: 1

      Granted. So lets say you have a particular set of questions to ask.. Say about 5 different question types. Well just ask them one question per movie, randomly chosen.

      --

      A Good Troll is better than a Bad Human.

    7. Re:Seems like a free gift for Netflix to me... by dustman · · Score: 1

      The article doesn't say but how will you know if your code is making choices better than their existing system? I wouldn't be submitting my code unless I was sure I was going to win. Then again I'm not a gambler or a coder ;)

      I laughed a bit (at myself?) after reading this comment. I *am* a gambler (poker) and a coder, and I've already downloaded the dataset and started work!

  3. Challenge Accepted by bigattichouse · · Score: 1

    I officially announce I will be entering BigAtticHouse's Vectorspace Database into the melee. At least to see what might come of it.

    --
    meh
  4. So, we can then conclude by jimstapleton · · Score: 3, Funny
    Says one researcher: "You're competing with 15 years of really smart people banging away at the problem."


    So, the professionals have been working at it for a long time. Is it safe to assume some teenage to early college hacker will find a success within two weeks.
    --
    34486853790
    Connection too slow for X forwarding? Try "ssh -CX user@host"
    1. Re:So, we can then conclude by no_pets · · Score: 1

      Who knows? But one thing is for sure, the teen or college hacker won't be worried about continued employment.

      --
      "A government is a body of people, usually notably ungoverned." - Shepard Book Quoting Malcolm Reynolds
    2. Re:So, we can then conclude by bobcat7677 · · Score: 1

      I am neither teenage or a hacker...but I still find the "challenge" somewhat unchallenging.

      Simply filter their existing result set to exclude titles that are in a genre that the user has NEVER rented anything from and that would be a huge improvement!

  5. Simple by Anonymous Coward · · Score: 5, Funny

    if(user.getGender()==Person.MALE)
    recomendation=MovieGenre.PORN;
    else
    recomendation=MovieGenre.CHICKFLICK;

    And of course, slashdot must have sensed my post as my image word is "pervert"

    1. Re:Simple by Anonymous Coward · · Score: 0

      And of course, slashdot must have sensed my post as my image word is "pervert"

      That, or maybe it sensed you are a former Florida Congressman.

    2. Re:Simple by kelzer · · Score: 4, Funny

      Old Version:

      if(user.getGender()==Person.MALE)
      recomendation=MovieGenre.PORN;
      else
      recomendation=MovieGenre.CHICKFLICK;

      New Version, sure to win the million bucks:

      if(user.getGender()==Person.MALE && user.getOrientation()==Person.STRAIGHT)
      recomendation=MovieGenre.PORN;
      else
      recomendation=MovieGenre.CHICKFLICK;

      --

      ---------------------------------------------
      SERENITY NOW!!!!!!!!!!!!!!!!
    3. Re:Simple by gambit3 · · Score: 1

      ahh.. but what if the user is gay?

      Gay porn? or chick flick?

      --
      Go where Web Thinkers gather

    4. Re:Simple by karnal · · Score: 1

      Hey, even gay people like porn. Have a friend who has more porn than me - and he's definitely not straight.

      --
      Karnal
    5. Re:Simple by Anonymous Coward · · Score: 1, Funny

      Hey, I am a straight man, it's just that my boyfriend is gay?

    6. Re:Simple by antifoidulus · · Score: 1

      Wait, is there a difference?

    7. Re:Simple by Plutonite · · Score: 1

      Yes but they like chick flix, we don't!

      ++Insightful!!

  6. Bah by Anonymous Coward · · Score: 0

    A lot of ppl is going to waste mucho time on this one. I recommend watching the movie Syriana instead.

  7. I had a thought like this a while back... by AceCaseOR · · Score: 4, Interesting

    ..except, instead of making it open to the community (which is not a bad idea, I must say) I thought of having Google do it. This is, perhaps, IMHO, a much better idea. Now, what we really need is a Movie Genome Project, much like the Music Genome project that lead to Pandora.

    --
    Zagreus sits inside your head, Zagreus lives among the dead, Zagreus sees you in your bed and eats you in your sleep.
    1. Re:I had a thought like this a while back... by Boone^ · · Score: 1

      Pandora's recommendations are really spot on. I rarely have to give one the thumbs down.

    2. Re:I had a thought like this a while back... by jestill · · Score: 1

      I am listening to Pandora now. I would love to see a movie genome project get underway. It is quite possible to build one as an open source community collaboration project. I am surprised that no one has done this yet.

      --
      "Asleep at the switch? I wasn't asleep, I was drunk!" -- Homer
  8. go see porn sites by LiquidCoooled · · Score: 3, Interesting

    They have decent tech for building similar/recommended alternative pages.
    Especially the newer blogish type pages where theres a gallery and a small selection underneath.

    Not that I would know of course.

    --
    liqbase :: faster than paper
    1. Re:go see porn sites by MustardMan · · Score: 1

      What kind of horrible person are you, to make a statement like this and not link to an example of the tech in action... you know, for illustrative purposes.

  9. Suggestion by 99BottlesOfBeerInMyF · · Score: 5, Insightful

    As a NetFlix user I have one suggestion for their recommendation system that can make it much better. Make it aware of the connection between series. That is to say, If you rent season 1 of something, suggest season 2, not season 4 (even if season 4 has better review ratings). If I mark season 1 of something as "not interested" instead of giving it a user rating, don't suggest every other season of that same show at the top of my recommendations. I mean how many times do I have to tell you I don't want to see any season of "Friends" ever, even if you pay me?

    1. Re:Suggestion by Xentor · · Score: 1

      Just a nitpick... If I mark, say, season 1 of series X as Not Interested, maybe it means I already own it and have no need to rent it, but still might want to see season 2.

      Of course, if I marked it as 1-star (Which I assume means "Utter crap"), then as you said, it should shut the hell up about the rest of the series.

      --
      "The amount of intelligence on this planet is a constant. The population is growing." -Cole's Axiom
    2. Re:Suggestion by 99BottlesOfBeerInMyF · · Score: 1

      Just a nitpick... If I mark, say, season 1 of series X as Not Interested, maybe it means I already own it and have no need to rent it, but still might want to see season 2. Of course, if I marked it as 1-star (Which I assume means "Utter crap"), then as you said, it should shut the hell up about the rest of the series.

      I disagree. If you have it, you presumably have watched it and should give it a rating. You do have interest in it, or you would not have bought it. So things you mark as 1 star should probably reduce the chances that you want to see later seasons, but marking it is not interested should reduce it even further.

    3. Re:Suggestion by 955301 · · Score: 1, Funny

      I didn't like Star Wars:Episode I very much. Episode 4 was great though.

      --
      You are checking your backups, aren't you?
    4. Re:Suggestion by Xentor · · Score: 3, Insightful

      Hmm, I see your point.

      I was about to mention that I mark things as Not Interested when I own them, to avoid being reccommended the rest (Usually because I prefer to buy series I like, and rent actual movies), but then I realized that fits into what you said perfectly.

      Point conceded.

      --
      "The amount of intelligence on this planet is a constant. The population is growing." -Cole's Axiom
    5. Re:Suggestion by rthille · · Score: 1

      Right, the way I think of it is that I had to be interested enough to watch the movie to see that it was a 1-star. 'Not interested' means that just based on what I heard about it, I knew I didn't want to see it. I could be a fantastically done movie, say 'Remains of the Day' or something, but it's not something I'm interested in. It's more indicitative of the kind of movies I'm interested in than a 1-star rating. After all, it could be that the premise/plot-line of movie that I wanted to see, but the writing/direction/acting sucked. That's less likely to match up between two movies than premise/plot-line.

      --
      Awesome furniture, accessories and cabinetry in Santa Rosa, CA: http://humanity-home.com/
    6. Re:Suggestion by Anonymous Coward · · Score: 0

      I would be happy if they didn't ask me to rate movies that I returned as physically defective without asking for it to be sent again

      "I never saw the movie...you want me to rate it?"

    7. Re:Suggestion by 99BottlesOfBeerInMyF · · Score: 1

      I didn't like Star Wars:Episode I very much. Episode 4 was great though.

      Right, so you might mark episode I, (technically number 4 by release order and prequels generally suck so I think this should be the ordering mechanism) as 2 stars or even 1. You wouldn't mark it as not interested, since from your comment you were interested enough to watch it. If, however, you were so disinterested in episode I so as to mark it as not interested (meaning you did not watch it and don't ever want to) then the chances are you probably would not be very interested in seeing episode 4 either.

    8. Re:Suggestion by gEvil+(beta) · · Score: 1

      Well, their records indicate that you rented it twice, so you clearly enjoyed it a lot. : p

      --
      This guy's the limit!
    9. Re:Suggestion by truthsearch · · Score: 4, Funny

      Point conceded.

      For the record, this is a turning point in slashdot history. I'll forever remember where I was when I first saw those words in a slashdot comment. (Which of course is at work, sitting through a boring meeting.)

    10. Re:Suggestion by LordNimon · · Score: 1

      Then the real problem is that "not interested" means two completely different things. Like you said, one way around that is to check whether the TV series was voted on, but I have a different angle. If I say I'm not interested, but I rated it, that means that I've seen it. If I haven't rated it at all, it means I'm really not interested in it.

      --
      And the men who hold high places must be the ones who start
      To mold a new reality... closer to the heart
    11. Re:Suggestion by borgboy · · Score: 1

      I for one welcome our civilized /.-posting Overlords.

      --
      meh.
    12. Re:Suggestion by mmalove · · Score: 1

      Funny, Star Wars was the first thing to pop in my head too.

      Damn prequels.

      I think this is an awesome idea for development - let the whole world offer solutions and spend only a million dollars. Most companies would spend that much just on a consultant team to tell them their current system sucks.

      --
      You can get 15 minutes of fame, but you can go down in history for infamy.
    13. Re:Suggestion by blugu64 · · Score: 1

      Kinda reminds me of that moment in 2001 when the ape threw the bone into the sky....

      --
      "Personal ownership is a hallmark of conservative capitalism. And I don't believe I am entitled to anything that I did n
    14. Re:Suggestion by tourvil · · Score: 1

      I wish Netflix would group an entire TV series as a recommendation instead of each individual season. I rented a couple DVDs of Dr. Who and marked them as 5 stars. Now my recommendation page is flooded with Dr. Who. I've already seen some of the series, so I already know I'll probably like the other seasons. I don't need Netflix to tell me that because it's obvious. I'd rather the recommendation system suggest DVDs that I might not have ever heard of or haven't thought about.

    15. Re:Suggestion by patrixmyth · · Score: 1

      There is a very simple question that has to be asked at the outset of building a system like this. Is the point to find programs that the audience will LIKE or WANT TO SEE? You may not have liked SW Ep. 1, but I would very much suspect that you wanted to see it very badly. (And see it very badly, you did, after all.) I think that this is possibly a fundamental problem with these algorithms. Most of the movies I've really loved, have been movies I didn't particularly care to see. Little Miss Sunshine being the most recently relevant example. Most recommendation systems do a decent job of finding things I want to see, excluding the fact that they often find things I've seen already, of course. That's what Netflix has in mind, I'm sure. Call me when someone comes up with a system that can do a decent job of finding the movies I wouldn't pick, but will love. I suspect that it would require a personality profile, EKG monitoring and selective application of hallucenigens, but maybe someone will surprise me.

      --
      "Don't you know you're going to shock the monkey?"- Peter Gabriel
    16. Re:Suggestion by mbrod · · Score: 1

      That would be my biggest complaint as well. Allow users to state they own it, and they don't want to be offered it. Even though they may want to rank it that they really liked it.

    17. Re:Suggestion by twistedsymphony · · Score: 1

      Naming conventions can be screwy... considering they might name it "One" "Two" Three" "Four" going by alphanumeric order wouldn't be very helpful when determining the next one in a series, instead you'd be better off going by the release date to determine the order, then prequels will appear in the proper place.

      Besides any logical cataloging system would mark the SW prequels as a completely different series then the original.

      Personally I think a good place to start is by director/writer... So if I liked Fight Club then maybe it could suggest Panic Room and The Game. Similarly if I liked Dogma then maybe it would suggest Clerks and Jersey Girl. This is usually how I find new stuff for myself, I've bought many DVDs without knowing anything about the movie simply because I like some of the other things that director has done 9 times out of 10 I am more then happy with my purchase. Most suggestion systems will find movies with similar genres/plots and that's useless IMO because that genre/plot will get tired after a while. Most directors will try to keep their movies fresh for the sake of their own boredom and I think most people are attracted to the style and soul of a film more so then the genre and plot type.

      Of course I think the best thing to do would be to take a look at how the current system works and follow through looking at where it goes wrong and why.

    18. Re:Suggestion by finkployd · · Score: 1

      On behalf of civilized /.-posting Overlords, I'd just like to say "screw you". Why? Because one thing we love is a paradox.

    19. Re:Suggestion by Wandering+Idiot · · Score: 1

      Amazon gets around this very simply, by having two options, one for "I own it", one for "not interested". Netflix should do the same.
      And I heartily agree with knocking the entire series out of recommendations for one genuine "not interested". I ordered the first few Battle Angel Alita mangas a while and then lost interest in it, and now my Amazon recommendations are filled with the entire 20-something volume series.

    20. Re:Suggestion by LoverOfJoy · · Score: 1

      I can see where you're coming from...but what if you weren't aware that season X of series Y has already been released on DVD? It'd be kinda nice to be notified when the latest one became available.

    21. Re:Suggestion by Autonomous+Crowhard · · Score: 1

      And consider various versions of movies to be the same movie. If I'm not interested in E.T., I'm not going to be interested in any of the quinannual release, the special edition, the extra special edition, the extreme edition, or the super secret special extreme mega edition which actually has guns.

    22. Re:Suggestion by 955301 · · Score: 1

      but who says I watched it using netflix? Why can't I watch it in all it's crapiness in a theatre while out of town visiting friends, then come home and mark it as not interested, but still see episode 4?

      The point being, Murphy's Law: as soon as you start making arbitrary constraints on a system scenarios come along which invalide the constraints.

      --
      You are checking your backups, aren't you?
    23. Re:Suggestion by swb · · Score: 1

      I think this is annoying, too, although some of the super-enhanced-ultra-editions have enough added footage or other changes to be meaningfully different. A 4:3 Apocalypse Now is different than the full director's cut version in anamorphic widescreen.

      They should add a preference setting that says something like "Rate all versions of a title the same" so that they auto-assign whatever rating you give to a movie to every version of it.

      It would also help if there was a way to have more complex ratings -- I may actually like some movie X, but hate the DVD production values (bad transfer, mono audio, etc).

  10. Privacy issues? by Vultan · · Score: 3, Interesting

    How will they handle privacy issues? Don't the same issues appear here that appeared with the AOL data this summer? With enough ratings you can narrow down to a specific person, and then find out about all the pr0n that this person has been getting as well.

    1. Re:Privacy issues? by Violet+Null · · Score: 1

      The AOL search ratings were different because the searches could include things like cities, proper names, phone numbers, and other such pieces of identifiable information. The movie ratings have none of that. You might be able to dig through the list and find the person who rated "Goat donkey pr0n" highly and laugh at them, but there's no information there that'll tell you who it was.

    2. Re:Privacy issues? by Anonymous Coward · · Score: 0

      No porn from netflix. I'm wondering about your concern too though. Even though they might not publish the indentifyable information AOL has, it still makes me wonder if rating movies is such a good idea in light of this.

    3. Re:Privacy issues? by Cruise_WD · · Score: 3, Informative

      From http://www.netflixprize.com/ :

      To prevent certain inferences being drawn about the Netflix customer base, some of the rating data for some customers in the training and qualifying sets have been deliberately perturbed in one or more of the following ways: deleting ratings; inserting alternative ratings and dates; and modifying rating dates.

      Plus all the usual replacing of IDs and such you'd expect. Looks like they're trying to avoid a repeat of the AOL debacle at least.

      --
      [ cruise / casual-tempest.net / xenogamous.com / transference.org / quantam sufficit ]
    4. Re:Privacy issues? by Shihar · · Score: 2, Insightful

      The AOL search was an issue because you could look at search requests for places and figure out where someone was very quickly. If I use Google to plot the rout to the nearest IKEA or porn store, it is a pretty simple matter to trace back who someone is. Short of some serious stupidity, I couldn't imagine Netflix giving away any valuable information in identity theft. A list of movies is highly unlikely to lead to anyone's address or identity.

    5. Re:Privacy issues? by bahbar · · Score: 1

      From the netflix rules:
      "To protect customer privacy, all personal information identifying individual customers has been removed and all customer ids have been replaced by randomly-assigned ids. The date of each rating and the title and year of release for each movie are provided. No other customer or movie information is provided."
      Sounds familiar...

      And later:
      "To prevent certain inferences being drawn about the Netflix customer base, some of the rating data for some customers in the training and qualifying sets have been deliberately perturbed in one or more of the following ways: deleting ratings; inserting alternative ratings and dates; and modifying rating dates. However, the Cinematch RMSE measured on the final, perturbed dataset does not differ significantly from the RMSE measured on the unperturbed dataset for the purposes of Grand or Progress Prize qualification described below. The RMSE values reported below represent the RMSE measured on both the perturbed and unperturbed datasets to the precision specified above."

      Not sure how much that actually protects end users, but they tried a bit more than AOL.

  11. RSSTimes by eldavojohn · · Score: 4, Insightful
    In a quest to better movie recommendations, Netflix is opening their database (nytimes, registration and first child required)...
    Not quite, you can find it here (or the minimalist version for anyone sick of ads).

    Why is it that the Slashdot editors are just too damn lazy to look up the RSS feed links to these pages?

    The problem is not easy. Says one researcher: "You're competing with 15 years of really smart people banging away at the problem."
    While this may be true, I wouldn't let it deter you. Collaborative filtering is a field that is far from dead. The interesting thing about collaborative filtering is that on the surface, it seems pretty straight forward but once you dig into the mechanics of it, there is actually a lot of playing you can do. Ironically, the way you display the data to the end user is often what determines how well of a job you did.

    Allow me to take a naïve approach at this topic and say we generate a movie index of each person. I would have A Clockwork Orange and Koyaanisqatsi at 5 while The Ring 2 would be at the very low end. My friend might have similar movies. If he has A Clockwork Orange up there, you might be able to compute a Euclidean distance between us. However, this approach falls apart because no one has seen Koyaanisqatsi and of the 20 movies I've ranked highly, they are hard to find.

    You don't have to stop there, however. You could also database the movies I marked as "uninterested" or the movies that were presented to me but I didn't vote on. Like if I had seen the offer to mark J-Lo's latest flop but didn't, wouldn't that tell you something about me?

    So these caveats present themselves all along the way and, at the end computation, you have many different strategies for this data. For example, while you might not be able to link my friend an I through movies, how far apart are we on a nod network? What I mean is, if you plotted every user in their own dimension depending on the movies they ranked and attempted to compute as good a distance as possible between all users, how far would I be away from my friend by hopping on these nodes? There's a lot of information to be gleaned in this sort of friend-of-a-friend collaborative approach.

    Now you need to present this information to the user. Do you just up and recommend him a movie? Do you take Amazon's approach and say "Other people did this -- so should you."? Or do you give them some sort of three dimensional flash plotting of you versus the people nearest to you? Do you allow the user to contact those closest to them? Those farthest away?

    My point is that while 15 years of research has been done, it doesn't mean there's been 15 years of testing and implementation which, in the end of creating products, is where most of the importance lies.
    --
    My work here is dung.
    1. Re:RSSTimes by Scutter · · Score: 1

      However, this approach falls apart because no one has seen Koyaanisqatsi

      Well...almost no one. ;)

      --

      "Tell me doctor, with all of your defenses, are there any provisions for an attack by killer bees?"
    2. Re:RSSTimes by cei · · Score: 2, Informative

      Correction: No one has stayed awake through Koyannisqatsi.

      (FWIW, Powaqqatsi was a better flick, IMHO)

      --
      This sig intentionally left justified.
    3. Re:RSSTimes by Scutter · · Score: 1

      Correction: No one has stayed awake through Koyannisqatsi.

      (FWIW, Powaqqatsi was a better flick, IMHO)


      See, I liked the first one better, and I didn't have any trouble staying awake. I like Philip Glass and I like Francis Ford Coppola, though.

      --

      "Tell me doctor, with all of your defenses, are there any provisions for an attack by killer bees?"
  12. I know...credit reports! by xxxJonBoyxxx · · Score: 1

    Link everyone's credit report into their movie preferences; I'll bet your complete credit history would give them a 5-10% better chance of picking your movies. But seriously...why isn't this just a regression exercise?

    1. Re:I know...credit reports! by Anonymous Coward · · Score: 1, Informative

      Because the problem is not always linearly separable. This is not to say that a linear classifier wouldn't do a decent job (given appropriate slack variables or underlying probability distribution), but to do a really good job - one where you wouldn't be laughing at the recommendations - rule-based techniques (such as association rules, RIPPER, etc) seem to do better. They aren't perfect, of course, and that's why it's an open problem.

  13. You could win smaller amounts too by gambit3 · · Score: 0, Redundant

    According to the article, if no one wins within a year, Netflix will award $50,000 to whoever makes the most progress above a 1 percent improvement, every year, until someone wins the grand prize.

    --
    Go where the Web Thinkers gather

  14. Easy by Anonymous Coward · · Score: 0

    Constructing a perfect recommendation system is easy.

    I have discovered a truly marvelous demonstration of this proposition that this post is too narrow to contain.

  15. Copy the Music Genome Project by Zaphod-AVA · · Score: 5, Interesting

    The problem with recommendation systems is that they use too little information to catagorize their subject.

    What they need to do is copy the methods of the Music Genome Project (www.pandora.com), and list a larger set of attributes for the films. This way it can recommend films by checking many more characteristics, such as director, tone, writer, or subject.

    1. Re:Copy the Music Genome Project by vontrotsky · · Score: 4, Informative

      The problem with recommendation systems is that they use too little information to catagorize their subject.

      What they need to do is copy the methods of the Music Genome Project (www.pandora.com), and list a larger set of attributes for the films. This way it can recommend films by checking many more characteristics, such as director, tone, writer, or subject.


      In this contest, you run your own code and submit the results to NetFlix to be scored. This means that you can use any other data (e.g. A Movie Genome projct) you can compile to enhance your rankings. Netflix apparently specifically designed the contest to allow this.

    2. Re:Copy the Music Genome Project by Anonymous Coward · · Score: 0

      Just be careful about Planet of the Apes recommendations ...

      http://yro.slashdot.org/article.pl?sid=06/01/07/09 42205

    3. Re:Copy the Music Genome Project by philipmather · · Score: 0

      The other factor is quality of categorisation, SHAMELESS PLUG ALERT AHEAD, I work for a company that's just finished witting a search engine for the British Film Institute (http://ftvdb.bfi.org.uk/searches.php) and I've had the opportunity of trawling through their reasonably extensive database. It's not just the quantity of data that makes this database interesting it's also the quality, their looking at implementing a feedback system to accept corrections and additions (only a very simple system has been implemented to start with) and they've taken the wise decision to to build researcher review into the plans from the start. The BFI has taken on board the lessons that others have had to learn in the past.

      --
      Regards, Phil
    4. Re:Copy the Music Genome Project by R2.0 · · Score: 1

      "list a larger set of attributes for the films."

      Why larger? Only a few matter.

      Tits?
      Guns?
      Cars?
      Explosives?

      Each one counts as 25%.

      Can I collect my $1M now?

      --
      "As God is my witness, I thought turkeys could fly." A. Carlson
  16. Who will win the $$$$??? by pfz · · Score: 1

    Recommend Death to Smoochy and Date Movie, please! I hope Richard Stallman enters the contest and wins. I bet he would make good use of the one million dollar prize.

    ALTERNATIVE FREEDOM
    A documentary about the invisible war on culture.
    Features RMS, Danger Mouse (of Gnarls Barkley and the Grey Album), Lawrence Lessig, and more...

    http://alternativefreedom.org/

    1. Re:Who will win the $$$$??? by fotbr · · Score: 1

      Yeah, another 1,000,000 dollars worth of Stallman doing damage to open source adoption by making a fool out of himself and insisting everyone follow his ideology.

      Thanks, but I'm hoping the winner is a couple of smart college guys or girls, and not Stallman

    2. Re:Who will win the $$$$??? by pfz · · Score: 1

      True, it would be nice to see a youngster win the grand prize.

      Stallman is no fool. He cares not about "open source adoption" -- and why should he?

  17. only a million? by StandardDeviant · · Score: 2, Interesting

    If you can beat "15 years of really smart people", then your work product probably has more than a million dollars in value if you were to license it out to places like Amazon, eBay, Netflix, etc. Even a 1% improvement in revenues from a 1% improvement in recommendation accuracy is probably worth more than 50K, if sold to the major e-tailers. On the other hand, if you just want an interesting problem to screw around with in your spare time and don't want to go through the bother of forming a company in order to monetize that work, this is a pretty cool opportunity.

    1. Re:only a million? by illegalcortex · · Score: 2, Insightful

      To win and take home either prize, your qualifying submissions must have the largest accuracy improvement verified by the Contest judges, you must share your method with (and non-exclusively license it to) Netflix, and you must describe to the world how you did it and why it works.

      So, you could take the money from Netflix, use it to start your business, then license it to the other players, too.

    2. Re:only a million? by That's+Unpossible! · · Score: 1

      That's probably why they don't require you to exclusively license it to them. You are free to sell your ideas to others, in fact they encourage it. Read the FAQ.

      --
      Ironically, the word ironically is often used incorrectly.
  18. Fix the problems with what they send me first by Jimmy+King · · Score: 5, Interesting

    I wish they'd fix the problems in the logic determining what they actually send me from my queue before fixing problems with what they recommend to me. If I've got season 1 of a show in my queue prior to season 2, don't start sending me season 2 because some disc of season 1 is unavailable (which has happened to me multiple with both netflix and blockbuster online), send me something else completely. They've got the tech to keep one season of a tv show in order, it can't possibly be that difficult to extend that to keeping multiple seasons of a show in order.

    On top of that, don't show me that it's available in my queue but send me something else instead. While I haven't asked netflix about this, I have asked blockbuster online, and I imagine they are both doing the same thing. The disc is "available" just not at the warehouse used to ship to me personally. Instead of basing one piece of information off of total stock and one off of local stock, base them both on the stock at the warehouse shipping to me.

    1. Re:Fix the problems with what they send me first by nine-times · · Score: 2, Funny
      They've got the tech to keep one season of a tv show in order, it can't possibly be that difficult to extend that to keeping multiple seasons of a show in order.

      I thought Netflix users just ripped the movies to their hard drive for later viewing anyway?

    2. Re:Fix the problems with what they send me first by Anonymous Coward · · Score: 0

      Simple solution.

      I put all my 'series' sets at the very bottom. I only move one to the top of the queue when I send back the prior one. With ~100 movies on my queue, that keeps #5 of a series from shipping before I have seen #4. #5 languishes around position 96 on the queue or so until I send back #4, then I pop #5 up to the top of my queue.

    3. Re:Fix the problems with what they send me first by Moofie · · Score: 1

      There is a fix for that. It's called http://greencine.com./ I find the service is a little slower than Netflix at its fastest, and I think it would get less good the farther you get from their base in San Francisco, but after getting burned by Netflix I find these guys to be a very adequate replacement.

      And you can hard-code the order in which you'd like to receive certain discs.

      --
      Why yes, I AM a rocket scientist!
    4. Re:Fix the problems with what they send me first by kaizokunami · · Score: 1

      We use Greencine as well, and we've been with them about four years now. We used to live in New York and it would take about three to four days each way for discs from Greencine; now that we're in Seattle, it's only a day or two. It was actually not a bad time delay because we'd watch things on the weekend, drop them in the mail Monday morning, and often our new discs would arrive in Saturday's mail.

      That said, it should be noted that Greencine caters more toward independant and niche films, and doesn't carry as much of the mainstream stuff as Netflix, so YMMV depending on your film tastes.

    5. Re:Fix the problems with what they send me first by Anonymous Coward · · Score: 0

      There's a way to avoid this at least on Blockbuster. I'm not signed up anymore and I don't remember exactly how to do it, but I remember that whenever I wanted a whole series/season I used an option to add them all at once, then it would only send the next disc in the series when you returned the current one, so you'd always get them in order.

      here

      though I'm not positive it works for seperate seasons, it may only work for multiple discs within a season/series.

    6. Re:Fix the problems with what they send me first by Jimmy+King · · Score: 1

      Yeah, that only works for discs within one season and most other things that come as a boxed set. For seperate seasons, which are sepearate sets, it doesn't work. I've worked on a large scale content management and distribution platform and unless they did something really stupid it should be relatively trivial for them to make it work that way, though. They should be able to just link disc 1 of season to to disc 6 of season 1 the same way disc 6 of seaon one is linked to disc 5, if nothing else. Maybe add a flag in the properties to show that while linked, they are seperate seasons, that way adding season 1 doesn't add ALL seasons and removing some disc from season 4 doesn't force you to remove or break the set for the rest of the seasons.

      The blockbuster rep's solution was "Don't add season 2 to the queue until season 1 is fully watched". Kind of defeats the purpose of having a queue, eh?

      Even within a season when queued together like that, it can still do some stupid stuff. Blockbuster never did this to me, but Netflix has. Send disc one of, say, Alias season 3 which is #1 in the queue with discs 2-6 next, all queued together as one thing. Even though the rest of the discs in the series say available, I then get all of 24 season 4 which were like numbers 18-24 or whatever in my queue. What the hell logic is going on there? It isn't nearly as bad as sending different seasons of the same show out of order as has also happened to me, but is still somewhat of an annoyance, especially when the website says the discs they aren't sendig me are available. Crap like this is caused by the second thing I mentioned, they use their overall company wide stock to show if something is available or not on the website but their local warehouse stock to determine what they actually send you. While fixing that wouldn't actually change the order things get sent, it still doesn't make any sense for it to be inaccurate, it should be a fairly simple change to use the local warehouse to show if a movie is available for you or not.

    7. Re:Fix the problems with what they send me first by Jimmy+King · · Score: 1

      I'm definitely going to check this greencine out. I'm on the east coast and not sure I have the patience for the shipping time and tend to watch most of the big budget hollywood movies plus every terrible B horror and sci-fi flick I can get my hands on, so their stock may or may not work for me, but I'll definitely check them out.

  19. About no-login links on /. by Ilgaz · · Score: 2, Insightful

    You can trick the NY Times personally but you can't do it from a front page of a widely popular commercial site.

    I think it is the reason.

    Slashdot can't send thousands of users with a fake referrer to NY Times. That link you provided is for people using RSS readers and subscribed to NY Times RSS feed.

    I think they should talk with NY Times web team to allow slashdot readers with referrer=slashdot without needing login. They can arrange it for sure, this isn't a "no name" site.

    It would be nice for NY Times for statistics too. I bet they currently have to tweak the statistics for "fake" RSS links from Slashdot.

    About "no ads" version: It would be like NY Times mentioning Slashdot and sending people to some other domain (slashdot sux? I forgot) which doesn't have Slashdot ads which makes this site work/pay for the costs. That also means hundreds of thousands users.

    I am not apologising for NY Times or trying to start a discussion about advertising, I just say my end user point of view and plain guesses.

  20. Hmm... by severdia · · Score: 1

    If they wanted users to rent more DVDs, they should stop throttling them first...

    http://www.msnbc.msn.com/id/11262292/

    Now some 12-year old will come up with a great systems and NetFlix will rape them for theor code and call it a day...

  21. Remove Artificial Supply Limitations by dduardo · · Score: 2, Insightful

    If Netflix doesn't have the movie in stock it should burn the movie on demand.

    1. Re:Remove Artificial Supply Limitations by (A)*(B)!0_- · · Score: 1
      Also, with this system, they'd only have to buy one copy of each movie! Brilliant! Think of the savings!

      Yeah, I don't see the MPAA complaining about that...

    2. Re:Remove Artificial Supply Limitations by dduardo · · Score: 1

      Netflix and the MPAA should agree on a percentage of sales instead of a flat fee per copy.

    3. Re:Remove Artificial Supply Limitations by LordKronos · · Score: 1

      That would be wonderful for the many people whose DVD players don't play burned discs.

    4. Re:Remove Artificial Supply Limitations by Anonymous Coward · · Score: 0

      That's a good idea, but the licensing would be hell.

      Would they want to pay licenses for a huge stack of discs they burned to cover a one-time fluke demand spike? They would be able to satisfy demand, but it wouldn't be cost effective. Could they negotiate a short term license to rent duplicated discs with the MPAA? Not likely. Or, they could do what they do, and make people wait a week for their movie. I'm pretty sure that if they have enough people with Cowboy Bebop in their que, they *will* buy more copies.

      Oh, and their burned on demand disc would also have to be screen printed on demand, or the home user would say, "Hey, this is a copy anyway; I think I'll copy it too."

    5. Re:Remove Artificial Supply Limitations by stonecypher · · Score: 1

      Unfortunately, under current US law, that's not up to NetFlix, but rather the company which holds the license on the movie. Currently, they do not allow any such behavior. The first content producer to allow something like that will make a small fortune.

      --
      StoneCypher is Full of BS
    6. Re:Remove Artificial Supply Limitations by lonecrow · · Score: 1

      I think the point of all the DVD by mail companies is to grab as many subscribers as they can before downloading movies is commonplace. If they can cover costs while increaing their market share and loyalty, then when distribution costs evaporate the value of each loyal customer will shot through the roof. Sow today reap tomorrow.

  22. First born? by qmaqdk · · Score: 1

    > ...Netflix is opening their database (nytimes, registration and first child required)...

    In order to view the article sacrifice your first born on the AJAX altar to the right. Use drag-n-drop pentagrams as necessary.

    --
    My UID is prime. Hah!
  23. Difficulties on the data-gathering end by jfengel · · Score: 4, Interesting

    Any marketer will tell you that what people tell you they want and what people actually want are very different things. Even if people answer honestly, the data you gather is often unreliable: people simply don't have as good a handle on what they want as they think they do.

    Not that marketers have a better handle, but simply that people will swear up and down that they would buy a peanut-butter-filled hot dog, that they loved the one they tried, and then don't actually buy any.

    Don't believe me? Go see Snakes on a Plane. Nobody else did. (Sure, $33 million seems like a lot, but that's chump change for a major studio release these days.)

    The best improvements will come from insights gained between the lines. You may have rated The English Patient eleventeen stars, but if your next seven rentals were all episodes of The Girls Next Door, which you only rated 3 stars, it certainly looks like you want more Hugh Hefner and less Ralph Fiennes.

    The best data is the data that the subject doesn't realize he's giving you. Once you start imposing conscious choice on the ratings, you get only what they say they like, not what they really like.

    1. Re:Difficulties on the data-gathering end by hoggoth · · Score: 1

      > what people tell you they want and what people actually want are very different things

      It's the difference between scientists and engineers trying to decide what activities seem more dangerous, and actuarians using real historical data to rate activities. Guess which method insurance companies use when their money is on the line...

      --
      - For the complete works of Shakespeare: cat /dev/random (may take some time)
    2. Re:Difficulties on the data-gathering end by speculatrix · · Score: 1
      this is a little bit off-topic, but...

      an excellent book which covers, amongst many other things, how people do behave over how they say they'll behave is Freakonomics.

      for example, they cover how people behave about race and dating, whilst people SAY they have little preference, analysis from dating agencies shows the opposite. Even some game show stats are used to prove people are prejudiced.

    3. Re:Difficulties on the data-gathering end by nasch · · Score: 1
      Once you start imposing conscious choice on the ratings, you get only what they say they like, not what they really like.
      That's a different problem though. It's pretty well known that what people say they will do is different from what they will do. However, this isn't asking them which movies they plan to rent, just how well they like a movie. It's not even possible to determine how accurate this information is, since you cannot tell objectively how well someone likes a movie. All you can do is ask them how well they liked it. When they can answer with the expectation of privacy, I would expect that to be pretty good data.
    4. Re:Difficulties on the data-gathering end by metamatic · · Score: 1
      Don't believe me? Go see Snakes on a Plane. Nobody else did.

      Did anyone say they wanted to see it, though? I saw a lot of people mocking it, but I didn't see any non-ironic expressions of enthusiasm.

      --
      GCHQ Quantum Insert installed. If only our tongues were made of glass, how much more careful we would be when we speak
    5. Re:Difficulties on the data-gathering end by daft_one · · Score: 1

      The one that takes into account the increased level of carelessness during activities perceived as low-risk?

    6. Re:Difficulties on the data-gathering end by hoggoth · · Score: 1

      > The one that takes into account the increased level of carelessness during activities perceived as low-risk?

      If the actual data supports that then yes. Otherwise no.
      Using actuarial statistics, we don't care what the explanations are, just the results.
      If the data suprises us with a high level of deaths from nose-picking, then that would be a high risk activity.

      --
      - For the complete works of Shakespeare: cat /dev/random (may take some time)
  24. Intractable problem - liking the movie, not genre by OakDragon · · Score: 3, Interesting

    I stopped rating movies after I found that I got recommended a lot of crap. Say I rent a slasher movie that, for its genre, is artfully done. I rate it high. Now I have recommendations for a bunch of worthless, straight-to-video stuff that I really don't want to see.

    This is the real nut to crack, IMO. How do come up with an algorithm that rates 'quality,' an elusive concept that means different things to different people?

    Not to mention, I'm fickle.

  25. 5 star rating is flawed by BMonger · · Score: 3, Insightful

    I personally weigh movies on a number of different factors. I might give 3 stars to a movie because it has 4 of my favorite actors in it even if I didn't care for the plot. I might give 3 stars to a different movie with horrible acting but interesting camera angles (From Dusk Til Dawn 2). I tend to average out my ratings dependent on many things a movie has to offer.

    The problem is is that that is my rating system. It works for me. But it does little good to anybody else because they are rating based purely on something else.

    I think they need to implement the ability to rate more aspects of the movie. I'm sure some people out there rate the movie poorly if their disc is scratched or the transfer quality is poor even. A simple 1 to 5 system doesn't cut it. People rate things that aren't "Was the (romance) plot good?", "Do you like this director?", "Do you like these actors?". People rate things that aren't on the box.

    1. Re:5 star rating is flawed by SamSpectre · · Score: 1

      You've probably answered your own question. The rating system needs to be revamped to make the ratings more specific. They should have a toggle for advanced ratings, and let people rate (1 to 5 is fine) on cinematography, directing, music, storyline, acting, suspense, etc. Not sure if people would want to rate 20 items per movie, but if it meant getting better recommendations...

    2. Re:5 star rating is flawed by AceCaseOR · · Score: 1

      Personally, I wouldn't mind if NetFlix added a 1/2-star rating to their user ratings. They do half-star ratings for their reccomendations, after all.

      --
      Zagreus sits inside your head, Zagreus lives among the dead, Zagreus sees you in your bed and eats you in your sleep.
    3. Re:5 star rating is flawed by Anonymous Coward · · Score: 0

      1> You stated that he answered his own question. Note: he did not ask a question.
      2> You basically repackaged exactly what he wrote using different words. Bzzzt! Try again.

    4. Re:5 star rating is flawed by kthejoker · · Score: 1

      The predominant problem with any movie ratings system is not the number of stars or the granularity of the rating.

      The major problem is, simply put, movies are not treated as a disposable commodity.

      I worked at a radio station for several years, and one of my jobs was to review CDs for library status. Listening to an entire CD takes anywhere from 30 to 60 minutes - simply impossible. So, you resort to fast-forwarding, previewing, and generally getting a feel for an album. Quickly you separate the wheat from the chaff (and conversely, quickly dump the worst works without a second glance.)

      But then what happened? You evolve into an even more nuanced shorthand. Poorly Photoshopped covers became a quick excuse to pop a CD in, listen to 15 seconds of generic (but not awful) prog metal or power pop, eject it and trash it without a second thought. Albums on your favorite labels were given equally short treatment, then placed in the most coveted "Hot Trax" section. Only the things in the unknown middle received any kind of discerning analysis.

      We could no doubt do the same with movies. In fact, any movie rating system I would implement would do the same:

      A small screen would pop up. Then I'd show you 5 minutes of the movie, in 10 30 second clips. Perhaps these clips would be "pre-clipped" by a staff. It doesn't matter, really. The point is you essentially get to see what the movie is about, in terms of look and feel, acting, pacing, etc in enough to rate it.

      Underneath the screen would be the "vitals" of the movie: year made, genre, director, production company, chief stars, maybe a brief plot synopsis. This would prep you for what you're about to watch. In addition, it would show you how you've rated similar things: so if you really liked this director or company, you'd know.

      This is a win-win for everyone. You get to preview the actual movie, instead of just reading recommendations. It's free advertising for the movie companies (maybe even let them pick the clips - but no trailers, we want actual movie footage, not psychologically manipulating fluff.) And it turns you into a movie connisseur much more quickly. You'll evolve a shorthand. I like this director, give it a 5 star (or whatever) and move on to the next movie. You can start judging a movie in 30 seconds, and be more right than wrong.

      And yeah, you might miss your favorite film through a bit of oversight or callousness. But it's not likely - most people err on the side of caution when reviewing things, so you'll only trash the trashiest things without hesitation. The best films (for you) will shine quickly, and the rest - well, the unknown middle is a wide playing field indeed.

  26. Why bother with the prize? by frosty_tsm · · Score: 1

    If I had a way of greatly improving the current Netflix system, I wouldn't bother with the prize; it'd start up a competitor. If it's that much better than Netflix, it'll be worth way more than $1 million.

    1. Re:Why bother with the prize? by Quiet_Desperation · · Score: 1

      I think I'd take the million and run. Better designs don't always win (reference: Microsoft).

    2. Re:Why bother with the prize? by Anonymous Coward · · Score: 0

      I have a patent on a pill that cures AIDS and also turns water into beer. I am offering a $1000 prize for anybody
      who can implement it, but remember, it was my "idea".

      Yes, it's because (and I'm pulling this out of my ass here, but it's more than likely) Netflix have a patent on "movie reccomendation systems" or somesuch nonsense.

      What they don't actually have is an implementation.

      They want you to provide that for them.

      Just another reason why the so called "patent" system is pure bullshit and should not be recognised, acknowledged or
      entertained by anybody with a modecum of intelligence.

  27. Just add racing stripes! by capitalj · · Score: 2, Funny

    And some macaroni pieces.

  28. Not just the movie characteristics. by khasim · · Score: 1

    Yes, they need more characteristics of movies.

    But they also need ways to identify the characteristics of people's choices. Right now, one NetFlix account can be used by a whole family. So instead of getting 1 person's characteristic choices (teenage emo goth girl), you get those combined with the other family members (Dad's action films, Mom's chick flicks, Jr's teenage sex comedies).

    Eventually, you'd end up with a movie genome cross indexed to a sub-culture.

    1. Re:Not just the movie characteristics. by LordKronos · · Score: 1

      Netflix already lets you set up queues for each person in the house. However, what they need to do to make this more useful is to have a round robin system for shipping movies. Right now they make you assign a set number of discs to each queue. If you are on the 1 movie at a time plan, you can only setup 1 queu at a time to be receiving discs. When my wife and I had netflix, I'd have to log in each time I sent back a disc and reassign the 1 available disc to the other queue. I could see this being annoying enough that some people just decide to share a single queue.

  29. Define "better" by EatHam · · Score: 1

    SELECT TOP 10 title
    FROM tblMovies as m, tblAdvertisers as a
    WHERE m.studio = a.studio
    ORDER BY a.adRevenue DESC

    I win.

  30. How about stuff I already bought? by HepCatA · · Score: 1

    Just a simple suggestion. A lot of the DVD's that NetFlix recommends I already own. So I can't put "not interested" because it'll filter out those types of movies for me. How about something like "I own it" as an option, and it can pick recommendations based on those as well?

    1. Re:How about stuff I already bought? by popeye44 · · Score: 1

      I am with you. I guess I didn't understand how they were sorting them. I put not interested on almost every thing I already own. Which is my case is quite a bit. Fortunately I keep a fairly accurate account of what I like and know and already have. I rarely go looking for their recommendations on movies. I go there looking for a specific actor/acresses's work,or a specific genre. I do use the fact they will add more movies with the same person into recommendations. My wife just puts up every kids movie ever made so I don't have to worry about that part of the queue!

      --
      Inane Comments are Generously Disregarded
    2. Re:How about stuff I already bought? by LMacG · · Score: 1

      There's a discussion about this in another comment thread, but if you already own something, you should rate it appropriately. Why would own something that you are "not interested" in?

      --
      Slightly disreputable, albeit gregarious
    3. Re:How about stuff I already bought? by nasch · · Score: 1

      They rate it Not Interested because they're not interested in renting it. It's perfectly reasonable, and more than one person here has said it's tripped them up.

    4. Re:How about stuff I already bought? by Anonymous Coward · · Score: 0

      It's not perfectly reasonable. If I rated a movie 3 stars, that means I've seen the movie. Why would they suggest I rent the movie again? The reasonable conclusion is that a rated movie wouldn't be recommended. Thus "Not Interested" means I haven't seen it, so I can't rate it, nor am I interested in seeing it.

    5. Re:How about stuff I already bought? by Kusand · · Score: 1

      So just rate it yourself. You can rate recommendations and they go away. Since you own it, ostensibly you already know how you would rate it, right? Or do you just buy DVDs and not watch them, in which case your problem has no solution until you watch them?

    6. Re:How about stuff I already bought? by HepCatA · · Score: 1

      Wellll, think of it this way. If I own a DVD that I like, yes I can rate it vs. putting in "not interested". But if I own the DVD, they can stop recommending it to me in the future, and any other assorted recommendations can be based on that one DVD. If I go as far as buying a particular title, wouldn't you think I would want something very similar (vs. remotely similar) as my next rental? Or better yet, have them stop recommending DVD's that I have at home?

      It may seem a bit silly, but if something like this helps get the recommendations for stuff that interests the renter, it could mean more rentals!

  31. Re:Intractable problem - liking the movie, not gen by EatHam · · Score: 1
    How do come up with an algorithm that rates 'quality,'
    Phaedrus did this ages ago.
  32. how to enter the contest? by Kinlan · · Score: 1

    Does anyone know how to enter the contest?

    --
    As cunning as a fox, which has just been appointed professor of cunning at Oxford University. http://www.kinlan.co
    1. Re:how to enter the contest? by Anonymous Coward · · Score: 0

      Yeah.. there is no link in the article or on their website. WTF kind of contest is it if there are no details except some trumped up articles?

    2. Re:how to enter the contest? by hutteman · · Score: 2, Informative
      From their press release:
      Complete details for registering and competing for the Netflix Prize are available at www.netflixprize.com
  33. Movie folksonomy by RomulusNR · · Score: 1

    Encourage end-user tagging, compare on popular tags for matching a la LJ "people who have the most in common" search.

    Or leech off of IMDB's recommendation system, which seems to be quite good.

    --
    Terrorists can attack freedom, but only Congress can destroy it.
  34. Adapting their business model by kernel_pat · · Score: 1

    Yahoo has just done it and now netflix is, these companies are not hiring programmers anymore but asking hobbyists to write them code in their spare time and the hobbyist with the best code wins. Which really eliminates the need for programmers.

    I rekkon this will bite us all in the ass in the future because if everyone does this there will be no jobs left and therefore people will not learn because there is no money involved and then the companies will have no hobby programmers left.

    1. Re:Adapting their business model by arachnoprobe · · Score: 2, Insightful

      I don't think this is a "programmers problem". From thinking about it, and reading the approaches discussed here, it looks more like a mathematical problem. Finding a good strategy for linking the data and making suggestions seems far more important than hacking a good (my)SQL-query.

  35. Here's how to improve the system by alohatiger · · Score: 1

    - Collect more data from the person: age, region ("The South" "Northeast" etc.), education level, etc. As far as privacy goes, if you don't want to enter it that's fine--but your recommendations will suffer

    - Analyze rental activity with ratings. If I've rented a movie 3 times, I probably like it more than the movie I've never rented but gave 5 stars

    - Analyze queue transactions. What movies did I add to my queue together? What did I move up or down? What movies did I delete from the queue (and ask why: saw it already, changed my mind, etc.)

    - Analyze how long I hold movies versus what I rate them. Upon return ask if I watched the movie or not.

    - Find more ways to group movies together (genre, subgenre, actor, theme, director, writer, etc.). Figure out which actors I love/hate

    --
    Bigtime Consulting - "We're the best because we cost the most"
    1. Re:Here's how to improve the system by alohatiger · · Score: 1

      Also: Enlist celebrities to publish their queues. Lots of people will rent movies just because their favorite actor/athelete/whoever just rented it.

      --
      Bigtime Consulting - "We're the best because we cost the most"
  36. whew!! by bjk002 · · Score: 1

    "The problem is is that that is my rating system."

    I re-read this 5 times and then gave up!!

    --
    Opinion:=TMyOpinion.Create(Me);
    1. Re:whew!! by penguinwhoflew · · Score: 0

      I can't even say "is is that that is" out loud when reading the whole sentence... my tongue just gives up...

    2. Re:whew!! by 0xABADC0DA · · Score: 2, Funny

      It depends on what the definition of the phhrase "is is" is.

    3. Re:whew!! by BMonger · · Score: 2, Funny

      Like I I read read what I write... sheesh!

  37. And the winner is .... by argoff · · Score: 1


    BitTorrent!!! :)

  38. Same as all jobs by suggsjc · · Score: 1

    All jobs work this way (or at least they should). Return on Investment. In order for a company to make money, they will pay you a wage. Hopefully you will produce work that is at minimum equivalent to the amount that they pay you. If not, then they will be losing money employing you and if they have decent reporting/management will probably fire you.

    Many companies do offer incentive programs (more likely for upper level positions), but is still just a percentage of the actual "value" that you created, not all of it.

    Sidenote. We all like to complain about how overpaid execs are. They usually don't have earth shattering ideas. However, if they find a way to increase efficiency by 10% but it is for a $100 million dollar project, then they essentially create $10 million worth of "value." If they only get a 10% cut, then they get a paycheck of $1 million, even if they didn't do any of the work to actually realize the improved efficiency.

    --
    When I have a kid, I want to put him in one of those strollers for twins and then run around the mall looking frantic.
    1. Re:Same as all jobs by nasch · · Score: 1
      However, if they find a way to increase efficiency by 10% but it is for a $100 million dollar project, then they essentially create $10 million worth of "value." If they only get a 10% cut, then they get a paycheck of $1 million, even if they didn't do any of the work to actually realize the improved efficiency.
      No problem there. The annoying ones are when he miserably fails to meet the conditions set out for the bonus, but the board (under the direction of the chairman and CEO) votes to give the CEO the bonus anyway. If you get millions of dollars for failing, why bother to succeed?
    2. Re:Same as all jobs by Khashishi · · Score: 1

      The whole problem with this argument is that it is the worst execs are overpaid as much as the best. http://www.dolmatconnell.com/resources/2006DCPTech 100Study.pdf

    3. Re:Same as all jobs by suggsjc · · Score: 1

      Completely agree. Probably one of the biggest "problems" with large corporations.

      I know I'm sounding like I'm promoting "execs" but for the most part they put up with a lot. Granted some truly are heartless, but there are also nice people that have to fire hundreds (if not more) people that they may even be friends with. If it truly were a walk in the park and everyone could (or even would want to) do it, then ummm...things would be...different. I'm not sure how, but it would just be different. In another sense, I doubt many of the /.'ers would be good CEOs for the same reason that most CEOs wouldn't be good programmers/sysadmins/whatever.

      --
      When I have a kid, I want to put him in one of those strollers for twins and then run around the mall looking frantic.
    4. Re:Same as all jobs by nasch · · Score: 1
      If it truly were a walk in the park and everyone could (or even would want to) do it, then ummm...things would be...different. I'm not sure how, but it would just be different. In another sense, I doubt many of the /.'ers would be good CEOs for the same reason that most CEOs wouldn't be good programmers/sysadmins/whatever.
      There are definitely skills required to be head of a large corporation. But too often (not a majority case, just too often) the skills needed to make the business successful are not needed to get rich running the company. Compounding the problem is the possibility (I don't have any evidence but I think it's true) that the same people possessing the skills to get into such a position are more likely to be less ethical, and you get CEOs running companies with an eye to their own personal riches rather than the good of the company or even the stockholders. When things go south they get a huge severance package and move on to do the same thing at another company. Again, not necessarily a majority case, just too frequent for my taste.
    5. Re:Same as all jobs by DuckDodgers · · Score: 1

      Warren Buffet, one of the most successful investors in the US, CEO of mega-firm Berkshire Hathaway, makes good arguments in several of the Berkshire Hathaway annual reports that CEOs are dramatically overpaid.

  39. Here's a problem to solve with much larger impact by Yogs · · Score: 3, Interesting

    Disclaimer: I subscribe to the same sort of service, except through blockbuster... maybe Netflix does have this feature. My wife and I share a queue... I imagine many, many of these queues are shared. We have very, very different tastes in movies. Instead of getting recommendations that suit us both (which is next to impossible), the recommendations just get very, very confused. If I could just keep my and her recommendations from tangling, we would both have an easier time.

  40. Common data by Mike+Hicks · · Score: 2, Informative

    I see that the NYT article linked to just about everything except MovieLens. I've used the site, and folks might like to try it out. It looks simple, but it's fairly nice, having some of those fun dynamic pages that are all the rage these days. One neat thing in comparison to Netflix is that it will give a projected star rating for you, rather than simply saying "Recommended".

    Of course, I'm biased since I had John Riedl as a professor in a few easy classes. I think he tried to spin off this research as a new company, but I'm not sure if it ever got off the ground.

    One thing I'd really like to see has little to do with the quality of ratings, though. I'd like to be able to keep a common database of my ratings across multiple sites. At the moment, I've rated a number of movies at Netflix, MovieLens, and IMDb, but they aren't entirely consistent. Unfortunately, two of the sites use a ten-point system (IMDb has a ten-point scale, MovieLens goes up to 5 stars, but in half-star increments), while the other uses a five-point one (maybe six if you say "Not Interested"..).

    Well, I'll have to poke around a bit with this stuff. I wouldn't be able to do much, though, since my level of knowledge in this arena is very limited...

    1. Re:Common data by nasch · · Score: 1
      One neat thing in comparison to Netflix is that it will give a projected star rating for you, rather than simply saying "Recommended".
      Netflix does that.
    2. Re:Common data by will-el · · Score: 1

      >One neat thing in comparison to Netflix is that it will give a projected star rating for you, rather >than simply saying "Recommended".

      The stars that you see *are* the projected star rating for you. If you scoll down in a movie
      listing, you can see the projected and average:

      Average of people who rate like you: 4.4 stars
      Average of 28,372 ratings: 3.2 stars

  41. Re:Here's a problem to solve with much larger impa by LMacG · · Score: 1

    Netflix allows you to have up to five profiles, each with separate queues. According to the site, "Each profile has its own ratings, recommendations, Friends and MPAA levels."

    --
    Slightly disreputable, albeit gregarious
  42. Once again... by NetRoll · · Score: 1

    Residents of the province of Quebec in Canada are ineligible to participate. :(

    1. Re:Once again... by Anonymous Coward · · Score: 0

      STFU with the whining and go drink some Labatt's and Maple Syrup you fucking Canuck!

  43. The Gamble of Lifetime by Admin_Jason · · Score: 1

    So now developers should gamble their knowledge and skills at this in hopes of the payoff and publicity instead of getting a contract for 1/100th of that to accomplish the same objective over likely the same time frame? Hmmm....let me consider that one for a while...

    --
    Just another nameless binary in a crowd of 1's and 0's
  44. Job Application by Shihar · · Score: 1

    If you truly and honestly believe that the winner of these contests are not then promptly offered jobs, you are missing the point fo the contests. These contests are generally more glorified job interviews then anything else. I doubt they even expect anyone to win. What they DO expect is for people to send in some innovative solutions. They will then go out and try like hell to hire the people with the best submissions.

    If you are looking for a job, I wouldn't view this as a competition to make you obsolete. This is a competition to find a new employee and offer him a sweet sign on bonus.

  45. Re:Here's a problem to solve with much larger impa by The+Darkness · · Score: 2, Informative

    Disclaimer: I subscribe to the same sort of service, except through blockbuster... maybe Netflix does have this feature. My wife and I share a queue... I imagine many, many of these queues are shared. We have very, very different tastes in movies. Instead of getting recommendations that suit us both (which is next to impossible), the recommendations just get very, very confused. If I could just keep my and her recommendations from tangling, we would both have an easier time.

    This problem is already solved.

    With Netflix you can have multiple queues (up to one per disc-at-a-time out) and reassign the "number of discs out per queue" from 0-#out as long as the total isn't greater than #out. It also handles reassignment with discs outstanding well.

    The result in my family is that we end up with independant queues and independant recommendations. If Blockbuster offers the same feature you could split your queue up and get what you want. On top of that you won't have to keep organizing your queue to get the correct movie next if someone takes time to get around to watching their movie.

    --
    There are two kinds of people: 1) those that need closure
  46. goldcorp now worth billions with similar contest by Anonymous Coward · · Score: 0

    They had a large property where they just couldn't find gold. They made all of their geology data public and then held a $575k contest for the best suggestions. Using this Open (Gold)Source technique, they found numerous deposits and dramatically increased their market capitalization.

    http://www.intelligententerprise.com/showArticle.j html?articleID=159907864

  47. Why ratings? by gravyface · · Score: 1

    Maybe the problem with Netflix is good old classification. For some reason, the movie industry feels compelled to call everything "Action" "Drama" or "Comedy" -- add some tagging, use the Dewey Decimal System, I don't care, but make an effort at some descriptive categorization and see if that increases repeat buys.

    I just can't trust a user rating without reading their anecdotal description -- I want to know why something was rated only 2 stars before I pass judgement. Maybe Sally was a first-time buyer and decided to rate everything she rented as 1-star because they double-billed her credit card. Who knows? And Personally, I never rate stuff. I just don't care and can't be bothered; I'm sure I'm not alone either.

    The software security experts always say, "never trust user data" -- maybe this applies to recommendations as much as SQL injections.

    --
    body massage!
  48. Netflix Stats by in2mind · · Score: 1
    # Netflix has more than 65,000 titles and more than 42 million DVDs total.

    # Netflix has more than 1 billion movie ratings from customers. The average subscriber has rated more than 200 movies.

    # Netflix members select approximately 60 percent of their movies based on movie recommendations tailored to their individual tastes.

    # Netflix's members rent more than 95 percent of all titles in the Netflix library each quarter.


    http://web.netflix.com/MediaCenter?id=1005&hnjr=8

  49. 1324 movies rated, 0 recommendations by Sinistar2k · · Score: 1

    Here's hoping somebody can fix the current system, because NetFlix and I are at a stalemate. With 1,324 movies rated, NetFlix gives me 0 recommendations because it has become quite confused about my taste in films.

    Once, I rented and liked a Devo DVD, so it recommended every band with a concert movie, but I don't like every band and started marking things "Not interested". Then, I added a Sarah Silverman disc to my queue, which NetFlix took to mean that I love all stand-up comics, especially those on the Blue Collar Comedy Tour. But I'm not really interested in all comics, so I had to mark them "Not interested". Then I checked out a few documentaries, sending NetFlix into an excited state where it recommended all kinds of uninteresting docudramas. "Not interested".

    At this point, NetFlix thinks I'm not interested in anything even though I have a number of films rated 3 stars or greater.

    But maybe its recommendations are right... Maybe Sarah Silverman and Bill Engvall really are for the same crowd.

  50. i have a suggestion to make netflix better by Anonymous Coward · · Score: 0

    how about instead of sending out DVDs that have been scratched to death and unplayable you actually inspect the discs when people send them back in and charge them accordingly if they have damaged the disk to the point that it needs to be replaced?

  51. Get recommended from the producers by zogger · · Score: 1

    The movie industry and netflix have an obvious solution to increase sales and interest and push more disks and also cut down piracy, and they have always had it, use advanced tecnology as it becomes available. DROP the price down to the rental fee (around there, very close anyway) and let the movie going public have the disk. Let them keep the thing. They could do it too, stamped disks are cheap, any number of "alternative" businessmen who operate their own "courtesy disk distribution channels" show what stamped disks cost in bulk with some data bits on them. A buck or two max beyond what those gents charge would be a good thing for all concerned. Volume sales, a time honored concept to "make profit". We have the technology to do this, this is one place where we have sci-fi coming true-we have cheap "replicators", I sugggest we adopt that technology across the board.

        For netflix that would mean one way shipping, a significant cost savings, for the outside shipping fees and halving the labor handling.. For theater owners,this a great way to get people to watch the movie the first time at the big screen theater and buy their lard brand popcorn and ~cola~, a complimentary disk of the movie the people just paid cash to see on the way out.

    The next step is drop the number of movies produced and do some more work on screenplays and plot, etc. Just make better movies, but less of them, right now, just too many out there, dilutes the pool a lot. Then people might actually go and *enjoy**** the movies, and tell-a-friend about them.

    *****this still leaves annoying cellphone ringtones in the movies. Not that I would suggest anything illegal, immoral or fattening, but if doofuses who don't turn their phones off got a 5 gallon or so ~cola~ and lard brand popcorn shower from their surrounding movie watching neighbors in the theater every time a ringtone was heard, the problem might become self correcting. IANAL, check local vigilante regulations first ;)

    1. Re:Get recommended from the producers by nasch · · Score: 1

      You really think they'll make more charging $5 instead of $20 (ie sell more than 4 times as many DVDs)? And of course they would also be ensuring that the millions of people who watched the movie in the theater, prime candidates for buying the DVD, wouldn't buy it. So cut those people out of the equation too. Then add to that fewer movies coming out. I'm not sure the studio execs even understand the concept of a good movie, outside of how much money it makes, so that argument probably won't get far. Don't get me wrong, I would love fewer better movies, and cheaper DVDs. But it's not happening.

  52. A wrench by pestario · · Score: 0

    I share a netflix account with 4 others in my family. We all have very different tastes in movies. I like the horror and foreign film genre, whereas my brother prefers oldies, and comedies, while others are 'Friends' fans. An algorithm would need to keep this scenario into consideration as well, as I am sure this is not uncommon.

    --
    :n
  53. uh huh by kahrytan · · Score: 1


      Did anyone realize that Netflix is releasing 100 million of anonymous customer data? I thought /. was all for privacy. This just prooves that when it's $1 million dollars at stake, /. users don't care about privacy.

    Netflix also needs to introduce a dvd iso download service. In an age where most people own dvd burners, Why the hell not open dvd iso download service?

    --
    \
    1. Re:uh huh by Anonymous Coward · · Score: 1, Insightful

      Releasing customer movie preferences is completely different from releasing customer search queries.

      The problem with "anonymous" search queries is that when aggregated, they provide data that can expose the searcher's identity. Taken as a whole, searches like "Tupelo Mississipi", "STD symptoms", and "Dr. Smith" reveal quite a bit of information about the searcher.

      However, the knowledge that user #1234567 rated "Beach Babes From Beyond" 5 stars, and yet gave "The Godfather" a paltry 2, while interesting, does nothing to help me determine who user #1234567 actually is.

  54. Re:Intractable problem - liking the movie, not gen by kthejoker · · Score: 1

    Well, if you think about it objectively, you find that genre is a terrible way to recommend movies. Consider:

    Movie A is an artful horror film (we'll even give it a bonus point for originality.) Movie B is a low budget straight to video rehash of Movie A.

    What are the differences between the two?

    A) Budget / production values.
    B) Production company.
    C) Actors.
    D) Crew, particularly director / writer / producer.
    E) Originality of script. (This is kind of subjective, but surely a remake of a movie is less original than the original.)

    These are good criteria for picking out films. It's up for you (and every other user) to tell you what the values of these mean.

    Genre is terrible for choosing a movie; it is much better suited for discriminating out other movies. Someone can filter out all romantic comedies, fine. But saying all romantic comedies are alike because they are romantic comedies is wrongheaded.

  55. here's the problem with one-dimensional recommend. by MORTAR_COMBAT! · · Score: 1

    If I both liked Shrek II and The Motorcycle Diaries and Maria Full of Grace, and you liked Shrek II, does this mean you'll like the other 2? Absolutely not necessarily. The problem with the recommendation system is the limitation in -expression- of rating. A 1 to 5 star scale for every movie? It will never work.

    Combine tagging with rating and you'll find a much better recommendation system.

    i.e.: "Shrek II: cartoon, comedy, satire, Mike Myers; 4.5 stars" vs. "Shrek II: 4.5 stars" and "The Motorcycle Diaries: documentary-like, Che Guevara, cinematography-driven, story-driven, soundtrack-driven; 5 stars" vs. "The Motorcycle Diaries: 5 stars".

    Then you start to do more than correllate which films are well-liked and which ones aren't. You start to correllate sense of humor, style, etc.

    On iTunes, one of the better features for me is the iMix. Let's say I like some Mason Jennings song. It shows up on a few iMixes, so I check out those iMixes and find songs I like, be it Jose Gonzalez, Teitur, Alexi Murdoch, Patrick Park, whomever. It beats the crap out of Apple's "Just for You" recommendations, which are skewed to hell because I've bought some Madonna songs (and maybe a few others) that are simply -too popular- and completely mess up the bulk of the recommendations because of the sheer number of connections those few songs have.

    --
    MORTAR COMBAT!
  56. Re:Intractable problem - liking the movie, not gen by Anonymous Coward · · Score: 0

    I agree. Similarly, if I rent something to watch w/ someone else, namely some chickflick my girlfriend begged me to watch w/ her, I am now forever getting chickflicks recommended to me. I see this a good deal more w/ Amazon- I buy people gifts off of there, and now Amazon thinks I am really into cooking. I bought a service manual for my car, and now Amazon thinks I am a car buff. Similarly, just because I didn't like "stupid cash-in sequel" 3 of a series, doesn't mean that I dislike the entire series, which is often how it seems to work.

    This in particular seems fairly easy to fix though- Only recommend movies based on trends, not single data points. If I start renting tons of chickflicks, well then... I have what is coming to me.

  57. Curb my enthusiasm? by iion_tichy · · Score: 1

    I am seriously excited about this, but it sounds too good to be true? Please point out any traps in this competition?

    The winning conditions seem fair to me, as far as I understand, I even get to keep the rights to my algorithm, I "only" have to give a free license to Netflix? Or are there any traps I don't understand?

  58. Re:Intractable problem - liking the movie, not gen by pregister · · Score: 1

    And look where it got him. In the nuthouse.

    Followed by a multi-million dollar publishing career. Ok. Good point.

    -p

  59. My problem by Brandybuck · · Score: 1

    My problem is that I rate the movies I watch very high. Why? Because if they were crappy movies I wouldn't have rented them anyway! I usually know something about a movie before I rent it, even if it's just the viewer reviews on NetFlix. My ratings are 3, 4 or 5, only very rarely do I give out a 1 or 2.

    Consequently NetFlix thinks I like everything. While the system is smart enough to not recommend Martin Lawrence movies, it usually gives me movies I'm simply not interested in. Or at the other extreme, it gives me excellent movies that I've already seen, just not rented at NetFlix. For example, right now it is recommending:

    Anne of Green Gables - Ugh
    Eddie Izzard: Dress to Kill - WTF?
    Eleanor & Franklin: The Early Years - I'm falling asleep...
    The 39 Steps - Good choice, too bad I've seen it 39 times

    --
    Don't blame me, I didn't vote for either of them!
    1. Re:My problem by Guysmiley777 · · Score: 1
      Eddie Izzard: Dress to Kill - WTF?

      Hilarious. Seriously, I thought "oh, HELL no" when it was chosen as "the movie we are going to watch tonight", but I laughed my ass off. Le singe est dans l'arbre.
      --
      Coding with assembly is like playing with Legos. Coding an application in assembly is like building a car with Legos.
    2. Re:My problem by mkosma · · Score: 1

      Interesting. One possible solution in your case might be to rate movies that you (a) have seen but not rented from Netflix, or (b) that you have not seen and would not see (i.e., Not Interested). As for (a), I wonder if it is possible from the provided data for the algorithm to treat differently the ratings for movies rented through Netflix, and the ratings entered without an accompanying rental? Might be that the second category of ratings should count for more. Of course, you can argue that they should count for less, too.

  60. No Sweat by Tiger4 · · Score: 1

    This is an easy problem. Simply write a predictor algorithm to compare the affinity characteristics of a given film to the affintiy characterisitcs of a given subscriber. Based on the goodness of fit, probabilities of acceptance can be assinged and recommandations made. "You like Romances more than Comedies, and both of them more than Westerns? OK, based on the weights you've given, High Noon isn't for you, but Young Frankenstein might be better."

    One problem, how many dimensions ARE there to human affinity (eHarmony thinks they know)? And how do they interact? I like Romances, and Comedies, but NOT Romantic Comedies, unles they ARE in a Western, in which case they are about the same as an Action - War - Drama for me.

    And by the way, can you infer these things from my watching habits, or do you have to ask me, and what if I lie (or just don't know/realize I'll like something? Or maybe I just rented Silverado (Western Action Romance Drama) because I happended to know one of the producers, and I can't stand Westerns otherwise?

    And of course it all goes out the window when it turns out I like Samurai films, and Magnificent Seven is a remake of the Seven Samurai.

    Piece of cake. How hard can it be?

    --
    Behold, this dreamer cometh. Come now, and let us slay him... and we shall see what will become of his dreams.
  61. I hope Amazon... by GigG · · Score: 2, Interesting

    I certainly hope Amazon chooses to license anything that comes out of this. I've been a Amazon customer for about 10 years and have bought a couple of hundred books from them. For the first 2 or 3 years they gave me pretty good recomendations and I found a number of new authors that I probably wouldn't have started reading. Over the last few years they never catch a new author and suggest them.

    --
    Is buying a Harley Davidson as your first motorcycle since you were 16 at age 49 a midlife crisis issue?
  62. Re:Intractable problem - liking the movie, not gen by FoXDie · · Score: 1

    What about falling back on good ol' human minds? When someone rates a movie the site should ask them why they rated it this way to narrow down certain characteristics. For example, if it is an action film and the user rates it poorly the questions could be:
    "Was there too much violence in this film?" Y/N
    "Did the plot seem too far fetched?" Y/N
    "Was the overall production quality of this film (acting, special effects, editing) not to your liking?" Y/N
    "Is this genre of film one you don't normally care for?" Y/N
    Etc...

    As long as the questions are yes/no and there are 10 questions or less, I'm sure users would be willing to answer them most of the time if they were aware that it was going to enhance the quality of movie selection for them.

  63. Solution... by SnowDog74 · · Score: 0, Offtopic

    How do you make Netflix better? Well, I think the answer is obvious... replace Netflix with internet-based distribution where a user can download movies to their computer and then stream them to their TV.

    Hey, I'm a freakin genius! I'm going to be rich!

    *reads Apple Expo Paris press release*

    Oh, snap.

  64. Wheres the url by codeHorse · · Score: 1

    What is the url at netflix with the information about this database and access and submissions and rules, etc?

  65. Re:Intractable problem - liking the movie, not gen by Anonymous+Custard · · Score: 1

    "But saying all romantic comedies are alike because they are romantic comedies is wrongheaded."

    It all depends on how dedicated to the genre you are. If you liked everything from Event Horizon, to Evil Dead, to Bram Stoker's Dracula, then there probably aren't too many horror movies wouldn't at least mildly enjoy.

    But if you only ever rent Action movies if they star Tom Cruise, then your high War of the Worlds rating wouldn't necessarily imply you'd also enjoy Armageddon.

  66. What's wrong with Quebec? by aduthie · · Score: 1

    From the rules at http://www.netflixprize.com/rules : "Residents of the province of Quebec in Canada are ineligible to participate." They go on to list the remaining Axis of Evil and some other countries the U.S. doesn't much care for.

    But what's wrong with Quebec? I would presume that they passed some sort of IP law that would make it problematic for NetFlix if the winner were based there.

    1. Re:What's wrong with Quebec? by Better.Safe.Than.Sor · · Score: 1

      Wrong? With Quebec? The one in/near/joined to Canada? I believe Provincial laws keep Quebecers safe from the evils of this type of thing.

      --
      It's all history, man. -anon
    2. Re:What's wrong with Quebec? by triso · · Score: 1
      ...But what's wrong with Quebec?...

      There is nothing wrong with Quebec. They simply disallow all forms of lotteries, gambling and contests as being sinful and a temptation of the devil.
  67. Re:Intractable problem - liking the movie, not gen by Anonymous Coward · · Score: 0

    Exactly the problem with Pandora's recommendations.

  68. the new business model by zogger · · Score: 1

    The people in the theater would be buying the DVD, all of them, just bump up the ticket price a dollar or two, which is more than enough to cover the disk once they stamp them out by the millions. It's also a loss leader type deal to get them back into the theaters, all of the theaters have been complaining of less and less crowds, a disk with the ticket is a nice inducement, a very nice one. If the real counterfeiters-the pirates-can make dupes and profit at two or three dollars, so can the movie producers. They just don't want to..they want the same cash they used to get for more complex and expensive to produce VHS tapes-they haven't dropped prices in a long time now. I'll keep saying it-volume sales work and we have the technology now to make that happen. And people will have less and less incentive to pirate and file share when they can get legit copies on stamped disks for cheap from a variety of venues. And then they wouldn't have to try and keep coming up with weirder and weirder DRM schemes, again, pissing of their customers and just making things stupider than what they have to be. Humans just don't like being gouged or taken advantage of, that is half the reason no one cares about any ethics when it comes to file sharing, they see it as the producers charge such a rip off price and have been so recalcitrant about it that they lost any ethical business high ground. They have been operating as a blatant price fixing cartel for a long time now, they are angrily getting inj the face of every customer out there with their bogus warnings in front of the movie, and if they just stopped that and dropped prices to a much more reasonable level, and introduce some incentives that make owning disks conveninet for the consumers-like getting to keep the disk instead of turning it in with a rental, the free or near free ones at the movies, etc, they would sell more disks, a lot more,and I bet they would make more net profit in the long run. That's just an opinion, but I bet it would happen.

    1. Re:the new business model by nasch · · Score: 1
      Yes, stamping a disc is cheap, but that doesn't mean getting it to you is. There's advertising, packaging and distribution as well, which probably add up to more than the cost of producing the disc. Another problem with this scheme is that if they price DVDs at $2, then people will believe that they are worth only $2. If the experiment fails, it's doubtful the public would ever again pay $20 for a DVD. Right now people believe they're worth $20 for a new release and $10 for an older one. I don't have numbers in front of me but it seems to me they're selling in the millions every year. Why risk that revenue on a gamble that could possibly make more money, but could also sink the whole industry? From the other side, I already almost never go to the theater, partly because it costs $10. If Hollywood wants to get me back, they need to *drop* prices so it's worth taking a chance on seeing a movie in the theater, not raise them and promise to give me the DVD of the movie that I don't yet know if it's a pile of crap.

      I think what you're suggesting would be great for consumers, but a questionable move at best for the studios and it certainly isn't going to happen. It's possible it would work out in the long run, but 1) big companies are famously risk-averse and 2) American companies are famously bad at long-term thinking.

  69. Wow... I've seen this movie before! by ElitistWhiner · · Score: 1

    Deja vu Netflix!

    The First one is the age olde "frame problem". This is IT taking a perfectly good database and expecting it to be an even better recommendation system.

    Airlines hit the same wall decades ago. They had databases of flights, seats and routes - all excellent. But they really wanted a reservation system based upon ticketing against that database. They finally recognized that nothing less than a mainframe was needed.

    The answers you get is all in how you frame the question. Starting with "database and PC" is not going to get to recommendation system without abandoning inherent limits within IT's reference frame.

    The second one is CCC Trucking Co solution. Sam owned Crete Carrier Co trucking. Sam owned a significant segment of his market ~50% but wanted to know *how* to grow to 80% of the market for his company. Sam's soluton was simple... he grew 1% at a time, buying smaller truckline competitors. 1% here, 3% there and soon CCC had real marketshare.

    Netflix need only implement 1 and 2% solutions. Pretty soon they have a real 10% solution.

  70. Psychology, extended data by inKubus · · Score: 1

    The problem with recommendations is that it can only determine what to recommend based on what you've rented, what you've marked you like and what other people who rented the same stuff liked also.

    They are leaving out a whole aspect of psychology. The problem is that they have a 2 day lead time to get the content to you. So, it's not just a matter of them asking what your mood is and then providing you with the movie. Instead, they have to predict how you will be feeling 2 days from now and send you the movies.

    They should start a voluntary rating program, wherein you sign up and they send you random movies, which you watch and rate (like a critic). Based on your responses to stuff you NEVER would have seen, they have a whole new set of data.

    Then there's other stuff, like location. Where in the US are you? Maybe New Yorkers are more likely to like an art movie while a movie about the South is more liked there. What about the time of year? Obviously during holidays you have an easy selection. But certain movies rent more in the Spring and Summer, etc. If you say you're a college student then it knows that you want college flicks in the fall.

    Weather? The system looks at the weather forcast for regions (doesn't have to be complex, just a statement like "Region A" is going to be rainy next week") and then queue up more videos with rain in them (or sun, if they are a person who rents the opposite of the weather).

    See, you have to play off the cues that make people actually want to watch a movie, not about what they already watched. That's the totally WRONG way. If you just got done watching Ep1-6 of star wars, why the hell would you want another space movie?! After 12 hours of space movie, you're probably looking for a good comedy. In this case, the past has no bearing on the future.

    This system CAN work for a place like Amazon. It works well for non-fiction. Say I have a land project and I order some developer books, it can reasonably assume I'll want the more advanced reference eventually.

    Would you watch it again? It should ask you about movies you've returned if you're likely to watch it again in the next month, quarter, year, etc. Then it can determine if you want to watch something like it in the near future.

    I agree with the TV show thing, keeping episodes in order is a good idea.

    Anyway, it's going to need to go out and get other information besides just what people watch. Picking a movie has so much more to it.

    --
    Cool! Amazing Toys.
  71. Official site by Anonymous Coward · · Score: 0

    Press release: http://www.netflix.com/MediaCenter?id=5368

    Offcial registration and competition information for the Prize: http://www.netflixprize.com/

  72. Dataset 'purturbed' by Anonymous Coward · · Score: 0

    From the Rules:

    To prevent certain inferences being drawn about the Netflix customer base, some of the rating data for some customers in the training and qualifying sets have been deliberately perturbed in one or more of the following ways: deleting ratings; inserting alternative ratings and dates; and modifying rating dates. However, the Cinematch RMSE measured on the final, perturbed dataset does not differ significantly from the RMSE measured on the unperturbed dataset for the purposes of Grand or Progress Prize qualification described below. The RMSE values reported below represent the RMSE measured on both the perturbed and unperturbed datasets to the precision specified above.

    Netflix assumes that because Cinematch's RMSE on the perturbed data does not differ significantly from its RMSE on the unperturbed data, the same will be true for the algorithms entered in the contest. This is probably valid if the contestant's algorithms are similar to their own. But they may not be similar. It is possible that Cinematch is a near-optimal refinement of a certain approach to the problem, and only a radically different approach will be able to improve significantly on its performance. But that approach, unlike Cinematch's, may have its performance significantly degraded by the perturbation of the data. Thus the perturbation of the data may prevent a win.

  73. Never-ending competition by Dan+East · · Score: 1

    From the rules:

    Contest begins October 2, 2006 and continues through at least October 2, 2011.

    your submitted predictions that year must be less than or equal to the accuracy value established by the judges the preceding year.

    Okay, so the contest doesn't have an ending date, and the judges can modify the winning criteria as they see fit. I hope whoever spends time on this enjoys having others profit financially from their free labor.

    Dan East

    --
    Better known as 318230.
    1. Re:Never-ending competition by Marlow+the+Irelander · · Score: 1

      You only submit an indexed dataset, not your code. Until they tell you you've won and pay up, they'll never see your code, and they'll never profit.

  74. So much data.. yet so little. by MistaBell · · Score: 1

    This seemed like a fun little project, so I registered on netflix and downloaded the file (which decompresses to almost 4gb worth of numbers.) They give you a list of movie IDs, user IDs, and ratings. Your job, if you choose to accept it, is to guess the correct ratings (1-5) for a random slice off a server log after the ratings have been cut out. If you can guess the missing ratings for various movies better than the system can.. you collect a cool $1 mil. The problem is that 4gb worth of user ratings is worthless without more data. You can average the ratings or match users up with others that rated movies similarly.. but that's about it. Having users fill out a small questionaire, pairing them up with others with similar tastes, and using their purchase histories would be infinitely more useful and less complicated than doing it with 4GB worth of numbers 1 through 5 alone. I really hope they use more data than this for their current method of recommendations. (and if they do, shouldn't they release that data if they expect people to come up with a better algorithm.. privacy concerns aside?)

  75. Re:Here's a problem to solve with much larger impa by dargaud · · Score: 1

    Yup, and there's the related problem of outliers: when you order something out of your ordinary habits. Example, I ordered a bunch of baby books as a gift for some friends on amazon.com. And although I have zilch interest in the little critters, Amazon now keeps recommanding whole ranges of baby stuff.

    --
    Non-Linux Penguins ?
  76. You need more data... by moorley · · Score: 1

    So use connotative association.

    The current recommendation systems (if I understand them) are trying to suss out info by comparing the differences between the preferences of people. The problem is that each movie is a very complex mix of themes and connotations.

    You need more information on the objects you are recommending to each person. Rather than just compare the preferences you want more information about the object than a rating of 1 to 5. You can do this with genre's, to a degree, but even genre's are imprecise. Is it SciFi or Romance? Is it more scifi than romance?

    Think of an object that has width and breadth in a multitude of different dimensions. A Venn diagram meets a hypercube. But where do you get that information?

    Why the same place/source that artists and designers do, DUDE!

    Oddly enough, to me, the answer comes from a design class: Connotative association. When we are working on a design for a project we start sifting through media. Songs, books, images, ideas, anything that give us a "hit", Makes us think of that project. Think of it as brainstorming but you are sifting through contemporary cultural artifacts and your own psyche to "find" a commonality. Once you have all the media you start to sort it. Does this image work with this sound, does this clothing choice work with this color. As you winnow down those choices you start to get coherent choice of color, mood, and theme that you may not have known existed but thanks to your subconscious you have found a workable choices that should relate to a contemporary audience.

    If artistic design uses that method to make new harmonious or understandable art to a culture as a whole than hooking into that same process will allow for better understanding, definition and recommendations.

    So you construct a series of images, sounds, media and you use that to classify movies. Popular songs to classic folk hits. Artwork masterpieces to new artwork. All forms of media and periods. Hitchcock is going to "hit" with a black and white pinstriped suit and art deco design elements as will "Hudsucker Proxy" which are both movies that I would enjoy. If you delve into my connotative associations you will find others that are similar and start to generate a mathematical "map" of what I might enjoy based on those associations. You may find that different genre's and movies have similar associations that make them worth recommending. You can also pull from the mappings of other folks connotative associations to see how what they've seen might be of interest to me.

    All in all you are just defining a new space. The idea, for me, is easy but I'm not a mathemetician or a database designer.

    The downside is getting folks to go through a series of "warshack-esque" media to classify a film. I don't even like filling out the number of stars let alone listening/viewing/reading 5 media bits to relate them to a movie I've seen. But I think it's a viable way to go. The current recommendation system can approximate the above after comparing thousands of films/preferences but without relating it to other media and moods it won't be able to achieve more resolution or granularity.

    In the end all art is merely a conversation of a time between the folks who lived in that time based on what has gone before. You are simply trying to better mathematically understand the deeper underpinnings of that conversation to make recommendations. Easy... Right? ;-)

    --
    "Don't fear death... fear not living..." -me :)
  77. Back in my day... by edraven · · Score: 1

    We used to call this MovieCritic. It used a system called LikeMinds to basically mine their database of users' movie preferences in order to define relationships between the tastes of various users. The theory being that if you feel pretty much the same way about a lot of movies as particular other users do, you're pretty likely to enjoy the movies they liked but which you haven't yet seen. I never had it recommend a movie I didn't enjoy, and I was pretty surprised by some of its recommendations. I saw movies based on MovieCritic's suggestions that I never would have bothered with on my own, and really enjoyed them.

    MovieCritic shut down in 2002, and the LikeMinds technology was apparently acquired by Adobe, who as far as I can tell are using it for toilet paper.