Netflix Prize Contest Ends, Down To the Wire
suraj.sun updates us on the Netflix Prize now that the competition has officially closed. We discussed the new leader with one day to go in the contest: The Ensemble, taking the lead from long-time leader BellKor's Pragmatic Chaos, the first contestant to submit an entry that broke the 10% barrier. In the contest's final day, BellKor re-took the lead with 20 minutes to go, then The Ensemble apparently pulled a Michael Phelps with 4 minutes to go, squeaking ahead by 0.01%. At least so the leaderboard claims — but those numbers are posted by the competing teams. The NY Times reports that an official winner will not be named until September — Netflix needs that much time to pore through the complex entries and read the code. Netflix contacted BellKor on Sunday to tell them the team remained in first place; The Ensemble has had no such notification.
They realized that all movies starring Matthew McConaughey and Kate Hudson were actually the same movie. The compression on that alone was enough.
What they need to start is a contest to improve their incredibly lousy on-demand service, the Silverlight player is beyond terrible. All this effort (and money) over getting 10% more accurate guesses that the same guy who liked "Terminator" will like "Terminator 2" is nice and all, but it's a bit of a time waster don't you think?
I hope Saki and Ikeda kick the Kodomo's ass next episode. What a spoilt little bitch.
BAWWW, I can't play Mahjong so I'll use hax to get my way.
team a makes algorithm improvement b
team c takes algorithm improvement b and makes algorithm improvement b(+d)
team e takes algorithm improvement b(+d) and makes algorithm improvement b(+d)->f
the guy who squeaked out the extra 0.01% did that on top of someone else's code that eked out 0.05%, etc., ad nauseum
so how do you ascertain who won? all the teams won
they should take the final prize money and try to fractionate each incremental improvement in the algorithm and proportionally dole out the money that aways. anything else is unfair
intellectual property law is philosophically incoherent. it is your moral duty to ignore it or sabotage it
Perhaps The Ensemble broke the rules?
"Eligible algorithms must provide predictions for all the withheld ratings for each customer/movie id pair in the qualifying set. Each submitted prediction set must be uploaded to the Site in the format specified in the dataset. Submissions that fail to follow these requirements will be rejected. A day must elapse between each teamâ(TM)s submissions."
So The Ensemble ripped bong with 4 minutes to go which gave them the creativity to squeak ahead by %0.01?
A Magic the Gathering Article and Forum Aggregator
I got it from the comments, I guess, but it would be nice if the summary would mention what the contest is about.
They're having a contest to improve Netflix's algorithm?
"There is no time, sir, at which ties do not matter," Jeeves, (Jeeves and the Impending Doom)
Netflix calculates the score shown on the leaderboard from a set of rating predictions submitted by a team. The team does not, and will not, know the correct answers. For testing their algorithms, the teams use another dataset. The two datasets, part of the package made available to the competitors, are known as "qualifying" and "probe".
Go Bellkor!!! Sorry, I'm biased :)
The reason BellKor is still first is that the published scores are irrelevant. The scores that matter for the prize are based on an unpublished data set known only to Netflix (to prevent people submitting answers that are optimized for the challenge data and work poorly on everything else). On this secret data set, BellKor's algorithm apparently performs better than The Ensemble's.
main(c,r){for(r=32;r;) printf(++c>31?c=!r--,"\n":c<r?" ":~c&r?" `":" #");}
It was Jason Lezak, not Phelps, who anchored the 4x100 Olympic relay and closed what appeared to be insurmountable lead by the French team.
I don't use netflix, i'm a blockbuster guy cause we happen to have one close to our house. But What is 10% of zero? in all seriousness, what is their accuracy now? How is it determined?
How much is your data worth? Back it up now.
...I'm sitting here wondering how stable these algorithms are over long periods of time. I'm assuming that the "practice" data set and the "test" data set are equal in terms of time distribution (date of movie release; date of review). But 10 years from now, 20 years from now, I see the RMSE numbers slowly drifting upwards as the algorithm was optimized to the 2000-2009 data set, not the 2000-2020 data set or tahe 2000-2030 data set. But this is not my area of expertise so I'm wondering what others have to say on this topic.
No fair! You changed the outcome by measuring it!
Comment removed based on user account deletion
Am I the only one on slashdot who refuses to ever install silverlight?
Yes, I'm an anti MS fanboy, but after the things that happened during the OOXML ISO approval I just cannot support that company anymore. In any way whatsoever.
If you too (speaking to the general audience here) feel this way about MS, I hope that you do not support them by installing silverlight or in any other way really.
subject says it all. when i'm distracting myself off this way, it's more efficient to run a VM fullscreen for the movie display. then do work on the host OS. lame but works well and i need that VM up most of the time anyway.
If ((MovieReleaseDate > 2000) OR (MovieProducer = "Bay, Michael")) AND (CustomerAge > 20) Then .5
Score = Score *
If (MovieFABRating = "Nudity") AND (MovieLeadActor != "Cohen, Sasha Baron")
Score = Score * 10
http://www.netflixprize.com/community/viewtopic.php?id=1498
"Thanks. In fact, this is a very happy day for us - our team is top contender for winning the Grand Prize, as we have a better Test score than The Ensemble. (Probably this is the first post revealing this in the forum smile)"
Also, Yehuda Koren is at Yahoo now, not AT&T.
---
Information Retrieval Feed @ Feed Distiller
I always wondered why squaring is always used for distance measurements. It seems it would over-magnify the influence of non-matches and outliers compared to what the "real world" would want. I realize it makes the math easier, but having easier math and having the best answer may not be the same thing. I've asked math experts about this, but it seems it's an under-studied question.
Table-ized A.I.