New Leader In Netflix Prize Race With One Day To Go
brajesh writes "The Netflix Prize, an algorithm competition to improve the Netflix Cinematch recommendation system by more than 10%, has a new leader — The Ensemble — just one day before the competition ends. The 30-day race to the end was kicked off after BellKor's Pragmatic Chaos submitted the first entry to break the 10% barrier, with the results showing a 10.08% improvement. The Ensemble, made up of three teams who chose to join forces ('Grand Prize Team,' 'Opera Solutions' and 'Vandelay United), has managed to overtake BellKor with a score of 10.09% — an improvement of .01% over the former leaders. From the article on Techcrunch: 'The competition will end [today], so teams still have a little bit of time left to make their last-second submissions, but things are looking good for The Ensemble. This has to be absolutely brutal for team BellKor.'"
that other websites should do this as well.
Slashdot, for instance, could have a contest to unbreak their fucking code by 10%.
What did they do, make sure that all of Uve Boll's movies never came up as a "Recommended for you" movie?
If our elected representatives no longer represent us, do we still live in a Democracy?
Back when I first began using Amazon.com, I never bought a book based on the recommended items. I felt the recommendations were trite, ill-advised, and typically only peripherally related to the item I was buying.
Then the recommendations got better. Much better. I started to find myself buying things right out of the recommended section, and the product combination deals also became very tempting.
If Netflix can turn their recommendation engine into something similar, they will be sitting on a goldmine. As they say, people hate get sold to but they love buying.
Why not wait another day before submitting the improvement? All they did now was giving the other team one day to respond, and if they succeed, I doubt they will be able to submit yet another improvement. So why not simply wait until an hour or so before the deadline, or am I missing something about the rules, e.g. any submitted improvements prolong the deadline by one day?
rather than declaring your best result early, the Belkor team should have employed a bit of strategy and only declared a lesser result (if any). That would give the other teams something to aim at, without giving away their best results. These would be held back right up until the last minute and then submitted, so that other teams would not have time to make any further improvements (in fact, maybe this IS what they're doing). It's been a successful bidding strategy on eBay for years, so why wouldn't it translate into other competitive areas too?
politicians are like babies' nappies: they should both be changed regularly and for the same reasons
Uwe Boll. It only sounds like a v because he's German.
I'm actually surprised that this hasn't been done before. You can prove that using multiple models will on average produce better results than using any single model in isolation. For example, each netflix system will make different errors; using multiple systems will tend to average-out these errors and the consensus decision is most likely to be correct.
I thought Vandelay was into manufacturing latex.
My question is whether there will be any winner at all other than netflix? One of the rules for the competition was that you could not form multiple teams. This was to prevent people from gaining multiple submissions per day. Otherwise a five person group could create 30 teams and thus be able to submit 30 attempts per day. I believe both teams that have exceeded the 10% threshold and thus are eligible for the grand prize are composed of members from other teams and could be disqualified.
'The tyrant will always find pretext for his tyranny.' - Aesop's Fables
Call me crazy, but if you actually *read* the rules it says the contest is going until at least October 2nd, 2001.
Netflix Prize Rules
Terms and Conditions in a Nutshell
* Contest begins October 2, 2006 and continues through at least October 2, 2011.
They could improve the predictive value immensely if they allowed me and my wife to each rank the movies we watch together separately. With the current system, some movies are rated by just me, some by just her, and some have a consensus rating. It leads to a dataset full of garbage.
Screw the code. All they have to do is stop hiding the new releases. They purposely bury recent releases so they dont have to keep up with demand. Which is total bullshit. 10% improvement of what? Getting the same garbage they offer out there even faster? *sigh*
Amazon could really benefit from a similar contest for its laughable Gold Box picks, but increasing the accuracy of recommendations by even 5000% would be low-hanging fruit.
It's a tie, check the leaderboard again...
According to the linked leaderboard it's 10.10% for Ensemble, and 10.09% for BellKor's Pragmatic Chaos.
It's interesting that the fearmongering of the prior /. post about AI got hundreds of responses but this /. post, which is far more relevant to real AI, has gotten less than a hundred responses thus far.
Anyway, congratulations to Netfilx for doing the right thing for their business in response to The Hutter Prize.
Seastead this.
Yehuda claims to have the best test error rate. They will win the million dollars. It was super-exciting: It seemed BellKor et al will be defeated thanks to a little-known rule of the competiton (the 30-day last call rule). But they have won after all, thanks to a less-known rule (the quiz dataset - test dataset distinction).
I think this is bringing us one step closer to EPIC (video).
Great point, stop the suffering!
While I would appreciate some good movie recommendations, I can't help but feel a little creeped out that netflix may be able to read my mind one day....maybe I can make up a movie in my imagination and netflix can play it for me! ~Ami
Chicago Web Design
Comment removed based on user account deletion
Since the test set was a lottery draw wherein any team could have come up on top, isnt it a little awkward for BPC to justify why their results would be better since the other team (Ensemble) is visibly better than then them on the leaderboard?