Slashdot Mirror


Augmenting Data Beats Better Algorithms

eldavojohn writes "A teacher is offering empirical evidence that when you're mining data, augmenting data is better than a better algorithm. He explains that he had teams in his class enter the Netflix challenge, and two teams went two different ways. One team used a better algorithm while the other harvested augmenting data on movies from the Internet Movie Database. And this team, which used a simpler algorithm, did much better — nearly as well as the best algorithm on the boards for the $1 million challenge. The teacher relates this back to Google's page ranking algorithm and presents a pretty convincing argument. What do you think? Will more data usually perform better than a better algorithm?"

1 of 179 comments (clear)

  1. Slashdot News Flash: BUSH RESIGNS +1, Good by Anonymous Coward · · Score: -1, Offtopic

    and flees to Kazakhstan.

    We hope he is apprehended and returned to face a criminal trial with the Satan sympathizer President-VICE Cheney.

    Yours PatRIOTically,
    K. Trout, ACTIVIST