Slashdot Mirror


Yahoo Releases Largest Ever Machine Learning Dataset To Researchers (tumblr.com)

An anonymous reader writes: Yahoo Labs has released a record-breaking dataset containing 110 billion interactions from 20 million Yahoo News users in 1.5TB of zipped data. The anonymized data is intended for research initiatives in artificial intelligence, including user-behavior modeling, collaborative filtering techniques and unsupervised learning methods.

4 of 41 comments (clear)

  1. Garbage in... by Anonymous Coward · · Score: 2, Insightful

    Garbage out. Enjoy your 1.5tb of crap.

  2. Somewhat outdated by Snotnose · · Score: 2

    For the last couple years I've been hitting their comics page daily, from there I'd sometimes go to finance and then regular news. Last month they nuked the comics page, and when I went to the finance page they had one of those annoying floating opaque ads that want you to click in them to make them go away. No thanks.

    Haven't been to yahoo since. My reasons for going have been either A) removed; or B) made untrustworthy.

    Icing on the cake? For about a week I kept trying to get the comics page, hoping it was a mistake. Then my google newsfeed told me that yahoo had deliberately deleted it. Not yahoo news, google news. Good job, yahoo.

  3. Not going to be anonymous for long. by KingBozo · · Score: 2

    My evil AI machine learning algorithms should have this problem licked post haste.

  4. Re:Only if student or faculty at university... by webmistressrachel · · Score: 5, Informative

    Wait wait wait... mod me down... it made me sign up, then it made me fill more forms, then agree to alsorts of EULA's, THEN it demanded a university email address.... Sorry everyone. My download is stopped. And I just corrected the GP, wrongly. Sorry! (ducks and prepares to lose karma)

    --
    This tagline was transcoded to result in at least one smirk. If you experience failure to smirk, please consult your Gen