Slashdot Mirror


Yahoo Releases Open Source Hadoop Distribution

ruphus13 writes "Yahoo has been a vociferous Apache Hadoop user and supporter for several years now, and uses it extensively within its Search technologies. Hadoop has been gaining popularity in the Cloud Computing space, with companies like the NYTimes converting 4TB and 11 million articles to PDFs in under 24 hours using Hadoop and EC2 in late 2007. Hadoop has been made available in Amazon's cloud and Yahoo has now released its own Hadoop version. From the article: 'At today's Hadoop Summit in Silicon Valley, Yahoo! announced the availability of the Yahoo! Distribution of Hadoop, a source-only version of Apache Hadoop that Yahoo! uses within its own search engine. [Hadoop] is an open source software framework that helps process very large data sets, and is widely used in large-scale data mining applications as well as in search tools at sites like Facebook and many others. For developers and users interested in Hadoop, it's worth noting that the Yahoo! Distribution of Hadoop has been widely tested and developed at Yahoo! for years now.'"

5 of 49 comments (clear)

  1. Hadoop? by ickleberry · · Score: 5, Insightful

    Can we bring back the ordinary, sensible pre-Web 2.0 names please?

    1. Re:Hadoop? by Just+Some+Guy · · Score: 4, Insightful

      Like Yahoo!?

      --
      Dewey, what part of this looks like authorities should be involved?
  2. Hadoop is awesome by fancellu · · Score: 5, Informative

    Not only is it used by Yahooo, but also by Facebook, who get 15TB of new data a day to handle. Checkout the very useful free vids from Cloudera. http://www.cloudera.com/hadoop-training-thinking-at-scale You can download a canned VM preloaded with Hadoop/Pig/Hive goodness, even a copy of Eclipse preconfigured. http://www.cloudera.com/hadoop-training-virtual-machine

  3. Yahoo! and OSS by Alethes · · Score: 5, Insightful

    Yahoo! really does get a lot of flack around here, but I have to say, they have contributed quite a bit of free and open-source software for developers to use. The list of of APIs and web services that are available is quite impressive and many of them are better than Google's similar offerings (BOSS vs Google's AJAX search, for example). For anybody who's interested, I really recommend checking out the Yahoo! Developer Network site.

    1. Re:Yahoo! and OSS by hairyfeet · · Score: 5, Interesting

      And folks like to make fun of Yahoo search, but after switching from Google I just can't ever even think about going back. The more/concept tab(that is the blue button below the search box) is just too nice to give up.

      Example- i just picked up "Blacksite:Area 51" for $5. I type in "blacksi" and there it is. From "Blacksite:Area 51" in the search box under more/related I have cheats,patch.system reqs, PS3.Xbox360, Midway games west,multiplayer modes, squad based shooters, release date by region, etc. Just from typing "blacksi" and picking area 51 from the drop down I have all those different avenues related to my search right there at the top where they are easy to get at. It really lets me hone in on an area, and in some cases like movies it finds me interviews with the director which i often don't even know who directed a particular flick.

      So those that haven't tried their search in a few years really ought to give it a whirl. The more/related concept tab at the top makes search so easy to drill down. Plus Yahoo has an opt out for ad matching if you are concerned about privacy. I looked and I don't think Google even has an "opt out" short of using ABP. So give it a go, its free and you might find the more/concepts button as useful as I do. And competition is always a good thing, right?

      --
      ACs don't waste your time replying, your posts are never seen by me.