Slashdot Mirror


Open Source Solution Breaks World Sorting Records

allenw writes "In a recent blog post, Yahoo's grid computing team announced that Apache Hadoop was used to break the current world sorting records in the annual GraySort contest. It topped the 'Gray' and 'Minute' sorts in the general purpose (Daytona) category. They sorted 1TB in 62 seconds, and 1PB in 16.25 hours. Apache Hadoop is the only open source software to ever win the competition. It also won the Terasort competition last year."

7 of 139 comments (clear)

  1. Re:I'm sure that I can rock their scores by Thinboy00 · · Score: 5, Funny

    My sort will totally beat yours!

    --
    $ make available
  2. They won the "Who has the most moneys" award. by nathan.fulton · · Score: 5, Insightful

    ...this cluster had nearly 4 times the number of nodes as the previous records. This competition was testing who had more nodes working together the best, but when you have so many more nodes, it would be hard not to top other clusters.

  3. Java by cratermoon · · Score: 5, Insightful

    OK, so where are the "Java is slow" comments? o.O

  4. 100 bytes, 10 byte keys. by eddy · · Score: 5, Informative

    Probably why the second sentence in the article is "All of the sort benchmarks measure the time to sort different numbers of 100 byte records. The first 10 bytes of each record is the key and the rest is the value."

    --
    Belief is the currency of delusion.
  5. Re:When's it going to be 1.0? by Anonymous Coward · · Score: 5, Informative

    It's 0.20 but it's stable and production ready already. I use it with HBase and it scales awesomely.

  6. Re:Overlords - Trivia by e9th · · Score: 5, Informative

    Hadoop's name (and mascot) came from Doug [the project leader] Cutting's son's yellow stuffed elephant toy.

  7. Re:Overlords by ModMeFlamebait · · Score: 5, Funny

    datasorting for I new one our overlords! welcome

    --
    Pavlov. Does this name ring a bell?