Slashdot Mirror


Google Sorts 1 Petabyte In 6 Hours

krewemaynard writes "Google has announced that they were able to sort one petabyte of data in 6 hours and 2 minutes across 4,000 computers. According to the Google Blog, '... to put this amount in perspective, it is 12 times the amount of archived web data in the US Library of Congress as of May 2008. In comparison, consider that the aggregate size of data processed by all instances of MapReduce at Google was on average 20PB per day in January 2008.' The technology making this possible is MapReduce 'a programming model and an associated implementation for processing and generating large data sets.' We discussed it a few months ago. Google has also posted a video from their Technology RoundTable discussing MapReduce."

21 of 166 comments (clear)

  1. Kudos to Google by Anonymous Coward · · Score: 5, Funny

    for knowing how important the Library of Congress metric is to us nerds!

    1. Re:Kudos to Google by canuck57 · · Score: 5, Funny

      for knowing how important the Library of Congress metric is to us nerds!

      But at least now we know Google can sort out petafiles.

    2. Re:Kudos to Google by shutdown+-p+now · · Score: 4, Funny

      Bah! To pay true homage, they need to add it to the list of units in Google Calc!

    3. Re:Kudos to Google by LingNoi · · Score: 3, Funny

      So Google can sort through 12 LoCs in 6 hours.

      Wow, that's 2 LoC/pH

    4. Re:Kudos to Google by Anonymous Coward · · Score: 1, Funny

      Woooosh, dipshit.

  2. Unit conversion by Zarhan · · Score: 4, Funny

    Yay! We finally have unit conversion from 1 LoC to bytes! So...20 PB = 6LoC, means that 1 LoC = 3,333... PB :)

    1. Re:Unit conversion by xZgf6xHx2uhoAj9D · · Score: 1, Funny

      This is an excellent point. No American football player has used his feet since the NFL adopted hoverchairs into the rules in 1974.

  3. Finally... by aztektum · · Score: 5, Funny

    I will be able to catalog my pr0n in my lifetime:

    Blondes, Brunettes, Red heads, Beastial^H^H^H^H^H "Other"

    --
    :: aztek ::
    No sig for you!!
    1. Re:Finally... by Pugwash69 · · Score: 2, Funny

      How do you catalogue the topics? I mean "Clown" and "Monkey" are so different, but something with both elements could be difficult to sort.

      --
      Pro Coffee Drinker
  4. Re:That's Easy by sakdoctor · · Score: 4, Funny

    And yet google don't even convert petabytes to libraries of congress in the google calculator.
    Or perhaps I got the syntax wrong.

  5. Re:That's Easy by sakdoctor · · Score: 4, Funny

    Huh? This isn't the parent post I was trying to reply to.

  6. Its About Time.... by Anonymous Coward · · Score: 2, Funny

    Finaly... A system with enough power to run vista efficiently.

    1. Re:Its About Time.... by peragrin · · Score: 3, Funny

      Not only that the extra processors aren't covered under the EULA and require special extra licenses.

      --
      i thought once I was found, but it was only a dream.
  7. Not impressive... by g0dsp33d · · Score: 4, Funny

    Not a big deal, that's just the data they have on you.

    --
    lol: You see no door there!
  8. Re:tagging by gardyloo · · Score: 5, Funny

    pr0n for Geeks, volume 18: Sorting On-the-Fly

  9. 0s and 1s by johno.ie · · Score: 2, Funny

    That's a lot of computing power to use just to get 4,000,000,000,000 0s and 4,000,000,000,000 1s.

    --
    872835240
  10. nice one, Google... by Tastecicles · · Score: 2, Funny

    ...fancy doing my mp3 collection?

    --
    Operation Guillotine is in effect.
  11. Re:Sort? Sort what? by Dpaladin · · Score: 5, Funny

    Sorting a petabyte sounds pretty impressive, but I don't think it was a whole yotta work.

    --
    Bad puns gave me bad karma. =(
  12. Amazing feat... by Duncan3 · · Score: 5, Funny

    Today from Google, the god of all things and doer of all things good in the universe, many millions of dollars in computer equipment were able to sort lots of things, in about the amount of time you would think it would take for millions of dollars of equipment to sort things.

    In other news, a woodchuck was found chucking wood as fast as a woodchuck could chuck wood.

    Congrats Google, you have a HUGE data set, and an even bigger wallet.

    --
    - Adam L. Beberg - The Cosm Project - http://www.mithral.com/
  13. Re:BCA's? by SEWilco · · Score: 2, Funny

    Can we convert that to number of bad car analogies?

    Sure, it's -4.15 Edsels.

  14. USSR by Anonymous Coward · · Score: 1, Funny

    In Soviet Russia, 6 petabytes sort YOU in ONE hour.