Slashdot Mirror


Data Sorting World Record — 1 Terabyte, 1 Minute

An anonymous reader writes "Computer scientists from the University of California, San Diego have broken the 'terabyte barrier' — and a world record — when they sorted more than a trillion bytes of data in 60 seconds. During this 2010 'Sort Benchmark' competition, a sort of 'World Cup of data sorting,' the UCSD team also tied a world record for fastest data sorting rate, sifting through one trillion data records in 172 minutes — and did so using just a quarter of the computing resources of the other record holder."

129 comments

  1. What is this by blai · · Score: 2, Funny

    I have one record of size one terabyte.

    Sort by size.
    It is already sorted.
    Sort finished. time: 0s! WTH!

    --
    In soviet Russia, God creates you!
  2. Doesn't sound so hard... by Minwee · · Score: 5, Funny

    As long as you use Intelligent Design Sort.

    1. Re:Doesn't sound so hard... by Anonymous Coward · · Score: 0

      Would be pretty simple using quantum bogosort also.

    2. Re:Doesn't sound so hard... by Thanshin · · Score: 1

      It's actually pretty simple.

      My 1TB Archos is sorted alphabetically at this very moment; in ascending and descending order at the same time!

    3. Re:Doesn't sound so hard... by Anonymous Coward · · Score: 0
    4. Re:Doesn't sound so hard... by humdinger70 · · Score: 1

      Not bad for a NCAA Division II Sports school. Go Tritons.

  3. I used to think it was great by MichaelSmith · · Score: 4, Interesting

    I had a 6502 system with BASIC in ROM and a machine code monitor. The idea is to copy a page (256 bytes) from the BASIC ROM to the video card address space. This puts random characters into one quarter of the screen. Then bubble sort the 256 bytes. It took about one second.

    For extra difficulty do it again with the full 1K of video. Thats harder with the 6502 because you have to use vectors in RAM for the addresses. So reads and writes are a two step operation, as is incrementing the address. You have to test for carry. But the result was spectacular.

    1. Re:I used to think it was great by DerekLyons · · Score: 1, Insightful

      The idea is to copy a page (256 bytes) from the BASIC ROM to the video card address space. This puts random characters into one quarter of the screen.

      Well, no. It puts the decidedly non-random contents of the BASIC ROM on the screen.
       

      Then bubble sort the 256 bytes. It took about one second.

      Which is roughly as surprising and unexpected as the Sun coming up in the East in the morning. Since much of (the non random and limited character set) data is repeated, many of the bytes don't have to move very far before being 'sorted'.

    2. Re:I used to think it was great by MichaelSmith · · Score: 1

      Which is roughly as surprising and unexpected as the Sun coming up in the East in the morning.

      When you are 12, lots of things can be fun.

    3. Re:I used to think it was great by thesupraman · · Score: 1

      That must have been in basic yes?

      A 6502, in assembly language, can sort 256 adjacent byte, faster than that.
      Even with a trivial bubble sort.
      they are not fast,

    4. Re:I used to think it was great by Cold+hard+reality · · Score: 2, Informative

      A 256 byte bubble sort requires about 33,000 operations. At 1 MHz, that means every operation (compare + possible swap) took 30 cycles. While somewhat slow, this is definitely much faster than what basic could have achieved.

    5. Re:I used to think it was great by MichaelSmith · · Score: 1

      No, it was in hand assembled machine code. The CPU ran at 1Mhz but I think it was the bubble sort algorithm which made it slow. For each iteration it looped through the full 256 bytes.

    6. Re:I used to think it was great by noidentity · · Score: 1

      Yeah, vectors to RAM addresses, having to use Y for indexing. The cycle count was one greater than normal indexing, pretty good. Wait, what were we talking about again?

    7. Re:I used to think it was great by Anonymous Coward · · Score: 0

      Which is roughly as surprising and unexpected as the Sun coming up in the East in the morning.

      While I see the value of your knowledge, I don't think that's what GP meant.

      I so desperately require a "-1, Killjoy" mod point right now. Sorry. Unless you have something insightful to share that I know not.

      - troll8901 (at work)

    8. Re:I used to think it was great by Anonymous Coward · · Score: 0

      Dunno if it's true on the 6502, but on other architectures access to device memory (like your video framebuffer) is a lot slower than main memory.

    9. Re:I used to think it was great by hey! · · Score: 2, Insightful

      I always preferred Shell Sort to Bubble Sort. It's just as easy to code as Bubble Sort. Although it's not asymptotically better that Bubble sort, it makes fewer exchanges and so tends to run faster in the average case. In my experience, it can be dramatically faster.

      Basically Shell Sort is Insertion Sort, but it starts by comparing keys that are far apart. The heuristic is that in an unsorted array, keys usually have pretty far to go before they are in their final position. The algorithm removes larger amounts of entropy in the early rounds, leading to fewer exchanges in the later iterations.

      That's what I like about Shell Sort. It takes a very simple sort algorithm and improves it with a clever insight into the problem.

      --
      Post may contain irony: discontinue use if experiencing mood swings, nausea or elevated blood pressure.
    10. Re:I used to think it was great by Anonymous Coward · · Score: 0

      On the C64, the processor could access video memory at exactly the same speed as normal memory. This because there was no real distinction between video and main memory.

      The videochip and the cpu took turns accessing memory. Ever wondered why you lost your video when using your cassette recorder? That's because the videochip got turned off, so the cpu could access the memory faster (effectively doubling the memory bandwidth). You could also POKE this status yourself, if you needed the pure, raw speed.

      These were the days... ;-)

    11. Re:I used to think it was great by Tyler+Durden · · Score: 2, Insightful

      Hell, just about anything is better than Bubble Sort. I'm not sure why it's even taught. Whenever someone is left to come up with a sorting algorithm of their own for the first time they'll usually re-create Selection Sort, and that's better than Bubble Sort.

      This might surprise you (I know it did me), but Shell Sort can do better than O(n^2) when implemented the correct way, making it asymptotically better than Bubble Sort as well.

      --
      Happy people make bad consumers.
    12. Re:I used to think it was great by Anonymous Coward · · Score: 0

      Good post. You successfully made the parent (but not your gp) look like an anal cockbag. bravo.

      I have lots of nostalgia for the old days of computing too!

    13. Re:I used to think it was great by anthonyfk · · Score: 1

      Agreed!

    14. Re:I used to think it was great by troll8901 · · Score: 1

      When you are 12, lots of things can be fun.

      That is exceedingly mild-mannered and well-mannered. I am full of respect for you. I am also ashamed of my own postings. ;)

  4. Mechanical Turk by Anonymous Coward · · Score: 0

    Data Sorting World Record -- 1 Terabyte, 1 Minute

    The fact that they needed a computer should disqualify them.

  5. 1-byte records? by mike260 · · Score: 2, Insightful

    TFA variously refers to 1 trillion records and 1Tb of data. So each record is 1 byte?
    Doesn't seem like that requires any real computation - you just go through the data maintaining a count of each of the 256 possible values (an embarassingly parallel problem), then write it back out with 256 memsets (likewise trivially parallelisable).

    1. Re:1-byte records? by mike260 · · Score: 4, Informative

      Ah, my mistake - the "trillion data records" refers to a different benchmark. In the 1TB benchmark, records are 100 bytes long.

    2. Re:1-byte records? by topcoder · · Score: 1, Informative

      Doesn't seem like that requires any real computation - you just go through the data maintaining a count of each of the 256 possible values (an embarassingly parallel problem), then write it back out with 256 memsets (likewise trivially parallelisable).

      That technique is called bucked sort BTW.

    3. Re:1-byte records? by topcoder · · Score: 2, Informative

      Sorry, i commited a mistake too, the right name is Counting Sort (http://en.wikipedia.org/wiki/Counting_sort).

    4. Re:1-byte records? by cffrost · · Score: 1

      In the 1TB benchmark, records are 100 bytes long.

      "100 bytes?" Why this arbitrary, ugly, sorry-ass excuse of a number? An elegant, round number like 64, 128... hell, even 96 would have been a sensible and far superior choice.

      --
      Thank you, Edward Snowden.

      "Arguments from authority are worthless." —Carl Sagan
    5. Re:1-byte records? by Anonymous Coward · · Score: 0

      96 elegant, round and sensible number? Have you been designing ATM networks by any chance?

    6. Re:1-byte records? by jesset77 · · Score: 1

      "100 bytes?" Why this arbitrary, ugly, sorry-ass excuse of a number? An elegant, round number like 64, 128... hell, even 96 would have been a sensible and far superior choice.

      Because those are only round(ish) numbers in binary. Might make sense if you were sorting 1TiB of data, but they were not. Per TFA:

      one terabyte of data (1,000 gigabytes or 1 million megabytes)

      This clarifies that they were working with the SI prefix "Tera", meaning that powers of ten much more evenly divide. To whit, 10^2 byte records in a 10^12 byte dataset = exactly 10^10 (ten billion) specific records. 64 or 128 byte records would divide evenly into a non-round number of records, 96 byte records would not divide evenly.

      This of course also assumes precisely 1TB of data, TFA suggests "more than" 1TB, so your guess is as good as mine. It might even be 1TiB, which is about 10% more.

      --
      People willing to trade their freedom of expression for temporary entertainment deserve neither and will lose both.
    7. Re:1-byte records? by Anonymous Coward · · Score: 0

      Ah, yes, the confusion created by the 'new and improved' SI prefix, which makes it so you can never tell what someone is talking about since you can never be sure if they're aware that the meaning of KB, MB, etc. was 'changed'.

      Retarded in so many ways.

    8. Re:1-byte records? by hawkfish · · Score: 1

      In the 1TB benchmark, records are 100 bytes long.

      "100 bytes?" Why this arbitrary, ugly, sorry-ass excuse of a number? An elegant, round number like 64, 128... hell, even 96 would have been a sensible and far superior choice.

      Because then you can't use all the elegant tricks involving cache lines that you were contemplating?

      A prime number might have been even better (100 is divisible by 4).

      --
      You will not drink with us, but you would taste our steel? - Walter Matthau, The Pirates
  6. Great to see sorting research advance by SuperKendall · · Score: 3, Insightful

    You almost think at this point that with super fast systems attention to algorithmic complexity hardly matters, it's good to see research still advancing and that there are areas where carefully designed algorithms make a huge difference. I look forward to seeing how they fare against the Daytona set.

    --
    "There is more worth loving than we have strength to love." - Brian Jay Stanley
    1. Re:Great to see sorting research advance by Anonymous Coward · · Score: 2, Interesting

      I work in the OLAP realm. Trust me, it matters. Being able to run an adhoc query across terabytes of data with near real-time results is the holy grail of what we do. The industry has known for a while that parallel computing is the way to go, but only recently has the technology become cheap enough to consider deploying on a large scale. (Though Oracle will still happily take millions from you for Exadata if you want the expensive solution.)

    2. Re:Great to see sorting research advance by Anonymous Coward · · Score: 0

      ... I look forward to seeing how they fare against the Daytona set.

      I don't think you need to allow for any great intellectual surprises from people who enjoy turning left for five hours while going 180 mph and chugging beer.

      8)

  7. World Cup of data sorting by the_other_one · · Score: 3, Funny

    How many parallel predicting octopuses were required to predict their victory?

    --
    134340: I am not a number. I am a free planet!
    1. Re:World Cup of data sorting by Anonymous Coward · · Score: 0

      According to Holt you only need one.

    2. Re:World Cup of data sorting by Thanshin · · Score: 2, Funny

      How many parallel predicting octopuses were required to predict their victory?

      Infinite, but one of them wrote some pretty good stories about a ghost.

    3. Re:World Cup of data sorting by pinkushun · · Score: 1

      f = n / x

      where n is the number of records, and x the number of tentacles per octopus.

      So 1000 records only need 125 parallel 8-legged octopi. This is the optimal amount, adding any more will invoke Brooks's Law

  8. Only 52 nodes by Gr8Apes · · Score: 4, Interesting

    You've got to be kidding me. Each node was only 2 quad core processors, with 16 500GB drives (big potential disk IO per node) but this system doesn't even begin to scratch the very bottom of the top 500 list.

    I just can't image that if even the bottom rung of the top 500 was even slightly interested in this record, that they wouldn't blow this team out of the water.

    --
    The cesspool just got a check and balance.
    1. Re:Only 52 nodes by rm999 · · Score: 3, Informative

      I don't know much about them, but perhaps most supercomputers break this rule: "The hardware used should be commercially available (off-the-shelf), and unmodified (e.g. no processor over or under clocking)"

    2. Re:Only 52 nodes by Anonymous Coward · · Score: 0

      Yea I think there is just a lack of interest in doing it. Many of the top 500 can easily do the entire sort in memory, but I bet a lot of people would be pissed off to see that people are using research supercomputers to sort numbers :)

      Besides, Google reported a 68 second terasort mark (http://googleblog.blogspot.com/2008/11/sorting-1pb-with-mapreduce.html) 2 years ago, and I'm positive they can easily beat that number today.

    3. Re:Only 52 nodes by afidel · · Score: 2, Informative

      Many of the systems on the TOP500 list are simple COTS clusters with perhaps an interesting interconnect. Remember the Appleseed cluster that was ranked in the top10 a few years ago, it was nothing more than a bunch of Apple desktops with an interesting gigabit mesh network. Regardless they hardly used optimal hardware, hex core Xeon's with 96GB's of ram per node and a better I/O subsystem could have crushed this result.

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
    4. Re:Only 52 nodes by straponego · · Score: 4, Informative

      No, most Top500 machines are composed of commercially available, unmodified parts. More than half use GigE interconnects, though Infiniband has nearly caught up. I'm not sure if you'd count Infiniband as COTS-- at least a couple of years ago, it was fairly involved to configure, and it's not cheap. But anybody who has the money can buy it.

    5. Re:Only 52 nodes by Anonymous Coward · · Score: 3, Insightful

      You say that in a way where someone could misinterpret the interconnect as not being a big deal. Poor interconnects are why EC2 wasn't useful for HPC (their new option sounds somewhat better, but is limited to 16 nodes, IIRC) and why iPod Touches or similar devices have very few possible applications for clustering. If good interconnects weren't important, then clustering portable systems would be inevitable -- e.g., if you wanted more computing power for your laptop, you could just plug in your phone or mp3 player or iPad.

    6. Re:Only 52 nodes by Anonymous Coward · · Score: 0

      Perhaps, but the quote "If you are idling your processors or not using all your RAM, you’re burning energy and losing efficiency." could be the reason they chose this paticular configuration, and more cores or RAM would just mean more idling and less power efficiency.

    7. Re:Only 52 nodes by antifoidulus · · Score: 2, Insightful

      Doubtful. The biggest bottlenecks for contests like this are I/O, network, and memory latency. While faster CPUs are always welcome increases in CPU speed rarely make a HUGE differences, esp. when you are talking about the relatively small gains you get from overclocking. Considering the drawbacks of overclocking, namely increased heat production and decreased reliability, it doesn't really seem all that compelling for supercomputer operators to employ it.

    8. Re:Only 52 nodes by Anonymous Coward · · Score: 0

      You've got to be kidding me. Each node was only 2 quad core processors, with 16 500GB drives (big potential disk IO per node) but this system doesn't even begin to scratch the very bottom of the top 500 list.

      I just can't image that if even the bottom rung of the top 500 was even slightly interested in this record, that they wouldn't blow this team out of the water.

      And that's why calling a cluster of separate computers a single "supercomputer" is highly misleading.

      Lots of little computers can solve lots of separate little jobs quickly. So if your problem can be broken up into lots of little chunks where the results of each chunk are independent of the results of other chunks, clusters are great.

      Sorting data is not such a problem.

    9. Re:Only 52 nodes by Anonymous Coward · · Score: 0

      Yes it is. With an algorithm such as merge sort and n processors you can easily divide your problem in n little chunks and sort them separately, then you will only use less than all your processors at the end when you merge all the little chunks.

      With some creativity other sorting algorithms can be done in parallel too. Take quick sort, after the first iteration you have two lists that can be sorted separately and they won't even need to be merged when you're done.

    10. Re:Only 52 nodes by Anonymous Coward · · Score: 0

      Yes it is. With an algorithm such as merge sort and n processors you can easily divide your problem in n little chunks and sort them separately, then you will only use less than all your processors at the end when you merge all the little chunks.

      With some creativity other sorting algorithms can be done in parallel too. Take quick sort, after the first iteration you have two lists that can be sorted separately and they won't even need to be merged when you're done.

      Thank you for proving my point.

    11. Re:Only 52 nodes by talldean · · Score: 1

      This would be a lot more interesting if they all ran on the same hardware?

    12. Re:Only 52 nodes by arth1 · · Score: 2, Insightful

      Doubtful. The biggest bottlenecks for contests like this are I/O, network, and memory latency.

      No, the biggest bottleneck is the algorithm. Which seldom is machine specific (there are exceptions, of course), and thus what's fastest on these low-cost machines will also contribute to higher speeds on the really hot irons, once the best algorithms have been adopted.

      Quite frankly, I would like to see even lower max specs for competitions like this, which would allow competitors who can't afford the equipment to participate and contribute. Even if they don't win, there may be parts of their algorithms that are genuinely useful.

      And it's also on the slowest hardware that you need the optimizations and best algorithms the most, making it more likely that Good Stuff will originate there. I can see plenty of applications for data mining and data sorting algorithms for cell phones, for example. Yellow Pages on a microSD card? Ordered by zip code or company name? No problem, just wait 10 minutes while it sorts!

  9. Amendments to the Geek Heirarchy by Lord_of_the_nerf · · Score: 5, Funny

    LARPers > Fan-fiction writers > Professional Data Sorting Competitors > Furries

    1. Re:Amendments to the Geek Heirarchy by Anonymous Coward · · Score: 0

      Adding...

      Solar racers > ... > Filking

    2. Re:Amendments to the Geek Heirarchy by MobileTatsu-NJG · · Score: 2, Funny

      LARPers > Fan-fiction writers > Professional Data Sorting Competitors > Furries

      Now sort by income level!

      --

      "I like to lick butts!" by MobileTatsu-NJG (#32700246) (Score:5, Informative)

    3. Re:Amendments to the Geek Heirarchy by jesset77 · · Score: 1

      LARPers > Fan-fiction writers > Professional Data Sorting Competitors > Furries

      w8, you left out that a good portion of LARPers and a resounding majority of Fan-fiction writers are Furries to begin with.

      So, are we just randomly bashing on imagination-have, or? ;D

      --
      People willing to trade their freedom of expression for temporary entertainment deserve neither and will lose both.
    4. Re:Amendments to the Geek Heirarchy by Anonymous Coward · · Score: 0

      So, are we just randomly bashing on imagination-have, or? ;D

      Of course we are.

      And I've got to say, for people who've often been at the receiving end of bullying and harassment from the jocks, we're surprisingly eager to find people to look down on and belittle. :(

    5. Re:Amendments to the Geek Heirarchy by f3rret · · Score: 1

      LARPers > Fan-fiction writers > Professional Data Sorting Competitors > Furries

      What about a LARPer who is also a furry and enjoys writing fan-fiction and professionally sorting data in competitions?

      --
      Admit nothing. Deny Everything. Make Counter-accusations.
    6. Re:Amendments to the Geek Heirarchy by Anonymous Coward · · Score: 0

      Gets a Darwin Award since it's outside of the gene pool?

    7. Re:Amendments to the Geek Heirarchy by pete_norm · · Score: 1
    8. Re:Amendments to the Geek Heirarchy by Anonymous Coward · · Score: 0

      Least common denominator rule. A LARPing fan-fiction writing, Data Sorting Furry is a Furry. Therefore an object of scorn even to their brethren in the higher levels of the Untouchable caste.

    9. Re:Amendments to the Geek Heirarchy by Monkeedude1212 · · Score: 1

      SQL ERROR -30: Table or View not found

  10. The most important part by Anonymous Coward · · Score: 0

    The most important part of any system, is its Intelligent Qube.

  11. Isn't this a little slow? by Anonymous Coward · · Score: 0

    I'm pretty sure that modern analytical DW systems like Greenplum or Teradata on the same number of nodes/RAM/disk/commodity servers could do this faster?

  12. good choice by Main+Gauche · · Score: 1

    "a sort of 'World Cup of data sorting"

    It ended in a 0-0 tie?

    1. Re:good choice by straponego · · Score: 1

      Actually, they sorted 100 trillion bits, then did a uniq. So it was a 0-1 tie.

    2. Re:good choice by Anonymous Coward · · Score: 0

      "a sort of 'World Cup of data sorting"

      It ended in a 0-0 tie?

      Yep.

      the UCSD team also tied a world record for fastest data sorting rate

    3. Re:good choice by draconx · · Score: 4, Insightful

      No, a sort of 'world cup of data sorting' ends in 'cup data of sorting world'.

    4. Re:good choice by Anonymous Coward · · Score: 0

      No it was 2-1: two sorts, one cup.

    5. Re:good choice by bugs2squash · · Score: 1

      or...
      >>> x='world cup of data sorting'
      >>> l = list(x)
      >>> l.sort()
      >>> l
      [' ', ' ', ' ', ' ', 'a', 'a', 'c', 'd', 'd', 'f', 'g', 'i', 'l', 'n', 'o', 'o', 'o', 'p', 'r', 'r', 's', 't', 't', 'u', 'w']

      That did not take a minute, including typing it in.

      --
      Nullius in verba
  13. Been doing this for years... by HockeyPuck · · Score: 3, Funny

    Why is sorting 1TB so hard?

    I can just instruct my tape library to put the tapes in the library in alphabetical order in the slots... Y'know AA0000 AA0001 AA0002... moves a hell of a lot more than 1TB.

    1. Re:Been doing this for years... by Carthag · · Score: 1, Informative

      Why is sorting 1TB so hard?

      I can just instruct my tape library to put the tapes in the library in alphabetical order in the slots... Y'know AA0000 AA0001 AA0002... moves a hell of a lot more than 1TB.

      That's not sorting 1TB. That's sorting n records where n = the number of tapes.

    2. Re:Been doing this for years... by cc1984_ · · Score: 2, Funny

      That WHOOOOOOSH noise is the sound of a backup tape flying over your head.

    3. Re:Been doing this for years... by thermopile · · Score: 1

      Why does this remind me of cup stacking?

      --

      "Diplomacy is something you do until you find a rock." --Richard Pound

    4. Re:Been doing this for years... by Anonymous Coward · · Score: 0

      Whoosh!

  14. Library of Congress by Anonymous Coward · · Score: 0

    Is it sorted?

  15. Best case by nikanth · · Score: 1

    Did they sort the best case input and claim record? Or is it the average or worst case? Or the algo's best case time == worst case time?

  16. 1 Trillion Records Sorted by amirulbahr · · Score: 4, Informative

    When I read the summary I thought what's the big deal if the 1 TB of data only contained two records 0.5 TB each. Then I saw that kdawson wrote the summary. So I browsed over here and saw that the impressive thing is that they sorted 1,000,000,000,000 records of 100 bytes each with 10 byte keys.

  17. Anything is faster than Hadoop by Tassach · · Score: 1

    Considering that the previous terabyte sort record was held by Hadoop, it's not surprising that another system can do the same job in a fraction of the time with a fraction of the resources.

    Hadoop a useful framework, but it's a horribly inefficient resource hog. Who was the f*ing genius who thought it was a good idea to write a high performance computing application in Java?!?

    --
    Why is it that the proponents of "one nation under God" are so eager to get rid of "liberty and justice for all"?
    1. Re:Anything is faster than Hadoop by Trieuvan · · Score: 1

      Java was slow 15 years ago , not now !

    2. Re:Anything is faster than Hadoop by Anonymous Coward · · Score: 0

      It is still a resource hog. Try using millions of small objects.
      "Don't do that," you say!
      I don't, thank you. Emulating structs with huge arrays gets around that, clumsy as it is. Feels like C, almost as fast, too.

    3. Re:Anything is faster than Hadoop by Anonymous Coward · · Score: 0

      Java is still slow, it's just that now the hardware is fast enough that you don't notice it so much anymore.

    4. Re:Anything is faster than Hadoop by flanders123 · · Score: 1

      Who was the f*ing genius who thought it was a good idea to write a high performance computing application in Java?!?"

      My Boss: "I Agree. I personally would have copy-pasted the records into this slick little Excel sheet i made, and clicked the "sort A-Z" button. Done and done. This programming stuff is easy!"

    5. Re:Anything is faster than Hadoop by Anonymous Coward · · Score: 0

      Who was the f*ing genius who thought it was a good idea to write a high performance computing application in Java?!?

      I've read some stupid crap on slashdot before but this gets up there is Extreme stupidity.

      Source please on the main reason Hadoop is slow is due to java.

    6. Re:Anything is faster than Hadoop by doti · · Score: 1

      Care to explain how the language changed to became faster?
      It may run faster now (better hardware, duh), but comparatively it's still slower.

      --
      factor 966971: 966971
    7. Re:Anything is faster than Hadoop by Trieuvan · · Score: 1

      It's not about the language changed, the JVM is getting better . Depends on what you try to compare , Java and C++ are comparable http://blog.cfelde.com/2010/06/c-vs-java-performance .

  18. One other area... by SuperKendall · · Score: 2, Interesting

    Come to think of it, one area where it also matters currently is in mobile development. If you aren't considering memory or processor usage you can quickly lead yourself into some really bad performance, thinking hard about how to make use of what little you have really matters in that space too.

    So only desktop or smallish backend development can generally remain unconcerned these days with algorithmic performance...

    I had to work with large datasets in my previous life as a backend IT guy, but nothing at the levels you are talking about. Even then I thought carefully about how any give approach would affect performance.

    --
    "There is more worth loving than we have strength to love." - Brian Jay Stanley
    1. Re:One other area... by raddan · · Score: 1

      Algorithmic efficiency also matters in the mobile space as designers get closer to building low-power systems that harvest energy directly from their environments. Just yesterday I was speaking with a researcher who had worked on animal tracking devices (e.g., for turtles) that collected power from small solar arrays. Any efficiencies that they could take advantage of, they used, from smart scheduling of their GPS units (don't madly run the unit if you don't have the power) to a delay-tolerant networking protocol (only phone home when you have the resources) to a programming language that informed the operating system which computations were power-hungry and should be performed later.

    2. Re:One other area... by Jogar+the+Barbarian · · Score: 2, Funny

      Was the programming language LOGO?

      --
      3. Profit!
      2. ???
      1. On Soviet Slashdot, a Beowulf cluster of alien Natalie Portman overlords welcomes YOU!
  19. Important Details.... by pdxp · · Score: 5, Interesting
    ... to make it clear that you won't be doing it (TritonSort, thanks for leaving that out kdawson) on your desktop at home:
    • 10Gbps Networking
    • 52 servers x 8 cores each = 416 CPUs
    • 24 GB RAM per server = 1,248 GB
    • ext4 filesystem on each, presumably with hardware raid

    I think this is cool, but.... how fast is it in a more practical situation?

    source

  20. Pretty strange contest... by Anonymous Coward · · Score: 0

    Sorting (like most other algorithms) can be optimized for run-time or memory consumption. Assuming a box with infinite memory the sorting algorithm itself doesn't take any significant run-time. Considering the low amount of memory of the test systems compared to the data to be sorted a quite significant portion of the time required is disk-IO. So the main challenge here seems to be to minimize disk-access - and to put the fastest disks available into the test system. Why aren't they simply using SSDs, this would likely speed things up quite a lot...

  21. 52 nodes? So many for 1 TB worth of data? by c0lo · · Score: 1

    when they sorted more than one terabyte of data (1,000 gigabytes or 1 million megabytes) in just 60 seconds.

    Each node is a commodity server with two quad-core processors, 24 gigabytes (GB) memory and sixteen 500 GB disks

    Ok, let's do the math. 52 computers X 24 GB RAM each = 1248 GB. Letting aside the RAM taken by OS, that's roughly 1 TB of RAM to dedicate for the sorting.
    Le'me guess: the main difficulty was to partition the data and perform a merging of the in-memory sorted partitions?

    (of course I'm posting before reading the details of the method they used, it is /. after all. I guess I even merit a price for the weirdness of RTFA article before posting)

    --
    Questions raise, answers kill. Raise questions to stay alive.
  22. What makes it a barrier? by Patch86 · · Score: 5, Insightful

    Impressive and all, but I take umbrage at calling it a "1 TB barrier". Is it disproportionately more difficult than sorting 0.99 TB?

    Breaking "the sound barrier" was hard because of the inherent difficulty of going faster that sound in an atmosphere (sonic booms and whatnot). It was harder than simply travelling that fast would have been.

    If this is just further evolution of sorting speed, it makes it a milestone, not a barrier.

    1. Re:What makes it a barrier? by Anonymous Coward · · Score: 0

      Absolutely nothing makes it a barrier. It's just a neat round number.

    2. Re:What makes it a barrier? by maxwell+demon · · Score: 5, Funny

      Impressive and all, but I take umbrage at calling it a "1 TB barrier". Is it disproportionately more difficult than sorting 0.99 TB?

      At 1TB the attraction of the 1-bits gets so large that if you are not careful, your data collapses into a black hole.

      --
      The Tao of math: The numbers you can count are not the real numbers.
    3. Re:What makes it a barrier? by wvmarle · · Score: 1

      It is psychological. It adds another digit to the size of the number.

      For the rest it is as much of a barrier as "the stock market has broken through the 20,000 point barrier", that idea.

      Breaking the light speed barrier now that would be an achievement.

    4. Re:What makes it a barrier? by Sockatume · · Score: 1

      kdawson thinks in an esoteric language where numbers go "1, 2, 3... 999,999,999, many."

      --
      No kidding!!! What do you say at this point?
    5. Re:What makes it a barrier? by flanktwo · · Score: 1

      People were obviously scared of a whole terror byte.

    6. Re:What makes it a barrier? by Wannabe+Code+Monkey · · Score: 1

      Impressive and all, but I take umbrage at calling it a "1 TB barrier". Is it disproportionately more difficult than sorting 0.99 TB?

      Breaking "the sound barrier" was hard because of the inherent difficulty of going faster that sound in an atmosphere (sonic booms and whatnot). It was harder than simply travelling that fast would have been.

      If this is just further evolution of sorting speed, it makes it a milestone, not a barrier.

      Just think of it like the 4 minute mile. Nothing particularly sacred about running 1.61 kilometers in 240 seconds, but still seen as an important barrier to be crossed. I think sometimes there are coincidental points where round numbers and feats yet to be achieved exist, and I don't have a problem seeing them as barriers. Maybe they're more mental than real though.

      --
      We always knew Comcast was corrupt, here's the proof: http://tech.slashdot.org/comments.pl?sid=1909890&cid=34545432
  23. Re:How to block KDawson articles? by MobileTatsu-NJG · · Score: 0, Offtopic

    Anyway to filter out articles by KDawson without logging in?

    Yes but you have to install Flash.

    --

    "I like to lick butts!" by MobileTatsu-NJG (#32700246) (Score:5, Informative)

  24. Truly Impressive! by hyades1 · · Score: 1

    I have to admit that I'm genuinely impressed. This rate of data processing is almost as efficient as my girlfriend when she hears, evaluates and demolishes my alibis.

    --
    I've calculated my velocity with such exquisite precision that I have no idea where I am.
    1. Re:Truly Impressive! by maxwell+demon · · Score: 1

      Are you sure that demolishing your alibis is as hard as sorting 1TB of numbers?

      --
      The Tao of math: The numbers you can count are not the real numbers.
    2. Re:Truly Impressive! by hyades1 · · Score: 1

      Yes, I am. She is able to parallel process and arrive at what she deems an appropriate response simultaneously. That response frequently involves numbers...time, usually, if you catch my drift.

      --
      I've calculated my velocity with such exquisite precision that I have no idea where I am.
    3. Re:Truly Impressive! by dtph · · Score: 1

      In other words, your girlfriend is a mentat.

    4. Re:Truly Impressive! by hyades1 · · Score: 1

      Exactly! And schooled in kanly, as well.

      --
      I've calculated my velocity with such exquisite precision that I have no idea where I am.
    5. Re:Truly Impressive! by Anonymous Coward · · Score: 0

      So you are basically saying you are an idiot who lies to his girlfriend?

      Try honesty hotshot, it gets you a whole lot further. It's also less stressful than trying to manage a web of lies (Something you obviously can't do properly anyways)

  25. Pretty close to theoretical max by melted · · Score: 3, Interesting

    Let's consider 100TB in 172 minute thing they also did. 52 nodes, 16 spindles per node is 832 spindles total and 120GB of data per spindle. 120GB of data can be read in 20 minutes and transfered in another 15 to the target spindles (assuming uniform distribution of keys). You can then break it down into 2GB chunks locally (again by key) as you reduce. Then you spend another hour and a half reading individual chunks, sorting them in memory, concatenating and writing.

    Of course this only works well if the keys are uniformly distributed (which they often are) and if data is already on the spindles (which it often isn't).

  26. The results are in... by DeanLearner · · Score: 1

    In third place, with a time of 01:02.305 is Sort United.

    In second place, with a time of 00:58.112 is Team Sorted.

    And in first place, with a time of 00:59:836 is UCSD!

    1. Re:The results are in... by Terrasque · · Score: 1

      In third place, with a time of 01:02.305 is Sort United.

      In second place, with a time of 00:58.112 is Team Sorted.

      And in first place, with a time of 00:59:836 is UCSD! .. which also designed the winner sorting algorithm

      Fix'd

      --
      It's The Golden Rule: "He who has the gold makes the rules."
  27. Hah! We'll find you now, ET! by ibsteve2u · · Score: 1

    Assuming UCSD gives the algorithm to SETI, anyway, and doesn't instead go to Wall Street to be used in detecting market trends and major share movements as part of their never-ending quest to separate the small investor from their money faster.

    --
    Orwell: "In a Time of Universal Deceit, telling the Truth is a Revolutionary Act"
  28. Re:Hah! We'll find you now, ET! by Anonymous Coward · · Score: 0

    This isn't about a new algorithm, really, bit about reducing the constants in existing algorithms. As far as I know, sorting is still O(n log n), and it is reasonable to believe it always will be.

  29. Oh brother.. by jackd · · Score: 2, Funny

    Wow, I realize I am seriously a nerd, getting genuinely excited about someone sorting large amounts of data. Please don't tell my girlfriend, she'll realize I've been faking being a semi-normal cool guy for years, and leave me.

    1. Re:Oh brother.. by Dunbal · · Score: 3, Funny

      Please don't tell my girlfriend, she'll realize I've been faking being a semi-normal cool guy for years, and leave me.

            Nah, she'll realize that you're more turned on by a new processor than by a pair of tits, and stay with you forever knowing you'll never cheat on her (except when you go to that site to download algorithms). You might have to give in and move out of the basement though. But not to worry - a good set of blackout curtains and it's almost the same. Plus as you get older the lack of dampness is easier on your rheumatism...

      --
      Seven puppies were harmed during the making of this post.
  30. Re:How to block KDawson articles? by Yacoob+Al-Atawi · · Score: 2, Funny

    Just put a fucking bullet in your head and you'll never have to see another kdawson article again.

    Don't do it while reading one of his articles or it will be imprinted in your eyes forever.

  31. when are we going to have access to this by hesaigo999ca · · Score: 1

    when are we going to have access to this technology? It is always nice to hear new breakthroughs, but governments always control access to these new technologies, and make it 5 to 10 years before we see any of it, while somewhere like japan, you already have it available...

  32. ((Triton)|(Gray)|(Minute))Sort by Lord+Grey · · Score: 4, Informative

    A paper describing the test is here. TritonSort is the abstract method; GraySort and MinuteSort are concrete/benchmarked variations of TritonSort.

    As TFA states, this is more about balancing system resources than anything else. The actual sort method used was "a variant of radix sort" (page two of the linked PDF). Everything else was an exercise in how to break up the data into manageable chunks (850MB), distribute the chunks to workers, then merge the results. That was the real breakthrough, I think. But I wonder how much is truly a breakthrough and how much was just taking advantage of modern hardware, since one of the major changes in MinuteSort is simply avoiding a couple of disk writes by keeping everything in RAM (a feat possible because that much RAM is readily available to a single worker).

    Regardless, this kind of work is very cool. If web framework developers (for example) paid greater attention to balancing data flow and work performed, we could probably get away with half the hardware footprint in our data centers. As it stands now, too many COTS -- and hence, often-used -- frameworks are out of balance. They read too much data for the task, don't allow processing until it's all read in, etc.. This causes implementors to add hardware to scale up.

    --
    // Beyond Here Lie Dragons
    1. Re:((Triton)|(Gray)|(Minute))Sort by Anonymous Coward · · Score: 0

      Everything else was an exercise in how to break up the data into manageable chunks (850MB), distribute the chunks to workers, then merge the results.

      This is the most difficult part of an MPI-type application, especially minimizing communication costs between nodes (which aren't small).

  33. Trivial by Anonymous Coward · · Score: 0

    Hell, I can sort a terabyte of data in less than a second. As long as the records are a terabyte long.

  34. NSA has orgasm by Anonymous Coward · · Score: 0

    upon hearing that news...
    and so did
    credit companies
    banks
    and other overlords ....

  35. Google Cache of paper describing their sort algo by Nimey · · Score: 1
    --
    Hail Eris, full of mischief...

    E pluribus sanguinem
  36. Big-Oh by WilyCoder · · Score: 1

    Who cares about the time it took?

    Show me the big-oh notation for the algorithm or I am simply not interested.

    1. Re:Big-Oh by JustNiz · · Score: 1

      totally agree.
      The unspoken implication here is that they have invented a great search algorithm, however by just giving timings of bytes processed per second, it could also just mean thay have a bubble sort running on some big iron.

  37. How efficient is it really? by Anonymous Coward · · Score: 0

    I bet they mandated Java, used a prototype algorithm or generic library, assigned a tiny slice of a shared virtual server, jacked the virtual memory, and rushed it to production, then threatened their employees until they made it execute in under a minute.

  38. misleading summary: 1 trillion records not bytes by davidwr · · Score: 1

    It's not the number of bytes that's important, it's the number of records and the size of the sort keys.

    I can take two records of 0.5TB each and sort them quickly - very quickly if their sort keys are short.

    The summary and article headline said "a terabyte" but the article says "They sorted one trillion data records." This is impressive with today's hardware. It will be child's play with hardware from "the day after tomorrow."

    --
    Knowledge is how to play a game, intelligence is how to win, wisdom is knowing what game to play.
  39. dining philospher chopstick sort by epine · · Score: 2, Insightful

    Hell, just about anything is better than Bubble Sort. I'm not sure why it's even taught.

    I once read that bubble sort is the optimal sort on a Turing machine. Too bad it wasn't named tapeworm sort, then every student would understand immediately. It would be analyzed alongside dining-philosopher chopstick sort, where the dining philosopher won't each (finicky like Tesla) unless holding a matched pair, with the option of swapping hands if holding a pair of sticks before release.

    If the purpose is to teach analysis of algorithms under different computational assumptions, there's nothing wrong with teaching bubble sort. When it gains rank in the panoply of commercially viable sorts, that's where the problems begin. But since when did academia much care about A) carefully informing you about the importance of the material they were teaching you, or B) the grim realities of commercial applicability.

    Extreme adherence to practicality made C++ what it is today. All software for the F22 and F35 are written in C++. Not Haskell or LISP. The beautiful languages are the languages best suited to reject design iteration on practicality. If it's not very practical to begin with, why mess up a good thing? C++ never had that problem.

    What my university failed to teach properly was radix-sort, which is both practical and of theoretical interest. If the time devoted to bubble sort had instead been devoted to radix sort, the astute reader would know that record length is a critical parameter in this benchmark protocol.

    Hell, give me a one-bit record length and some fast disk drives, I could sort a terabyte at the same speed as UCal using a single loop of RS485 multidrop instead of that overpriced Cisco kit.

    Don't bother reading TFA. It does not report the sort protocol record length.

  40. Lotta space by toxique · · Score: 0

    "...100 terabytes of data is roughly equivalent to 4,000 single-layer Blu-Ray discs, 21,000 single-layer DVDs, 12,000 dual-layer DVDs or 142,248 CDs ..." that is the space my pr0n takes up

    --
    - This can't be... - Be what? Be real?
  41. Too easy by existentialentropist · · Score: 1

    abeertty There. 15 seconds.