Slashdot Mirror


IBM Designing Superman Servers For World's Largest Telescope

Nerval's Lobster writes "How's this for a daunting task? By 2017, IBM must develop low-power microservers that can handle 10 times the traffic of today's Internet — and resist blowing desert sands, to boot. Sound impossible? Hopefully not. Those are the design parameters of the Square Kilometer Array (SKA) Project, the world's largest radio telescope, located in South Africa and Australia amid some of the world's most rugged terrain. It will be up to the SKA-specific business unit of South Africa's National Research Foundation, IBM, and ASTON (also known as the Netherlands Institute for Radio Astronomy) to jointly design the servers. Scientists from all three organizations will collaborate remotely and at the newly established ASTRON & IBM Center for Exascale Technology in Drenthe, the Netherlands. By peering into the furthest regions of space, the SKA project hopes to glimpse 'back in time,' where the radio waves from some of the earliest moments of the universe — before stars were formed — are still detectable. The hardware is powerful enough to pick up an airport radar on a planet 50 light-years away, according to the SKA team."

38 of 67 comments (clear)

  1. 26 petabytes? by bloodhawk · · Score: 3, Insightful

    Is internet traffic really only 26 Petabytes a month, while that is a big number it sounds awefully low to me as the place I work does 15 Terabytes a month and they are little more than a miniscule pimple on face of the internet.

    1. Re:26 petabytes? by iYk6 · · Score: 1

      Agreed. There are roughly 100 million internet enabled households in the United States. If each of these sent and received, on average, 1GB per month, that's 100 PB.

    2. Re:26 petabytes? by bloodhawk · · Score: 1

      I would seriously doubt where I work even ranks in the top 10,000 internet content providers by volume, yet the article would suggest they would have to be one of the top providers for there numbers to be true.

    3. Re:26 petabytes? by c0lo · · Score: 1

      that is a big number it sounds awefully low to me

      Well, it is actually low. E.g. the entire cloud fits a single server on a cable modem (true, with lots of caching). You ask for citations? Here you go

      --
      Questions raise, answers kill. Raise questions to stay alive.
    4. Re:26 petabytes? by scheme · · Score: 1

      Is internet traffic really only 26 Petabytes a month, while that is a big number it sounds awefully low to me as the place I work does 15 Terabytes a month and they are little more than a miniscule pimple on face of the internet.

      That's just wrong. Open Science Grid transfers about 1.4PB a day and I seriously doubt OSG uses a significant fraction of the bandwidth on the net.

      --
      "When you sit with a nice girl for two hours, it seems like two minutes. When you sit on a hot stove for two minutes, it
  2. Planets 50 Light years away have airports? by Paleolibertarian · · Score: 1

    Wow! How is this possible given that the intensity of radio waves diminish at a factor equal to the square of the distance? That's some powerful radar or a darned big capture area of the antenna here on earth. How is it distinguishable from CBR.

    1. Re:Planets 50 Light years away have airports? by stanlyb · · Score: 2

      Where? On the earth, in the empty space?

    2. Re:Planets 50 Light years away have airports? by Paleolibertarian · · Score: 1

      A lot of great math there but it completely misses the point. A point source radiator, at that distance couldn't be distinguished from the radiation coming from the planets star. I don't care how much processing power you put on the various correlated antenna. The capture area BTW is not the longest baseline between array collectors. That would be the resolving power meaning the beam-width of the antenna array at 50 light years which would be far greater than the distance between the planet and its star. Capture area OTOH would be the sum of all the antennas dish or aperture area. Therefore the planet or any radiator on it's surface would be completely lost in the noise of the star not to mention the noise generated by the system itself. It isn't necessary for me to point out the noise created by the intervening 50 light years of sources which would also be captured by the array.Only with an antenna with a beam width infinitely small would the system be able to resolve the radar on the planet and only when the antenna was pointed directly at the radar source which is so impractical as to be impossible. I studied antennas when I was in high-school 45 years ago when I was in electronics class. It's called signal to noise ratio and believe me there's a lot of noise. Just on logic alone I have proven the absurdity of the statement. You can do the math yourself since you seem to be so good at it.

    3. Re:Planets 50 Light years away have airports? by Anonymous Coward · · Score: 1

      You *do* know we can detect signals well below the noise floor, yes?

    4. Re:Planets 50 Light years away have airports? by flyingfsck · · Score: 1

      ...and the SKA low noise receivers are super cooled.

      --
      Excuse me, but please get off my Pennisetum Clandestinum, eh!
    5. Re:Planets 50 Light years away have airports? by tibit · · Score: 1

      This should be +5 informative. Well done.

      --
      A successful API design takes a mixture of software design and pedagogy.
    6. Re:Planets 50 Light years away have airports? by Paleolibertarian · · Score: 1

      The sensitivity and thermal noise of a receiver has absolutely nothing to do with the fact that a mythical airport radar on a planet orbiting a star at 50 light years distance, that cannot be resolved as a separate source by any known optical or synthetic aperture will be completely overwhelmed by the output of the planet's star. A wildly inaccurate (because I'm to lazy to do the math) analogy would be comparing the output of a microwave oven with the output of 1 billion blast furnaces. While the EM spectrum of the blast furnaces would not be in the gigahertz range there is still some present. Likewise with a star there is EM "noise" that extends well down into the gigahertz range so having a highly selective filter won't help.

      The antenna will essentially "see" a blob of radiation coming from the star and while the airport radar would be in there somewhere it couldn't be picked out at all.

      OK so lets assume for the sake of argument that we're NOT talking about an airport radar on a planet orbiting a star but instead we're talking about a radar aboard a space ship which is far enough from any other source that it COULD be resolved by the antenna. Is the amount of radiation that has traveled 50 light years enough to overcome the CBR? That's questionable as well.

      I'm merely pointing out the absurdity of the original statement and my perhaps faulty assumptions.

  3. Don't worry by Anonymous Coward · · Score: 1

    they'll just do what they did with NCSA and decide they can't make money doing it and walk away.

  4. Re:10x today's internet traffic by c0lo · · Score: 4, Informative

    can handle 10 times the traffic of today's Internet

    Yeah, you can get something on the front page of slashdot if you use stupid, misleading metrics like this. Soulskill has his head buried in the sand.

    A single computer, probably not.
    Otherwise, the entire SKA will indeed produce 10 times the amount of data trafficking the today's internet.

    --
    Questions raise, answers kill. Raise questions to stay alive.
  5. worldwide blackout imminent? by rjejr · · Score: 1

    Is this the machine that lets us see our lives 9 months in the future?

  6. Re:Why? by PlastikMissle · · Score: 3, Informative

    From the Wikipedia entry:

    "Suitable sites for the SKA telescope need to be in unpopulated areas with guaranteed very low levels of man-made radio interference. Four sites were initially proposed in South Africa, Australia, Argentina and China.[16] After considerable site evaluation surveys, Argentina and China were dropped and the other two sites were shortlisted (with New Zealand joining the Australian bid, and 8 other African countries joining the South African bid):"

  7. Re:10x today's internet traffic by Trepidity · · Score: 3, Informative

    True, but that's getting pretty common in large-scale scientific applications these days. The LHC generates about 100 terabytes per second, for example. The numbers on the page you linked say SKA will generate "enough raw data to fill 15 million 64 GB iPods every day", which is actually an order of magnitude lower: 15 million * 64 GB = 960 PB per day. Divide that by 86400 seconds in a day, and you get about 11 TB/s.

  8. sensitive by planckscale · · Score: 1

    The impressive part of the blurb to me was the ability to detect something like an airport radar on a planet 50 light-years away. With that sensitivity I would think this could go a long way towards SETI, nevermind background radiation.

    --
    Namaste
    1. Re:sensitive by femtobyte · · Score: 4, Insightful

      One big obstacle is, as with SETI, not merely gathering super-sensitive data, but processing all the data to identify E.T.'s air traffic control in trillions of other (natural) radio sources. Just because you're sensitive enough to tell whether a signal is present or absent *when you know exactly what to look for* doesn't mean you'll be able to identify previously unknown signals.

  9. Re:Why? by Paleolibertarian · · Score: 1

    Government masters who by their nature violate ZAP are equally evil no matter the color of their skin.

  10. Re:10x today's internet traffic by c0lo · · Score: 4, Informative

    True, but that's getting pretty common in large-scale scientific applications these days. The LHC generates about 100 terabytes per second, for example. The numbers on the page you linked say SKA will generate "enough raw data to fill 15 million 64 GB iPods every day", which is actually an order of magnitude lower: 15 million * 64 GB = 960 PB per day. Divide that by 86400 seconds in a day, and you get about 11 TB/s.

    While LHC generates 10 times more data in a single experiment (usually scheduled months or years ahead), think that SKA will generate data each day every day.

    --
    Questions raise, answers kill. Raise questions to stay alive.
  11. Re:10x today's internet traffic by Anonymous Coward · · Score: 3, Interesting

    The LHC only records about 25 PB a year though, as the raw data is heavily filtered by custom hardware before getting to the more off the shelf computers that record data for later use. SKA on the other hand, needs to hold on to raw data for a couple hours until a run is complete, requiring intermediate storage of data of about a PB an hour, which will then get reduced to about a 1-5 PB a day for longer term storage and analysis. The intermediate data will use conventional hardware for processing, but even ignoring that, the long term data, that which needs to be stored and distributed, will out pace LHC's year' production in about a eek. If you wanted a more apples-to-apples comparison to LHC's raw data collection, you would need to look more at the amount of raw data produced before filtered down to commodity computer hardware. And with a final goal of thousands of antennas collecting up to 30 GHz signals across nearly the full spectrum, that is a lot more than the 10 terabits/s LHC roughly generates, and the intermediate 1 PB/hr data for SKA is much more than LHC's intermediate ~ 1 TB/hr.

  12. Re:10x today's internet traffic by Trepidity · · Score: 1

    Interesting numbers; thanks for the clarification! I agree that's a significantly more ambitious goal by some of those metrics. A PB/hour is indeed quite a lot of intermediate storage, and even the reduced 1-5 PB/day is more than any existing experiment.

    I realize it's a lot to ask for popular science journalism, but that's one reason I'd like more specifics and precision in some of these stories. What do we mean by data being generated: where is it generated, how long is it stored for, what are its characteristics, etc. And ideally figures in units of bytes (or a multiple) rather than "iPods" or "internets" would be nice; it was sort of ridiculous that I had to multiply out what 15 million 64GB iPods hold to arrive at a real number (and even that number could be wrong if they really meant GiB rather than GB, though the actual iPod space is indeed in GB).

  13. LOFAR - interferometric array by Frans+Faase · · Score: 2

    ASTRON is the organisation that is also running LOFAR, which is basically a smaller version of SKA in a different frequency range. It is an interferometric array which requires a central system to process all the signals into one result. LOFAR is using a lot of dedicated hardware and a IBM Blue Gene/L supercomputer for this purpose. Because all the signals are digitized at the receivers, this result is a very large stream of data, which are processed (but not stored) by a pipe-line of processors, each combining more and more signals, into one final image.

    1. Re:LOFAR - interferometric array by broekema · · Score: 1

      The Blue Gene/L was replaced by a Blue Gene/P in 2008. It is interesting to see that, at the beginning of the project, the most powerful supercomputer in Europe (for abour a month or so) was required. Now, only a few years later, more work can be done using only a few servers with accelerators. The SKA will ofcourse be a whole different challenge, which requires a completely different way of thinking about computing.

  14. Re:Why? by lexsird · · Score: 1

    Why not build a monster one in space instead? Think space elevator please.

    --
    Take the Red Pill.
  15. Bah. by Black+Parrot · · Score: 1

    I'll pick a Batman server over a Superman server every time.

    --
    Sheesh, evil *and* a jerk. -- Jade
    1. Re:Bah. by ButchDeLoria · · Score: 2

      A Goku server would totally kick a Superman server's ass!

    2. Re:Bah. by captjc · · Score: 4, Funny

      Yeah, but you couldn't install Linux on it because as we all know Batman hates the Penguin!

      --
      Slow Down Cowboy! It's been 1 hour, 47 minutes since you last successfully posted a comment
  16. Re:10x today's internet traffic by fyngyrz · · Score: 1

    And with a final goal of thousands of antennas collecting up to 30 GHz signals across nearly the full spectrum

    Hmmm. Sounds like a marvelous database for us SDR freaks to troll through, big chunks of spectrum at a time, eyes on the waterfall and spectral displays.

    All ya need to do is create a server that will supply a file that is a chunk-o-spectrum as baseband IQ data, and you'll likely have a whole bunch of eyes on it for you. You'd certainly have mine!

    --
    I've fallen off your lawn, and I can't get up.
  17. Re:10x today's internet traffic by Anonymous Coward · · Score: 1

    I think they've used a low estimate for the SKA. SKA phase I consists of three parts: a single-pixel dish array in South Africa, a focal-plane-array dish array in Australia, and an aperture-array tile array in Australia. The second part, with the focal plane arrays, is about twice the size of the precursor instrument ASKAP. The data rate for ASKAP is:

    (36 antennas) * (192 elements per antenna) * (384 MHz bandwidth) * (factor of 2 to get the Nyquist rate) * (1 byte/sample) = 5.3 TB/s

    So data rate for one of three parts of SKA phase I will have a data rate of twice this, or about the 11 TB/s you calculated. SKA phase II will be a lot bigger, probably exceeding the LHC's 100 TB/s.

  18. “Resist blowing desert sands” by LMariachi · · Score: 1

    So they’ll be putting the servers indoors then?

  19. Re:10x today's internet traffic by ldobehardcore · · Score: 1

    it is about making things relatable to those that have little understanding of the topic.

    What's ironic is that 15 million iPods is no more relatable than saying 960 Petabytes. What do you imagine a pile of 15million iPods looks like? Or is it 15million retail boxes stacked in neat rectangular prisms? Or is it 15 million iPods laid end to end? Big numbers need to be expressed in ways that are easily divisible into realistic chunks people can hold in their heads. I have no idea what kind of volume this many iPods takes up, or how expensive it would be, I doubt anyone really does off the top of their head.

    --
    Hectice, baby, Mercator says hello to you
  20. Superman? by wonkey_monkey · · Score: 1

    Since when did "Superman" become a good adjective for describing powerful computer systems? I mean, if you must appeal to the hardcore comic book reading geek, wouldn't Brainiac be a better choice?

    --
    systemd is Roko's Basilisk.
  21. How many? by dutchwhizzman · · Score: 1

    How many olympic sized swimming pools can be filled with the servers?

    --
    I was promised a flying car. Where is my flying car?
  22. Here is an geek info on SKA. by hamster_nz · · Score: 4, Informative

    This link is a really interesting info on some of the SKA signal processing.

    The SAK's power budget is 58MW for signal processing - this is such a high running cost that by spending 30 Million Euro on developing a few custom ASICs to halve that power usage will pay off in 9 months!

    1. Re:Here is an geek info on SKA. by tibit · · Score: 1

      Very informative. While it's a cool project, in a way I'm fine only dealing with distributed signal processing that can power a dozen nodes through one PoE ethernet drop :)

      --
      A successful API design takes a mixture of software design and pedagogy.
  23. Apollo hardware on moon by nelsonteixeira777 · · Score: 1

    Maybe then we'll be able to see Apollo mission hardware on the moon :|