Slashdot Mirror


Supercomputer Becomes Massive Router For Global Radio Telescope

Nerval's Lobster writes "Astrophysicists at MIT and the Pawsey supercomputing center in Western Australia have discovered a whole new role for supercomputers working on big-data science projects: They've figured out how to turn a supercomputer into a router. (Make that a really, really big router.) The supercomputer in this case is a Cray Cascade system with a top performance of 0.3 petaflops — to be expanded to 1.2 petaflops in 2014 — running on a combination of Intel Ivy Bridge, Haswell and MIC processors. The machine, which is still being installed at the Pawsey Centre in Kensington, Western Australia and isn't scheduled to become operational until later this summer, had to go to work early after researchers switched on the world's most sensitive radio telescope June 9. The Murchison Widefield Array is a 2,000-antenna radio telescope located at the Murchison Radio-astronomy Observatory (MRO) in Western Australia, built with the backing of universities in the U.S., Australia, India and New Zealand. Though it is the most powerful radio telescope in the world right now, it is only one-third of the Square Kilometer Array — a spread of low-frequency antennas that will be spread across a kilometer of territory in Australia and Southern Africa. It will be 50 times as sensitive as any other radio telescope and 10,000 times as quick to survey a patch of sky. By comparison, the Murchison Widefield Array is a tiny little thing stuck out as far in the middle of nowhere as Australian authorities could find to keep it as far away from terrestrial interference as possible. Tiny or not, the MWA can look farther into the past of the universe than any other human instrument to date. What it has found so far is data — lots and lots of data. More than 400 megabytes of data per second come from the array to the Murchison observatory, before being streamed across 500 miles of Australia's National Broadband Network to the Pawsey Centre, which gets rid of most of it as quickly as possible."

40 of 60 comments (clear)

  1. Raijin assists with other big data tasks. by auric_dude · · Score: 2

    As the appetite for super computing and associated use of big data expands as Raijin in brought online http://www.theregister.co.uk/2013/06/21/australias_latest_top_super_fills_up_in_a_day/

  2. 400 Mb per seconds by vikingpower · · Score: 1

    Is that much ? If it is structured, and if the processing of it requires taking the structure into account - well hell yes, then that is humongous.

    --
    Religous speak to God. Insane are spoken to by God. When all shut up, one can finally hear Shostakovich in peace
    1. Re:400 Mb per seconds by Anonymous Coward · · Score: 2, Informative

      Most of it is noise you can throw away quickly. After that point it gets more and more difficult to choose so you need balance processing+storage+bandwidth
      CERN ran into similar problems but at least they had a part of the science done on-site. (a week in geneva is way better than a week in the middle of the fucking desert)
      Space people have kind of the opposite problem, since they have very limited on site storage/processing power and limitations in bandwidth/telemetry and they cant just dump more computers to solve the problem (rad hard electronics are not cheap and weight is counted in million$ up there). Usually the end result is bitter sacrifices of valuable data and bitter fights in the community on whose instrument will get to send back stuff.

    2. Re:400 Mb per seconds by the_other_chewey · · Score: 4, Informative

      Most of it is noise you can throw away quickly.

      In the case of the Square Kilometer Array (named for its total collection area by the way,
      not because it is "spread across a kilometer of territory", whatever that's supposed to mean),
      none of it is noise.

      The SKA relies heavily on processing everything, using advanced phased-array
      and other "inverse beam-forming" techniques to look at multiple targets in multiple
      frequency ranges at once (the final design will have continuous coverage from
      70 MHz to 30 GHz!).

      This is only possible with centralised processing, so none of the antenna sites can throw
      anything away: They don't know what will be important.

    3. Re:400 Mb per seconds by NixieBunny · · Score: 1

      That's not much data by radio astronomy standards. The typical millimeter-wave VLBI experiment records data about 10 times that fast in aggregate, onto hard disk drives that are shipped to a central correlator facility.

      --
      The determined Real Programmer can write Fortran programs in any language.
    4. Re:400 Mb per seconds by vikingpower · · Score: 1

      Shipping hard disk drives ? Aha. Now that is a way to increase bandwidth, remember the old adagium "Never underestimate the bandwidth of a Boeing 747 full of DVDs" ;-)

      --
      Religous speak to God. Insane are spoken to by God. When all shut up, one can finally hear Shostakovich in peace
    5. Re:400 Mb per seconds by epiphani · · Score: 1

      It's not. It's really not much at all. For $150k, you could build a hadoop cluster that would happily accept the data stream, process it, and make it available for consumption. If you just want to store it, you don't even need that much.

      That's a waste of a Cray. Well, a Cray is a waste of money these days anyway.

      --
      .
    6. Re:400 Mb per seconds by TheTrueScotsman · · Score: 1

      It's actually 400 MB. You need to get this sort of thing right if you're planning a career in the tech industry when you finish school.

    7. Re:400 Mb per seconds by war4peace · · Score: 1

      " before being streamed across 500 miles of Australia's National Broadband Network to the Pawsey Centre, which gets rid of most of it as quickly as possible."

      I imagine a bunch of Indian and Chinese people pressing Shift+Delete randomly on files. Their target: 90% resolution rate on incoming data :)

      --
      ...gis sdrawkcab (usually not responding to ACs; don't bother posting as AC)
    8. Re:400 Mb per seconds by war4peace · · Score: 1

      As a matter of fact...
      OK, I may be too pedantic, but a 747 full of DVDs is just large storage, wildly different from large bandwidth.
      When you stream something, the data is immediately ready for processing as it comes (provided it's structured with that goal in mind). On the other hand, a 747 full of DVDs is data that must be read before it's ready for processing, and the average DVD read speed is more or less 100 Mbps, maybe a bit more than that but not by much. Throw time spent writing those DVDs into the mix and you'll get a shitty bandwidth, if you really want to go as far as calculating a bandwidth equivalent.

      Let's assume fly time is zero, just for kicks. Now for a DVD it takes you 10 minutes to write it (at high speeds) and 10 minutes to read it, that's 20 minutes per DVD. Say you can write/read 100 DVDs at the same time, that's roughly 430 GB every 20 minutes, that's roughly equivalent to a bandwidth of 367 MB/second. That's provided all DVDs are readable and you have tens of people you allocate to this project.

      --
      ...gis sdrawkcab (usually not responding to ACs; don't bother posting as AC)
    9. Re:400 Mb per seconds by Immerman · · Score: 1

      >This is only possible with centralised processing, so none of the antenna sites can throw
      anything away: They don't know what will be important.

      Even more than that, *all* of it is potentially important. As I understand it phased arrays pretty much require the whole signal from all the antennas to get the benefit of having the antennas at all, it's not until *after* the signals are combined and processed that you can weed out the data you're not interested in. In fact based on 20s of reading on phased arrays I get the impression that the multi-directionality ability, etc. is likely determined by the transformation function you use to combine the raw data, so the same raw data can create "images" of various directions.

      --
      --- Most topics have many sides worth arguing, allow me to take one opposite you.
    10. Re:400 Mb per seconds by Immerman · · Score: 1

      So use SSDs instead. The point being though that I can get maybe 1GB/s with a high-speed data link, or umpteen PB/s with a truck full of storage media (I first heard the maxim as "...a station wagon full of floppy disks".

      As for bandwidth reading the data, sure you'd need a lot of connections to get anywhere near that. Heck, you'd need a lot of computers to process data that quickly - a single PC with dual channel DDR2-800 RAM has a maximum data throughput (no processing, just reading it from memory) of only 800M lines/second * 64bits/line * 2 interfaces = 12.8GB/s

      --
      --- Most topics have many sides worth arguing, allow me to take one opposite you.
    11. Re:400 Mb per seconds by epiphani · · Score: 1

      Not really. The real-time components (aka correlation) are basically just straight up FFTs. Custom hardware in correlators might make sense (and probably does at scale), but through ASICs or FPGAs. They're not doing that (...yet). Throw a GPU or two into each node, and you'd get far more FLOPS than you would with a cray. This work is mostly embarrassingly parallel, so throwing money into cray's is a total waste of time.

      --
      .
    12. Re:400 Mb per seconds by Rich0 · · Score: 1

      Traditionally phased array was done by feeding the raw signals to a central point and then the "processing" is analog (circuits, not algorithms). The output is a single signal that contains the desired "image" which then goes into an A/D. At any time you only can look at the data in one way, since the raw data is not captured (raw being the data from each individual antenna).

      That works great for a radar on a ship where the antennas are all next to each other and where you can just rapidly steer back and forth, or where you are only tracking a single point and just don't want to use servos to do it. When the antennas are spread across a large area then you can't just run antenna feeds directly into a central box due to signal loss (even with amplification). It also doesn't work if you want to capture wideband data and look at all directions simultaneously after the fact. For that you need to digitize every antenna feed, and you also need an absolute time reference (which means lots of atomic clocks unless the sites are close enough to share a reference).

    13. Re:400 Mb per seconds by vikingpower · · Score: 1

      Thanks, mate. Must have been in the wrong career for 19 years, then. Glad you are not my not-a-single-typo-forgiving-boss ;-)

      --
      Religous speak to God. Insane are spoken to by God. When all shut up, one can finally hear Shostakovich in peace
  3. What a bad summary. by Anonymous Coward · · Score: 2, Insightful

    A lot of waffling that tells me nothing about the premise. Why did they do it, why did they need to, what made that thing uniquely suitable so nothing else would do?

    HEY EDITORS. DO YOUR JOB ALREADY, DAMMIT. STOP WASTING MY TIME.

  4. 400MB/s by Thanshin · · Score: 4, Funny

    More than 400 megabytes of data per second come from the array to the Murchison observatory, before being streamed across 500 miles of Australia's National Broadband Network to the Pawsey Centre

    They forgot to mention the step where the 400 MB go to the NSA to be checked for signs of extra terrestrial terrorism.

  5. Two, actually! by Impy+the+Impiuos+Imp · · Score: 1

    What it has found so far is data — lots and lots of data. More than 400 megabytes of data per second come from the array

    Well, I knew someone on this planet actually needed gigabit Internet if we looked hard enough.

    --
    (-1: Post disagrees with my already-settled worldview) is not a valid mod option.
    1. Re:Two, actually! by rex.clts · · Score: 1

      Note that GigaBIT Ethernet tops out at ~119 MegaBYTEs per second. You're going to need a ~3.3 Gbps link, not including overhead.

  6. Summer? by mjwx · · Score: 5, Informative
    I live in Western Australia and it's winter here.

    Later "this summer" doesn't start until December.

    500 miles

    For those of us who dont use archaic measurements, it's 800 KM from the city of Perth, which makes it 800 KM from the closest city. If anyone is interested, here's the google maps link and it's distance to Perth, Western Australia.. There's literally nothing out there, picking up an AM radio station is difficult, making it the perfect place for a telescope.

    If you truly want to get lost, you need to go somewhere like Murchison, no-one will find you. Of course just about everything there is trying to kill you, from King Brown snakes to Land Sharks and Koala Drop Bears.

    --
    Calling someone a "hater" only means you can not rationally rebut their argument.
    1. Re:Summer? by Javaman59 · · Score: 3, Informative

      I live in Western Australia and it's winter here.

      I live in South Australia, and it's winter here, too.

      Later "this summer" doesn't start until December.

      I would say it does, because using seasons as a unit of time is a distinctly Northern hemisphere convention. In my observation, American's and Canadians are the main users of it (more than the British).

      I often get confused talking to an American when they talk about doing something "in the summer", and it's not so much that they have a different summer, but that I'm not used to measuring time like this. (We only use it for things that are specifically related to the weather, such as sports).

      In Australia we wouldn't say "later this winter", we'd just say "around August/September".

      --
      I'm a software visionary. I don't code.
    2. Re:Summer? by Swampash · · Score: 1

      Yeah, "product scheduled for release this autumn", wtf does that mean?

      Still, the USA uses Imperial measurements so it's not exactly hip to, you know, measurements that people can actually understand.

    3. Re:Summer? by Artea · · Score: 1

      Thats because we only have "summer" and "wet summer", it kind of makes the measurement somewhat vague.

      ..Ok fine, so it did hit 8 degrees this morning, but it's 18-20 during the day which would be considered a warm spring day for some parts of the US.

    4. Re:Summer? by fast+turtle · · Score: 1

      18-20 isn't a warm spring day. Hell I consider anything less then 25-35 to be a cold day during the spring as we routinely hit 40-45 during this time of year.

      Of course, if we even hit 8 during the period Dec-March we're suffering a heat wave as it's usually closer to -8 here during that period and the funny thing is, I'm only 300Km from Los Angeles.

      --
      Mod me up/Mod me down: I wont frown as I've no crown
    5. Re:Summer? by Anonymous Coward · · Score: 1

      In my observation, American's and Canadians are the main users of it (more than the British).

      Not true. We, Canadians, use the following seasonal measurements to indicate the time of the year: Almost winter, winter, still winter, and construction.

    6. Re:Summer? by ebno-10db · · Score: 1

      In my observation, American's and Canadians are the main users of it (more than the British).

      Not true. We, Canadians, use the following seasonal measurements to indicate the time of the year: Almost winter, winter, still winter, and construction.

      On the plus side there is very little risk of heat stroke.

    7. Re:Summer? by ebno-10db · · Score: 1

      I live in Western Australia and it's winter here.

      It's currently the middle of the night in Perth, and still 12C. Tomorrow's high is forecast to be 20C. That is not winter.

    8. Re:Summer? by ebno-10db · · Score: 1

      the USA uses Imperial measurements so it's not exactly hip to, you know, measurements that people can actually understand.

      Don't they teach arithmetic in your country?

    9. Re:Summer? by mjwx · · Score: 1

      I live in Western Australia and it's winter here.

      It's currently the middle of the night in Perth, and still 12C. Tomorrow's high is forecast to be 20C. That is not winter.

      Yes it is.

      Summer in Perth is 40 Deg C.

      --
      Calling someone a "hater" only means you can not rationally rebut their argument.
  7. I would use this for by ozduo4 · · Score: 1

    an intergalatic radio station to beam "24 hour rap at full volume" which should scare off any aliens.

  8. Misleading summary and first article by amaurea · · Score: 5, Informative

    The Square Kilometer Array will have a *collecting area* of one square kilometer. That means that if you add up the area of all the detectors, you get one square kilometer. Since there is some distance between each detector, the SKA will cover a ground area *much* larger than a square kilometer.

    Part of the SKA will be built in the MRO-area in Australia. But it is far from finished - construction won't begin in earnest until 2016 I think. So the most powerful radio telescope in the world is not at MRO now. It is LOFAR in Europe.

    1. Re:Misleading summary and first article by ogre7299 · · Score: 5, Informative

      The article also washes over the fact that there are different telescopes for different parts of the radio spectrum. The MWA and LOFAR are the most powerful in the MHz regime, but the VLA is still the most powerful between 1 to 50 GHz, and ALMA is the most powerful from 85 and 700 GHz.

    2. Re:Misleading summary and first article by amaurea · · Score: 5, Informative

      Right. And then there are the issues of resolution and survey area. Planck covers the same frequency range as ALMA, but measures the whole sky in total intensity and polarization, for example, and is much better at measuring the CMB than ALMA. So the term "powerful" is an over-simplification.

  9. Petaflops by Nedmud · · Score: 2

    Well it sure can do a lot of floating point operations per second; how does that help for networking applications exactly?

    1. Re:Petaflops by White+Flame · · Score: 1

      Ditto. Also, in many "big data" projects, FLOPS is of little use anyway. There is a ton of textual processing and predicate matches to be done in the rest of the world. With ARM entering the HPC space, hopefully more broadly meaningful integer & IO ops numbers will be bandied about rather than just this laser-focus on vector floats.

  10. Getting rid of data? by Celarent+Darii · · Score: 1

    From the article:

    before being streamed across 500 miles of Australia's National Broadband Network to the Pawsey Centre, which gets rid of most of it as quickly as possible.

    Get rid of data? Don't you mean routing the data to its destination? And you would hope the Pawsey Centre actually DID something with the data and not just get rid of it.

  11. Routing? by mc1138 · · Score: 2

    So... anyone actually know more about the "routing" part of this. All I saw was that they turned it into a "really big router" whatever that means, and then talk about the array. I'm assuming they're using the super computer to actually make the decisions of who is getting what data in real time, and sending it to the correct place, but they don't really talk about that at all. Anyone have a better link?

    1. Re:Routing? by ebno-10db · · Score: 2

      Good ol' Wikipedia has a decent description of the overall system: http://en.wikipedia.org/wiki/Murchison_Widefield_Array

      An educated guess is describing it as a router is ridiculous. It's more like intelligently combining the M incoming data streams (beam forming) so that the data can be shipped at a lower bandwidth to N universities (each of which may be using a different combination of incoming data and hence looking at a different beam).

      One of the nice things about phased array (electronically steered) antennas is that you can simultaneously receive signals from N "virtual antennas" (usually called beams in the business), each of which may be pointed in a different direction and have a different beam width, frequency and bandwidth. You create those N virtual antennas by combining the input signals from the M physical antennas in N different ways. The combined signals are of much lower bandwidth than the incoming signals. Hence you could have people at university A looking at one place in the sky, the people at university B simultaneously looking at a different place in the sky, and have both of them receiving real-time signals.

  12. Keep an axe handy.... by TimO_Florida · · Score: 1

    If it renames itself Colossus and starts looking for routes to Guardian, CUT THE LINES!

  13. Not the National Boardband Network by gdtau · · Score: 1

    It doesn't use the NBN. That's an optical access network for residential housing and small business with an access rate of 100Mbps. It uses AARNet -- Australia's Academic and Research Network -- which has installed multiple 100Gbps links across Australia for this project.