Slashdot Mirror


A Look At CERN's LHC Grid-Computing Architecture

blair1q writes "Using a four-tiered architecture (from CERN's central computer at Tier 0 to individual scientists' desk/lap/palmtops at Tier 3), CERN is distributing LHC data and computations across resources worldwide to achieve aggregate computational power unprecedented in high-energy physics research. As an example, 'researchers can sit at their laptops, write small programs or macros, submit the programs through the AliEn system, find the necessary ALICE data on AliEn servers, then run their jobs' on upper-tier systems. The full grid comprises small computers, supercomputers, computer clusters, and mass-storage data centers. This system allows 1,000 researchers at 130 organizations in 34 countries to crunch the data, which are disgorged at a rate of 1.25 GB per second from the LHC's detectors."

53 comments

  1. I wonder when.. by Superken7 · · Score: 1

    I wonder when we will have the equivalent computing power at home? :)

    1. Re:I wonder when.. by Anonymous Coward · · Score: 1, Funny

      According to Moore's Law.. about 4 Hitlers away

      Damn thats Godwins.. Moore's says about 12 years.

    2. Re:I wonder when.. by William+Robinson · · Score: 1

      I wonder when we will have the equivalent computing power at home? :)

      When you will create a black hole at home. Simple !!!

    3. Re:I wonder when.. by Just_Say_Duhhh · · Score: 0

      You can have that kind of computing power at home TODAY. Stumble over to http://boinc.berkeley.edu/ to see where LHC and other projects get their global computing power (SETI@HOME is probably the most famous). From there you can start your own project, and in no time you'll have millions of computers around the world crunching away for you! Maybe that'll help you pick next week's winning lottery numbers?

      --
      I need trepanation like I need a hole in the head.
    4. Re:I wonder when.. by splogic · · Score: 0

      And I wonder when somebody will think of a better waste of time, energy, and computer cycles. I think the LHC is pretty much a Black Hole for money that gets pumped into various "projects" that otherwise would not get funded. Other than that, it doesn't serve much scientific value. A little blip, in a sea of particles is not enough evidence to verify anything.

  2. 1.25GB/sec not that much. by Anonymous Coward · · Score: 0

    A single 10gb ethernet connection can handle that quite easily. a single symmetrix/shark should be able to keep up.

    1. Re:1.25GB/sec not that much. by $RANDOMLUSER · · Score: 1

      Apparently the LHC detectors spew out 1.25GB of data per second. That's about six times the contents of Encyclopedia Britannica including the index every second.

      That's 3.7 Libraries of Congress per hour for those of us on the other side of the pond.

      --
      No folly is more costly than the folly of intolerant idealism. - Winston Churchill
    2. Re:1.25GB/sec not that much. by Alef · · Score: 3, Insightful

      A single 10gb ethernet connection can handle that quite easily.

      Eh. A 10 Gb ethernet connection can't handle 1.25 GB/s at all, not to mention doing it reliably. Theoretically, 10 Gb is exactly 1.25 GB, but then you need to account for protocol overhead, packet loss and so on.

    3. Re:1.25GB/sec not that much. by Anonymous Coward · · Score: 0

      mmm, channel bond 2 of them together, then you have a 'single' logical 20Gbps link!

    4. Re:1.25GB/sec not that much. by evilviper · · Score: 2, Interesting

      Theoretically, 10 Gb is exactly 1.25 GB, but then you need to account for protocol overhead, packet loss and so on.

      You don't have to use TCP/IP over ethernet, you know. AoE & FCoE come to mind.

      There are very few ways you might lose packets in a well-built local data-link network. Collisions/congestion are a thing of the past. Modern networking gear is fully capable of forwarding packets at full speed. Packets don't just go flying out of CAT-6A, never to be seen again.

      And yes, bonding two 10Gbps ethernet interfaces would give you greater than 1.25GByte/s, though certainly NOT double that...

      --
      Slashdot gets worse every day... Pipedot: News for nerds, without the corporate slant
    5. Re:1.25GB/sec not that much. by Alef · · Score: 1

      You don't have to use TCP/IP over ethernet, you know. AoE & FCoE come to mind.

      Mechanisms for ensuring reliable transfer of data aren't exclusive to TCP/IP. There is also some overhead in the packet headers (yes, it could be made very small if you use non-standard Ethernet frames).

      There are very few ways you might lose packets in a well-built local data-link network.

      Well, yeah, depending on how you define local. I don't know what distances they need to transfer the data at CERN, and I imagine there could be all sorts of nasty EM fields around a 14 TeV particle accelerator.

      Anyway, to say you could easily handle this data stream on a single 10 Gb/s Ethernet connection is a bit of a stretch, don't you think?

  3. truly amazing by Goldsmith · · Score: 5, Funny

    I was having lunch with some CERN guys a couple weeks ago, and was asking them about the speed of their analogue to digital converters. I don't remember what the number was, but it seemed low to me, something like 200kHz. So, of course, I had to point out that *my* cheapo converters ran faster than theirs by more than an order of magnitude. They responded with "well, each of our converters does 200kHz on all of our 4000 channels at the same time, so we're really recording at..."

    They won.

    1. Re:truly amazing by Anonymous Coward · · Score: 0

      Um.

      This is a really weird post and so dramatically offtopic in this forum that it probably makes people either annoyed or uncomfortable. Perhaps it is even an automatic bot that picked up on the words "truly amazing". As such it should probably not get a reply.

      In the case I am wrong:
      1. It appears you have no support, and you are unlikely to get support here on a public offtopic forum. Find a person you can talk to directly that either loves you but has no vested interest in your situation, or has training in counselling but no bias in the decisions you have to make. Internet forums cannot help you here.

      2. Your perception of God are dark and punitive. That God will do nothing for you or your mental health. If you pray, pray to the one that wants the best outcome for you and those around you.

      3. The Christian/Atheist debate is not helpful for any strong or healthy person, and even worse for you if you feel not strong or not healthy. Stay away from it. Beware of labelling yourself or others, because those labels create unattainable expectations and do not reflect all that you or other people are. Labelling is a dehumanizing technique.

    2. Re:truly amazing by Anonymous Coward · · Score: 0

      YHBT YHL HAND

    3. Re:truly amazing by Anonymous Coward · · Score: 0

      Sounds like someone is jealous...

    4. Re:truly amazing by shentino · · Score: 1

      This, coming from someone who is too chicken to post their troll material under their real name and take the karma hit like a man.

  4. I have a dream... by Mathinker · · Score: 1, Funny

    Your post makes me wonder about a future where I have a home computer powerful enough to run an algorithm which downloads as many tracks off of iTunes as it needs and then can compute by extrapolation the future hits of RIAA, before they are released.

    One wonders whether the courts would find that such a program is a circumvention of DRM for the purposes of the DMCA. Unfortunately, the computer, which can answer that question, will be destroyed by the construction of a .....

    (Ouch. I should go get some sleep....)

    1. Re:I have a dream... by jvin248 · · Score: 1

      put enough monkeys on typewriters and you'll get shakespeare (http://en.wikipedia.org/wiki/Infinite_monkey_theorem_in_popular_culture)

      And someone started such a simulator : http://everything2.com/title/Monkey+Shakespeare+Simulator That's what you use this for. And Pr0n.

  5. In other words: The LHC Grid by Anonymous Coward · · Score: 1, Funny

    is a botnet !

    Thanks in advance.

    Yours In Akademgorodok,
    K. Trout

  6. That's awesome... by Anonymous Coward · · Score: 0

    That's awesome, but will it run Crysis at max settings?

  7. Good boffin! Who's a smart boffin? Yes you are! by $RANDOMLUSER · · Score: 0

    Good thing they didn't use PS3s to build it.

    --
    No folly is more costly than the folly of intolerant idealism. - Winston Churchill
  8. Oblig. by lloydsmart · · Score: 0

    Wow, imagine a Beowulf cluster of those!

  9. I want more! by lemur3 · · Score: 1

    I was quite saddened to find that this 'Look at CERN's LHC GRID' ....didnt include any pictures. :-(

    1. Re:I want more! by Arathrael · · Score: 4, Interesting

      Have a look here: http://lcg.web.cern.ch/LCG/image.htm for Google Earth based dashboards showing WLCG live grid sites, links, data transfer and job activity.

  10. Just out of curiosity what kind of db? by Anonymous Coward · · Score: 0

    What kind of database do you think they would be using? would it be sql based, as they would obviously need to query? but storing that amount of data? at that extremely fast rate??

  11. LCH@Home by Luyseyal · · Score: 1

    I just wish they would send some more work units down the LHC@Home pipe. None of my computers have done any work for that project in ages.

    -l

    --
    Help cure AIDS, cancer, and more. Donate your unused computer time to worldcommunitygrid.org. Join Team Slashdot!
    1. Re:LCH@Home by Anonymous Coward · · Score: 0

      You might want to check your statistics page. I've had (let's see...) four work units complete in the last two weeks. Granted this is after months of silence, but there seems to be a trickle of new data coming.

    2. Re:LCH@Home by Luyseyal · · Score: 1

      Hrm, maybe they think my computers are too slow or something. It's been well over a year since I did any LHC work.

      -l

      --
      Help cure AIDS, cancer, and more. Donate your unused computer time to worldcommunitygrid.org. Join Team Slashdot!
    3. Re:LCH@Home by sznupi · · Score: 1

      Not likely, one Athlon XP 1700+ with 768 MiB of RAM, that I keep around here, got maybe a dozen workunits in the last month.

      It seems now the project is gearing up specifically for calculations of later stage of LHC, when it will transfer to operating at greater power.

      --
      One that hath name thou can not otter
    4. Re:LCH@Home by Luyseyal · · Score: 1

      I have another theory. I don't leave any computers on at night so perhaps the jobs are going out during the European daytime when mine are offline and there is simply not enough work to go around, yet.

      -l

      --
      Help cure AIDS, cancer, and more. Donate your unused computer time to worldcommunitygrid.org. Join Team Slashdot!
  12. LHC IT people are far from top notch by Anonymous Coward · · Score: 0

    IT people at LHC are the biggest liars in the universe, they are also pretty incompetent and get confused by simple questions...
    One actual example (specifics removed to protect the innocent):
    "So you configure the IP like this thru this software?"
    "Oh dear no! I am a hardware guy, I dont deal with software! IPs are a hardware problem..."
    "No they are a software setting..."
    "No they are hardware!"
    and so on...

    1. Re:LHC IT people are far from top notch by lepidosteus · · Score: 1

      Anonymous Cowards people at Slashdot are the biggest liars in the universe, they are also pretty incompetent and get confused by simple questions...
      One actual example (specifics removed to protect the innocent):
      "So you have any proof of such IT incompetence by them, it's not just jealousy right ?"
      "Oh dear no! I am a story teller guy, I dont deal with proof! Proofs are a wikipedia problem..."
      and so on...

  13. Correction by mike260 · · Score: 2, Informative

    Using a 4-tiered architecture (from CERN's central computer at Tier 0 to individual scientists' desk/lap/palmtops at Tier 3) [...]

    Sorry for being pedantic, but the article says there are three tiers between the central computer and the scientist's machines (which are tier 4, not 3).

    1. Re:Correction by olsmeister · · Score: 1

      Tier 0
      Tier 1
      Tier 2
      Tier 3

    2. Re:Correction by mike260 · · Score: 1

      And finally Tier 4 which comprises, according to TFA, the scientists desk/lap/palmtops.

    3. Re:Correction by Arathrael · · Score: 1

      Correct. Tier 3 is local batch farm facilities, etc., which aren't really part of the project as such.

    4. Re:Correction by blair1q · · Score: 1

      Interesting. I don't remember writing it that way. Did someone edit the submission before it was posted? Or has it just been a long weekend?

  14. Data to crunch by Lord+Byron+II · · Score: 4, Interesting

    As someone who worked on the processing of HEP experimental data for awhile, let me say that there is a ton of work to do. You have particles entering the detector every ~40ns and hundreds of different instruments making measurements, which leads to a ton of data very quickly. You then have to reconstruct the path of the particle based off of the detector information, but it's not straight-forward. The detector can have gaps in coverage; neutrinos (which are undetectable) can be created removing momentum; particles from the previous event can still be in the detector et cetera.

    And all of the data crunching you do must be done in 40ns, so that you're ready for the next set. (Of course, you can do some processing offline, but if you don't maintain a 40ns average, then your data will start piling up.)

    1. Re:Data to crunch by McTickles · · Score: 0, Flamebait

      More like "noise crunching", all I can see LHC producing right now is recordings of noise... Now I am not saying there is nothing interesting in that noise but honestly is it worth all that money in trouble to perhaps find some sort of relevant thing in there? One can find any sort of pattern in noise if one really wants to... Doesn't prove anything.

    2. Re:Data to crunch by $RANDOMLUSER · · Score: 1

      ...there is a ton of work to do ... which leads to a ton of data...

      Holy crap! I had no idea that the relativistic speeds involved would cause the mass to increase that much!

      --
      No folly is more costly than the folly of intolerant idealism. - Winston Churchill
    3. Re:Data to crunch by Anonymous Coward · · Score: 0

      Dear LHC scientists,
      Well, it was fun while it lasted but it's time to pack up the LHC and head home. Why, you ask? You need look no further than the incredibly subtle insights of esteemed professor/gibbering slashdot troll McTickles. Next time you make one of these funny contraptions, remember to fill your it with expanded polystyrene and talk no louder than 30dB - that should help cut down on all of this noise he is so worried about.

    4. Re:Data to crunch by klazek · · Score: 5, Informative

      You have particles entering the detector every ~40ns and hundreds of different instruments making measurements, which leads to a ton of data very quickly.

      Not exactly true. It's running at 40 MHz, so that's 25 ns bunch spacing. Further, you don't exactly have to 'crunch' the data as it comes in, there are multiple triggers that throw lots of data away based momentum cuts and other criteria before it ever makes out of the detectors.

      In ATLAS, for example, there are ~ 10e+9 interactions/sec. The Level1 Trigger, consists of fast, custom electronics programmed in terms of adjustable parameters to control filtering algorithms. Input is from summing electronics in the EM and hadron calorimiters, and signals from the fast muon trigger chambers. The info is rather coarse at this point (transverse momentum cuts, narrow jet criteria, etc), and at level one the info rate is decreased in about ~2us (including communication time), from 40MHz to about 75KHz. Level2 now does a closer look, taking more time and focusing on specific regions of interest (RoIs). This process takes about 10ms, and data rate is reduced to about 1KHz for sending to the event filter. Here, the full granularity of the detector (the 'detector means all the bits - Inner detectors: Pixels, strips, Transition Radiation tracker - The calorimiters - The muon tubes at the outside radius) and runs whatever selections algorithms are in use. This takes a few seconds, and output is reduced to about 100Hz and written to disc for a gazillion grad students (like myself) to analyze endlessly and get our PhDs.

      There is much more to it of course, but you can find info about it on line if you really are interested in the details. Have a look at the ATLAS Technical Design Report: http://atlas.web.cern.ch/Atlas/GROUPS/PHYSICS/TDR/TDR.html

  15. USENIX LISA presentation by Anonymous Coward · · Score: 2, Interesting

    There was a good presentation at LISA '07 on this entitled "The LHC Computing Challenge":

    http://www.usenix.org/event/lisa07/tech/tech.html#cass

    It was given by Tony Cass, who is/was "responsible for the provision of CPU and data storage services to CERN's physics community". They're planning on collecting 15PB/year.

  16. lhc infrastructure history? by flok · · Score: 1

    I wonder what they at cern were expecting to have for infrastructure at the end of the project (e.g. now).
    I mean: who could have guessed the processorspeed and diskspace we have now.

    --

    www.vanheusden.com - home of Multitail, HTTPing, CoffeeSaint, EntropyBroker, rsstail, bsod, listener, nagcon, nagi
    1. Re:lhc infrastructure history? by TheRaven64 · · Score: 3, Funny

      I mean: who could have guessed the processorspeed and diskspace we have now.

      Gordon Moore?

      --
      I am TheRaven on Soylent News
  17. As a Tier-3 manager... by imevil · · Score: 2, Interesting

    I can say that the article doesn't explain it very well. Since CERN has been calling the sites "Tier", this terminology has become a buzzword, and everything is a Tier (the managers call their services "Tiered" just to make them sound important).

    Tier0 and Tier1 are well described by the article. Tier2 are mostly computing clusters, with of course big storage, but they're mainly for analysis. Tier3 are like Tier2 but not really. They are "uncertified" Tier2 in the sense that they do not strictly adhere to the Tier2 standards in terms of middleware and configuration and policies.

    Tier4... never heard of that, I think the buzzword Tier backfired and they're calling their desktops Tiers. When I started managing the Tier3 we did not even call it like that... it was just a cluster.

  18. sounds familiar by Anonymous Coward · · Score: 0

    in fact it sounds just like what i do to get
    my latest TV-series (from bit-torrent).
    -
    oh come ON! anybody that still believes IONIZING,
    thus changing and INTERFERING with the "to-be-studid"
    object at hand is FUNDAMENTAL is plain ... STFU.

  19. 200kHz x 400 channels is nothing by Roger+W+Moore · · Score: 1

    It depends on which CERN guys you talk to. When I was a grad student we had a 1 GHz ADC (with fewer channels and only 8 bits IIRC) reading out scintillator which timed protons in a beam to O(10ps) timing resolution. I've been more involved with triggers than front end digitization since then but 200 kHz and 400 channels is nothing - the ATLAS calorimeter alone has 110,000 channels and its ADC's operate in the 10's MHz range (IIRC - you'd have to look up the ATLAS detector paper for exact numbers).

    1. Re:200kHz x 400 channels is nothing by Goldsmith · · Score: 1

      You know, I don't really remember the exact numbers we were talking about at lunch. I'm sorry if I got some of the numbers wrong, particularly if they ended up being far to small, that would annoy me.

      I just remember being very impressed at the insane amount of total data coming in. I'm definitely more used to the setup of a single GHz ADC, switching between a handful of channels.

      There's a big difference in scale between a condensed matter experiment, where I get to do absolutely everything myself, and something like ATLAS. I'm amazed that these various groups (hardware, software, design, construction, cryogenics, analysis... all on each detector) are able to work together so well.

  20. Speaking as an LHC physicist... by Roger+W+Moore · · Score: 1

    ...I already do! One of the great things about the LCG is that you can submit and monitor jobs from anywhere. It is used by far more than the 1,000 physicists the article mentions. There are 2,500+ on ATLAS alone and then there is CMS and LHCb to count as well.

  21. Imagine by MistrX · · Score: 1

    Imagine a Beowulf cluster of these! \o/