Storing CERN's Search for God (Particles)

News for Nerds! by KlomDark · 2007-07-20 16:23 · Score: 4, Insightful

Wow! Actually geeky science news, not enough of that here lately!

Re:News for Nerds! by zeugma-amp · 2007-07-20 16:36 · Score: 3, Interesting

Interesting article.

Many years ago when the SSC (Superconducting Super Collider) was still being built in Texas, I went to an HP users group meeting as I was working primarily with HP-3000 systems at the time. The fellow addressing the meeting was the head of the physics department at the SSC. It was a really neat presentation, in which he described a similar, though orders of magnitude smaller data storage requirement, though he was talking terabytes of data per month IIRC. At the time, they were planning on using two arrays of 40 workstation computers to handle the load. This would have been fairly early loosely coupled setup similar to a Beowulf cluster.

After the presentation I went up to him and told him that all I wanted to do is sell him mag-tapes.

These types of experiments evidently produce tons of data. I wonder if the processing could be parcelled out like Stanford's Folding@Home or SETI to speed up data correlations.

--
This is an ex-parrot!
Re:News for Nerds! by Anonymous Coward · 2007-07-20 16:41 · Score: 3, Insightful

ive often wondered if i could sneak into cern and just look around. i think the only two things you would need to do it would be a white lab coat and a really grizzled look on your face.

i remember when i was under 18 i used to go to alot of places i wasnt allowed in just to check things out. i wasnt a malicious kid that would run around breaking things for fun, i just loved seeing various things that most people never see or think about, especially feats of engineering.

when i turned 18 i looked back and was actually sad i didnt do it more often. after 18 you dont just get escorted out with a warning. now that im older im really really sad for the upcoming generations. genuinely good kids wont go peeking around at stuff as often, and the ones that do will be severely punished because everyone will think they were 'terrorists'.

for many of the up and coming kids, all they have to look forward to are pointless unnecessary techno gadgets and the warped MTV social culture where money, drugs, and sex are all they are taught to appreciate and strive for.
Re:News for Nerds! by FractalZone · 2007-07-20 17:33 · Score: 2, Interesting

I'm not so sure about the "huge disk buffer". Smaller disks can be spun faster and tend to have lower latency. I'd like to see the drum drive make a comeback for disk cache...expensive, but fast!

--
"You're young, you're drunk, you're in bed, you have knives; shit happens." -- Angelina Jolie
Re:News for Nerds! by Rodolpho+Zatanas · 2007-07-20 17:45 · Score: 5, Interesting

From my experience, generic blue work clothes (preferably with your name on the breast pocket) work best. I once got into some research facility (they had lasers and everything) because I got out of the elevator on the wrong floor and some guy in a lab coat opened the door for me (I was wearing my work clothes because I was on my lunch break). I wandered about at the place for something like 10 minutes before I found a way out. There was even a security guy of some type sitting at a hallway but he lost interest in me after I looked him in the eye and said hello.
Re:News for Nerds! by xyvimur · 2007-07-20 18:10 · Score: 4, Informative

Just go there and take a guided tour. If you'll hurry you'll be able to go to the detector pit and see it. Otherwise after starting up it will be inaccesible for visitors for the life-cycle of the experiments (10-20 years). Google for CERN visit service.

Milosz
Re:News for Nerds! by uolamer · 2007-07-20 19:56 · Score: 2, Funny

ftp.alice.cern.ch/incoming/ /~~--allah-was-here--~~/0day/

--
s/©//g
Re:News for Nerds! by Anonymous Coward · 2007-07-20 21:36 · Score: 2, Interesting

I managed to see this at Easter. It's huge. I've posted some photos at: http://grantpe.googlepages.com/cernpics [googlepages.com]. The last shows one of the rooms of computers they're using. The others are just views of the huge detector. It's in a man-made canvern 100 metres tall and 100 metres wide, all below ground! All just taken on the visitors tour.
Re:News for Nerds! by lixee · 2007-07-20 22:35 · Score: 1

I called them earlier this week to schedule a visit. They're booked until January 2008!

--
Res publica non dominetur
Re:News for Nerds! by GooberToo · 2007-07-21 00:40 · Score: 0, Redundant

It's not real geeky science news until they tell us how many library of congresses it is per second.
Re:News for Nerds! by Anonymous Coward · 2007-07-21 01:35 · Score: 2, Interesting

It is! Sorta, at least... On my experiment (CMS), data gets a first pass handling on site at CERN, then gets parceled out to about 7 other sites (of which Fermilab is one) where their section of data gets another look. Each Tier 1 station, as it's called, also services requests from affiliated research institutions, both to get reconstructed data, and also to run and store their simulated data.

It's a really neat system that makes the geek in me happy =)
Re:News for Nerds! by Ginger+Unicorn · 2007-07-21 03:01 · Score: 1

I showed up at CERN in 2002 when i happened to be on holiday in geneva. They didnt have a tour then, damn those lazy scientists, so i had to make do with the visitors centre, but it was quite interesting, and they had bits of an old accelerator on display in a courtyard outside.

--
(1.21 gigawatts) / (88 miles per hour) = 30 757 874 newtons
Re:News for Nerds! by pintpusher · 2007-07-21 05:50 · Score: 1

and now they'll be slashdotted and you'll be able to get a tour, but you can only take one step every 30 seconds. or maybe there's a coral cache?

--
man, I feel like mold.
Re:News for Nerds! by 32Na · 2007-07-21 10:10 · Score: 2, Interesting

The folks at CERN maintain a set of libraries for analyzing nuclear and high-energy physics data sets, known as 'root'. These also include the Parallel ROOT Processing Facility, or PROOF. I'm guessing that PROOF will play an important role in the analysis of this experiment once it comes online.
Re:News for Nerds! by Anonymous Coward · 2007-07-21 15:06 · Score: 0

Why not 10,000 PCs with 100 Gbit/s ready to fluid operations?

100 x PCs 1GB/s x 1 month = 267,840 TB/month

Buy them 1,000,000 TB and there is not problem of capacity, it reaches upto aprox. 25%, the rest 75% is free, with a margin of 3 months more to follow receiving a lot of information.

I has time to ERASE them less than one month!!!

What problem have you?
Re:News for Nerds! by dargaud · 2007-07-22 06:37 · Score: 1

Try to get an interview for a job there. I had a few and every time I got a tour of (different) facilities: Alice, Atlas, the accelerator. Monster physics. And now I work in particle physics too...

--
Non-Linux Penguins ?
Re:News for Nerds! by Anonymous Coward · 2007-07-22 17:10 · Score: 0

Mmmmm... hot dense soup.
Re:News for Nerds! by perturbed1 · 2007-07-30 08:27 · Score: 1

I am a guide and I know that the CERN visits service is overwhelmed with requests. So if you want a tour, contact the experiment's secretariats directly and ask for a tour. This works a lot better. The major experiments that one should see are ATLAS and CMS. I dont know who to e-mail for CMS but for ATLAS, send e-mail to Atlas.Secretariat@cern.ch

PC's? by deopmix · 2007-07-20 16:25 · Score: 1

I don't precisely think that CERN is going to be purchasing thousands of dell PCs to analyze the data that they collect. maybe they are talking about a distributed computing project?

Re:PC's? by SnoopJeDi · 2007-07-20 16:28 · Score: 1, Redundant

From TFA:

The ALICE experiment grabs its data from 500 optical fiber links and feeds data about the collisions to 200 PCs, which start to piece the many snippets of data together into a more coherent picture. Next, the data travels to another 50 PCs that do more work putting the picture together, then record the data to disk near the experiment site, which is about 10 miles away from the data center.
Re:PC's? by Anonymous Coward · 2007-07-20 16:36 · Score: 2, Funny

Actually their plan is to store all that data on Commodore 64 cassette tapes.
Re:PC's? by Falstius · 2007-07-20 16:59 · Score: 2, Informative

Actually, there really is a gigantic room at CERN full of commodity PCs that form the first level of computing for the different experiments. The data is then shipped off to sites around the world for further processing. There is a combination of 'locally' distributed computing and world-wide grid being used.
Re:PC's? by 1310nm · 2007-07-20 16:59 · Score: 2, Funny

load"*",8,1
Ok, who put California Games x 10000000000000000000000000000000000000000000000000 0 on the tapes?
Re:PC's? by Anonymous Coward · 2007-07-20 17:56 · Score: 1, Informative

Initially some data is being filtered at the detector pits by the farms of PCs doing the triggering. After that the data will be fed to storage and analysis. CERN has been upgrading its computer centre for quite some while (the main problem is not power supply, but cooling system - thus some of performance benchmarks also include it). Besides CERN (Tier-0) will have high-speed connections (via means of LCG backbone) with many sites around the world and the data processing will be done in a 'global manner'.

You can google on phrase 'service challenge' site:cern.ch, or just go to the LCG site.

--
Milosz
Re:PC's? by Rodolpho+Zatanas · 2007-07-20 18:42 · Score: 5, Informative

load"*",8,1 would load something from a diskette, not a cassette.
Re:PC's? by gardyloo · 2007-07-20 19:25 · Score: 1

No way. Fisher Price spikey plastic records.
Re:PC's? by Anonymous Coward · 2007-07-20 20:28 · Score: 0

Good... but can someone please explain why I have a key labelled "RUN/STOP", and how exactly I am supposed to use it ?!??
Re:PC's? by pipingguy · 2007-07-20 21:13 · Score: 1

What, no Sig?
Re:PC's? by Rodolpho+Zatanas · 2007-07-20 21:25 · Score: 1

If you want to load the first programme on a cassette tape, just push . Also, is sort of like on an IBM PC or compatible. Those are the most common uses. Of course, you already knew that, didn't you?
Re:PC's? by Rodolpho+Zatanas · 2007-07-20 21:29 · Score: 1

This is what you get for not reviewing... "SHIFT+RUN/STOP" is what's missing from the first sentence. "RUN/STOP+RESTORE" goes between "Also," and "is sort of". Sorry.
Re:PC's? by Fizzl · 2007-07-21 01:22 · Score: 1

Hmm, bugger. What was the command to load from a tape?
Just 'LOAD' and press play?

--
Bot Assisted Blogging
Re:PC's? by 1310nm · 2007-07-21 05:05 · Score: 1

My experience with the Commodore 128 came very early in life while playing games to distract myself from chicken pox, so I'm not what you would call an "expert".
Re:PC's? by Rodolpho+Zatanas · 2007-07-21 05:12 · Score: 1

That would do it.
Re:PC's? by Sipos · 2007-07-21 07:17 · Score: 1

The article is actually just talking about the PCs used to store ALICE's data.
The data analysis for the LHC experiments uses the LHC Computing Grid . The analysis is spread out between different sites (exactly what happens at which sites depends on the experiment). The PCs which make up the grid though are largely (fom what I have seen) Dell PCs.
Data analysis for particle physics is highly parallelisable (large number of events, on which you want to run the same analysis) so large numbers of inexpensive computers makes more sense than super computers.

If Only... by i_ate_god · 2007-07-20 16:32 · Score: 4, Funny

If only I could get porn that fast

there I said it, let's move on now.

--
I'm god, but it's a bit of a drag really...

The mere thought of that much bandwidth... by Khyber · 2007-07-20 16:33 · Score: 0, Offtopic

I think I just creamed myself. The hardware needed to push that much data must be insane!

--
Still waiting on Serviscope_minor to wake up to fucking reality and realize that Jessica Price isn't going to fuck him.

Re:The mere thought of that much bandwidth... by dosguru · 2007-07-20 16:42 · Score: 3, Interesting

A standared dual CPU dual core HP server with Windows can keep a 4Gb FC pretty full if set up correctly. I work for a large bank, and we have many a Solaris box that can keep 4 or even 8 2Gb FC cards full into our FC and SATA disk arrays. Not to trivialize the extreme coolness of what they are doing at all, but a PB of data with a few PB of I/O in a day isn't what it used to be. I'm just glad to see they don't use Polyserve, it is worthless for clustering and has caused more downtime at work than it has ever prevented. If they really have that much data they should use 10Gb FC or Infiband. Even our stodgy old bank is implementing our first infiband system so we can move IO at 12Gb instead of the slow 4Gb links.

Um no...it's a product placement for Quantum by xxxJonBoyxxx · 2007-07-20 16:34 · Score: 4, Informative

Um...no. Actually, it's a product placement PR piece about Quantum's StorNext. (Read page 2...)

Re:Um no...it's a product placement for Quantum by Anonymous Coward · 2007-07-20 17:55 · Score: 5, Funny

Um...no. Actually, it's a product placement PR piece about Quantum's StorNext. (Read page 2...)
We knew there were some serious nerds on Slashdot, but to be potential customers for the same RAID system as CERN, whoa! :)
Re:Um no...it's a product placement for Quantum by Anonymous Coward · 2007-07-20 22:31 · Score: 0

We knew there were some serious nerds on Slashdot, but to be potential customers for the same RAID system as CERN, whoa! :)

1 GB/second would be quite a lot of porn.
Re:Um no...it's a product placement for Quantum by MMC+Monster · 2007-07-21 01:20 · Score: 1

1 GB/sec of porn likely just means very high resolution.

Imagine being able to zoom in on some porn and seeing the actual DNA of the actress(es)...

--
Help! I'm a slashdot refugee.
Re:Um no...it's a product placement for Quantum by Midnight+Warrior · 2007-07-21 06:16 · Score: 5, Insightful

You may think of it as product placement, but I use it. I even provide the occasional blog entry on it on Advanced Topics. I sat through a RedHat performance tuning class that was quite excellent. But when they came to the part about ext3 and tuning it, well, let's face it - ext3 just isn't going to scale. I started with Veritas' Filesystem which is pretty nice. If you're a small-time admin, then you never get beyond a local, 4U disk array. Once your group spends more than US$2million on servers though, it's obvious what the problem is: Storage - The Final Frontier. SAN and clustered filesystems allow a level of scalability completely unheard of before.
They also completely left out anything but a tagline of their multi-tiered solution. I wish they'd talked more about how CERN supports 500Gbit per second aggregate throughput to their disks (at least they implied that). 50GB/sec (or so) is probably the toughest I/O problem you've ever dealt with, or will deal with for a long time. Whose RAID controllers did they use? Did they focus on speed (ASIC and ISL minimization), availability (redundant fabrics), or both? Did each node get dual 4Gb links or just one?
If this had been an advertisement, they would have discussed some 3.0 features like LAN clients.
So, in short, it's easy to say it sounds like an advertisement. Quite possibly, Quantum (formerly ADIC) coerced them into getting the piece written. But if this had been an advertisement, there is so much more that is going on under the hood that would have been said. Large, fast, distributed filesystems are non-trivial and take an extreme amount of engineering and testing. StorNext really is good at what they claim to do.
If you want to read about some of the drawbacks though, I yak about them on my blog. Sorry for the plug.
Re:Um no...it's a product placement for Quantum by jesboat · 2007-07-23 13:19 · Score: 1

50 GB/s = 50*8 Gbit/s = 400 Gbit/s.

400 is nowhere near the order of magnitude different from 500 as your choice of units imply.

Too many video games may stunt your growth. by Futurepower(R) · 2007-07-20 16:39 · Score: 0, Offtopic

Quote from the Slashdot story, as it is now: "... and the SAN tasked with catching the flood of data."

I think the correct word, considering the meaning, is "caching".

"Don't run with scissors" advice: If you play video games too much, it will stunt your growth. People need time to learn about the real world around them, not just a fantasy world. Part of learning about the real world is learning how to communicate with other people.

Re:Too many video games may stunt your growth. by OverlordQ · 2007-07-20 18:22 · Score: 2, Insightful

I think the correct word, considering the meaning, is "caching".

No, I believe the word was catching. As in:
They're throwing all this data at me and I gotta catch it.

--
Your hair look like poop, Bob! - Wanker.

Striped FS by Anonymous Coward · 2007-07-20 16:42 · Score: 1, Interesting

They're probably using an object based parallel filesystem like Lustre or something similar. I heard at At Sun they build these all the time with one customer striping data against 214 PCs acting as data engines all within one Lustre Filesystem. All the storage is direct attach but SAN can't even come close to the speeds generated and all the equipment being used is commodity hardware.

Re:Striped FS by BoberFett · 2007-07-20 17:01 · Score: 1

Even so, this is a big project. If they're projecting 1GB/sec for a month, even if they use the latest massive hard drives (1TB) they'll still need around 3000 of them. Presumably they won't need them all online full time, I'd imagine they'll use some sort of hot swapping. Still, that's a lot of data.

Gigabits or Bytes? by Easy2RememberNick · 2007-07-20 16:46 · Score: 1

2,629,743 seconds in a month, so... 2,629,743 GB or 328,717 GB?

It's too late to do math.

Re:Gigabits or Bytes? by snowraver1 · 2007-07-20 16:55 · Score: 2, Informative

2.6 Petabytes. The article says that they will be collecting petabytes of data. Also, the article clearly said GB. GB= Gigabyte Gb= Gigabit. The thing that I thought was "Wow that's ALOT of blinking lights!" Sweet!

--
Copyright 2010. All rights reserved. This comment may not be copied in any way including, but not limited to caching.
Re:Gigabits or Bytes? by Easy2RememberNick · 2007-07-20 17:02 · Score: 1

Yeah that's why I was confused they had a big B but if you're talking network speed it's usually described in Gigabits, small b.

"In total, the four experiments will generate petabytes of data."

Divide at least 1 PB by four and you get 256 TB, I was close with 328 TB, so it must be Gigabits.
Re:Gigabits or Bytes? by ireallylovelinux · 2007-07-20 17:21 · Score: 0

its 328 Terrabytes
http://www.translatorscafe.com/cafe/units-converte r/data-storage/calculator/gigabyte-to-gigabyte/
Re:Gigabits or Bytes? by Anonymous Coward · 2007-07-20 19:23 · Score: 2, Funny

"2,629,743 seconds in a month, so... 2,629,743 GB or 328,717 GB?"

If they were smart, they'd choose February. They could save ~172800 seconds and therefore some disk space!

Pfft... by Anonymous Coward · 2007-07-20 16:52 · Score: 0

You think that's bad? I've gotta download 2 Grateful Dead torrents for, like, 3 months from Lossless Legs! I scoff at your God (particles)!

Re:Pfft... by Anonymous Coward · 2007-07-20 17:07 · Score: 0

Dammit, I meant 2 torrents per day. Too late, think, caffeine, etc.

A rough calculation on disk size by mad+zambian · 2007-07-20 16:53 · Score: 2, Interesting

based on 1GB/sec * ((3600 * 24) * 31) means over 2.5 Petabytes.
Wow.
Something like 3000 of the current ITB drives.
How long until Exabyte level storage is required for some project or another?

--
Trying to associate Microsoft with "fun" is like trying to associate Satan with aromatherapy. -Tycho

Re:A rough calculation on disk size by BlueCollarCamel · 2007-07-20 17:52 · Score: 1

Thursday, maybe Friday. Depends on the weather.

--
1&1 - Cheap domain and web hosting.
Re:A rough calculation on disk size by Regolith · 2007-07-20 20:05 · Score: 1

Tsk, tsk... you forgot about redundancy.

--

Bow before my sig, for it is good.
Re:A rough calculation on disk size by VisionMaster.NL · 2007-07-21 00:45 · Score: 2, Interesting

Estimates are that the four LHC experiments will produce about 15 PetaByte/year. The LHC will be online for about 15 years (maybe more). All data is kept permenantly. This means that there is a fail-safe copy stored at CERN on tape, which is a big task to perform constently. But that data is not worked on there, it is spread through the huge tubes of the academic fibers to big data centers around the world. All that online copy is replicated and is stored at two geographical locations. At each location most of the data (depends on the type) is mirrored to tape. So the largest volumes is on tape but there is still a need for mucho-grande cache servers, which are mostly huge disk-arrays. The 10-11 biggest data centers will store and perform (re-)processing of the data at the rate in which it is produced. The other 190 data centers are calculating the physics analyses of all the (local) science groups. ps: Most data is analyses/processed multiple times.
Re:A rough calculation on disk size by Anonymous Coward · 2007-07-21 02:59 · Score: 0

Something like 3000 of the current ITB drives.
How long until Exabyte level storage is required for some project or another?

And people wonder why Sun designed ZFS to be expandable to 128 bits (though POSIX interfaces only currently handle 64 bits).
Re:A rough calculation on disk size by Anonymous Coward · 2007-07-21 04:11 · Score: 0

But you don't need to store all 2.5 Petabytes on disk. You can use an HSM on the back end of you filesystem to pull the data off to tape and just stage data back on an as needed basis. http://www.hpss-collaboration.org/ is a good example.

Pseudo-Dupe? by DTemp · 2007-07-20 16:55 · Score: 1

http://science.slashdot.org/article.pl?sid=07/05/2 2/009216 From two months ago.

Re:Pseudo-Dupe? by Easy2RememberNick · 2007-07-20 17:05 · Score: 3, Funny

Nah it's just, spooky article submission at a distance.

The other article appeared because it knew this one would be submitted later in the future.
Re:Pseudo-Dupe? by Anonymous Coward · 2007-07-22 17:18 · Score: 0

"Nah it's just, spooky article submission at a distance."

Much better than unfair, spooky article submission at a distance.

Searching for God by Cassini2 · 2007-07-20 17:04 · Score: 1

These physicists always say they are searching for God or "the God Particle". But what happens if they switch the big God Particle generator on, and God suddenly appears? What if we really do find God?

What are all these geeky physicists going to do then? Do we really want to find God? Do we really want physicists finding God? Is this a good thing?

Just wondering ...

Re:Searching for God by Easy2RememberNick · 2007-07-20 17:08 · Score: 1

Considering the Creationist versus Science debate that would be be quite a hoot! Irony at it's best, Science discovers God.
Re:Searching for God by Ai+Olor-Wile · 2007-07-20 17:16 · Score: 1

What percentage of you is Dan Brown, and how can we extract the other parts? o_O
Re:Searching for God by Svenne · 2007-07-20 17:17 · Score: 1

Well, they could find pink invisible unicorns as well. What then?

--

Slagborr
Re:Searching for God by ammonynous · 2007-07-20 17:28 · Score: 2, Funny

With a God Particle generator, wouldn't you *generate* God? Wouldn't that be a hoot?!?
Re:Searching for God by Umbral+Blot · 2007-07-20 17:51 · Score: 1

Im going to assume, for the sake of charity, that you are being facetious, and know that what is called the "god particle" has nothing whatsoever to do with god as the word is usually understood (as an invisible sky wizard). The higgs particle is only called the god particle as a joke by physicists to emphasize how awesome finding it would be. Once upon a time I was bothered by that, because I thought it was confusing for no good reason, but now I see it as a kind of intelligence test.

--

Philosophy.
Re:Searching for God by edsyc · 2007-07-20 18:01 · Score: 2, Funny

The physicists don't really want to find god, it's just the only way they can get research funding under the bush administration.
Re:Searching for God by Saikik · 2007-07-20 18:05 · Score: 1

I think this whole thing is a farce. These scientists should spend time on more interesting problems, even looking for aliens seems less ridicules than this. If God wants to be seen/found, he would be. If he doesn't want to be seen/found, he won't be. If we find something that is hiding, then by the very definition of finding it, that thing we find can't be God. That thing becomes fallible, it is something else.
Re:Searching for God by damiam · 2007-07-20 18:24 · Score: 1

Dude, just STFU if you don't understand the nature of the research. No one's trying to find God here.

--
It's hard to be religious when certain people are never incinerated by bolts of lightning.
Re:Searching for God by JMZorko · 2007-07-20 19:41 · Score: 1

* sigh *

When society finally moves past this silly "god" idea, as it has with a lot of other silly superstitions, we will all be _so_ much better off. I really don't mean this in a disrespectful way (i'm perfectly willing to be your friend, even if we disagree on the "god" thing, and I can respect people for a lot of reasons, even if we differ on matters of religion), but it's really what I think / feel. It's done waaaaay too much harm, and the sooner we jettison it, the better.

Regards,

John, your friendly neighborhood happy-go-lucky heathen godless atheist :-)

--
Falling You - beautiful
Re:Searching for God by Anonymous Coward · 2007-07-20 20:24 · Score: 0

This isn't about finding god, it is about finding the force that gives everything mass. This is EXTREMELY important work, it isn't religious mumbo jumbo.
Re:Searching for God by jdh41 · 2007-07-20 20:28 · Score: 1

Dissection. Questioning. Years of resulting research.

Possibly ask him for more funding.
Re:Searching for God by StarfishOne · 2007-07-20 22:08 · Score: 1

OMG! Ponies!! :D
Re:Searching for God by Cassini2 · 2007-07-21 00:01 · Score: 1

Thanks to all the posters for the witty responses.
Re:Searching for God by master_p · 2007-07-21 01:16 · Score: 1

Well, if God can be downloaded over fiber links and stored in a few thousand TBs of data, then why not?
Re:Searching for God by Ant+P. · 2007-07-21 02:36 · Score: 1

Do we really want physicists finding God? Is this a good thing? Of course.

They'd have proven God exists, AND done so using scientific methods. It'd be the death blow for half of the worst organisations in the world.
Re:Searching for God by Chutulu · 2007-07-21 03:33 · Score: 0

thanks to those geeky physicists you are now using a computer and have a television set at home. If it improves your quality of life i don't know...
Find God? which one? Zeus? The Muslim god? The christian god? Dagon? Thor? Endovelicus? The Spaguetti monster? Santa Claus? The tooth fairy?
Re:Searching for God by alienmole · 2007-07-21 05:51 · Score: 2, Funny

Find God? which one? Zeus? The Muslim god? The christian god? Dagon? Thor? Endovelicus? The Spaguetti monster? Santa Claus? The tooth fairy?
Geez, haven't you been paying any attention to physics for the last fifty years? Just as atoms (or protons, neutrons and electrons) can be used to create any kind of matter, so the God particle can be used to create any kind of god. Once they isolate the God particle, they'll be able to create a god who actually likes science. Or create a god who isn't always short of cash, so doesn't need to raise money via telethons. Or the ultimate challenge: create a god that doesn't exist.
Re:Searching for God by ResidntGeek · 2007-07-21 06:31 · Score: 1

No, you've got cause and effect backward. Religion is a symptom. People don't kill each other and fail to understand how the world works because they're religious, they're religious because they don't understand how the world works and want to kill each other. If religion goes away people will find plenty of other things to fight about, and plenty of other reasons to watch American Idol instead of reading books.

--
ResidntGeek
Re:Searching for God by Sipos · 2007-07-21 07:39 · Score: 1

Searching for the Higgs boson (the particle that the media are talking about when they say God particle) has nothing to do with God.
I have never heard another physicist refer to it as the god particle (except when satirising media articles on the subject). It is an unfortunate expression and I don't like it.
The Higgs boson is an important part of a theory called the Standard Model, our best theory of the interactions of fundamental particles. It is the last fundamental particle in this theory left to be discovered and is an integral part of it. Looking for the Higgs is about understanding nature at its most fundamental level and I think that makes it worthwhile. The LHC is aiming to find or disprove the existence of the Higgs and explore other new physics. There is also a huge amount of spin out technology which comes from it which is also useful (although I am far less interested in that). If you are interested in what the LHC is for have a look at CERN's FAQ or wikipedia .
Re:Searching for God by Saikik · 2007-07-21 07:45 · Score: 1

thx
Re:Searching for God by ScrewMaster · 2007-07-21 14:54 · Score: 1

I can't argue with you on that point ... but, on the other hand, if more people didn't have their religion blinding them to the true nature of reality (which is a lot less predictable than they would like, yet for all that much more interesting), they might have a chance to really learn to think. And you can't deny that, down through the centuries, religions of one sort or another have provided a ready excuse for mayhem.

--
The higher the technology, the sharper that two-edged sword.
Re:Searching for God by ResidntGeek · 2007-07-21 18:30 · Score: 1

Yeah, but something else will provide an excuse for mayhem, and people will never really learn to think, even when given the chance. It'd be nice, though, wouldn't it?

--
ResidntGeek

FTL by unchiujar · 2007-07-20 17:09 · Score: 3, Funny

"Due for operation in May 2008, the LHC is a 27-kilometer-long device designed to accelerate subatomic particles to ridiculous speeds, smash them into each other and then record the results."
Next up ludicrous speed!!! Better fasten your seat belts...

--
Shakespeare poems - infinite monkeys with infinite time.Computer tech support - a few trained ones working from 9 to 5.

Re:FTL by Roger+W+Moore · 2007-07-20 20:37 · Score: 1

Due for operation in May 2008, the LHC is a 27-kilometre-long device designed to accelerate subatomic particles to ridiculous speeds

Actually it would be better to say "ridiculous energies" because the speed of the protons in the LHC will barely be any faster than those in the Tevatron...but the energy is seven times larger thanks to relativity.
Re:FTL by Eravnrekaree · 2007-07-20 23:51 · Score: 1

Are there any practical applications of this research in technology? And what will this research tell us about the universe?
Re:FTL by Anonymous Coward · 2007-07-21 06:22 · Score: 0

will barely be any faster

Wow, you should go back and study the Lorentz factor and rapidity again. The gamma difference is huge, as you note.

Thousands of disk drives. by Anonymous Coward · 2007-07-20 17:11 · Score: 3, Funny

Hmm, lets see. ~2700 TB of data over one month. Let's store it on 500 GB drives. That's 5400 disk drives just to store the data. Add in the the extra drives for parity, and a few hundred hot spares, this thing could easily use OVER NINE THOUSAND drives.

Re:Thousands of disk drives. by noggin143 · 2007-07-20 18:42 · Score: 5, Informative

We are expecting to record around 15PB / year during the LHC running. This data is stored onto magnetic tape with petabytes of disk cache to give reasonable performance. A grid of machines distributed worldwide analyses the data. More details are available on the CERN web site www.cern.ch.
Re:Thousands of disk drives. by complex(179,-70) · 2007-07-20 20:36 · Score: 1

Using, what, 2A @ 5V + 0.5A @ 12V peak while spinning up? That's 18kA @ 5A + 4.5kA @ 12V ; 144kVA. That's a lot of whacking big AC/DC convertors, and some serious plumbing to make sure all these drives get a stabile power feed at all times.
Re:Thousands of disk drives. by UnHolier+than+ever · 2007-07-20 21:57 · Score: 2, Insightful

How much is a 500Gb drive worth nowadays? 150$? So your OVER NINE THOUSAND drives are worth about, hum....1.35M$. CERN has a budget of about 5B$. It's the speed at which data is coming that's a problem. Not the total amount of data.
Re:Thousands of disk drives. by Anonymous Coward · 2007-07-21 00:38 · Score: 0

I'm not sure whether to be ashamed or proud of slashdot that no other responder got the joke.
Re:Thousands of disk drives. by OzRoy · 2007-07-21 02:49 · Score: 1

$150 for a SATA disk maybe. These would be Fiber Channel whose price would easily be about $600 per disk
Re:Thousands of disk drives. by Anonymous Coward · 2007-07-21 05:10 · Score: 0

WHAT 9000?!
Re:Thousands of disk drives. by VisionMaster.NL · 2007-07-21 07:01 · Score: 1

Actually ... all the data that is kept is going on to a huge tape store. The live data is flowing to the biggest data centers on the Grid. So, basically all the huge distributed data centers will need to pay for the needed storage.
Re:Thousands of disk drives. by Iron+Condor · 2007-07-21 15:25 · Score: 1

Just because the data goes through fiber doesn't mean the disks have to be FC disks. Actually that's usually a bad idea, as the physical operations of the HDD (moving heads around and such) limit the rate at which the disk can actually accept data to much less than FC speeds.
I have a couple local RAID boxes here and I feed data into them through 4Gb fiber channel, but the box just consists of 16 run-off-the-mill SATA drives in a RAID5 config (yielding 14 data disks (plus 1 parity plus 1 hot spare) times 750GB =~10TB storage). From the outside the box looks, works and acts like a 10TB fiber channel drive, but replacing the actual physical drives is much cheaper and since I'm effectively writing to 14 disks in parallel, they can actually digest the ~300MB/sec I'm throwing at them.
Yeah, 300MByte/sec. Which is ~2.4Gbit/sec. Through a single fiber. Sustained. Which isn't even the limit of a 4Gb FC. All commodity hardware. Why does CERN need 500 fibers for moving ~3 times as much data than me? (granted, I do it for hours, not months - but the number of fibers should be a throughput question, not a capacity one).

--
We're all born with nothing.
If you die in debt, you're ahead.
Re:Thousands of disk drives. by Johnny+Mnemonic · 2007-07-21 17:48 · Score: 1

You ought to have Google store that data for you. Seriously.

Google has collaborated on other scientific projects before, and one in particular has many of the same needs as the LHC, the LSST. Of course, it doesn't hurt that one of the primary backers of the LSST is an ex-Google exec.

I'm confident that Google is capable of dealing with large data stores, even those on a multi-PB scale, with reliability and redundancy.

--

--
$tar -xvf .sig.tar

Not Informative by x_MeRLiN_x · 2007-07-20 17:18 · Score: 1

That refers to the number of PCs involved in storing the data.

Where's the problem? by femto · 2007-07-20 17:40 · Score: 1

1GB/s * 1 month = 1GB/s * 30 day/month * 24 hour/day * 3600s/hour = 2,592,000 GB.

A big disk (Seagate ST3750640AS) is 750GB.

324,000 GB / 750GB/disk = 3,456 disk.

At AUD467 per disk this will cost AUD1,613,952 (plus computers+net). Even cheaper if you allow for the fact these are retail
prices for wholesale quantities. Let's take the startup current of 2A@12V as the worst case power
consumption and we end up with a maximum power of 83kW. That's less than 35 domestic heaters (2.4kW ea).

Okay, it's not trivial stringing together 3,456 disks, but it's not exceptional either. It is no bigger in
scale than a typical university network. Or just buy a few of the Internet Archive's Petaboxes off the shelf.

Re:Where's the problem? by Anonymous Coward · 2007-07-20 17:51 · Score: 0

They should use this Sun Microsystems switch which comes with 3,456 ports. Perfect fit.
Re:Where's the problem? by cjanota · 2007-07-21 08:56 · Score: 1

I agree with the parent. The investment company that I worked at over this summer used about 400TB in magnetic tapes per week for the past month and a half. And CAT 6 can carry just over 1GB/s.

--
You can fix anything with duct tape and sticks.
Re:Where's the problem? by kayditty · 2007-07-21 16:23 · Score: 0

uhh.. you used the wrong dividend (you incorrectly converted the 2.592 * 10^6 * 2^30 bytes to bits), but you got the right quotient. how did you manage that?

Ah, engineers. by onebuttonmouse · 2007-07-20 17:51 · Score: 1

'During this one month, we need a huge disk buffer,' says Pierre Vande Vyvre, CERN's project leader for data acquisition. One might call that an understatement.

I expect he referred to the problem of finding the God Particle as "distinctly non-trivial".

--
MacBook Pro. Worst name since the Bicycle

Fun problem by bob8766 · 2007-07-20 17:58 · Score: 2, Insightful

The network is one thing, but just processing that amount of data is incredible.

200 computer breaks the 1GB chink into more manageable 5MB/Sec chinks of data, but then they still need to handle the metadata that figures out how to put it all back together. On top of this they'll need to have some redundancy in case of data loss, and how the load is redistributed if a machine croaks.

These are good problems, it would be a fun system to work on.

Re:Fun problem by xyvimur · 2007-07-20 18:05 · Score: 1

It's a hell of fun to work on.
Re:Fun problem by pipingguy · 2007-07-20 21:11 · Score: 1

What's this 'Google' thing I keep hearing about?

A correct use of the word "catch". by Futurepower(R) · 2007-07-20 17:59 · Score: 4, Insightful

Not only did the Slashdot editor not catch a spelling mistake, he apparently didn't catch the fact that the linked article is an advertisement from CXO Media, which, according to its web site, mixes articles and advertisements: "Through our integrated media and marketing programs we provide..."

From the linked article: "... the team is using Quantum's StorNext software as its file system..."

Question: Did a Slashdot editor get paid directly for running an advertisement disguised as an article? Or was someone in Slashdot's parent company paid "under the table"? Or did the parent company get paid?

Anyone wanting to read a real article from 2005 about CERN's data handling, data storage, and data processing can download this PDF file: Grid Computing: The European Data Grid Project.

Real articles begin this way: "The computing challenges for LHC are: * the massive computational capacity required for analysis of the data and * the volume of data to be processed."

Advertisements begin by talking about God and murder, this way (from the article linked by Slashdot): "CERN's Search for God (Particles)..."

and "Maybe you last read about CERN (the European Organization for Nuclear Research) and its massive particle accelerators in Angels & Demons by Dan Brown of The Da Vinci Code fame. In that book, the lead character travels to the cavernous research institute on the border of France and Switzerland to help investigate a murder."

Re:A correct use of the word "catch". by Anonymous Coward · 2007-07-20 19:23 · Score: 2, Interesting

Absolutely correct. (I didn't read the article - i work with the Grid [LCG])

Just two points which may seem to ignore:

Firstly, the Data is of no use if it just sits on some tape/disk drives at cern, because it has to be analyzed as well if you actually want to find something. Back when the whole thing started, it was deemed to expensive to build a central analysis facility at CERN, so the LHC Community Grid was created, some ~100 datacenters around the world with lots (>20k) of CPUs and lots of diskspace. The Data from CERN is automatically distributed over high-speed links to the main site in every "cloud" (called Tier 1, for example Karlsruhe in Germany) and then from there to the smaller centers. Then, if a physicist sends an analysis job, it finds its way to the site where the data is and works there, so there is no unnecessary copying.

Secondly, in addition to the real data coming out of the detector physicists need also quite a lot of simulated "Monte-Carlo" data. The production and storage of that has already been going on for some time, and is already taking up some millions of Gigabytes.

By the way, the data storage management system preferred by a lot of the lhc guys is called D-Cache ( dcache.org ), developed at DESY in Hamburg and free for non-commercial use (this is only for you if you have lots of disks. and preferably a tape robot as a backend.)

Idea by bguzz · 2007-07-20 18:13 · Score: 1

./go.sh | bzip2 > results.bz2 Problem solved!

Re:Idea by KillerCow · 2007-07-20 18:29 · Score: 3, Funny

./go.sh | bzip2 > results.bz2 Problem solved!

No. No, my friend; you do not grasp the scale of this project.

./go.sh | bzip2 | bzip2 > results.bz2
Re:Idea by VisionMaster.NL · 2007-07-21 07:09 · Score: 1

I'd like to see you cards by asking you to bzip2 one month of data on your prefered desktop. Now you're contributing to science and you have an extended coffee break.
Re:Idea by Anonymous Coward · 2007-07-21 07:39 · Score: 0

> ./go.sh | bzip2 | bzip2 > results.bz2

That's wasteful. Do it in one command: ./go.sh | bzip2 -1337 >results.bz2

(of course, someone still needs to implement the 133700 KB block size - in which case you could just use lrzip)

Not So Huge by PenGun · 2007-07-20 18:45 · Score: 5, Informative

It's only 5x HD SDI single channel ~ 200MB/s. Any major studio could handle this with ease.

SDI is how the movie guys move their digital stuff around. A higher end digital camera will capture at 2x HD SDI for a 2K res, 4:4:4 colour space. A few of em' and you got your 1GB/s easy. Spools onto godlike RAID arrays.

Get em' to call up Warner Bros if they have problems.

30 racks, $1.8M in disks by this+great+guy · 2007-07-20 18:46 · Score: 3, Informative

Assuming a non-RAID 3x-replication tech solution (what Google do in their datacenters), using 500-GB disks (best $/GB ratio), they would need about 16 thousands disks:

.001 (TB/sec) * 3600*24*30 (sec/month) * 3 (copies) * 2 (disk/TB) = 15552 disks

Which would cost about $1.8M (disks alone):

15552 (disk) * 110 ($/disk) = $1710720

Packed in high-density chassis (48 disks in 4U, or 12 disks per rack unit), they could store this amount of data in about 30 racks:

15552 (disk) / 12 (disk/rack unit) / 42 (rack unit/rack) = 30.9 racks

Now for various reasons (vendors influence, inexperienced consultants, my experience in the IT world in general, etc), I have a feeling they are going to end up with a solution unnecessarily complex, much more expensive, and hard to maintain and expand... Damn, I would love to be this project leader !

Re:30 racks, $1.8M in disks by bockelboy · 2007-07-22 04:31 · Score: 1

So, 30 racks per month ... for a 15 year project. Say you only buy the first 5 years worth of disks - a simple 1800 racks.

The LHC went with a tape-based, distributed storage system. Seven T1 sites around the world keep the data on tape (one copy at CERN, another copy at a T1 site). They do reconstruction of the raw data, and write the reconstructed data on disk. They then distribute the reco data to a T2 site, which has a large amount of disk-only space (like you suggest). The individual physicist does the analysis at the T2 site.

"Catching a flood"? by Futurepower(R) · 2007-07-20 19:12 · Score: 1

I thought about that, but when was the last time you heard someone talk about "catching a flood"?

Re:"Catching a flood"? by JamesTRexx · 2007-07-20 20:23 · Score: 1

I think it was some 2000+ years ago, some guy with an animal fetish. I believe his name was Noah.

--
home
Re:"Catching a flood"? by Ultra64 · 2007-07-20 21:11 · Score: 1

When was the last time you heard someone talk about "caching a flood"?

It's not that much, really by diamondsw · 2007-07-20 19:17 · Score: 1

1GB/sec is 3.6TB/hour, or 86.4TB/day, or 2.5PB in a month. That's really not all that huge for enterprise or scientific storage. I see that all the time in hosted environments.

--
I don't know what kind of crack I was on, but I suspect it was decaf.

Just A Thought by Anonymous Coward · 2007-07-20 19:24 · Score: 0

I was wondering how this ranks against what Google handles in a month. Either way, I'm sure Google's got plenty of storage to handle the needs for the experiment.

E-Mail it to Google by Nom+du+Keyboard · 2007-07-20 19:52 · Score: 2, Funny

Just e-mail it all to Google. By then gMail should be able to handle that much per user.

--
"It's the height of ridiculousness to say for those 9 lines you get hundreds of millions."

CERN DAQ is generally impressive by torako · 2007-07-20 20:01 · Score: 5, Interesting

It's important to distinguish between the amount of data generated during an event right in the detector and the filtered data that in the end will be kept and saved on permanent storage. The ATLAS detector, for example, has a data rate in the order of terabits per sec during an event. There's a pretty sophisticated multi-level triggering system whose purpose it is to throw out most of that data (~98%) and only look for interesting events.

Right now, the average event size for ATLAS is 1.6 MByte and the system is designed to keep around 200 events per second, or roughly 300 MByte. This isn't much of course, but you have to consider that the bunch crossing rate (i.e. the rate at which bunches of protons will collide and generate events) is 40 MHz.

So you have to design a system that boils this rate from 40 MHz down to 200 Hz and only keeps the interesting parts, while also buffering all the data in the meantime. For this reason, the first trigger level is entirely implemented in hardware right in the detector and reduces the rate down to 75 KHz with a latency of 2.5 s. The rest of the trigger works on clusters using Linux computers and has a latency of o(1s).

Better yet... by curryhano · 2007-07-20 20:11 · Score: 2, Interesting

...all this data will be distributed to a handfull of TIER1 sites (CERN is TIER0) all over the world (about 10). At the TIER1 sites the data will be preprocessed. The TIER1 sites distribute their preprocessed data to TIER2 sites which are the places where the international scientists work. I work at a TIER1 site and we face a lot technical challenges with this project. At a TIER1 site as I mentioned, the data is preprocessed too, so we will need a compute cluster and the necesary bandwith internally to move the data around. With each new software release (about every six months), ALL raw data has to be reprocessed with the new software. All results have to be stored. So for every part of raw data we will have to store preprocessed data for every software release. Of course a lot of data will be stored on tape but we expect that the dataflow from CERN (for us 150MB/s to disk and 75 MB/s to tape) will be the least of our problems. Moving the data around and preprocessig the data is probably a bigger problem in the long run. An the fact that the machine will be running for about 15 years or so, this will be a very long run!

way too late/early by ducomputergeek · 2007-07-20 20:16 · Score: 0

After coming home from a party I read this as "CERN Searches for GOLD Particles." Thought to myself, WTF?

--
"The problem with socialism is eventually you run out of other people's money" - Thatcher.

Not really that much storage bandwidth... by RulerOf · 2007-07-20 20:27 · Score: 2, Insightful

TFS makes a point about storing 1 GB (presumably GigaBYTE) of data per second, but THAT feat is already in widespread use, spefically for the digital manipulation of 4k film. The company that produces the systems that process this film data is called Baselight.

Basically, 4k film, at a resolution of 4096x3112, requires approximately 50MB per frame @ 24 fps. That comes out to about 1.22GBps, and maninuplating the data doubles it to 2.44GBps. The systems[PDF] that Baselight sells run 8 nodes and 16 processors, and it's all built with commodity hardware and some flavor of Linux. Apparently they use 3ware RAID cards... and I found out about this by browsing 3ware's site when I was shopping for a RAID controller.

Either way, my point is, it's been done, and there's a real world application that requires that type of data storage bandwidth and has nothing to do with scientific data. :P

--
Boot Windows, Linux, and ESX over the network for free.

I'd like something like that by Anonymous Coward · 2007-07-20 20:37 · Score: 0

...will use 500 optical fiber links to feed particle collision data to hundreds of PCs at a rate of 1GB/second, every second, for a month.

My God, think of the porn you could download with that setup. It would be biblical (in a pornographic sense).

Choice of filesystem by Tracy+Reed · 2007-07-20 20:42 · Score: 1

I am really surprised they did not use the Lustre filesystem for their data storage since it is vendor neutral, open, and designed for exactly this sort of thing. The lustre guys report being able to obtain tremendous bandwidth and scalability. I have not yet been able to play with Lustre but I look forward to doing so.

ALICE is not Higgs Hunting by Roger+W+Moore · 2007-07-20 20:48 · Score: 2, Informative

The ALICE experiment is actually concentrating on heavy ion collisions which is why they only worry mainly about one month/year, the rest of the time the machine is running protons for the other experiments, ATLAS and CMS, which will look for the Higgs. ALICE will hopefully study the quark gluon plasma but, as far as I know, has no plans to look for the Higgs.

God Particles by pipingguy · 2007-07-20 21:08 · Score: 2, Funny

According to a guy that I met yesterday on the street (he was talking to himself or somebody) the only way I could meet God (and hopefully His particles) was through his son. WTF? Can't even *God* get a good secretary these days?

Finding God by Mark_MF-WN · 2007-07-20 21:11 · Score: 3, Funny

Don't worry -- the products of particle accelerators only exist for a few picoseconds. If God is created during a collision event, he will wink out of existence so fast that we'll only become aware of his presence by the shower of Mormonions and PatRobertsonite particles impinging on the detection apparatus.

CERN: been there as teenager by Anonymous Coward · 2007-07-20 21:22 · Score: 0

About 15 years ago, I was around 16, we made a one week school-trip to Geneva and we also visited CERN for one day. Even if you don't understand anything of what they are doing there, the place is impressive. I was very surprised that you actually can visit such a facility. I bet there are similar labs/institutions near you happily showing around and showing off what they do :)

Re:CERN: been there as teenager by ScrewMaster · 2007-07-21 14:44 · Score: 1

I've taken tours of Fermilab and a couple of other big labs. I'm only a software engineer so what they're working on is usually over my head, but it is interesting nevertheless. My father was a physicist, and it was fascinating to see a lot of what he talked about when I was growing up in actual working hardware.

--
The higher the technology, the sharper that two-edged sword.
Re:CERN: been there as teenager by Anonymous Coward · 2007-07-30 08:23 · Score: 0

Err... Sorry... So if you understand something of what we are doing here, you wouldn't be impressed?! (yes, you friendly AC from CERN!)

Decoded Transcripts from experiment: by sm4096 · 2007-07-20 21:53 · Score: 0

1>Oh God!
1>Oh God!
2>Oh God thats is awesome. more!!!

3>Hey wait, you guys are studying the wrong kind of collisions.
1>Sorry just stress testing the hard drives.
2>Yeah we couldn't help it, the vibrations of so many drives...

The CIO article is incomplete by quarkie68 · 2007-07-20 22:12 · Score: 2, Informative

OK, we got a half way overview of CERN's decision, with some bold statements of questionable validity. I am submitting the criticism purely on the grounds of being really interested in large data storage, I don't work for any large storage vendor, but I am an architect of storage systems.

First of all, with the statement "and it's (StorNext) completely vendor independent": Lot's of other solutions provide flexibility about choosing the hardware vendor from a theoretical perspective. The theory says that if vendor A makes a SAN, vendor B makes a RAID controller, C a disk cabinet and D offers a clustered FS, and all comply to the relevant standards, you can plug them together and expect them to function. However, imperfections in the standards, hidden proprietary optimizations, always dictate certain configs and combinations for optimum performance. There is a lot of work to be done in the StorNext and other similar products, until they claim full flexibility. My experience in deploying a StorNext based solution on a 1200 node setup says so and to keep the post short, I shall exclude at this stage vendor details, but if someone is interested, I am happy to go over the details. There is vendor dependence if you wish optimum performance. Not to mention that if you mix and match the RAID and SAN cards in the setup, any unfortunate issue might end up in a multi-headache, even if you have solution support (A blaims B, B accusses A, and the game of ping-pong begins). You can never exclude vendor dependence in such a large setup, you have to deal with it.

Then you have the "Clustered file systems are still an evolving category, she says, but enterprise IT is warming up to it.". I can imagine what the author classes as enterprise IT here, but I think there is a bit of an orientation issue. CERN is not exactly the classical enterprise IT environment, is it? Not in terms of their requirements for resilience and capacity. These FAR EXCEED enteprise IT requirements. CERN is a research setup. And the mentality of a research setup (that incubated the WWW after all) is (or should be) that of innovation and playing with some of the latest and the greatest. In fact, some US based research setups have long experimented with other cluster FSes. They are not warming up. CIO claims that StorNext is scalable. It is. But to what extent? Have they excluded for example things such as Lustre? http://wiki.lustre.org/index.php?title=Main_Page If yes, why?

Hang on... by skinfitz · 2007-07-20 23:40 · Score: 1

...just because a SAN is connected at 1Gbit to a machine does not mean there is 1 Gbit of data passing over there all the time.

If I were to write up my house network I could say 'network switches feed data to several computers at 1Gbit per second' - this would be true if I only use it for web browsing - doesn't mean I'm saturating my bandwidth.

Backup options by Mostly+a+lurker · 2007-07-21 00:26 · Score: 5, Funny

I assume they will want to have more than one copy of this for backup purposes. Here is my analysis on their choices. The total data to be backup up (for the month) is taken as a lazy 1 * 60 * 60 * 24 * 30 = 2,592,000 gigabytes

Printed hardcopy. Many authorities recommend this as you do not need to worry about changes in data formats over time. For exact calculation, we would need to know the font they were planning to use and the character encoding. However, let's take a working assumption that they can cram 10KB of data onto an A4 sheet. That implies 259,200,000,000,000 pages. They will probably not want to use an inkjet printer if they use this solution and may, indeed, choose to acquire multiple printers and split the load. A single printer at 10 ppm would take approximately 50,000 years to complete the backup. On 70gm paper, it would weigh a little over two million tons. At any rate, this would certainly produce reams of output.
Diskettes. This was good enough for nearly everyone 15 years ago. It is curious that such a tried and trusted technique is no longer in fashion. I assume regular 3.5" 1.44MB diskettes, generally recognised as easier to handle than 5.25". We shall need around 1,800,000,000 diskettes. One drawback is the person changing the diskettes as each one filled up might become a little bored after a while. On the positive side, the backup will be quite a lot faster than the printed solution. Assuming about one diskette per minute, inclusive of changing disks, the backup could be complete in less than 3,500 years.
Now considered somewhat old fashioned, punch cards were once a mainstay of every programmer's personal backups. Like printed hardcopy, anyone familiar with the character encoding used, could read the data without needing any access to a computer. If we assume 80 column cards, we would need 32,400,000,000,000 cards. I would be somewhat concerned about the problem of getting this stack of cards back in the correct order if I dropped it. With a weight of about 30 million tons and stretching perhaps 6 million miles end to end, handling certainly would be challenging and an accident very possible.
Paper (punched) tape was the only alternative on the first computer I used, a basic early model Elliott 803 without the optional magnetic tape. If I recall correctly, you could manage about 10 characters per inch, so you would need a paper tape over 4,000,000,000 miles long. Hmmm, that would be silly. The other solutions are clearly better.

I am sure other options will be considered, but I just wanted to bring these up in case CERN had failed to consider them

Re:Backup options by ookabooka · 2007-07-21 06:26 · Score: 1

Why not just have volunteers remember the data? If you made a linked list of individuals so that each individual would remember the name/face of the individual after him and also either a 0 or a 1 representing the data he stores. By doing this you would need just under 21 quadrillion people (20,736,000,000,000,000 people to be exact). A doubly linked list would only require that each individual remembers 2 people (before as well as after) which is quite managable. The number of people required obviously goes down with the more information an individual is able to remember.

--
If you are about to mod me down, keep in mind that this post was most likely sarcastic.
Re:Backup options by fatphil · 2007-07-21 07:43 · Score: 2, Funny

Nice figures. If they did use 3.5" diskettes, then they'd have to write 1000/1.44 per second or roughly 700/s. Assuming they could be written to instantly, they'd need to move through a single drive at 700*3.5"/s = 224km/h. Assuming you need to get them stationary to write to them, then they'd need a maximum speed of 448km/h to keep up the mean speed. Don't stand in their way...

Of course, the tower of floppies for each day would be 151km high...

No, I don't know what that is in football fields.

--
Also FatPhil on SoylentNews, id 863

"Too much information" by Gription · 2007-07-21 02:49 · Score: 2

Imagine how deep the personality problems must run in a person who gets all hot because of someone's DNA sequences!

Re:"Too much information" by bhiestand · 2007-07-21 04:03 · Score: 1

Imagine how deep the personality problems must run in a person who gets all hot because of someone's DNA sequences! XX? I'm interested!

--
SWM seeks new sig for a brief fling

"Caching a flood of data" by Futurepower(R) · 2007-07-21 02:50 · Score: 1

"Caching a flood of data" sounds fine to me.

well if no one else is going to say it by Main+Gauche · 2007-07-21 04:10 · Score: 3, Informative

"Imagine how deep the personality problems must run in a person who gets all hot because of someone's DNA sequences!"

You must be new here.

cassette load? by Anonymous Coward · 2007-07-21 04:52 · Score: 0

on my old trash-80, the command was CLOADM
ahh, the memories....

Try 5,000 years ago by benhocking · 2007-07-21 06:30 · Score: 2, Informative

I think you're thinking of that guy who got nailed to the cross (Jesus). Noah was born about 5,000 years ago.

--
Ben Hocking
Need a professional organizer?

but ... by Anonymous Coward · 2007-07-21 12:24 · Score: 0

what is really fascinating is the data collector array in the first place.
the "thing of sensors" that makes these huge amounts of data.
oh well, good luck.
my guess is that it's lossy, from the first "smash" to the last pixel analyzed, sorry ...
maybe somebody will tell us how much got lost in between.
PR doesn't seem something important to CERN anyway, even if they invented the WWW.

Offtopic beyond comprehension [Re:News for Nerds!] by Iron+Condor · 2007-07-21 14:39 · Score: 1

Your units need work: power per velocity is action, not force.

--
We're all born with nothing.
If you die in debt, you're ahead.

Re:Offtopic beyond comprehension [Re:News for Nerd by Ginger+Unicorn · 2007-07-21 22:33 · Score: 1

tell that to google

http://208.69.34.230/search?ie=UTF-8&oe=UTF-8&sour ceid=navclient&gfns=1&q=1.21+gigawatts+%2F+88+mile s+per+hour

--
(1.21 gigawatts) / (88 miles per hour) = 30 757 874 newtons

Re:Offtopic beyond comprehension [Re:News for Nerd by lachlan76 · 2007-07-21 23:36 · Score: 1

It's force.

P/v

= (W/t) / (s/t)

= (W/s)

W = Fs, therefore P / v = W / s = F.

Slashdot Mirror

Storing CERN's Search for God (Particles)

154 comments