CERN Releases LHC Data
An anonymous reader writes: Ever wished you had access to CERN's LHC data to help with your backyard high-energy physics research? Today you're in luck. CERN has launched its Open Data Portal, which makes experimental data produced by the Large Hadron Collider open to the public. "The first high-level and analyzable collision data openly released come from the CMS experiment and were originally collected in 2010 during the first LHC run. This data set is now publicly available on the CERN Open Data Portal. Open source software to read and analyze the data is also available, together with the corresponding documentation. The CMS collaboration is committed to releasing its data three years after collection, after they have been thoroughly studied by the collaboration." You can read more about CERN's commitment to "Open Science" here.
I wonder what people will do with this data or what can be learned from it?
have you seen my sig? there are many others like it but none that are the same
You can see the exact moment the earth was destroyed by a black hole.
Can the availability of these data help me — or Iran — develop a nuclear weapon faster?
In Soviet Washington the swamp drains you.
This is just a ploy by SERN to divert our attention away from their clandestine plans to take over the world.
I wonder how this relates to the LHC@home project?
I've got that plus a couple more running and it seems the LHC project has had some down-time lately.
Are they switching to releasing the data in this form now?
I was told, at a NSF meeting not many months ago, that CERN never makes its data openly available and never would and that US scientists should just plan on getting European collaborators if they want to work on it.
Now, if we just get ESA to start releasing the Rosetta data...
Hopefully Bennett is looking at the data closely and can give us some insight on fusion research. He is a frequent contributor. Additional input by mdsolar and HughPickens would be appreciated as well.
I downloaded some of the cern data, and while its in xml format, it seems to be just a bunch of numbers. I originally thought that it would be nice to take some data, and plot them to 3d or something. But the data format just seems so strange that it's probably not going to happen. Nothing useful kind of graphs can be extracted from that data :-)
Here's the CMS collaboration web front. It has a map showing where the collaborators are - lots in the US
http://cms.web.cern.ch/content/cms-collaboration
What you are getting is the reconstructed data. To be able to do anything scientifically valuable with it you have to understand the intricate details of the reconstruction software, the trigger, the calibration etc. etc. To be honest I would be amazed if anyone outside CMS will be able to do much with it at all. I'd also expect that there will be bandwidth restrictions on accessing the data since the dataset is multi-PB (if it is the full set of run I data).
We did a similar exercise with the D0 experiment at Fermilab several years ago and it was of interest to practically nobody. I expect there may be somewhat more interest with this being the LHC data but I'd be surprised if anything useful comes of it given the massive amount of work required to be able to do a useful analysis. The best I can think of is that this might make a really nice undergraduate course project or, with some pre-written, high level analysis code, perhaps even as outreach for high school students.
It’s laudible that they release it after 3 years, but then rather reprehensible that they are so scared that someone else might discover something faster than them, and they don’t release it straight away.