CERN Releases 300TB of Large Hadron Collider Data Into Open Access (techcrunch.com)
An anonymous reader writes: The European Organization for Nuclear Research, known as CERN, has released 300 terabytes of collider data to the public. "Once we've exhausted our exploration of the data, we see no reason not to make them available publicly," said Kati Lassila-Perini, a physicist who works on the Compact Muon Solenoid detector. "The benefits are numerous, from inspiring high school students to the training of the particle physicists of tomorrow. And personally, as CMS's data preservation coordinator, this is a crucial part of ensuring the long-term availability of our research data," she said in a news release accompanying the data. Much of the data is from 2011, and much of it is from protons colliding at 7 TeV (teraelectronvolts). The 300 terabytes of data includes both raw data from the detectors and "derived" datasets. CERN is providing tools to work with the data which is handy.
A legitimate use for my seedbox!
I just can visualize a horde of crackpots using this data to fuel fringe theories, find messages from God and prove the existence of aliens.
That being said, this is awfully cool from CERN. The raw data will be really useful in academic environments, and the Linux visualization tools are great.
Should only take 5 million years or so to download...how much is each extra meg after 1GB again???
If I'm not mistaken, the LHC has been publicly funded, so these data should have been public to start with. Anything else is bs.
I understand there are tools to work with these data, but even so, 300TB is a lot. Wouldn't it be better, assuming they want to encourage future generations of particle physicists, to open source the tools and provide better instruction on how one should manage these data? That seems like half the problem. No way will anyone in high school download 300TBs to play with. Even if they could, what would they use to play with it?
300 TB?
How many Libraries of Congress is that?
"Grab them by the pussy" -- President of the United States of America
Yeah you really need to upgrade from Telex to something a bit more modern.
Staff was paid.
It was available to all scientists of the funding and visiting countries. Now as the scientists are through with it you can have a look too.
It may be better to stick this behind an API of some form where we can call subsets of the data. No one earth, outside of a handful of people, would have the infrastructure to play with this. Its not like we have 300TB SANs in our homes or schools.
With an API some useful things like sampling, etc, could already be performed and made available along side the raw data. If people really wanted more that an API could deliver, they could define a sibset and have the API generate iso images of that data for download.
It should have been available to the whole population...
By the time you have downloaded the 300 TB, they'll have built another, bigger, particle collider, and released an even bigger tarball about that one.
how many people have 300 TB of storage? it won't fit on your iPhone, buddy.
Maybe now, we can unlock the mysteries of Steins Gate! Mwahaha!
It is now. Before that the people who developed the experiments got first access. I personally understand that perfectly. They invested decades of their lives.
human is dead, mismatch.
I know I know, replying to AC, but you think the staff cared about being paid?!
"If I'm not mistaken, the LHC has been publicly funded, so these data should have been public to start with. Anything else is bs."
"It should have been available to the whole population..."
Such massive ignorance about how International Science actually works means only one thing- x0ra is not very... bright.
So the Cern Supreme Soviet Central Committee just met and decided that everybody with the means and intelligence, and who wanted access to the raw and filtered data, could have it- except for x0ra, who is a poopy-head. He even admits it; this is just from the last month:
"...What you call "press" is an awfully one sided propaganda machine selling the UN agenda...." /. is a hideout for SJW and other anti-capitalist crypto-communist anarchists..."
"...I don't mind being called a racist / bigot / whatever..."
"...hummm! my dick just getting hard again !..."
"did you mean http://goatse.info/ ?"
"we're not only trying to fuck "stuff", but pussies, tities, mouthes, asses as well !"
"...I *do* believe we have a population problem. Hopefully, war / disease / climate change will take care of the problem within the next decades."
"I'd like to, but libtards fuck it up beyond recognition."
"Oh, and by the way, while I'm a pretty selfish prick,..."
"SJW need targets to spur their hatred..."
"Screw millennials, they need a crash course into real life...
"Typical SJW asshole argument..."
"yeah, I always forgot
"We should ban bananas !"
"my dominant hand is busy doing something else..."
Some may object that I'm quoting x0ra out of context. In context, he is even worse. But I still think that he should stay around as a cautious example for our Youth- One should not put a diaper on their head after already soiling it.
Can't wait to print it all out!
Cool. Where's the torrent? It's not in TPB yet.
const int one = 65536; (Silvermoon, Texture.cs)
SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC
Before this, the largest collection of collision data was the Russian dash-cam footage on YouTube
Just curious how many floppy disks would it take to store 300 TB?
Sure, staff cares about being paid. However, research is research... and speaking as a long-time researcher, having worked in several major universities, there is no way to know on a given day whether a colleague is onto the next great idea or is staring at the wall. Frankly, most of the time, I don't even know if I'm wasting my time or not. So some basic salary to keep people in academic research is needed, and then some extra perks are still required too. Since we judge people (for hiring, for promotions, ...) based on publications, rights to first publish results you worked on are standard throughout science.
Sure are a lot of articles about the Large Hardon Collider lately.
http://opendata.cern.ch/about/CMS
Why? What interest does the general population have in access to the LHC data? They've already release a subset of the data for educational purposes, in addition to this considerable data dump. It serves no public interest to make the whole data set available to everyone, and in fact would run contrary to the public interest: the data set is absolutely massive (the LHC produces petabytes of data per day), and the costs associated with making that data available to the public would be non-negligible.
If a specific individual is interested in access to the data, they're certainly free to email their local (or not even necessarily local) university department associated with the LHC and ask for it, and they could probably get access to a subset of it, if they've shown genuine interest. And by "genuine interest", I mean have already downloaded, processed, examined, and understand much of the already publicly available data, to the point where they are capable of performing actual scientific research on the data, and aren't simply interested in wasting already-precious scientific research money and time in making some kind of political or philosophical point.
"None can love freedom heartily, but good men; the rest love not freedom, but license." --John Milton
CERN releases data. And the Jihad begins?
ponder how high tech God is.
Jesus is the way to God.