SKA Telescope Set To Generate More Data Than Current Net
angry tapir writes "The forthcoming $2.1 billion Square Kilometre Array (SKA) radio telescope could generate more data per day than the entire internet when it comes online in 2020, according to the director of the International Centre for Radio Astronomy Research (ICRAR), Professor Peter Quinn. SKA — which Australia with New Zealand and South Africa are competing to host, and which will help the search for Earth-like planets, alien life forms, dark matter and black holes — will be 10,000 times more powerful than any telescope currently used. Slashdot has previously discussed the proposal to use 'Skynet' — a grid-computing-based solution for processing and storage."
Cue in the HAARP freaks in 3...2...1...
Democrats or Republicans. They are both taking us to the same place and they are not afraid of us anymore.
finally we can retire that saying/meme about Internet is for porn....or that the mass storage market is driven by porn
unless...I guess we happen to be able to spot the alien equivalents via this SKA.....
Really, how much "data" is "generated" by the internet every day?
Sure, there's lots of traffic, but that's millions of copies of the same data.
The new data going on to the internet probably isn't too heinous in quantity.
And the summary blew the meme. It's not "generate more data per day than the internet", it's "generate more data per day than the earth does in a year, and conduct more internal networking traffic than the internet."
I prefer the PUNK telescope myself.
Given that we just had a Slashdot article about how the space based James Webb telescope is already on the ropes with Congress, http://science.slashdot.org/story/11/07/07/0038247/Congress-Dumps-James-Webb-Space-Telescope, perhaps we should be worried that the same will happen for SKA. Unlike Webb, SKA is an international project, so it won't necessarily go down the tubes if the US backs out. Moreover, the US has backed out of European lead science projects before often with very little warning. SKA is going to allow some very interesting work. Among other things, SKA might be able to detect extraterrestrial life, either through direct radio signals (from intelligent life) which would be a really big deal, or more indirectly detect non-intelligent through the analysis of extrasolar planets' atmospheres (such as the detection of large amounts of oxygen). SKA will also be used for many other astronomy and astrophysics projects, such as examination of supernovas. SKA is very good science, let's hope that the penny pinchers who repeatedly cut tiny science programs while leaving defense, social security and medicare alone will not touch it. In the long run, science helps everyone.
Sure but think about how much more data the Internet generates today than it did 10 years ago. I'm sure we'll be able to cope in 10 years. Also, I know the LHC produces a ton of data. Does anyone know how it compares?
More data/traffic than the internet NOW, or then the internet will be doing in 2020? 'cause let me tell you, the last 9 years has had a pretty sharp increase...
Those who fail to understand communication protocols, are doomed to repeat them over port 80.
This will true until someone post the data on the pirate bay.
Is it just me, or are the statements
1) "could generate more data per day than the entire internet when it comes online in 2020" and
2)"Slashdot has previously discussed the proposal to use 'Skynet' — a grid-computing-based solution for processing and storage"
Somewhat incompatible?
(Warning: Australian content ahead!)
I hope this lays down a water-tight case for the NBN going ahead - or the combination of the two being a catalyst for each other. If there's one thing this is good for demonstrating, it's that future data requirements will outgrow the current infrastructure very quickly, and a project which is as far-sighted as installing FTTH throughout the country has a justification for the unforeseen benefits it can help happen.
(and bah humbug to anyone who thinks the SKA isn't justified to begin with!)
Man who leaps off cliff jumps to conclusion.
I did not know that the music created by the likes of "Reel Big Fish", "The Clash" or other Ska bands could be used to find intelligent life somewhere...
If the CIA invented a device that listened into every phone call in the entire world, real time and dumped it all as a WAV file on a storage device in the basement, would that really do them any good at all?
...calling your global computing initiative 'theskynet' might be detrimental to your acceptance by the technologically affluent who operate the computers you need help from.
Finally a new unit for large amounts of data, appropriate for this new golden age of progress and hyperbole What I want to know is how many library of congresses are in an 'internet-year'...
The project is expected to deliver up to an exabyte a day of raw data, compressed to some 10 petabytes of data in images for storage.
So, 10 petabytes of data - who cares about the raw source. I work for a video streaming company and we have several petabytes of H.264 video. If that were to be uncompressed into 30 FPS 1080p raw data, it would be 50-100x that, so already approaching a couple hundred petabytes. And think of all of the JPEGs out there - why don't we just uncompress all of those for the comparison as well?
A (likely conservative) back of the hand calculation by Google estimated at least 5 exabytes accessible on the Internet (so even the wrong estimate is wrong). I'd imagine a huge percentage of that is compressed video, audio, and images. So, basically 5 exabytes vs 10 petabytes - it's off by 3 orders of magnitude.
Calling the Clash a ska band is like calling the Beatles a ska band.
Really, how much "data" is "generated" by the internet every day?
Sure, there's lots of traffic, but that's millions of copies of the same data.
Where do you put the line between data that's entirely new and a copy? Will an MP3 file with it's ID tags modified be classified as completely different? Does a web page count as entirely new traffic if the URL is different because of the session ID? Would you count each line of code in a program's source as an unique entity? Also, compressed data traffic makes sure any larger chunk of a file is pretty much unique.
I find these kinds of imprecise estimations pointless, because I know they're made up to entertain, not to inform.
Hmm. Old news, was reading about this about 6 months ago.
Peter Quinn appeared on the ABC Science Show with Robyn Williams talking about exactly this in Oct 2010, here is the transcript/podcast. Very interesting stuff.
Aussie, aussie, aussie, oi, oi ,oi!
In nine years my home DVR system will generate more data than the entire internet!
"They that give up essential liberty to obtain a little temporary safety deserve neither liberty nor safety"-B.Franklin
3.5" floppies sent through the mail. You get an extra back in the form of floppy disks and the extra billion or so pieces of mail being sent every month will help out the post office.
This begs for research on how the neural system in the eye compresses raw data into information that can be transmitted through the limited bandwidth optic nerve. Collecting data is great but we can drown in data. We need information processing near to the source.
Something Coming online a DECADE from now may have the same amount of data as the internet TODAY?!?
It's not like data on the net is growing exponentially or anything...
I know this guy who is working on the SKA project here in Australia. Have a look at this site http://www.icrar.org/education/edxn_resources - I have seen this presentation before, and it gives a rough idea of how much data is involved and how much processing is required.
As an excerpt from the PDF of the presentation, each image of the sky will be multiple wavelengths of light (i.e. 2d image x multiple wavelengths gives a 'cube' shape). Each 'cube' of data will be between 4 and 5 TB, and there will be about 1000 of these every 5 days to make up an image of the entire sky visible by the array. This of course is the data that has been pre-processed and had the atmospheric noise removed, so the total bandwidth and processing requirements are many orders of magnitude higher.
It's an exciting project.
storage called Skynet... ...I'm sure this won't turn out bady.
I wonder if the system will run on a Perl one-liner.
This merely says they are unable to filter out all the noise. Producing huge numbers of bits isn't a particularly remarkable achievement. I guess there is far less than 1% actual information in that bit stream.
I could generate just as much with a big computer and exclusive access to random.org :)
was a checkerboard pattern in black and white. When reached for comment, the leading scientist on the project had this to say,
"Pick it up, pick it up, GO!"
In Soviet Russia jokes are formulaic and decidedly non-humorous.
THE SKY IS FALLING!
With so much data processing being generated what will they rely on to pickitup pickitup pickitup?