GPU Supercomputer Could Crunch Exabyte of Data Daily For Square Kilometer Array
An anonymous reader writes "Researchers on the Square Kilometer Array project to build the world's largest radio telescope believe that a GPU cluster could be suited to stitching together the more than an exabyte of data that will be gathered by the telescope each day after its completion in 2024. One of the project heads said that graphics cards could be cut out for the job because of their high I/O and core count, adding that a conventional CPU-based supercomputer doesn't have the necessary I/O bandwidth to do the work."
I could beat that with my VIC-20.
Interesting but not surprising that they are looking at the GPU route. The fine article doesn't explain enough though. Does anyone know exactly what they are trying to process?
They're not going to even start collecting data for another 12 years, yet they're basing their hardware estimates on what's available today. Compare today's GPUs with those made 12 years ago. I'm guessing they'll be able to crunch their data in 2024 by just using a video game console.
09 F9 11 02 9D 74 E3 5B D8 41 56 C5 63 56 88 C0
I guess they did not get anyone that technical to write that article or the summary.
For I/O I guess they mean memory bandwidth. GPUs have a LOT of memory bandwidth from their cache memory, the problem is that they sit at the end of a PCIe bus from the CPU and the CPU has to handle most of the book keeping (and the actual IO, i.e. taking data from an external source).
So what is important is the compute density i.e. how much computation you do for each piece of data. Getting stuff into the GPU is slow, getting stuff out is slow, but doing stuff on the data is very very fast (because you have so many compute units and so much memory bandwidth).
That is also the way they are programmed, with the main code running on the CPU, and then the kernals getting launched on the GPU with explicit or implict transfer of data from the CPU memory to the GPU memory and back again.
I do have high hopes for stuff like Fusion ( http://en.wikipedia.org/wiki/AMD_Fusion ) which gets rid of the PCIe bus, and make it a lot easier to get data to the GPU cores and back again.
And if you are going to mention GPU machines, why not mention titan ? ( http://www.olcf.ornl.gov/computing-resources/titan/ )
We use GPU cards for computed tomography, and large reconstructions went from taking days, to hours to minutes. OpenCL should be mature in 12 years so they can go with that instead of CUDA, and by then GPGPU computing will probably be using the hybrid APU chips that AMD is starting to market. The bandwidth on the Tesla cards right now is the bottleneck as the PCI bus transfer speeds can cause huge wait times for large data sets. Plus even the biggest Tesla cards only have 4GB or on-board memory, which is not enough. I'd rather have the chips be on-board and have direct access to 512GB of ram for large data sets. Although I can't wit for the Kepler chips to come out, they'll probably reduce computation times by another factor of 3 for our image processing problems.
> believe that a GPU cluster could be suited to stitching together the more than
> an exabyte of data that will be gathered by the telescope each day after its
> completion in 2024
Nah, I'll let you use an app on my Galaxy S12
(-1: Post disagrees with my already-settled worldview) is not a valid mod option.
They've finally created a computer that can run Crysis at 60 fps.
Thanks, Reagan+Thatcher. Now academics basically jack around with extremely expensive and utterly useless crap.
I miss the days when scientists applied their brains rather than their grant money.
"One of the project heads said that graphics cards could be cut out for the job because of their high I/O and core count, adding that a conventional CPU-based supercomputer doesn't have the necessary I/O bandwidth to do the work." And maybe one of these days even the national labs will realize that billions and billions of CPU's that can barely talk to one another do not a supercomputer make.
a conventional CPU-based supercomputer doesn't have the necessary I/O bandwidth to do the work.
I work in HPC and the trend is towards heterogeneous architectures ( CPU+accelerators). Moore's law, power requirements and economics are dictating that trend. It's definitely a stretch to claim that you get better I/O bandwidth with GPUs. Even with PCI Gen 3, the effective bandwidth you get per CPU core is greater than that of an 'equivalent' GPU core.
Remember those old tape fridge days where data buses where 256bit and could transmit stuff faster than they could process them?
Twelve years is a long time for the state of the art to change, GPUs being a perfect example. Let's have this discussion again in ten years...
GPUs have small memory footprints. SKA will be processing HUGE images and data sets. And the image creation cannot be broken up into discrete independent chunks. So the I/O between GPUs is a real problem. Obviously CPUs have the same problem as the on chip memory is (relatively) tiny, but they are designed to pull on the much larger system memory which should be adequate.
Image analysis may well be a different kettle of fish.