Slashdot Mirror


Researcher Shows How GPUs Make Terrific Network Monitors

alphadogg writes "A network researcher at the U.S. Department of Energy's Fermi National Accelerator Laboratory has found a potential new use for graphics processing units — capturing data about network traffic in real time. GPU-based network monitors could be uniquely qualified to keep pace with all the traffic flowing through networks running at 10Gbps or more, said Fermilab's Wenji Wu. Wenji presented his work as part of a poster series of new research at the SC 2013 supercomputing conference this week in Denver."

10 of 67 comments (clear)

  1. That's it? by drcheap · · Score: 4, Informative

    So in violation of /. convention, I went ahead and read TFA in hopes that were would actually be something more than "we solved yet another parallel computing problem with GPUs." Nope, nothing. Not even some useless eye candy of a graph showing two columns of before/after processing times.

    And the article just *had* to be split into two pages because it would have killed them to include that tiny boilerplate footer on page one. What a fail...at least it wasn't a blatant slashvertisement!

    1. Re:That's it? by timeOday · · Score: 5, Funny
      They said it achieves a speedup of 17x, here is the graph:

      CPU: X
      GPU: XXXXXXXXXXXXXXXXX

    2. Re:That's it? by NothingMore · · Score: 4, Informative

      I saw this poster at the conference and I was not impressed and in fact it was one of the weaker posters that I saw at the conference (it was light on details and had some of the information on the poster when talking about GPU's in general was not entirely accurate). It is really a poster that should not have been at SC at all. While it is interesting in the network sense the amount of data they can process is not anywhere close to the amount that is actually flowing through these large scale machines (up to 10 GB/sec per node) and there was no information about scaling this data collection (which would be needed at extreme scales) to obtain meaningful information to allow for tuning of network performance.

      This poster should have been at a networking conference where the results would have been much more interesting to the crowd attending. Also of note, IIRC the author was using a traditional GPU programming model for computation that is not efficient for this style of computation. The speedup numbers would have been greatly improved by using a RPC style model of programming for the GPU (persistent kernel with tasking from pinned pages). However this is not something I totally fault the author for not using since it is a rather obscure programming technique for GPU's at this time.

    3. Re:That's it? by TeXMaster · · Score: 5, Informative

      Yeah but with this kind of applications the real bottleneck is the fact that the discrete GPU needs to access data through the high-latency, low-bandwidth PCIe bus. For this kind of application, an IGP, even with the lower core counts, is often a much better solution, unless you manage to fully cover the host-device-host transfers with computations.

      I'd be really curious to see this thing done in OpenCL on a recent AMD APU, exploting all the CPU cores and the IGP cores concurrently.

      --
      "I'm never quite so stupid as when I'm being smart" (Linus van Pelt)
    4. Re:That's it? by hairyfeet · · Score: 2

      But at what cost? The power of the high end GPUs like the top Tesla units are frankly insane while the power of both AMD and Intel CPUs has been going down. Last I checked those top o the line GPUs sucked so much juice you could probably run a 32 core Intel and still use less juice and require less cooling.

      --
      ACs don't waste your time replying, your posts are never seen by me.
  2. not new by postmortem · · Score: 2

    NSA already does this, how else you think they process all that data?

    1. Re:not new by JustOK · · Score: 2

      Fuck that. If I have to go into a field to program the gnome arrays, I'm not doing it.

      --
      rewriting history since 2109
  3. Sorry, but ... (wrong tool for the job) by Ihlosi · · Score: 2
    ... the main task of GPUs is floating point calculations, and I doubt you need many of those when monitoring networks. Wrong tool for the job.

    It's like saying that GPUs are "terrific" for Bitcoin mining, until you realize that they require one or more orders of magnitude more power for the same amount of processing than specialized hardware. And network monitoring is probably a common enough task that it's worthwhile to use hardware tailored to this particular job.

  4. Re:I'd like a regex computing device by Ihlosi · · Score: 2

    Get an FPGA development system and implement your hardware in the FPGA, then ask a chip manufacturer to turn it into an ASIC. Expect to pay bucketloads of money on the way, though. It's only feasible if either costs are not an issue or you expect the resulting device to be mass-proced (six or better yet seven digit numbers manufactured per yeat).

  5. Re: wishful thinking by loufoque · · Score: 2

    In practice, most people who publish results of a new algorithm ported to GPU do not have a version well-optimized for CPU, or aren't that good at optimization in the first place. I've had several cases where I could make the CPU version faster than their GPU version, despite them having claimed a x200 speed-up with the GPU.
    If you have a fairly normal algorithm in terms of data access and your speed-up is bigger than 4, you're probably doing it wrong.