aggregate.org · Domains · Slashdot Mirror

Re:Try These by Bodhammer · 2012-07-31 09:00 · Score: 1 · on Ask Slashdot: Good Books and Tools For a Software/Hardware Hobbyist?

forgot a couple:
http://www.phy.davidson.edu/instrumentation/NEETS.htm
http://aggregate.org/hankd/piaee12.pdf

Cluster software & GPU experence by PAPPP · 2011-09-13 11:11 · Score: 5, Informative · on Ask Slashdot: Best Use For a New Supercomputing Cluster?

I assume this is an epic troll, but am going to give an honest answer anyway, because there are some legitimate questions buried in there.

I work with a aggregate.org a university research group which has a decent claim to having built the very first Linux PC Cluster, set some records with them (KLAT2 and KASY0 were both ours), and still operates a number of Linux clusters, including some containing GPUs, so I feel like I have some idea of the lay of cluster technology. It is *way* overdue for an update (and one is in progress, we swear!), but we also maintain TLDP's widely circulated Parallel Processing HOWTO, which was the goto resource for this kind of question for some time.

In a cluster of any size, you do _not_ want to be handling nodes individually. There are several popular provisioning and administration systems for avoiding doing so, because every organization with a large number of machines needs such a tool. The clusters I deal with are mostly provisioned with Perceus with a few ROCKS holdovers, and I'm aware of a number of other solutions (xCat is the most popular that I've never tinkered with). Perceus can pass out pretty much any correctly-configured Linux image to the machines, although It is specifically tailored to work with Caos NSA (Redhat-like), or GravityOS (a Debian derivative) payloads. Infiscale, the company that supports Perceus, releases the basic tools and some sample modifiable OS images for free, and makes their money off support and custom images, so it is pretty flexible option in terms of required financial and/or personnel commitment. The various provisioning and administration tools are generally designed to interact with various monitoring tools (ex. Warewulf or Ganglia) and job management systems (see next paragraph).
Accounting and billing users is largely about your job management system. Our clusters aren't billed this way, so I can't claim to have be closely familiar with the tools, but most of the established job management systems like Slurm, and GridEngine (to name two of many) have accounting systems built in.
The "standard" images or image-building tools provided with the provisioning systems generally provide for a few nicely integrated combinations of tools, which make it remarkably easy to throw a functioning cluster stack together.

As for GPUs... be aware that the claimed performance for GPUs, especially in clusters, is virtually unattainable. You have to write code in their nasty domain-specific languages (CUDA or OpenCL for Nvidia, just OpenCL for AMD) and there isn't really any concept of IPC baked in to the tools to allow for distributed operations. Furthermore, GPUs are also generally extroridnarly memory and memory bandwidth starved (remember, the speed comes from there being hundreds of processing elements on the card, all sharing the same memory and interface), so simply keeping them fed with data is challenging. GPGPU is also an unstable area in both relevant senses: the GPGPU software itself has a nasty tendency to hang the host when something goes wrong (which is extra fun in clusters without BMCs), and the platforms are changing at an alarming clip. AMD is somewhat worse in the "moving target" regard - they recently deprecated all 4000 series cards from being supported by GPGPU tools, and have abandoned their CTM, CAL, and Brook+ environments before settling on OpenCL, and only OpenCL. Nvidia still supports both their C

Cluster software & GPU experence by PAPPP · 2011-09-13 11:11 · Score: 5, Informative · on Ask Slashdot: Best Use For a New Supercomputing Cluster?

I assume this is an epic troll, but am going to give an honest answer anyway, because there are some legitimate questions buried in there.

I work with a aggregate.org a university research group which has a decent claim to having built the very first Linux PC Cluster, set some records with them (KLAT2 and KASY0 were both ours), and still operates a number of Linux clusters, including some containing GPUs, so I feel like I have some idea of the lay of cluster technology. It is *way* overdue for an update (and one is in progress, we swear!), but we also maintain TLDP's widely circulated Parallel Processing HOWTO, which was the goto resource for this kind of question for some time.

In a cluster of any size, you do _not_ want to be handling nodes individually. There are several popular provisioning and administration systems for avoiding doing so, because every organization with a large number of machines needs such a tool. The clusters I deal with are mostly provisioned with Perceus with a few ROCKS holdovers, and I'm aware of a number of other solutions (xCat is the most popular that I've never tinkered with). Perceus can pass out pretty much any correctly-configured Linux image to the machines, although It is specifically tailored to work with Caos NSA (Redhat-like), or GravityOS (a Debian derivative) payloads. Infiscale, the company that supports Perceus, releases the basic tools and some sample modifiable OS images for free, and makes their money off support and custom images, so it is pretty flexible option in terms of required financial and/or personnel commitment. The various provisioning and administration tools are generally designed to interact with various monitoring tools (ex. Warewulf or Ganglia) and job management systems (see next paragraph).
Accounting and billing users is largely about your job management system. Our clusters aren't billed this way, so I can't claim to have be closely familiar with the tools, but most of the established job management systems like Slurm, and GridEngine (to name two of many) have accounting systems built in.
The "standard" images or image-building tools provided with the provisioning systems generally provide for a few nicely integrated combinations of tools, which make it remarkably easy to throw a functioning cluster stack together.

As for GPUs... be aware that the claimed performance for GPUs, especially in clusters, is virtually unattainable. You have to write code in their nasty domain-specific languages (CUDA or OpenCL for Nvidia, just OpenCL for AMD) and there isn't really any concept of IPC baked in to the tools to allow for distributed operations. Furthermore, GPUs are also generally extroridnarly memory and memory bandwidth starved (remember, the speed comes from there being hundreds of processing elements on the card, all sharing the same memory and interface), so simply keeping them fed with data is challenging. GPGPU is also an unstable area in both relevant senses: the GPGPU software itself has a nasty tendency to hang the host when something goes wrong (which is extra fun in clusters without BMCs), and the platforms are changing at an alarming clip. AMD is somewhat worse in the "moving target" regard - they recently deprecated all 4000 series cards from being supported by GPGPU tools, and have abandoned their CTM, CAL, and Brook+ environments before settling on OpenCL, and only OpenCL. Nvidia still supports both their C

Empowering... by electron_plumber · 2009-07-16 04:29 · Score: 1 · on Low-Budget Electronics Projects For High School?

Way too any intro electronic "experiments" are either underwhelming (I lit an LED!) or black box magic (Build a radio transmitter by following these 37 simple steps! The following paragraph explains how the circuit works...). So I sympathize with the original request. Personally, I think the trick is to use a low end microcontroller and some cool I/O, and get the kids doing some simple, minimal programming so that they feel ownership. The best answer to this used to be a Basic Stamp, but the cost is prohibitive. Doing this on a budget today, I'd probably get some low-end 8-pin PIC, some switches and lights, and a cheap servo motor (one of the sub $4 HXT ones from HobbyCity). Then I'd have the kids share a few PC's loaded with MPLAB (free) and maybe a cheap Basic or C compiler (there are free ones). Finally, you'll need a cheap programmer (a PICKIT 2 or a third party one). It's a bit more work, but that's enough for the kids to do some really cool stuff. The goal here should be to give the kids tools so that they can be confident enough to go off and make their own cool stuff. To get the flavor of some of these ideas, check out: http://aggregate.org/hankd/piaee12.pdf

Re:Kentucky by jadedoto · 2008-09-25 09:30 · Score: 1 · on State of Kentucky Seizes Control of 141 Domain Names

Here in Lexington we've had them for at least 8 years... http://aggregate.org/KLAT2/press.html

A Pragmatic Introduction to the Art of EE by Anonymous Coward · 2008-05-07 04:34 · Score: 0 · on Books On Electronics For the Lay Programmer?

I'm surprised no one mentioned this one: http://aggregate.org/hankd/piaee12.pdf It's a bit dated now (from the mid 90's), but still has lots of good info. It was originally designed as a college text for non-EE majors, but is much more project-oriented than a classic text. By the way, the title was meant to be an homage to the Art of EE - a great book, but a bit intimidating in scope for a newbie...

FTFA: the first to cost less than $100/Gflop? by vikstar · 2007-08-31 16:03 · Score: 1 · on Student and Professor Build Budget Supercomputer

What about KASY0, which had $84 per GFLOP in 2003?

Price/Performance not new... by wilw410 · 2007-08-30 23:37 · Score: 2, Insightful · on Student and Professor Build Budget Supercomputer

The University of Kentucky (where he is coincidently going to grad school) beat his price point years ago on a "real" supercomputer. This super computer was built for about $84 per GFLOP in 2003 and it made the Top500 list when it was built. The Aggregate team at UK is one of the tops in the field when it comes to supercomputers on the cheap.

This has been done before... by wilw410 · 2007-04-14 03:08 · Score: 1 · on Building a Video Wall out of Old Laptops?

by a cluster computing group at the University of Kentucky called the Aggregate http://aggregate.org/. They built a nine laptop display panel that is basically what you are trying to do. It is much more difficult than I thought it would be to do. Here is a video of the panel in action http://aggregate.org/IMG/mvi_5158.avi. And here is the software they created to do it http://aggregate.org/VWLib/.

This has been done before... by wilw410 · 2007-04-14 03:08 · Score: 1 · on Building a Video Wall out of Old Laptops?

by a cluster computing group at the University of Kentucky called the Aggregate http://aggregate.org/. They built a nine laptop display panel that is basically what you are trying to do. It is much more difficult than I thought it would be to do. Here is a video of the panel in action http://aggregate.org/IMG/mvi_5158.avi. And here is the software they created to do it http://aggregate.org/VWLib/.

This has been done before... by wilw410 · 2007-04-14 03:08 · Score: 1 · on Building a Video Wall out of Old Laptops?

by a cluster computing group at the University of Kentucky called the Aggregate http://aggregate.org/. They built a nine laptop display panel that is basically what you are trying to do. It is much more difficult than I thought it would be to do. Here is a video of the panel in action http://aggregate.org/IMG/mvi_5158.avi. And here is the software they created to do it http://aggregate.org/VWLib/.

My Experence by PAPPP · 2006-07-26 13:46 · Score: 3, Informative · on Building Your First Cluster?

I have a stack of five origional Pentium boxes with 32mb of RAM and 2gb harddrives (except for one, with a larger drive for a software repository). Origionally built it to experiment with AFAPI based clustering, but since AFAPI is a reasonably non-invasive setup, it works well for trying other techniques too, everthing from simply running distcc on the nodes to speed up i586 software builds to briefly fiddling about with some of the other clustering options mentioned. Fiddling around with options on a real cluster (running cluster software on a single node really isn't a good impression) that could be reinstalled from scratch in a few hours, and the machines aren't worth enough to matter if it is physically damanged is a great way to learn.

I was expecting something more detailed than this by Zork+the+Almighty · 2006-01-05 12:14 · Score: 2, Insightful · on Rounding Algorithms

I was expecting something a little better than this, like maybe some fast code to study and use.

On on U of K... by tom8658 · 2005-12-06 18:51 · Score: 0, Offtopic · on Reduce Transistor Power Consumption

Seriously, we have some really good programs. Hank Dietz, Bill Dieter, and Tim Mattox have some exceptional results in parallel computing. Until recently, their $40,000 home-made cluster beat UK's million dollar HP Superdome cluster in Linpack ratings. The $40k even factors in the cost of student labor (in the form of pizza) to wire the cluster.

I just wish I could say the same for our CS department... It's been getting steadily better since the College of Engineering adopted it, but they switched to M$ Visual Studio .NET this year, and that really worries me... program internals shouldn't be hidden from the student at lower levels of computer science.

I hope I didn't /. aggregate.org too badly...

Aggregate.org by PAPPP · 2005-09-28 14:11 · Score: 5, Informative · on High-Performance Linux Clustering

For some very good information on F/OSS based clustering, check out aggregate.org. They have really neat ideas, that are reasonably well doccumented and freely implementable/usable. I built a little cluster (AFAPI on a WAPERS switch) with them for my highschool senior project, and it was a great experence.

just ask Hank Dietz! by Anonymous Coward · 2004-10-26 15:50 · Score: 0 · on SGI & NASA Build World's Fastest Supercomputer

http://www.aggregate.org/

just ask Hank Dietz! by Anonymous Coward · 2004-10-26 06:02 · Score: 0 · on Virginia Tech Supercomputer Up To 12.25 Teraflops

http://www.aggregate.org/

just ask Hank Dietz! by Anonymous Coward · 2004-10-25 20:38 · Score: 0 · on Cray XT-3 Ships

http://www.aggregate.org/

Just ask Hank Dietz! by Anonymous Coward · 2004-10-05 19:01 · Score: 0 · on Cray XD1 Now Available

http://www.aggregate.org/

FNN by bofkentucky · 2004-09-20 08:55 · Score: 1, Informative · on Can Anyone Suggest a Good Switch?

Flat neighborhood networks, basically you get to use "cheap" cards and switches in a web configuration to provide a fast interconnect between nodes.

Other than that, a Cisco 6513 with 11 10/100/1000 48 port switch cards would fit the bill to provide a single chasis switch for all 500 nodes. Hope you've got a decent budget, because it will cost you.

Re:http://aggregate.org/ by Nynaeve · 2004-08-06 06:04 · Score: 1 · on Where to Spend $1M on a Cluster?

I've been keeping up with Dr. Dietz's work since Purdue. I really admire his work, and I even ran a small 2-node PAPERS cluster at home using his AFAPI library.

PeTS may be applicable here, especially his research into Flat Neighborhood Networks (FNNs). However, I think that AMD/Intel sytems use too much power (70 watts or so each). A computationally-equivalent cluster of VIA EPIA motherboards (maybe 10 watts each) would be both physically smaller and much easier on the electric bill. At $100 each for a VIA EPIA V10000A or $163 for the newer VIA EPIA M10000 Nehemiah I could afford to both buy a cluster and run it. Running an AMD cluster would use more electricity than I could afford.

The picture in the middle of the PeTS page, KAOSlab.jpg, is my background desktop at work, and I often get comments. I wish I were so lucky as to work with that sort of thing every day. :)

http://aggregate.org/ by donniejones18 · 2004-08-05 15:08 · Score: 1 · on Where to Spend $1M on a Cluster?

I would have the research group that I work with at the University of Kentucky build it. Maybe you should contact my professor, Dr. Hank Dietz.

KAYS0
University Of Kentucky Supercomputer Breaks The $100 Per GFLOPS Barrier

They built the supercomputer for under $40,000 with 128 nodes + 4 spare nodes, just think how many nodes and how powerful it could be with $700,000!

http://aggregate.org/ by donniejones18 · 2004-08-05 15:08 · Score: 1 · on Where to Spend $1M on a Cluster?

I would have the research group that I work with at the University of Kentucky build it. Maybe you should contact my professor, Dr. Hank Dietz.

KAYS0
University Of Kentucky Supercomputer Breaks The $100 Per GFLOPS Barrier

They built the supercomputer for under $40,000 with 128 nodes + 4 spare nodes, just think how many nodes and how powerful it could be with $700,000!

Low Cost Cluster Computing by r0xah · 2004-08-04 21:51 · Score: 1 · on Where to Spend $1M on a Cluster?

I would very much recommend this research site from one of my professors at the University of Kentucky. He has been doing work with cluster super computing for quite some time now and has managed to build some very impressive systems at low costs. Much lower costs than what your current grant is for. With a grant of that size using this professor's techniques you could build a whole bunch of clusters. I would suggest you taking a look at his group's research site aggregate.org.

You can also see one of the specific examples of a very low cost efficient cluster computer. KASY0

Low Cost Cluster Computing by r0xah · 2004-08-04 21:51 · Score: 1 · on Where to Spend $1M on a Cluster?

I would very much recommend this research site from one of my professors at the University of Kentucky. He has been doing work with cluster super computing for quite some time now and has managed to build some very impressive systems at low costs. Much lower costs than what your current grant is for. With a grant of that size using this professor's techniques you could build a whole bunch of clusters. I would suggest you taking a look at his group's research site aggregate.org.

You can also see one of the specific examples of a very low cost efficient cluster computer. KASY0

disagree by Anonymous Coward · 2004-04-13 04:02 · Score: 0 · on Cray CTO: Linux clusters don't play in HPC

I'm sure Hank Dietz would disagree : http://www.aggregate.org

Re:Important items of note by tmattox · 2003-10-22 15:55 · Score: 2, Informative · on Big Mac Benchmark Drops to 7.4 TFlops

I have yet to find a satisfactory description of the network topology they are using. The specs on the Infiniband switches they are using are quite impressive for latency and bandwidth numbers, but without knowing how they are interconnected, its' hard to say if it's latency or maybe bisection-bandwidth issues limiting their efficiency. From the early report of 80% efficiency on 128 CPUs (or was it 128 nodes?) would seem to indicate the problem is with the switch fabric in some way. With ~1100 nodes, communications are having to cross through mutliple switches in any traditional network topology, resulting in higher latency, and possibly bandwidth bottlenecks.

I saw some indication that they were using a Fat-Tree topology, which would eliminate any bandwidth bottlenecks between switches, but the number of switches used didn't seem large enough for a fat-tree. But again, VT just hasn't, as of the last time I looked, released enough information about the cluster to tell.

BTW - My thesis work on Flat Neighborhood Networks (FNNs) used in the KLAT2 and KASY0 supercomputers is finding better ways to interconnect the nodes, given a particular set of network components.