30,000-Core Cluster On Amazon EC2
Joining the ranks of accepted submitters, hooligun writes with an article in Ars Technica about a rather large cluster built on EC2. From the article: "The details are impressive: 3,809 compute instances, each with eight cores and 7GB of RAM, for a total of 30,472 cores, 26.7TB of RAM and 2PB (petabytes) of disk space. Security was ensured with HTTPS, SSH and 256-bit AES encryption, and the cluster ran across data centers in three Amazon regions in the United States and Europe."
what exactly was it used for?
If you don't know the scale from yocto to yotta, then you need hand in your geek card.
Imagine the possiblilities. /. on steroids.
Obi-Wan: "I felt a great disturbance in the Force, as if millions of voices suddenly cried out in terror and were sudden
Let's hope their European nodes didn't use any certs from Diginotar.
But at least they weren't using RSA tokens for authentication.
Before anyone else asks what I was about to, the full title of the article is: $1,279-per-hour, 30,000-core cluster built on Amazon EC2 cloud
How does that compare to the cost-per-core-hour for other Amazon EC2 offerings? Is this a value meal deal or just a lot of burgers?
fair.org counterpunch.com truthout.com indymedia.org salon.com
eff.org guerrilla.net debian.org gentoo.org
Security ensured by HTTPS, SSH and 256-bit AES? That's alright then, no need to worry about security any more.
Help me understand something here ... isn't EC2 really one gargantuan cluster far bigger than 30,000 cores? So why is it news that it ran a big job? Was there some significant step forward in software that allowed features that were not previously available on EC2?
Put my fist through my alarm clock with its ding-dong death inside my ear. - The Blackjacks.
Finally - a computer that can fully handle Windows 8!
I assume the next step is Permutation City?
But, what was their password? So many details about that computer, but no password...
when you dodge taxes
How powerful would one estimate linking multiple cloud and, of ten percent of the top 500 supercomputers would be? That would be one massive number cruncher.
Didn't we just read that the US has fallen to #25 on the international speed list? So, is this like serving up Skynet over a 28.8 modem?
---- Please be nice in case my Slashdot karma ~= my real life karma.
They should donate a couple of hours a month to curing a disease.
Nobody gives two fucks. There's over 2 million registered UIDs on this site. Slashdot isn't some popularity contest. Quit turning Slashdot into fucking Digg or Reddit.
Still waiting on Serviscope_minor to wake up to fucking reality and realize that Jessica Price isn't going to fuck him.
It is great that folks can do this kind of stuff in the cloud. However, looking at the cost structure for this cluster, it becomes very expensive, very quickly, at around $11,204,040 per year to operate assuming 24x7 operation. Also, this kind of HPC configuration is not for everyone. HPC environments can require high speed interconnects with super low latency, like Infiniband or Myrinet which is something I doubt that Amazon has invested in due to the cost of these types of solutions. If your application utilizes message passing interface (MPI) this solution is most likely not for you.
So for one off types of jobs, Amazon may be a great choice where you do not need to make a large initial upfront investment in technology and your HPC usage is spotty at best. However if you need a long term solution that will be operating 24/7 or you have security requirements that prevent operation in the cloud, you are better off making the investment into high performance computing. Just keep in mind that this type of solution is not for everyone.
You can verify the certificates used with DigiNotar... well.. site looks down... maybe when they are back up...
Imagine a beowulf cluster of these!
Seeing how their total lack of reliability kept some webpages (meneame.net) down for about two days after that storm last month i just hope they put it to work on weather prediction.
Why don't you post an html address for this machine, so we can see if it can survive being slashdotted?
But will it run Crysis at max settings?
A few months back, old Anonymous tried to 'take out' Amazon by using the LOIC (Low Orbit Ion Cannon) in 'hive mind' mode. They found that even with dozens of kids sending thousands of requests per second they couldn't DDOS a.k.a. "Slashdot effect" Amazon. There was disappointment in geekdom. Lesser targets fell easily, but this one proved too strong, "We don't have enough machines" one hapless and dejected nerd wrote. Now you know why Sparky! A single machine running LOIC can take out one or two hyperthreaded cores, but if they have thousands of hyperthreaded cores, you will need half as many machines to run the attack (for it to be effective). They never had that many recruits willing to let their IP address be collected by the feds (or enough who didn't realize that their IP address would be sniffed by the feds), and so were unable to take Amazon down. A big fat load balancer feeding 30000 hyperthreaded cores can swallow everything the LOIC can feed it and not lose a byte or break a sweat. I heard that Amazon has 'on demand' systems that power up and respond to external requests. If 1000 new machines started sending loic requests at a given time, you could temporarily DDOS amazon (for the amount of time it takes for their servers to ramp up and handle the load). I suspect that would last only for a few minutes.
Neat, but for any job that isn't embarrassingly parallel, communication latency and speed will kill you when your nodes are spread across continents. If you're not doing any communication, well then groovy. Usually these large core servers are only 'earning their keep' when you're taking advantage of very fast interconnect hardware and doing things that can't be done by just a bunch of CPUs.
A squid eating dough in a polyethylene bag is fast and bulbous, got me?
Mainly because they're LITERALLY QUITE IMPERVIOUS to the "unstoppable attack online" - the DoS/DDoS!
How/Why?
Well, because they've "overbuilt" their ENTIRE infrastructure for one thing, hardware-wise, for telecommunications!
They also monitor their levels of "hits" their sites get, & IF they get too high (as they do in DDoS)? They can stall any that are coming from unrouteable addresses (think 172.x.x.x, 192.x.x.x, etc./et al).
Microsoft also has settings in its IPStack that help "stall out" DDoS/DoS too:
SynAttackProtect, EnableDynamicBacklog, MaximumDynamicBacklog, MinimumDynamicBacklog, TcpMaxDupAcks, TcpMaxHalfOpen, TcpMaxHalfOpenRetried
Those ALLt work IN COMBINATION with one another @ THE OPERATING SYSTEM'S IP STACK LEVEL!
(Also in combination with hardware measures noted above both MS & Amazon do, to stall off "the unstoppable attack method" (the DoS/DDoS)).
APK
P.S.=> It's the "how & why" you NEVER see Amazon OR Microsoft getting news that "anonymous/lulzsec" (& the like) "took down MS/Amazon via DoS/DDoS"...
(Because you KNOW that'd be "big news" IF it went down, of course... especially around here with all the "Pro-*NIX" sentiment (from the sockpuppet FUD spreading trolls that keep 100 user accounts to attempt to fool others with that bullshit))... apk