More on Virginia Tech G5 Cluster: 17.6 Tflops
daveschroeder writes "BBC World's Click Online has a video report (with text transcript) on Virginia Tech's new 1100-node dual 2.0 GHz G5 Terascale Cluster. The report quotes the performance as 17.6 Tflops. As a point of reference, the cluster would be number 2 on the most recent June Top 500 list, behind only Japan's Earth Simulator, and considerably more than doubling the performance of the current number 3 1152-node dual 2.4 GHz Xeon MCR Linux cluster. Assuming the performance figure accurately reflects the LINPACK score (which it should; since the deadline for submissions for the upcoming list of Oct 1 has already passed, one would imagine VT would quote that figure), and depending on new entries for November's upcoming list, the cluster should almost certainly rank in the top 5 - all for only US$5.2 million. The video report is available in Windows Media 9 and Real formats; the relevant portion starts at 13:00."
http://www.bbcworld.com/content/template_clickonli ne.asp?pageid=666&co_pageid=3
Surley they only need 1099 G5s.
can it defeat an iMac in Apple's Photoshop benchmarks ;-)?
They have previously discussed this, they use error correction algorithms, no ECC RAM necessary.
Everybody and their dog use systems without ECC ram, yet the world has not come to an end yet. I would be more concerned of programming errors than flipping bits.
...considerably more than doubling the performance of the current number 3 1152-node dual 2.4 GHz Xeon MCR Linux cluster.
If I understand this correctly, it's saying that a G5 is more than twice as fast as a dual 2.4 GHz Xeon? (1152 dual 2.4 GHz Xeons vs 1100 dual 2.0 GHz G5s -- there are fewer G5s and they run at a slower clock speed.)
This is a pretty staggering statistic. I hadn't really believed the hype about how fast the new G5s were.
________________________________________________
suwain_2
So they bog down the software doing something that could be done in hardware?
A Good Intro to NetBS
ECC memory doesn't solve world hunger, it just corrects single bit errors when a request is read from memory. If ECC memory isn't there and an error occurs, the system reboots.
Data isn't returned corrupted, it don't come back at all
if you used computers in the early 90s, it was AKA 'PARITY ERROR'
As a rock-in-roll Physicist once said, No matter where you go, there you are.
The BBC page of the devil!!
(look at the link URL)
For all the people involved in it's construction. I helped in the construction. Wouldn't have known about it if not for the Slashdot article. I don't doubt the Linpack score. Allthough the e-mail I got about the reception said the score wasen't supposed to come out for another week or so. Anybody know what went on in the reception?
mck
That is the true test.
Windows Media and Real Player about a G5 cluster? Don't you mean a nice Quicktime movie? (yes I have both players on Mac, sans spyware and other undesirable "system integration". But Apple will be sad)
I don't wan't to start a holy war here, but what is up with you G5 cluster zealots? Ive been sitting at my freelance gig of a G5 cluster (1200 Dual G5s) for twenty minutes as it attempts to copy a 17 megabyte file from one folder to another! Twenty minutes! At home my Beowulf cluster of 100 Celerons running Linux, the same operation takes just 2 minutes, if that. While this is happening, Itunes won't work, and everything else grinds to a halt. Even vim is struggling to catch up as I type this.
Yes, I do have DMA enabled, and I am using 25K SCSI disks. My old 3.2 Ghz 64 way xeon runs faster than this G5 cluster at times. G5 zealots, flame me if you like, but I'd rather hear some intelligent reasons why I should use G5s over faster, cheaper clusters.
oh wait.
I am the Alpha and the Omega-3
The new "top 500" list will be announced right before SC2003 and discussed in detail at a session of SC2003 on November 18.
Look for another (less speculative) story on Slashdot around then.
How is the system to know whether an error has occurred? If the value at some byte address should be 202 but it comes out 201, how to determine that the result is wrong? For that you need parity memory, which is able to detect (though not recover from) single bit errors. As you say, the detection is called a parity error - but this requires memory with parity information (usually one extra bit for every eight).
I believe that common ECC memory is able to recover from single bit errors and detect two bit errors.
Or does the Mac G5 use parity memory? It seems hard to believe, hardly anyone manufactures parity memory these days, it's either cheapo no-checking sticks or full ECC.
-- Ed Avis ed@membled.com
What's been left behind is wether or not these systems are using Panther as the OS. It would seem that with this kind of performance, an Apple supplied OS -- as opposed to Yellow Dog would -- only be capable of performing well on the G5 since Panther has processor optimizations for the G5.
If the original XServers were too costly and low performance (since they came with a G4) wouldn't a G5 server (since the performance is apparently much better) be a great option for small/medium size businesses for a web/mail/database server?
Humm, isn't the second fastest ASCI Q at Los Alamos has now been measured with 13.88 TF/s?
As mentioned here: http://www.top500.org/lists/2003/06/top5.php
MCR uses Quadrics - What does this cluster use?m e.nsf/ DisplayPages/7C18E51DBC215D3E80256D5900535959
http://doc.quadrics.com/Quadrics/QuadricsHo
Ugh.
It has been said thousands of times by now I'm sure.
Running Mac OS X does not mean running FreeBSD Mac OS X is a system of frameworks running on top of a Mach Kernel. The only thing that relates Mac OS X to FreeBSD is the userland. In addition to the userland you have: Cocoa, Carbon, Aqua, Java, etc. The FreeBSD portion is minimal.
And yes, if you want you can run this lower level unix without the rest of Mac OS X. It is called Darwin. It runs on Intel and PPC if you're wondering. No, this doesn't mean that Mac OS X runs on both or ever will.
Here is a short description of the BSD families.
Could someone who knows the going rate of these top 5 supercomputers please tell me how much less expensive 5.2 million is? I know that it sounds like a lot of money to me, but I'm guessing that it is orders of magnitude cheaper than the other top computers.
10 Gbps Infiniband from Mellanox
Each machine has a PCI-X Infiniband card, interconnected with several 96-port switches.
The video is available for either Windows Media Player or RealOne Player -- but not QuickTime!
Ouch!!!
quiquid id est, timeo puellas et oscula dantes.
Air Conditioners, switches, etc....
more than just CPUs
Yep, and they're going to be top 5. Between you and them I wonder who has the best knowledge of how to build a cost efficient cluster?
Main Entry: troll
Pronunciation: 'trOl
Function: verb
Etymology: Middle English
Date: 15th century
transitive senses
1 : to cause to move round and round : ROLL
2 a : to sing the parts of (as a round or catch) in succession
b : to sing loudly c : to celebrate in song
3 a : to fish for by trolling b : to fish by trolling in (troll lakes) c : to pull through the water in trolling (troll a lure)
intransitive senses
If you want to use the term to refer to a message that invites one to respond or otherwise lures you into a discussion, you want this definition of the noun, from Webster's:
Main Entry: troll
Function: noun
Date: 1869
: a lure or a line with its lure and hook used in trolling
For those of you who were in diapers when the Internet was created, a "troll" is a message designed to lure you into responding, to rise to the bait, so to speak. Please learn this bit of Internet lore before we have to start the canings again.
Shut up and eat your vegetables!!!
I didn't see anywhere the source of these benchmarks - I believe what they have done is multiply the SPECfp_rate_base2000 for a dual processor 2GHz G5 (which according to Apple is 15.7) and multiply it by 1100.
The thing is, that only comes to 17.27TFLOPS, and in addition it does assume that the original spec scores were accurate.
Would anyone care to shed some light onto this?
The project leader, Dr. Srinidhi Varadarajan, will be speaking at a session entitled Building Virginia Tech's G5 Supercluster on Oct 28 at the upcoming O'Reilly Mac OS X conference.
He'll probably reveal some of the technical details, such as the version of Mac OS X used, at that session.
Also, according to a blog at O'Reilly:
Next year, all the little known details [about the cluster] will be revealed in a new book. By that time we'll know what the project means for supercomputing and for Apple.
How many seti packets can it do in an hour?
And which part of the score was released by apple?
Just because it's in hardware doesn't mean it's free. The ECC logic is going to add a small delay to each of trillions of memory accesses. Plain memory can most likely be tuned to run faster than ECC memory.
If you're running a constrained problem and can verify the results at the end, a single error check in software could consume far less overall time than the continuous ECC hardware checks. The software check would probably catch other types of errors as well (including many errors caused by software bugs).
what the frames per second would be on Q3A...Has anyone tried it yet?
"Love is like pi - natural, irrational, and very important." (Lisa Hoffman)
Easy: you yourself point out that 1100 * 15.7 = 17.27 ... not 17.6.
Since the call for papers for the new Top 500 list was Oct 1, and the BBC show aired on Oct 9 with a companion BBC News story dated Oct 12, you'd hope that VT was simply regurgitating the figure that has already been sent to the Top 500 organization.
And why are you trolling around with one of those super-old benchmarking stories? We've already established that every manufacturer does what they can to show their products in the best possible light. At least Apple documented their test results and methods in full.
So acually, your logic doesn't make any sense: you jump to the conclusion that it's not real results - even though real results already exist and have been submitted, and the entire story is pretty much about that process, making performance figures a critical piece to get accurate - and that they must have just multiplied some benchmark number by 1100. Then, even though the subject of your own post indicates your recognition that "it doesn't add up", you still apparently assume that the results are somehow doctored, this time for the worse, and you manage to weave in one of the stories that tries to make it look like Apple lied with its benchmarks - which it didn't - which is unrelated to the current issue! How does it "assume" the original scores were accurate?? YOU are assuming that they're just multiplying. You might have been onto something if the multiplication actually came out, but it doesn't, meaning that is NOT what they did.
Bravo, +1 Troll.
And why would it be misleading? If Xeons would be used, surely the researchers would optimize for SSE2, as x87 on the Pentium 4 has been crippled.
Scientific Computing has always emphasized numerical linear algebra. An entire strain of supercomputer processors was developed to support such requirements.
Besides, the final "score" will be produced by benchmarking with LINPACK. It's not merely a matter of taking manufacturer supplied numbers, multiplying them together, and claiming a spot.
http://www.bbcworld.com/content/template_clickonli ne.asp?pageid=666&co_pageid=20
Skip ahead to 13:00
Since the freakin' Windows Media files won't play on OS X.
simon
home page
The power supply was another huge challenge. The Supercomputer uses the same amount of electricity as 3000 average sized homes.
How do 1100 dual G5s end up using the same amount of eledtricity of 3000 average sized homes? Even with the cooling and networking, this number seems way too big.
Clearly they do. As I have no interest in having 'one of the five fastest clusters' for however many weeks they remain in that spot. I mean, gee whiz.
However, it's still an interesting topic. Making memory error correction a processor intensive task seems like a kludge. Kind of the 'Win Modem' of the whole Apple design.
Sorry if raising it as an interesting point to ponder gets you all in a defensive fluster.
A Good Intro to NetBS
As I've asked before, what will they *do* with it once they've posted their performance benchmarks?
Japan's Earth Simulator will be used for climate studies. What will VT use it for?
Chip H.
And they have random data corruption ALL THE TIME.
It's amazing that no one seems to care about random data corruption. Random, unexplainable crashes.
Guaranteed to happen. Your non-ECC memory flips bits all the time.
Solaris doesn't even log ECC corrections until it it happens repeatedly to the same bit. That's because bits get flipped all the time. Physics and all that.
They meant 3,000 prison-cell sized dorm rooms.
1 Hollywood = 100 Oscars 1 Oscar = 100 flops therefore 100 Oscars = 10,000 flops Therefore 1 Hollywood = 10,000 flops
What I would like to see is the excess heat being used to heat the rest of the building, instead of simply being junked outside. By using this heat in the winter they would reduce the amount of heat the need to generate using their conventional heating system.
That would be smart use of technology.
Jumpstart the tartan drive.
Running Mac OS X does not mean running FreeBSD
Yes, but no one is going to use Acqua, Java, Cocoa and stuff...
Its gonna work mostly w the kernel (theres no need for graphics and stuff), hence it's going to matter the most.
how long until
In fact the heat is so intense that ordinary air conditioning units would have resulted in 60 mph winds
Help fight continental drift.
The VT cluster cost about $5.2 Million and get approx. 17TFlops - The NEC Earth Simulator gets 35TFlops and cost one billion dollars. That makes it 192 times more expencive. So you can build 192 VT Clusters. And then in theory get. 3.2PFlops for the same amount of money. If you detract performance for cable lenght etc. - You will most definitly get around 1PFlops.
So, you supercomputerusers out there - build a 1PFLOPS cluster NOW!
- To understand recursion, we must first understand recursion -
The 'project' uses the same amount of electricity as 3,000 average sized homes. There are many more devices deployed than just the 1100 G5s. The cooling system alone is a major power eater. Read the articles :)
Excellent use of the blur filter to pretend that some of the footage was shot poorly. Wait a minute...it WAS shot poorly.
Thanks for the link. I have no immediate plans to install WMP and Real.
Supercomputer is #2 & Football team is #3
-Go Hokies!
Only semi-smart.
Places have done this, but to do it you need to design (or re-design) the heating system to do it. And then, in five years, when you upgrade the computers to newer, smaller, cooler (overall) models the entire system fails.
It's a good idea, but computer speed/heat needs to stabilize first. Which isn't going to happen anytime soon.
'Sensible' is a curse word.
Yep, the numbers must be wrong.
And if not, they have a 6 megawatt coal-fired power plant within spitting distance of the engineering buildings. That's 5 kilowatts per computer; plenty for either a house or a computer.
Incidently, after years of putting up with coal dust, I thought we would finally see the benefits of living next to a huge coal pile. We had a severe ice storm and most of blacksburg lacked power. but so did campus!.
HIV Crosses Species Barrier... into Muppets
Indeed.
But, as I said, the kernel is not of FreeBSD lineage. XNU is the Darwin mach kernel. Not that it really matters.
I see no link at all for the Real One Player version of the video. Will someone post that link or just reply to my post.
They do? Then how come nobody ever sees it? My linux setup does newer crash. At at the university we got something like 100 indy/Octane/O2 workstations widtout ecc memmory and they all got a uptime which is equal to the last time someone cut the power. But we might just be luckey.
Martin
Indeed, I forgot that the top500 site requires 64 bit precision or rather, specifies an error bound that requires such precision.
Dude, I love macs, but something tells me they didn't order all those computers with DVD-Burners in EACH ONE. And I don't know how much of a workout the PCI bus will be getting, unless they've got some new networking cards.
I said nothing about Apple, I merely commented on a benchmark procedure. It is you who are carrying the baggage of preconceptions; release yourself, my friend!
Tubal-Cain smokes the white owl.
Then there's only the need of coming up with the applications to use it (besides research), apart from gobbling up seti@home data snippets for breakfast.
However, XCode and Rendezvous enable something in this front, enabling distributed seamless compilation of big projects.
So, they are just going to junk this setup in 3 years and start over? Doesn't make sense to me. The PowerMac is always had the better processor upgrades then PC's. I'm sure PowerLogix or Sonnet would love to make 1000+ CPU upgrade cards for them. Sure, the bus speed would be slightly behind a new system at that time, but the cost of buying an upgrade versus a whole new machine is worth looking at. Maybe after the first round of upgrades they should consider scrapping it.
-m
http://www.invisik.com
Yeah, sure. How do you thnik that they did implement that magic software correction in the linpack benchmark?
Simply run the matrix multiplies twice and check if the result is the same? (no, and i dont need to explain why)
HI O WISE PRINCE. WHT TOOK U SO DAM LONG?
Wow, those GRAPE 6 machines must be great for modeling tertiary structure of proteins...
Finally I'll be able to play SimCity 4!!
The next comment I write will be ready soon, but subscribers can beat the rush and see it early!
If you order the g5 with 8 gigs of RAM, then it costs a lot more than 3,000 dollars.
does it run windows?
Simple equation: 4 FPops/cycle (IBM-PPCs) * 2GHz * 1100 G5s * 2 CPUs/G5 = 17.6 TFLOPS!
No *real* Rmax linpack scores are known yet, and from what i figured the submissions on Oct 1st are just for *inclusion* in the list, real Linpack scores can be submitted till shortly before (or even on!) the conference mid-November..
This article is BS and should be removed...
P.S.: 4 FPops/cycle per clock with 2 FPUs i hear you scream - Impossible! - That's due the Multiply/Add FMAC thing that counts as 2 FPops!
It's amazing what a bunch of turkeys can accomplish when we set our minds to it. This makes us #3 in the nation in football and #2 in the world for computing power. Dont mess with the Hokies ;-)
I'm glad someone out there thinks these things through...
BTW, an acquaintance told me of her ILLIAC IV days. With 64 independent processors it was the fastest pre-Cray machine, but sometimes did produce wrong values. Standard practice was to have 3 processors run the same problem and compare the results at the end, deciding that the 3x performance hit was worth it, if the results actually meant something...
The clearance system sounds logical. It is not. It is completely arbitrary. -- John Bolton
They do? Then how come nobody ever sees it? My linux setup does newer crash.
Either that's a typo or the most ironically appropriate ascii bit error in the history of computing.
ECC memory uses a set of extra bits - 7 bits to cover each 32 bits or 8 bits to cover each 64 bits. Each error-checking bit works like a parity bit for a different subsection of the memory it is correcting. By working out which subsections contain an error, the errant bit can be deduced and then corrected by simply flipping that bit.
ECC can also detect (but not correct) 2, 3 or 4-bit errors in each block, though that would be unusual.
I had this pointed out to me in May of 2002 when Apple introduced the Xserve... in this Slashdot thread.
Apple's page about the G5's execution core states that its two FPU's are double-precision, and that it can thus "complete at least two 64-bit mathematical calculations per clock cycle." That's obviously an improvement over the G4, but let's see...
They've got 1,100 machines. Each has 2 CPU's for a total of 2,200 CPU's. Each CPU has 2 FPU's for a total of 4,400 FPU's. Each FPU is capable of doing a 64-bit (double-precision, I presume) floating-point calculation each clock cycle. The machines are running at 2GHz, or 2 billion clock cycles per second. So... 4,400 * 2 billion = 8.8 trillion 64-bit floating-point calculations per clock cycle.
To me, that's 8.8 TeraFlops, which is conveniently precisely half of the 17.6 figure. Did I forget to multiply by two, or did someone else multiply by two an extra time? Or... is the situation (most likely) that there are situations under which the G5's additional mathy bits can actually turn out a couple more FLOPs per cycle, but that's not going to be the case most of the time, and probably won't be the case running LINPACK?
Of course, it is important to point out that this is a theoretical maximum, and that things like interconnects, RAM limitations, chip failures, entropy and unauthorized use of the cluster to render hot babes in "Poser 5" will detract from it.
So... I don't think this will be the second-most-powerful machine in the world. Even relative to the previous list. I would not be at all surprised to see it in the top 5, though, and would be surprised if it didn't make the top 10.
Cool post. This is almost always what goes into numbers
quoted by IT groups that don't know anything about the machines they're running (I know; it's a redundant sentence). Just take the marketing drivel from the manufacturer, multiply by how many you bought, and quote it! Everything I've read from VT appears to have been synthesized that way.
I'm running a G5 at home now. I'm still wondering how a machine with a 32-bit OS "breaks the 4 GB barrier", given that it can't do anything that a 32-bit Xeon with more than 4GB of RAM can't do.
But it hasn't quadrupled my home energy bills, at least, and that include air conditioning.
Yep, that's an accepted practice in mission critical real-time systems. I recall reading about the IBM computers used in the Space Shuttle, they have triple redundancy, all 3 computers operate in parallel, and they "vote" on all results. If one computer doesn't agree with the other two, it is outvoted. Of course this is an extreme oversimplification of the software design, but you get the idea.
I suppose I should mention that the pageid for this thing is '666' :P
I believe the space shuttle has 6 computers that vote. I read somewhere that if they have one failure, they are fine. But they have to abort if they have two, because another failure would being them down two three, the minimum required for a proper voting scheme.
ASCII stupid question, get a stupid ANSI
It's a good day to be a Mac man.
I mod down pyramid schemes in sigs.
Hmm.. a quick google reveals we are both wrong, they have 5 computers. How odd (literally).
(Of course, there may be other fused 2-operation things it can do in one cycle... but it seems unlikely that code would consist of them so completely as to attain that theoretical peak.)
The dual G5 that should be arriving soon (are you listening, Apple?) has, if I recall, a 600 watt power supply. 600 watts * 24 hours = 14.4 kwh per day.
Yowza.
Um... yeah, I think I had better leave all the power-saving features turned on, and put it to sleep at night and all that other good stuff. Don't want to have to explain to the wife why the electric bill is suddenly double...
Anyone have any idea where he was standing at the end of the video and mentioned a building would be built to house a new supercomputer? I go to VT. I know all the MACs are over at the corporate research center but I can't figure out exactly where he's standing at the end of the video.
http://www.maximum-cars.com - My little hobbie.
blimey ... someone got hold of some cheap speed, didn't they?
What's the frequency, Kenneth?
You did forgot a factor of 2. First you say
Each processor can do 2 fpu instructions/clock. So each computer can do 4 fpu instructions/clock.
if you have g5 and mathematica, can you guys go here
http://smc.vnet.net/timings40.html
http://www.scientificweb.de/mathstef3.html
download the benchmark file and report the result?
thanks.
Xah
xahlee.org
http://xahlee.org/PageTwo_dir/more.html
Each computer can do 4 fpu instructions/clock.
(and those are double-precision 64-bit flops).
There are 1,100 computers. Times 4, that gives us 4,400 flops per clock.
(Note: given 1 DP flop per clock per FPU, this ties in nicely with my use of 4,400 as the number of FPU's, in my comment.)
4,400 flops per clock * 2 billion clock cycles per second = 8.8 trillion flops per second.
I'm still getting the same number. Where's the factor of two I missed?
Because nobody is checking to make sure their data isn't silently corrupted. And the corruption might be extremely minor - one bit flipped in audio or video might never be noticed. Or it might corrupt data that won't ever be looked at, or at least not for years. Or it might introduce a wrong answer one someone's homework, and they never realize why they got it wrong. Or it crashes a random application and no one suspects a thing.
The error rates on good hardware are pretty small, but guaranteed to be nonzero.
Seems to me what matters most is simply how fast is it and how much did it cost?
$300,000 per Tflop - I doubt any other on the list is even close...
No, ECC ram typically is just made with faster internals. As an example most ECC comodity ram is CAS2 latency whereas most generic ram is CAS3, so the ECC ram will perform exactly the same as the non-ECC ram. You can buy CAS2 non-ECC ram but it's nearly as expensive as the ECC ram. If you have a simple idiot check at the end of a complex calculation then saving the cost of going with ECC may be worth it but most clusters this large will be used on too many different projects to assume that all of them will have such checks. For an idea of how important ECC is read (a href="http://www.ibm.com/servers/eserver/pseries/c ampaigns/chipkill.pdf">This IBM whitepaper on their chipkill ECC scheme. Even normal SEC ECC ram (what most ECC ram is today) will have aproximately 900 failures per 10TB per three years. I think that IBM is right and that eventually all ram will be RAID-M, that is a RAID5 style array of redundant memory banks that are composed of ECC banks. At future densities this will be necessary because a single high energy particle will have the ability to scramble an entire memory word including it's ECC checking bits.
There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
The simple way to do it is to continuously pump the waste heat into the heating ducts during the winter and have the main heating system on a zoned thermostat system. That way your main heating system is only producing (n-m) btu's where n is its normal btu output and m is the waste heat output from the cluster. If you later remove m then the system still has the ability to produce n btu's. Besides that most datacenters have INCREASED in btu output and power consumption over the years not decreased. The old systems were large and hot but the new systems are small and REALLY hot. Trust me I have been in quite a few datacenters that have had their HVAC systems upgraded from the days of room sized mainframes.
There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
Read the article...they don't just plug them in and turn them on.
"The Supercomputer, unofficially nicknamed Big Mac, was built in just three months. Right from the start there were major hurdles that could only be overcome with significant construction in and around the building. Running 1100 computers in a 3000 square foot area sends the air temperature well over 100 degrees Fahrenheit. In fact the heat is so intense that ordinary air conditioning units would have resulted in 60 mph winds. So specialized heat exchange cooling units were built that pipe chilled water into the facility."
yeah, but scientific computation *does* consist entirely of multiply-adds (ish). How do you computationally solve a problem? You find approximations in some convenient linear space - you reduce the problem to a whole lot of linear algebra problems. And linear algebra problems are solved by a whole lot multiply-adds. i'm being incredibly simplistic here. really mindbogglingly simplistic. and anyway, why bother to speculate when there will be real numbers in a month?
If they went to RackSaver (http://www.racksaver.com/shop/UserCustomize3.asp? lSystemID=70) they could have created a super computer with over 20 TFlops of peak performance for that same 5 million.
It's been previously discussed, and their "error correction algorithms" are going to do dick-all for them if they can't trust their data, and without ECC, you can't trust your data.
The first poster is correct, results from this machine can not be trusted as being 100% accurate. It may be that they can live with the lack of accuracy, but it's definitely something that they will have to figure into their work. With 4.4TB of memory, they are going to have soft memory errors on a VERY regular (daily?) basis, and they can NOT be caught by some sort of software algorithm unless you store every bit of data twice and do every single calculation twice. If you're doing that, you might as well save yourself a few million and buy a 550 node cluster with ECC and get the same result.
1) your link doesn't work
2) 5 million is the cost of the project.... that is 1100 Macs with (I think) 4 GB of ram and 160 GB HDs, GigE and FW800 on board, Infiniband cards, Customized 19" racks, a customized cooling solution.... The Macs were just a fraction of the overall cost, a sizeable fraction but there were a lot of other significant costs.
I'm not feeling witty so bite me
2 CPUs in each Mac!
FYI: The formula FPops/cycle * MHz * number of CPUs is the formula for all Rpeak-Values in the Top500! Try it yourself by dividing the listed Rpeak-numbers down and you'll end up with the FPops/cycle for each CPU-Type!
This 4 FPops/cycle "FMAC-feature" (besides the price and Altivec!) probably explains why they went with G5s in the first place!
To all the ones who like to scream "cheating" and "lying" trying to blame Apple:
;-) Intersting: IBM-PPCs (Power4, Power3, G5) have 4 FPops/cycle, Itanic dito, x86 has 2 FPops/cycle and Alpha (!) aswell! Yes, Alpha, the FPU-Monster! ;-)
;-)
The formula FPops/cycle * MHz * number of CPUs is the formula for all Rpeak-Values in the Top500! It's the way Rpeak is calculated! Try it yourself by dividing the listed Rpeak-numbers down and you'll end up with the FPops/cycle for each CPU-Type!
This 4 FPops/cycle "FMAC-feature" (besides the price and Altivec!) probably explains why they went with G5s in the first place!
P.S.: Did you know that test by the german Magazine C't using IBMs XLC/XLC and P4s/Xeons with Intels ICC basically proved what apple claimed with their GCC comparison? That the G4 is faster than a 3Ghz P4/Xeon Dual 3GHz in FPU but slower in Integer? And the IBM-Compiler is still in beta, mind you!
So much for cheating lying Apple, hehe! Ofcourse, Slashdot, The Register, the Inquirer, Wired and all the rest would NEVER write an article about that, a supposed "Mac-User" who's accusing Apple of cheating without having the glance of a clue is just so much better, and who are they to admit they made a mistake?
And soon to be considered a felony in a courtroom near you.
cat
What a majority says is not right per se, it's just less likely to be questioned...
But really, how different need the voters to be, to get the same results independantly?
The best way to popularize Open Source Software is to support the BSA.
In the slashdot spirit of things, I would like to differ- Apple has historically (at least for the last 3 years or so) shipped it's computers with 2-2-2 spec'd RAM, OS X typically doesn't like to install on a computer that has 3-2-2 latency RAM installed. A workaround that many people with the slow (read: cheap) RAM must deal with is uninstalling their non-OEM RAM for the OS X install, then reinstall the RAM after the OS installation. The reason being the RAM tests performed during bootup from the install CD/DVD are more stringent than the normal booting-from-the-HD startup RAM tests. There are multiple threads on macintouch, macfixit, [and probably even] apple.com's message boards et el to document this.
Cursing in the French language is like wiping your ass with silk.
Damn. Beaten to it.
is it snappier?
First, I am amazed that you support the annihilation of small livestock. (That's what your post means to me.) Second, check who you're replying to. It's Chuck Shotton. Stop reading Slashdot and read some internet history.
(Gosh... it's almost like they designed it to be really fast, or something!)
So... hmm. I guess like everyone else, I'll have to wait and see what the numbers are on the TOP500 list. That's an interesting formula you've got there, by the way.
(I wonder how fast it'll run The Sims?)
A buddy of mine is hosting a clip he re-encoded in Quicktime for those of us who don't want to use WMP or RealPlayer. The link was posted earlier on the Accelerate Your Mac website. The direct link is here.
Perhaps. But there might be a bit more involved to getting all those computers to work on the same problem. I'm sure Rendesvous had something to do with that. It's also interesting to note that they chose to go with OS X, rather than Yellow Dog Linux. Again, probably a configuration issue.
Possibly the high bandwidth bus may have more effect with a network of cpus than the individual performance of CPUs. Perhaps an expert would know.
Also, that 17.x terraflops is for Peak. No figures on sustained performance. But perhaps more than 5.7 TF, putting it in top 5.
>>"ad space available -- low rates!!!"
I do, but Microsoft's out of date WMP for Mac won't play the audio in the BBC files.
The system knows because the parity bit is checked and if there was an error it's not in parity, causing a parity error and rebooting
As a rock-in-roll Physicist once said, No matter where you go, there you are.
The slide-show says MacOSX which sounds more likely, They could even mean just Darwin(The command-line totally open source part of MacOSX). I imagine that the press people meant GNU software. The slide-show also says that they are using IBM's compiler that was just released for MacOSX.
http://don.cc.vt.edu/tcfslides.pdf
page 13 of the slide has the interesting stuff.
On a side note, without ECC Ram this thing can not be trusted to give the right answer all the time, I wonder what the researchers will do about that.
Are you saying the Mac uses parity memory? I assumed it did not (as mentioned above).
-- Ed Avis ed@membled.com
Hassan Aref, Dean of Engineers at Virginia Tech, answered a lot of questions about the setup of the computer during a 20 minute interview last month. Go to the Your Mac Life archives page and pull down the MP3 or AAC archive of the Sep 10 show. The interview itself is 1:15-1:35.
They do not use parity memory on the G5.
"Your Identity is Filipino. Think about what that means.... " I SAY IT MEANS AWESOME!!!!!!
You little, pathetic, ugly, piece of shit. Filipinos requested NOT TO BE CLASSIFIED WITH OTHER ASIANS on racial/ethnic classifications, so how could you say they want to be like the fucking Korean culture ripping Japanese or the Chinese???????? YOU ARE A ROACH that badly needs to be SQUASHED.
Here is a list of Philippine Achievement in the U.S.
WRITER: Jessica Hadedorn
COMEDIAN: Rob Schneider
ACTORS: Lou Diamond Philipps, Nia Peeples
ACTRESSES: Tia Carrerre, Tamlyn Tomita, Phoebe Cates
SINGERS: Leah Salonga (Broadway star & Tony Award winner), Julio Iglesias Jr., Prince
MISS UNIVERSE: Gloria Diaz, Margarita Moran
MISS U.S.A.: (can't recall her name)
ATHLETES:
* Pancho Villa - World Champion Flyweight Boxing Champion 1924
* Gabriel Elourde - World Champion Boxing 1962
* Vicki Draves - U.S. Olympic Diving 1984, 1st woman in Olympic history to win springboard and platform diving, America's sports sweetheart in 1948
* Bobby Balcena - Cincinnatti Reds Baseball Player 1948-1963
* Roman Gabriel - Quarterback NFL MVP 1969 Hall of Fame
* Tai Babilonia - Olympic Ice Skating 1979 Champion
* Benny Agbayana - Baseball Outfielder 2002 Colorado Rockies
* Jerome Williams - baseball pitcher, S.F. Giants
POLITICIANS & BUSINESS PEOPLE & PROFESSIONALS
* Governor of the State of Hawai`i is a Filipino-American named Benjamin Cayetano
* Co-founder of Los Angeles. In 1781, Antonio Miranda Rodriguez Poblador, a Filipino, along with 44 other individuals were sent by the Spanish government from Mexico to establish what is now known as the city of Los Angeles.
* Loida Nicolas Lewis is chairman and CEO of TLC Beatrice International Holdings, Inc., a multinational food company with sales in 1995 of $2.1 billion. Working Woman magazine hailed her as the top businesswoman in the country for 1994. Mrs. Lewis was the first Asian woman to pass the New York State bar exam without having studied law in the U.S
* Ruben Aquino and Cynthia Ignacio, two Filipino-Americans, were instrumental in the creation of the Disney animation film "THE LION KING."
* Eleanor "Connie" Concepcion Mariano - President Bill Clinton's physician
Virginia Rep. Robert Cortez-Scott, a Harvard alumnus.
* Philippines Herald war journalist Carlos P. Romulo in 1941. (He was also the first Asian to become UN President
****
(Not an American history, but I wanted to include) my Philippine hero, the great revolutionary leader of the rebellion against Spain before U.S. colonizatoin:
Jose Rizal could read and write at age 2, and grew up to speak more than 20 languages, including Latin, Greek, German, French and Chinese. What were his last words? "Consummatum est!" ("It is done!")
INVENTORS
* Nikola Tesla - inventor of fluorescent light
* Eduardo San Juan designed the Lunar Rover or "moon buggy" which was used by the Apollo astronauts to explore the moon?
* Dr. Abelardo Aguilar, a Filipino-American, discovered the now widely used antibiotic known by its generic term as erthyromycin. Chances are, you've already used this antibiotic.
Kali martial arts - unknown inventor in Philippines