Specs On New SGI Onyx And Origin
An anonymous reader wrote in to tell us that
SGI has announced their latest and greatest MIPS-based computers, the Onyx and Origin 3000 line. Up to 1 TB RAM and 512 processors, all on a single system (not a cluster).
Beyond Boxes has a nice summary, too. This is definitely a great system for anyone who wants to
have their computer be the size of several refrigerators ;)
luckman
luckman
I don't involve myself with flames, much less know how to bait one.
with the PowerMacCube with gigabitethernet and Mac OS X you'd have something wonderful that would make the Beowolf type architectures mindblowingly fast and amazingly small..
Imaging a room full of these ----
Actually, one of the benefits of the system's modular building blocks is that one rack can hold up to 32 processors. This is double the density of the o2k. The refrigerator comment is correct though. 4p in 4U is bound to kick off some heat.
Yeah, I know. After I posted it, I realised that noticably wasn't the right word. Perhaps measurably would have been better. The point I was trying to make (and I guess I didn't succeed very well) is not that you could use a cluster of off-the-shelf machines instead of an O3K, but that the O3K (and other large machines) had some cluster-like properties housed in a single case. Bandwidth and latencies may be orders of magnitude better, but architecturally, they're similar (although not identical).
"The invisible and the non-existent look very much alike." -- Delos B. McKown
Ermmm... no. Data General and Sequent have both been shipping NUMA boxen for many years now.
"The invisible and the non-existent look very much alike." -- Delos B. McKown
Check out the internal bandwidth of those boxes, 5 year old O2's and Indigo2's still have greater memory bandwidth than the latest pentium motherboards. I still have yet to see a multi processor athlon board. Voodoo5 cards are meant for two things.
1. Getting high FPS on games
2. Getting high benchmark scores
SGI concentrates on realtime rendering and processing. Their video cards have dedicated mpeg/jpeg rendering engines and don't forget the cpus are 64 bit. Given the choice of a 1Ghz pIII or a 200Mhz Onyx, I would take the Onyx in a heartbeat.
Only the State obtains its revenue by coercion. - Murray Rothbard
Ah!
That was the crucial detail I was missing.
Thank you very much.
---
pb Reply or e-mail; don't vaguely moderate.
pb Reply or e-mail; don't vaguely moderate.
I, personally, wouldn't have a use for one (other than bragging rights :-), but it's not actually that big. At 48bpp (16 each for RGB), you could get 3900x1792, an aspect ratio and resolution that may well be suitable for motion pictures using digital projection. Alternatively, you could have a triple-headed 1600x1200 display. I've worked at companies where three 1280x1024 displays per machine were commonplace, so it's not that unreasonable.
"The invisible and the non-existent look very much alike." -- Delos B. McKown
I was thinking prelauch type stuff :)
But ya thats the one time these monsters would look cheap is when you check how much it would cost to put one in orbit
Alright! Now I know what to ask my Grandpa to get me for Christmas! I think this will finally give me the edge I need to win the CPL. I should pull about 98763214 frames per second in Quake 3. Guess I have to buy a new car to bring it to the next LAN party though.
Dissenter
Dissenter
"There is no knowledge that is not power."
Wait, by my calculation, a 1600x1200 display has 1920000 pixels. So, each pixel takes 8 bites to store, so we are using 15360000 bytes per frame buffer. That right around 15 megs. Now, we of course want double buffered, so thats 30megs per display. 3 displays is 90 megs.
Now, we can only drive a maximum of 8 displays per pipeline, so to drive 8 displays at 1600x1200 means 240megs. That means that there is still 80 megs left. Hmm. I guess if we bump up the resolution to 1920x1200 that would increase our frame buffer useage to 294megs, which only leaves us a little over 20megs free.
Now, 8 displays may sound rediculous, but now that I think about it, these machines are made to run CAVEs and video walls, so maybe 8 displays ain't so bad.
-- Superlame http://catpro.dragonfire.net/joshua/
The nicest thing about the SGI machines is that they have low-latency interconnect. Complete cache coherency is on the order of nanoseconds - not your microsecond latency on SCI or Myrinet, or your millisecond latencies on Ethernet (and those latter latencies are for data transfer only). A lot of supercomputing tasks can be done by a cluster of Linux machines these days; but for exactly the class of applications you're talking about (lots of communication/contention) this is the machine you'd want to run it on. The other class of applications (of course) is detailed simulations with a fine grid size - where else can you get 1TB of shared memory? ;)
As far as the kernel goes, it's been scaled from 1..512 processors. There is almost no kernel overhead in computational code to begin with anyway (sure, that simulation may run for 100 hours, but it makes about 1000 system calls), but Irix does a pretty decent job of staying out of the way (aside from periodic stupidness of the scheduler anyway).
No offense, but comparing Linux/BSD/whatever kernel overhead to commercial high-end UNIX overhead is like comparing apples to oranges. Sure, Linux may scale to 8 processors ok, but that's way different than scaling to 512 (which is very difficult to do).
Yep, mea culpa. I was dividing 320MB by 48, not by 6 (or 8, if you assume 32-bit word aligned accesses).
"The invisible and the non-existent look very much alike." -- Delos B. McKown
Even if that Gates quote is really true (it seems to be more of an urban legend than anything else), he was most definetly not talking about serious computers (workstations, servers, mainframes, super computers).
:)
He was refering to personal computers running DOS (i'm not sure that anyone outside of Xerox had seriously done any experimenting with GUI's before at least the early 80's). But regardless, it's just silly to keep bringing that up 20 years later...
End of off-topic comment...
And yes, I know you were being sarcastic, I just failed to laugh is all...
So the rest of the industry is playing "catchup" to SGI ?! I don't really think there's a huge market for large-scale multiprocessor machines when equivalents can be built up easily from cheap hardware and fast network infrastructure.
Actually, they can't be.
This is not a cluster - it's a multiprocessing supercomputer designed as a single unit. The internal busses have far, far greater bandwidth than even the expensive networks in a high-end cluster.
It does have competition - the Sun Starfire. But that's about it.
Clusters are definitely useful, and give you by far the best bang-for-the-buck on problems with relatively light communications load, but problems with a heavy communications load are best run on machines with high communications bandwidth, like this one.
I think, if you need an SGI box with 256 or 512 processors, nothing else will do... There's no way you can make a beowolf cluster that would come close to the performance of one of these boxes (in my opionion) because the bandwidth inside each box interconnecting the boxes dwarfs anything else available.
Besides which, this isn't a single user machine. No one in the world is going to buy one so they can sit a user in front of it to make animations. This is the type of box that will get shipped to the DoD, NSA, various univerities, and large corporations that need to build virual prototypes (Detroit and Japan).
And no. At this level of the market, Linux just can't compete. Lower-end, yes, but not up here...
I think you missed the "UNIX is amazing" comment.
1. The CDROM is on an internal FireWire bus.
2. The system disk is Fibre Channel.
3. SGI hasn't made a big deal about it yet, but the system will accept either MIPS or Intel processors in the same CPU modules. The MIPS processors come on one kind of daughtercard, and the Itaniums (Itania?) on another. You can't mix-and-match MIPS and IA-64 CPUs in the same machine, but you can mix-and-match in the same cluster.
4. The IA-64 based versions of the 3000 series will include the Linux kernel along an some IRIX compatibility layer.
Amusing bits from the page:
Debra Goldfarb, group vice president at analyst firm IDC, agrees: "Modular computing empowers end users to build the kind of environment that they need not only today but over time. SGI, with this product, is really ahead of the curve in the market. We are seeing the [rest of the] industry absolutely trying to catch up" with SGI.
So the rest of the industry is playing "catchup" to SGI ?! I don't really think there's a huge market for large-scale multiprocessor machines when equivalents can be built up easily from cheap hardware and fast network infrastructure. The last time I saw an SGI was the NASA AMES crew using one for their amazing Viz tool, and even they were making mutterings about porting it to NT and Linux for ease of maintenance and actual use.
In addition, SGI Origin 3000 servers and SGI Onyx 3000 visualization systems reflect a return to SGI's core competencies.
At least that's true. The NT machines were a joke. Anyone tried SGI Linux yet?
Yes, BUT Linux can't at this time even think about supporting somethin like that. If they sent Linus one, maybe, but I suspect he'd rip out the MIPS processors, replace them with Crusoes, and send them back a 512-processor Crusoe machine... with no cooling :-P
I'vealways drooled after these things. but I am curious of bang for buck. How do these compare to sun, and the latest greatest IBM and inel based clusters. Cost is such an important issue these days and with technology ramping forward do you really need something this mean. This ofcourse is in the domain of server machines and not Imiging stations, here I believe SGI will always be king, or queen if you please.
Insanity is the last line of defence for the master diplomat. But you have to lay the groundwork early.
Aah, yes. The $64000 question. The answer to this is NUMA and hypercube structured interconnect. Check out the specs. Its not an SMP. It is shared memory like an SMP. Looks and acts like an SMP at all processor counts.
At Academy we use still Onyx RE2, 4xR4400@200, 640MB RAM, ASO (4 sound channels 6 serial ports) Sirius Video option, as our primary system to build real time virtual sets, our new NT's and G4's are faster for some simple tasks but when game comes to realtime uncompressed video I/0 this 6 years old monster is still killer, details as 1,2 GB of internal bandwith and 48bit color will be reached on x86 once... Yes there are several o2, Octanes and Indigo2, Impacts for 3D and video preprocessing. This machine grandpa of O3k is still usefull and warm ;-)
(didn't SGI have a rather big lead on sun with SMP ??)
On the sun "starfire" (E10000) CPU boards( with up to 4 CPU's) can be hot-swapped ...)
(after migrating the procceses running on them of course
Also AFAIK CPU(system boards) can be added to the System (with no reboot)
As well as moving CPU boards between domains ( again: with no reboot)
Of course the E10000 has no NUMA & tops out at 64 Processors
BTW You may find it interesting that sun is claiming the E10000 replacment(using UltraSparc III) will have NUMA & scale to 1000 processors
You may also have noticed that a lot of this functionality (except domains) has been "downported" to the E[3456]500 models
--
Rafi
(to Email replace "NOSPAM" with "meron")
As for the E10k followon being NUMA, I'll believe it when I see it. Sun has previously said that they don't think NUMA is a good thing. Also, we heard they were working on an architecture called COMA (bad name, but it stands for Cache Only Memory Architecture) where you treat all of memory as a cache and let cache lines move wherever. If they *are* really doing NUMA on 1000 processors, they are going to find that the jump from 64 to 1024 is more like scaling a cliff than a gentle slope... Besides, Sun's NUMA stuff is vapor - ours runs *now* :)
Go Badgers! -- #include "std/disclaimer.h"
The whole system has one contiguous view of memory. The NU means "non-uniform" as in the memory access time is non-linear. If a process's memory is located on the processor module it's running on, the memory access is fastest. If it has to jump one module away, the memory access time increases by 100 nanoseconds (roundtrip). Architecturally, it's completely different than anything Sun has to offer. Sun has been promising a NUMA mahcine for years and still hasn't delivered. The closest company to SGI is Compaq(DEC), and there top of the line offering can almost compete with Origin _2000_. All other companies high-end servers use symmetric multiprocessing, which becomes limited as more and more processors try to access the shared memory bus, ultimately bringing in negative returns as you add more processors. This NUMA architecture incurs very little (if any ) penalty by ading more processors, as long as the hardware and OS do a good job of placing processes and memory (keeping them physically near). Also, the machine is _not_ limited by 512 processors. To give an example of the power of this box, a company has certain calculations that they run day to day. On their top-of-the-line Sun hardware, it takes about seven hours. On O3k, it takes seven seconds! What does being a modular system have to do with being a cluster? By being "modular" it simply means that you can plug in more of whatever you want, whenever you want. I believe you can even mix faster cpu modules with existing ones as they become available, protecting your investment. This is not a cluster.
Intel transfer the difficult from Hadware to software, for get more power, programmer need more technology. -- chinaitn
Don't get me wrong, I love SGI's machines and use one daily. Even passed up on a faster PC (running Windows) because I like it so much. But there is no way I could cost justify getting a new one. They simply do not provide enough performance to justify the cost anymore. All the demos of their stuff we've seen doesn't indicate that their new machines are a huge leap in performance. (meaningfully faster to be sure but not nearly enough to justify the cost of a new one) Fortunately for SGI they make a ton of money on each Onyx & Origin they sell but if they aren't careful this could easily evaporate out from under them. They make very cool systems but it is not a well run business IMO. I'll be somewhat suprised if SGI doesn't get bought out by someone in the next year or two.
part of the reson why they are selling it is the second processor tray is bad, and it cost about the price to replace the system to repair it
Slight clarification - the key in the design is the N in NUMA (NON, as in Non-Uniform Memory Access). A processor accesses memory on it's own board quickly, slower to access memory elsewhere. The question is how much slower "slower" is. This also means you really want processes to stay on their processor, and to have good locality of memory, because otherwise you take latency hits.
Check out Chapter 7 of Greg Pfister's "In Search of Clusters" (ISBN 0-13-899709-8)
"This is definitely a great system for anyone who wants to have their computer be the size of several refrigerators."
I foresee a day when computers may be as small as one refridgerator. Probably there will be a world market for no more than 10 of these.
--
Give us our karma back! Punish Karma Whores through meta-mod!
Linux MAPI Server!
http://www.openone.com/software/MailOne/
(Exchange Migration HOWTO coming soon)
I think you are spreading a myth about locking, Godfrey. It's not the addition of locks themselves that cause a performance penalty, after all, they result in a nop on a up system. The gratuitous adding of locks without regard to design (this is what lm complains about often) does add maintainence problems, but that is a matter of design not performance. The only performance penalty to making a system scale in number of cpus is when you have to make tradeoffs in the design of structures and algorithms. Usually you can (with a little more thought) find a design that will help both small system performance and large system performance... But sometimes that isn't possible. In those cases you could split and maintain two separate subsystems for the two designs but this again introduces maintainence headaches. Luckily we haven't gotten to that point yet... And I honestly don't know whether it will be worth it to pursue. (I'm making the assumption that we can scale into the 10's of processors without hitting that point.)
And wants several refrigerators to cool the system, too. Can Linux even handle that many processors, let alone make good use of them? UNIX is simply amazing...
I think these machines are simply awesome, but you have to wonder how many of these really gets sold? Yesterday Ascii white was announced to be sold to the public and now we see this bad boy. Does anyone have a link or figures on how many of these sell? How long does a company keep a supercomputer after buying one? The specs are impressive and so is the price tag, but do many companies, or countries buy these?
System Bandwidth
3200: 11.2 Gigabytes/sec
3400: 44.8 GB/sec
3800: 716 GB/sec
...methinks they skipped a decimal point here.
(if not, please explain!)
---
pb Reply or e-mail; don't vaguely moderate.
pb Reply or e-mail; don't vaguely moderate.
Break out the motion-sickness pills!
This is a MIPS-based processor architecture. And MIPS machines have extremely sketchy support in Linux right now; that's what irix is for.
And the efficiency of the system depends on the efficiency of the processes running on it; if your program knows how to use stuff like MPI and forks itself off lots of times, then yes, you will get extremely good performance. But one process of SETI@Home won't do very well...
# debian/rules
It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
Well.... before we can support 512 processor MIPS boxen we need to support single processor and dual processor.... IMHO low end multiprocessor SGI box support is where Linux needs to go on the SGI architecture
I don't know a whole lot about SMP.
That said, what's to stop each running thread from using one or four or whatever processors. I mean, unless the software is specifically to use 512 processors, wouldn't it kind of work as a really great multitasker?
Like I said, I don't know much about SMP.
Goatse.cx is not a troll thing! It is a spammer thing!
All you spammers ran off with the perfectly good troll name and defaced it! It's like the l33t d00dz script kiddies who ran off with the hacker name! You are not trolls... you are spammers!
</rant>
Moderators!!! I believe this was intended as a joke
To fix a few misconceptions: 1) The bricks are (mostly) 3U [5.25"], or 4U [7"] high, and the same bricks are used to construct a wild range of systems, with huge variations in CPU-I/O-storage ratios. 2) In some cases, the bricks will be sold separately and embedded into airplanes, vans, etc, by defense contractors. I'm told the submarine folks really love the idea. 3) In a half-rack (SGI Origin 3200), you can have 2-8 CPUs [1-2 C-bricks], a required I/O brick [I-brick], and either another I/O brick (I, P, or X) or a disk brick (D-brick). 4) People always announce a wide range of systems: realistically, most of these machines will be 1-2 rack systems, just like they are for everybody else. People who buy lots of computers use racks anyway - the last thing in the world they want to do is waste precious floorspace. 5) IRIX already scales to 512P fairly well, and NASA AMES runs individual shared-memory jobs on their older Origin2000. It already saved you a lot of tax money. 6) SGI is not shipping Linux on the MIPS-based machines. This is a "Caterpillar" announcement, with a lot of shoes left to drop, like IA-64-Linux versions coming later. A major point of the brick thing is that you can change bricks while re-using most of what you already had; you can for example, introduce a PCI-X, or later, Infiniband brick without obsoleting older I/O bricks. Also, you can build C-bricks with Intel IA-64s, and those will run Linux, not IRIX. All of the rest of the hardware infrastructure & bricks are the same. 7) SGI is working hard with the Linux community on scalability, i.e., to let it handle more CPUs well without damaging the basic Linux. Personally, I doubt that it will make sense to try to scale Linux to where Irix is, but it will certainly scale big enough to be interesting [say 32-64P in single system image]. Using partitioned hardware, one can get NUMAlink speeds between partitions, and that satisfies many customers. 8)The customer should be able to pick the size of machine, and then cluster that size together. For some customers, 1P + 64MB is just dandy, and they buy clusters of IA-32 boxes. I know customers where the right size happens to be 32P, 16GB of memory, 2 disks, and 3 Ethernets [one full rack], and then they cluster a lot of those. I know customers that cluster 128Ps, and there's one who would cluster 512Ps if they had the money. If the NASA Ames folks had the money, what they really want is a single machine with Petabytes of memory and Petaflops. I was sorry to tell them, Not Likely Soon. 9) Don't get too crazy with the fact these systems can go really big. I've lost track, but I think there are 30,000 of the Origin2000s & 200s out there, and most systems are small to medium. Of course, the big systems account for many CPUs. 10) The NUMAflex brick approach has many subtle benefits, but is hard work. In some thread, people mentioned backplanes ... but there aren't backplanes in the normal sense. Each C-brick has 4 MIPS CPUs, memory, and an ASIC Crossbar, with 2 ports out the back for cables that run (peak) rates of 3.2 GB/sec (2 * 1.6GB) and 2.4 GB/sec for I/O to separate I/O bricks. Each brick has internal circuit boards, but there is nothing that looks like a normal CPU backplane. To do this, you have to be able to run 3meter/5meter cables at these rates, and do tricky circuit engineering. Later versions will independently improve the interconnects as well, not just upgrade the bricks.
But does it really do the same graphics processing? Can a Voodoo5 or GeForce2 handle 48-bit colour for example (as used by the motion picture industry)? How about a 320MB framebuffer with 256MB texture RAM?
"The invisible and the non-existent look very much alike." -- Delos B. McKown
We got one of the earlier Onyx machines (creatively named onyx.astro.wisc.edu) back in 1993. It was pretty novel with its dual processors and fast OpenGL hardware. When some SGI programmers ported Doom (and later Quake) to the MIPS chip, some of us grad students used to play on the dept SGI boxes, including that dorm-fridge-sized machine. But for all its lofty framerate scores, our Onyx had no sound, so the poor sucker sitting at that terminal often got fragged with no warning.
But alas, the proprietary $15,000 memory module fried itself after the warranty expired and the machine was sold (for parts, I guess). No heated footstools in our computer room any more...
One thing I never understood is what the heck is a person to do with a 320mb frame buffer? I just can't come up with any way to use a significant amount of that.
-- Superlame http://catpro.dragonfire.net/joshua/
In response to the person who asked "who buys these supercomputers". I read an article in the Silicon Valley Metro newspaper, that said that the number one file server brand used by the on-line porn industry is SGI.
Use a VR cave or one of those 120 degree triple projector screens. Peripheral vision is more important than a big screen.
Sure!
http://reality.sgi.com/sgiquake
With boxen this size, the boundary between a single machine and a cluster tends to get a little blurred anyway. Even SGI are stressing the fact that it's a modular system. Basically, each module has it's own CPUs and memory, and has connectivity to the other modules in the system. What's the difference between that and a conventional cluster? Mostly the phenomenal inter-module bandwidth, but that's just a matter of numbers. Architecturally, is there much difference? OK, so you have a single OS image running across all CPUs, but is that even true any more? Certainly other large systems (e.g., from Sun or Data General) let you run multiple versions of the OS concurrently on a single box as you see fit.
"The invisible and the non-existent look very much alike." -- Delos B. McKown
Wouldn't you prefer a good game of Global Thermonuclear War?
http://reality.sgi.com/sgiquake
Keep in mind that until a month ago, SGI's top-of-the-line graphics board sets (MXE and IR2) were the same designs that originally appeared as Maximum IMPACT on Indigo2 and InfiniteReality on Onyx R10000. About five years ago, give or take.
During that time, entire graphics hardware companies have come and gone. The really good ones have caught up to, and occasionally surpassed, what SGI was doing in 1994. Impressive. Most impressive. ;-)
Now SGI has released Vpro, which despite having one name is actually two totally different workstation graphics designs. The Vpro you can get in the IA-32 workstations is basically high-bin commodity graphics hardware from a company that shall remain nameless.
But the Vpro that comes in the Octane2 looks outstanding. I haven't had a chance to use it yet, so I won't endorse, but the design specs for the Buzz chip make it look like InfiniteReality performance on the desktop. Way better than anything in the commodity market right now, and way more expensive, too. It's one of those things: if you have to ask how much it costs, you can't afford it.
And so we are all a part of the great Circle of Life.
Perhaps he should get the biMac or dMac :) check them out here
Something this size does well for meterological simulations, atomic weapons research, something that entails MASSIVE numbers of computations. I wouldn't be surprised if you see these in placed like NCAR (National Center for Atmospheric Research) NCSA, the Government labs like Lost Alamos and Larry Livermore. I still remember seeing NCSA's purple monster 1024 node cluster of Origin 2000's (using an experimental node bridge)
;)
As for how this is different than a Beowulf cluster, look at the bandwidth! Even with switched 10/100 Ethernet as your Beowulf 'backplane' most switches have just enough backplane bandwidth to handle every 100 Mb connection, some have a little less. sgi has always had amazing bandwidth numbers, this is just taken to the N'th degree.
AND this is one machine, one OS, unlike a cluster of many independant machines, much easier to administer.
These are simply awesome machines, now maybe sgi can sell a bah-zillion of them and I can get my Indy sold
g:wq
What if it is just turtles all the way down?
Any operating system that requires a reboot to detect a new config is not worthy to ever be called a Server OS. Uptime is uptime, even when memory needs to be swapped or a disk added.
We will only come out of these dark ages of clunky cumbersome computing is we insist on it. Requiring reboots is evil, and should be minimized, whether for hard or software. Architect for it.
It's tricky enough to design file-systems that are properly distributed. I did some design for a school thesis for a serverless distributed file-system with useful fault tolerance features. Thats pretty tricky in and of itself, even to support UNIX file-semantics. Building on something like that to build a strong and safe RDBMS would be quite a feat.
People _really_ like the single-machine programming paradigm. The OS at every level needs to emulate that behavior as much as possible, regardless of the reality of the situation. Hence, the need for a good file-system. (see Berkely xFS for the right approach, or Centravision for a shipping product looks interesting). RDBMS are already choked by locking algorithms and contention on SINGLE CPU machines. It should be no surprise that a fast RDBMS that is fully distributed and scalable isn't widely available. To do it right you've got to have transparent internal replication of basially everything. Not just data and meta-data, but even logic. Coming up with a serverless (and thus usefully scalable) scheme that gives strong enough guarantees for RDBMS applications yet still survives and survives corectly and quickly and doesn't bog down the system with locking will be quite a feat for whoever manages to do it.
My opinions are my own, and do not necessarily represent those of my employer.
Crap name, but I really think this 'brick' implementation is a great idea, and although I don't doubt the backplane/bus adds a certain amount of overhead to the cost, it'd be nice to see this sort of thing on Workstation and Desktop systems. And yes, I know similar things have been tired before (Acorn?)
PC getting slow or out of date? Add a new processor brick, that gets detected and used with just a reboot. Keep the old brick if you want. Graphics too slow? Just bought a second 19" monitor? Add a new graphics brick.
Im not suggesting this is a cheap or easy solution (yet) but its a much nicer one that PCI slots, and a tidier one than USB...
Pax,
White Rabbit +++ Divide by Cucumber Error ++
free experimental electronic music netlabel at www.viablehybrid.com
Has anyone done any benchmarks for these boxes yet? Our research group currently has a few SGI Origin 200 systems (178MHz R10000 based, 4 processor), and I am curious how these new systems stack up in comparison.
Looking over their web pages, I notice that they are careful to never even mention what clock rates the R12000 and R14000 configurations are available in. Is this because they have not decided which parts they are using, or they fear negative PR due to low clock rates? (MIPS cpus get a lot of computation done per clock, and typically run at lower rates than, say, Intel chips).
As far as Beowulf clustering: why bother? SGI already ships some excellent software for running these large configurations with every box. It works as soon as you turn the system on (no tedious setup). We use our current systems as build and simulation servers, and they do an excellent job of load balancing tasks and running parallel makes very quickly...
Chris
Please do not take my post as a big thumbs down for SGI. I love their gear. Applications such as Alias/Wavefront are testament to the power that these boxes have. It is just like buying a pair of Nike shoes, are they really worth the premium, or can you get a cheaper version that can do 90% of the task. Now that Alias are releasing Maya for Linux, I am assuming that some high end boxes will be in order. Can Linux with heaps of hardware thrown at it compete at all with SGI's new range of gear.
It isn't just the graphics, the the architecture of the entire box. You cannot compete with the pipelines on an SGI box with a x86. Its just pointless.
Ever wonder why Pixar has so many SGIs? It isn't because they have the extra money to burn. Its because SGI _IS_ the best at graphics. Until you use one for visualization (my department does a LOT of vis work - combat simulation), you have no idea the power of these things.
-- toolie
Parallelized software is almost always written to take advantage of N processors, for any N.
Non-parallelized software isn't, though. The worst kind of non-parallelized software isn't threaded, doesn't fork, just runs in one process forever.
A 512-processor machine will run a process like that in exactly the same amount of time as a 1-processor machine.
However, the 512-processor system will also run 511 other instances of the process in the same amount of time.
One woman, nine months, one baby. Nine women, one month, no baby. Nine women, nine months, nine babies.
That is not really what this system is meant for (running one job on all of the processors). On our Origin2000's (we have 3), we run many "small" jobs through a queueing system. (Small = a handful of processors and a dozen GB of memory or so.)
Each of these jobs are larger than most jobs one would run on a cluster like the IBM SP (got a few of these) or a Linux cluster (got one in my lab, getting a big one soon), becuase they are memory bound rather than CPU bound. That is, it is easier to address 10GB of RAM than to fiddle with message passing between nodes and such.
P.S. "we" are msi.umn.edu, and you should be able to find at least one of our supercomputers on the Top500 site.
VPro is better than commodity parts? Actually, no. NVidia made SGI's "VPro" graphics chipsets as part of SGI's "we'll drop our lawsuit against you if you sell us some chips really really cheap". The VPro, depending on the model, is either the Geforce or the Geforce2 with a SGI-customized bios and a slight overclock. Very slight.
People are asking things like "why would I use this" and "who wants these?" Let me tell you, in the era of bloatware like Oracle and any of the content management systems out there (possible exception of Zope), the incredible scalability of these systems will be a huge selling point. Oracle, for example, is very careful to build and market their software to be monolithic so that you have to buy big hardware to run it, and then they charge you based on the size of the hardware you're running. Thus, they drive the purchase of huge systems like this, and then charge you up the ass for their "Enterprise class" database.
Believe it or not, this is actually the kind of business model that the Fortune 500 are not only happy with, but demand.
Personally, I'd be happy with a database that could run on a loose, fault-tollerant network of a dozen or so small (e.g. 2-processor Intel or Alpha) systems.
Then again, I'd really like to play with some of SGI's big iron....
no kidding, sgi is the only one with a fat enough pipe to get the job done. Currently, I run three streams of uncompressed standard def through an Octane. Yahoo! http://www.ridiculopathy.com
NOW it's all so clear to me as to why iD would want to sell off it's SGI PowerHaus(tm).
The new models are on the way!
Rami James
Guy with Duh.
--
rJames.org - illustration
Geeky girls are often impressed by the size and power of your computer equipment. However, size is not the most important thing, it's how you operate it.
Mas vale cholo, que mal acompañado.
And probably need some refrigerators to keep the thing cool when you get up to 512 processors
I dont have a
Anonymous is uninformed. Digital Domain used a Linux render-farm for Titanic, but as usual at DD, the bulk of the 3D interactive work was done on SGIs (and some Macs, and PCs with NT). This is very typical: renderfarms are whatever the company can get for the lowest cost/rendermark (or equivalent), and they don't use any graphics hardware, just the CPUs. For example, Sun gave Pixar a great deal on a renderfarm ... and they still buy OCTANE2s for their interactive work.
It is trivial to check:
http://www.d2.com/text/faq/main.html
and see what tools they use.
In the last 10 years, consider all of the films that won Academy awards for Computer-Generated special effects, and add in all of those nominated. Of these films, can you name the films that did *not* use SGI?
Finally, to avoid this being an SGI versus LInux, do recall that SGI is seriously investing in LInux work and contributing to the community in this turf, so it's not like we dislike it, just the facts.
SGI closed its doors for the last time today despite announcing record profits.
"We just ran out of names beginning with O" said Spokesman Otto Olson, head of names. The Ohshit and the Omygod were really scraping the bottom of the barrel.
Oliver Ottowan added "We really should have used a more common letterlike T or S."
I don't think that there is curently a 3D video that has been done purely on Linux systems yet.
I know that our video's will be about as close as we can get. We animate and model on Octantes. I would say 75+ % of the frames are rendered on linux and the rest on our SGI boxes. We use only SGI's to create video to import on our NT AVID system.
I applaud all of the work that SGI has done with the linux communtiy and I wish that we could get some more SGI boxes in for quick, single frame renders.
I still think that linux is the best choice for a render farm, though. Small, fast, cheap. I like to treat my boxes as appliances. Maya files in, frames out.
I agree with all that you say, and I thik that SGI boxes are still needed, but less and less.
-I just work here... how am I supposed to know?
Speedwise - yes, probably. Although a lot of the fast cards sacrifice some image quality for speed. SG probably go for a high quality output. For the type of applications that are used, a jerky display can be forgiven more easily than a display which screws up occasional polys.
NVidia made SGI's "VPro" graphics chipsets...
You're half right. There are three-and-a-half flavors of Vpro right now. There's V3/VR3, which is an nVidia board with 32 or 64 MB of DDR RAM.
Then there's V6/V8, also known as Odyssey. These are available only in Octane2. They're an all-SGI design with the Buzz chip-- "OpenGL on a Chip!"-- at the heart.
There's talk of a V12, which I think is supposed to be a two-Buzz version of V8. That, if it happens, will be exactly twice the geometry performance of V8.
Odyssey-- V6, V8, V12-- look on paper like they're light-years ahead of the nVidia stuff you find in the 230/330/530 systems. I say "look on paper" because I haven't used one myself. Disclaim, disclaim.
Finally-- my very own regeneration alcove :)
-j
-sigs of the world unite
If you wish to see what SGI graphics can do, visit the Hayden Planetarium in New York. http://www.sgi.com/features/2000/feb/hayden/index. html
There's nothing worse than a braggart. ;-)
What could the average slashdot troll do with such a box?
- faster first posts
- use the heat to cook some hot grits
- render natalie portman naked and petrified in realtime
- buy several for a super dooper beowulf cluster
- forge real looking slashdot logins (ie. bruc3 p3r3nz)
- tease signal 11
the list goes on and on....windows 2000 would actualy be as fast
as linux is on a 486 if it did
this is a joke for those who have had
there funny bone removed
Mmmmm 4 Gigs of video ram, 512 processors all with 8 meg of cache, and quake 3 mulit-proc inabled. Yummmy Wonder if there is a way to hook this to an Imax projector and get some SERIOUS fragin underway?
Or Biorythms... Then I could test the moderator water before attempting jokes
Dear "Slashdot.org" Management:
It has come to our attention that a user made use of my client's exclusive trademark ("Yahoo!") in a post to your "Specs on New SGI Onyx and Origin" messageboard on 25 July 2000. Said user did not credit my client in any manner whatsoever, or even provide a link to my client's website (http://yahoo.com). Therefore, we can only assume that my client seeks so misrepresent and libel against my client for his own personal gain. Please provide us with a means to get in touch with this user ("tehzig_112") so that we may give him a proper beatdown and molest his children. Alternatively, you may remove the posting from your messageboard, and provide oral sex to yours truly before the end of the week.
Sincerely,
Horace J. Netherson, Esq.
Law Offices of Johnson, Johnson, and Kerblinsky
I know, I know... (-1, Unfunny).
---------///----------
--
I like to watch.
I found a page on sgi.com giving a bit on the difference between VPro/Intel and VPro/Irix.
Thank you. You're my favorite poster of the week.
Ah, I love UNIX... :)
---------///----------
--
I like to watch.
That said, if you check out the "partitioning" feature of the machine, you can break it up into multiple smaller machines that can share memory (well, not yet but we're working on it :) and communicate via direct memory->memory copies.
Go Badgers! -- #include "std/disclaimer.h"
Ahh, but unfortunately, a whole new group of open source zealots are posting. I remember when Linux zealots were on everyone's nerves. Now there is BSD.
<br><i>
"BSD is a better liscense. Sooo wah!"<br>
"Look at this nifty 350mb/ps throughput we benched on freeBSD! No, I promise, we didn't doctor it"<br>
<br></i>
I have nothing against FreeBSD, but Linux is more developed/stable system. And eventually, people will just use whatever system they feel comfortable with. If BSDers and Linux users stopped fighting we could convert more ppl to the Open Source movement. And isn't that what's really important?
====
Crudely Drawn Games
Plus, usually these machines aren't strictly single-use; more often than not, there are several large simulations/computations running on them. In general, more CPUs = more large independent programs running simultaneously. --Eric
I still wonder about SGI for its title of kings of graphics. They used to be the best, but have lagged behind. They charged large sumns of money for grunty boxes, but why not just get a custom box with 4 Voodoo5's or GeForce2 GTS's and whacks of RAM and throw in a couple of Athlon's. Box that does the same graphics processing, 1/10th the price. The boxes always look so cool though.
Sounds like someone is playing a numbers game to me. The SMP effect is going to chew their lunch on performance. Say you have an application that runs on ten processors. Now, can you imagine the new performance level if you change that to 100 processors? It won't be even close to a 1000% increase. You'll have contention between the processes, and contention (especially) within the kernel. (Hell, I get it with just 12 processors, depending on the application that is running.)
If they know something I don't here, I'd love to see it.
As great as it is to see SGI's moves to utilize Linux, computers like these demonstrate that Irix still has a place in the larger picture. Irix is really a pretty neat operating system, and frankly, it can scale in ways that Linux just isn't ready to yet. As long as SGI is still making systems like these on the high end, I don't see Irix being displaced anytime soon.
Of course, Irix also has a lot of graphics production tools that you don't find on any OS, Linux included. That's something else that'll keep Irix around, at least until equivalents exist. Ideally, we'd see SGI continue to take steps toward open source/Free software, with Irix components.
Anyway, looks like a pretty cool new system from the people who brought us the original colored computer. Can't wait to get my hands on one of these.
yours,
john
A beowolf cluster of these?
seriously -- Does/Can/Will Linux/*BSD be able to scale to something like this? I know SGI supports Linux on their x86 boxes, but do they encourage Linux/MIPS development? SGI: Send Linus, Alan and friends some of these!