Cool Linux Tricks With Atlas
dpilgrim writes: "Looks like some powerful players want to see Linux going toe to toe with Unix 'big iron.' Would you like to be able to run two Linuxes simultaneously on the same box? Or seemless swap processor and memory in and out of your machine? The Atlas project aims to bring you all that and more. There's a press release from TurboLinux reported here, and a more in-depth article running on SourceForge's
Linux on Large Systems Foundry."
If I'm going to spend lots of money for hardware like this anyway, why would I use Linux?
:-)
I'm not trolling, I mean it. What does Linux offer me that Solaris doesn't?
And please avoid the philosophical ramifications -- I have nothing against commercial software, except that 99% of it sucks.
--NBVB
Atlas seems to be relying upon this new Intel chipset and the Mckinley processor (one of intels new 64bit processors). This new chipset will support hotswapping it sounds like, and any motherboard maker that would use that chipset, would make sure that the slots could do that. So yes you do need a special mobo, but it wont be available for a while.
This goes hand in hand with a lot of the work planned for 2.5 to make it scale to larger systems. Linus feels that the current architecture is just about ideal for SMP's of any size, but there are really two obstacles to Linux working on big iron and competing against solutions from IBM, Sun and others. The first is scalability to NUMA machines. This issue is being addressed by the kernel development team. The second is support for the reliability features that the really high end hardware provides. That's the work described here. Together they will make Linux the winning combination even on the very high end!
Even Slashdot wants to hide some things
It's important to remember that much of Linux's competition comes not from the dreaded MS, but from commercial UNIX vendors, like Sun and IBM.
Most companies that currently employ Linux tend to use it for things like DNS, Web servers, and file sharing. Fitting Linux with enterprise features is critical in moving beyond these types of services and truly entering the enterprise world of hot plugging, scalability, and *proven* reliability.
While I realize that its reliability is more than proven to most of us here, it's important that it be proven to executives as well. Not only must it be reliable, but proven companies must have track records of standing behind the product 100%.
One concern I've heard voiced is that no company providing support for Linux will take ultimate responsiblity for a product that isn't theirs.
Get a few more years and services behind Linux, and we should see it explode.
The pomposity of the professor is inversely proportional to the difficulty and importance of the subject being taught.
Would you like to be able to run two Linuxes simultaneously on the same box?
Actually, I'd like to be able to run one Linux on N boxes, or M Linuxes on N boxes where M!=N. Just immagine a cluster of 50 machines where the failure of one machine has no effect on the operation of the cluster as a whole. There are some good projects in this area, but I don't think they can quite offer this kind of transparency.
JET Program: see Japan, meet intere
I read that the Linux kernel developers are planning a number of kernel improvements to increase overall system reliability. Some of the minor updates include a completely new VM, new block IO layer, new VFS layer, new kernel NFS server, new device naming management, new SCSI lyaer, new IDE layer, and an in-kernel web server (khttpd and TUX) for improved system reliability.
Just like the similar complete rewrites in Linux 2.0, 2.2, and 2.4, Linux once again finally be a winning combination on the very high end!
cpeterso
A lot of people are going on and on about what linux can do that solaris can't.. cross platform, open source, etc...
But I think your question was, given the Sparc platform.. why not use solaris?
At this point, you are right. Solaris is where it's at.. I mean, if you are buying Sun.. you obviously want more than just a fast machine... you want the support, etc.
But... as to why I prefer using linux to solaris, in general...
Linux is the new reference platform. new tools are developed on linux first, then ported to other unixes (the mahjority, anyway).
The number of tools quickly & easily available for linux vastly outnumbers the same for Solaris. Yes, you can get, compile, and run pretty much everything on solaris.. it's easy to port form linux to solaris.. but it's still easier to use linux.
Linux is open.. I just, well, I DO like that. Sure, I'm not gonna go out and modify a kernel.. but it means I'm not necessarily stuck with what Sun tells me I'm stuck with.
Would I buy a server farm of solaris boxes and run linux on it now? no.
Would I if Linux sparc supportw as as good as it's intel support? probably (once SPM is fixed)
This isn't a linux issue. It's a hardware issue.
The significant thing about 'big iron' is that it's an enabling hardware technology.
Once you have it you can write firmware and software that creates the illusion that the hardware never fails
Until you have it, you can't.
The hardware described looks about right - if they handled machine checks properly. (And the fact that they even used the term implies they either did or are trying.) Basic idea: The machine catches ANY error, with enough state saved that you can:
CORRECT the error
IDENTIFY any failed components,
MOVE tasks to non-failed components or reconfigure the failing components to limp along,
NOTIFY the OS of any problem, so it can do things like start moving things off a dying component, and
pick up the computation where it left off WITHOUT the error.
When you can do this you can write a modified Linux, Windows, BeOS, or what-have-you that can do the things a mainframe can. (But you'll need to have a REALLY reliable OS for your starting point - you're now talking uptimes measured in decades. The software better not take the system down in the absense of hardware trouble, and there IS NO hardware troube. B-) )
Hot-swappable parts are more a side-effect than something key. You have to be able to hot-swap to replace a broken part with the system live. Once you have the ability to hot-swap in a replacement for a failed part and add it back into a running domain, it's trivial to generalize that to "fix" parts that were "bad" because they had never been installed.
Partitioning is also implied: You need a minimum of two domains ("virtual machine" subsets of the total device) - working (where the live system is) and diagnostic (where the maintainence guys check out the parts). Once you have that mechanism, making a LARGE number of working domains (with varying amounts of resource, including full or time-shared CPUs) is straightforward.
Bantam Dominique roosters crow a four-note song. Once you've heard it as "Happy BIRTHday" you can't NOT hear it that way