HA-OSCAR 1.0 Beta release - unleashing HA Beowulf
ImmO writes " The eXtreme Computing Research (XCR) group at Louisiana Tech University is pleased to announce the first public release of HA-OSCAR 1.0 beta. High Availability Open Source Cluster Application Resource (HA-OSCAR) is an open source project that aims toward non-stop services in the HPC environment through a combined power of High Availability and Performance Computing solutions. Our goal is to enhance a Beowulf cluster system for mission-critical applications and downtime-sensitive HPC infrastructures. To achieve high availability, component redundancy is adopted in HA-OSCAR cluster to eliminate single point of failures, especially at the head node. HA-OSCAR also incorporates a self-healing mechanism; failure detection & recovery, automatic failover and fail-back. The 1.0 beta release supports new high-availability capabilities for Linux Beowulf clusters based on OSCAR 3.0 It provides an installation wizard GUI and a web-based administration tool that allows a user to create and configure a multi-head Beowulf cluster. A default set of monitoring services are included to ensure that critical services, hardware components and important resources are always available at the control node. "
So on a multi-CPU sever if you started the same process synchronously on multiple CPU, how close in time would they finish assuming there is sufficient memory and disk drive controller to prevent severe competition?
I make my face look like this and concerned words come out.
I actually go to Louisiana Tech. Chokchai Leangsuksun (Dr. Box), the director of the HA-Oscar program also teaches my Operating Systems class. He came into class today looking tired...he said he'd been working very hard on it.
I think it's about time LaTech got some recognition.
"A diplomat is a man who always remembers a woman's birthday but never remembers her age." -Robert Frost