Slashdot Mirror


Mars Rover Spirit Back Online

Skyshadow writes "Just in time for the arrival of its twin, the Spirit Mars Rover is back in working order. Programmers at the JPL have traced the problem to the rover's flash RAM, which it uses to maintain its filesystems. They are using a ramdisk in the rover's RAM to bypass the bad flash memory, and are working on a workaround for the bad flash. Good news, but the rover is still potentially weeks away from full operational status."

25 of 386 comments (clear)

  1. heh... /. was right! by Smitty825 · · Score: 5, Interesting

    During all of the "Spirit is broken" columns, I kept reading /. comments saying that it was likely a memory error due to the non-consistent errors...I guess a million monkeys with a typewriter can be correct :-)

    --

    Doh!
    1. Re:heh... /. was right! by anubi · · Score: 2, Interesting
      Yeh.. hats off to you guys!

      I was barking up the tree of spike on the power supply. For the exact same reasons.

      I have had power supplies or bypassing go bad due to the increasing ESR of aging capacitors, and by golly they come up with the damndest intermittent failures you would ever want to see. They will have you debugging every process in the system until you put a storage oscilloscope on the power supply line and watch it like a hawk.

      --
      "Prove all things; hold fast that which is good." [KJV: I Thessalonians 5:21]

  2. The epitome of remote administration by Faust7 · · Score: 4, Interesting

    Engineers guessed that Spirit's troubles were in its Flash memory and set about sending the rover a complex series of instructions to see if they could get it to bypass the corrupted memory. Theisinger said engineers sent Spirit a command just before its daily "waking up," telling it to shut down and restart in what is known as "cripple mode," using RAM instead of Flash for its start-up instructions.

    Some people may take this sort of thing for granted, but I for one find it remarkable that we can essentially reboot and perhaps even fix a system that is on a whole other planet.

    Just wait until we have Interplanetary, Interstellar, Intergalactic Remote Desktop. I'm only half-joking.

    1. Re:The epitome of remote administration by blincoln · · Score: 4, Interesting

      It's all good until tech support says, "So... Do you have a boot disk?" :-)

      You joke, but newer servers can do this remotely too.

      We have a bunch of Compaq servers at work, and one of the really cool features of the remote administration software is that you can send a virtual floppy image to the machine from anywhere in the world that can open a web browser connection to the server's remote administration board.

      A few months ago one of our servers in Denver died, and I had to boot it up in Windows 2000's command prompt only safe mode... but the local admin password had never been written down. I was able to make virtual floppy images of a tool that resets the local admin password, send them over the wire, and boot off of them from the remote administration system.

      Okay, it's not fixing a super-expensive robot on another planet, but I thought it was pretty cool.

      --
      "...always new atoms but always doing the same dance, remembering what the dance was yesterday." -Richard Feynman
    2. Re:The epitome of remote administration by Psychotext · · Score: 3, Interesting

      Oh god... I really, really hope you have a superb firewall & username & password blocking that machine off from the world. I did just read you right, you didn't have the admin password, so you you used a tool over the remote administration to hack past it?

      Mmmm... hackalicious. :-)

      (I've actually used a similar remote kvm system with lights out boards but until you write it down it just doesn't sound that risky!)

      --
      People that believe in their opinions don't post AC.
  3. Where is the redundancy? by MWChapel · · Score: 3, Interesting

    Shouldn't they have like 5 Flash RAM's? Really,they shouldn't have one of anything. In my computer if my BIOS fries, I pop open the box and replace it. If it fries on mars, obviously I kiss my megamillion dollar project goodbye, all for a $5 Flash ROM.

    1. Re:Where is the redundancy? by Hawkxor · · Score: 2, Interesting

      First of all: obviously they've thought of that. Adding to what others have said in their responses tearing the parent apart, I'd like to mention that the problem is probably not due to a general defect in the RAM card. It probably has to do with the conditions on Mars, the landing, etc - in which case the same problem would be affecting all of the (even redundant) Flash RAM cards: so it really is amazing that they got this working at all.

  4. Static Discharge? by seven+of+five · · Score: 5, Interesting

    Is there a chance that the problem could've been caused by electrostatic discharge? Rover bounces on rubber airbags on sand, bags fold up, Rover rolls off, Rover touches rock - zap!??

    1. Re:Static Discharge? by myowntrueself · · Score: 2, Interesting

      I had been wondering.

      The sequence of events that lead up to this was, IIRC,

      1. Rover extends arm ready to take a grinder to a rock.

      2. Contact with Rover lost due to bad weather in Australia.

      3. Rover bad.

      So it had just moved part of its structure closer to the rock just before this happened.

      --
      In the free world the media isn't government run; the government is media run.
  5. Cosmic rays... by bc90021 · · Score: 4, Interesting

    ...will apparently cause one out of every trillion bits on Earth to flip randomly... I guess with less of an atmosphere, it is a bigger problem on Mars! ;)

    1. Re:Cosmic rays... by shadowmatter · · Score: 5, Interesting

      Funny you mention that. I'm taking a class on design of digital systems at my university, and my professor works for JPL. He helps design the control systems onboard space vehicles such as the Mars rover. Anyway, a majority of the class grade is based on an end-of-the-quarter project, which we complete in groups of 2 to 4. On Wednesday he expressed interest in a group developing some sort of redundancy for FPGAs that would be suitable in spacecraft. You see, on Mars, you're not shielded from huge doses of radiation as you are on earth. A healthy dose of radiation bombardment could easily reprogram an FPGA chip on the surface of Mars; ASICs chips are used to overcome this problem.

      Maybe he was gung-ho about anti-radiation redundancy because he already knew the likely problem of the Spirit. Who knows?

      - sm

    2. Re:Cosmic rays... by Anonymous Coward · · Score: 1, Interesting

      Someone may have already said this or you may have already seen it, but Xilinx has a press release about having their chips (FPGAs of course) in the rovers. They're touting the FPGA as being used as the "main brain" so you may be right on.

      http://www.xilinx.com/prs_rls/design_win/0412_mars rover.htm

      A funny side note, I think they posted the press release on the same day as the Spirit failure. That's some good pr timing ;)

  6. Nice by Omega1045 · · Score: 4, Interesting

    I have a friend who works in the field. Space travel hoses electronics bad. Triple redundancy and over-engineering is the name of the game. This is nice to hear. I would imagine that something went wrong intransit or on-landing, but they can keep going,

    --

    Great ideas often receive violent opposition from mediocre minds. - Albert Einstein

  7. flash ram is known to fail on writes after a while by Anonymous Coward · · Score: 2, Interesting

    I know a lot of ppl are using flash ram in smaller computers for booting linux or what not. Well if they are writing their logs and other things to that flash be aware that you can only write to it so many times before it fails.

    Was NASA writing to that flash or just reading? A ram drive in flash sounds like it will access/write thousands of times a ?minute? This should wear it out quickly.

  8. Information on the MER hardware. by elrond1999 · · Score: 5, Interesting

    Ive been unable to find any hard information on the design of the MER memory systems. If anyone can point me to a technical brief id be very happy.

    From what ive pieced together the MER system is something like this:

    One RAD6000 powerpc cpu.
    Connected via probably compact pci to 128 mb of ecc sdram.
    256 mb of flash. No info on what make of flash, but likely Intel since they are the biggest. There was some info from the press conference that there are actually two flash chips and that the flight software is redundantly stored on each. So does this mean that there is actually 128mb of redundant flash? Also it was said that they had problems even with the redundancy, could they possibly have overwritten something? We all know that even a redundant raid does not stop filesystem corruption.

    No information on how the flash is connected, parallell / serial? How the redundancy works?

    Btw, I guess flash is rather radiation hard since they require 10 - 20V to erase / write.

  9. Why are they sending its twin so early? by James+Lewis · · Score: 1, Interesting

    I know those guys at NASA are smart... so does anyone know why they sent opportunity right after spirit? I would think it would be better to wait and see if any problems occured with the Spirit, and learn from their mistakes. I'm sure there is an optimum aligning of the planets for a launch... but is that really so rare that they couldn't wait?

  10. Opportunity by loconet · · Score: 3, Interesting

    Opportunity is fast approaching the red planet. It should be an interesting night at JPL. Execellent work guys, good luck.

    --
    [alk]
  11. Re:Monday morning quarterback: RTOS tradeoffs by anubi · · Score: 2, Interesting
    From where I sit, I think they did damm good.

    I don't know that much about VXWorks, but I heard that one of its main assets is having a very small tight multitasking kernel.

    They were able to regain the system, despite loss of a major computational component. Remotely. Through a debug link. That sure says a helluva lot for the robustness of the OS and how they configured it.

    Good job, JPL.

    --
    "Prove all things; hold fast that which is good." [KJV: I Thessalonians 5:21]

  12. We learn from our mistakes... by Chordonblue · · Score: 4, Interesting

    So... I wonder if they'll consider validating MRAM more quickly if Flash is found to be more error prone.

    You know how NASA works. The Space Shuttle running on 486's and whatnot. I understand the science behind that reasoning, as sad as a 66 MHz processor seems to us geeks nowadays, but I wonder if MRAM will prove more flexible and stable for future space missions.

    --
    "...Well, there's egg and bacon; egg sausage and bacon; egg and spam; egg bacon and spam; egg bacon sausage and spam..."
  13. Re:You think that's neat by be-fan · · Score: 3, Interesting

    This is a bit OT, but I need to rant:

    A quote from his site: "It is incredibly frustrating watching all this happen... I can't even say the word Lisp without cementing my reputation as a crazy lunatic who thinks Lisp is the Answer to Everything"

    I feel his pain. I was introduced to Lisp not too long ago, and within a short time, a Lisp-derived language (Dylan) became my favorite. I also found that many of the features I loved from Python were very Lisp-y in nature. Now, I see Java and C# either neglecting all the knowledge garnered from the Lisp-family of languages, or reinventing it --- badly. The features in C# 2.0 have either been in Lisp for decades (lambdas, closures) or are not necessary in Lisp (iterators, enumerators --- which, btw, are theoretically not necessary in C# 2.0 either because of lambdas and closures!) This new "Xen" (or X#) language Microsoft Research is pushing takes a great idea (extending the language to fit the problem domain) that has been a part of Lisp for decades, and chops it off at the knees. Instead of having proper macros, so you can extend the language to fit *your* problem domain, they hack support for a single problem domain (back-end business programming) into the language itself!

    That said, the Lisp community is to blame as well. Part of the reason people stop listening the moment somebody says Lisp is that the Lisp community is *so* rabid and *so* unyielding. Especially some high-profile members who are highly respected within the community despite the fact that they are completely obnoxious and lack any human sense of manners.

    --
    A deep unwavering belief is a sure sign you're missing something...
  14. Solution for Next Time by wildsurf · · Score: 2, Interesting

    Flash RAID array.

    (Can this even be done?)

    --
    Weeks of coding saves hours of planning.
  15. Radiation by panxerox · · Score: 2, Interesting

    with all the radiation and very high energy particles zipping thru the spacecraft on its way there, I'm suprised any computerized spacecraft get anywhere intact.

    --
    "It's so convenient to have a system where everyone is a criminal" - A. Hitler
  16. Re:You think that's neat by be-fan · · Score: 2, Interesting

    Lambdas are just an aesthetics/keystrokes issue.
    --- :: jaw drops :: :: gets on knees ::

    I bow down to your ignorance, oh mighty King of the Cluless!

    Seriously, though, please research lambdas. They don't just save typing. They are *everything*. All of computation can be described just with lambdas of a single parameter. Everything else is just syntax suger. If you ease one restriction of the lambda calculus (no side-effects), lambdas can do procedural code, functional code, and even object-oriented programming. That's why I said iterators, enumerators, etc, are not necessary in C# 2.0, because it has proper lambdas. All of thsoe can be implemented very easily on top of lambdas.

    and start focusing on really useful subjects - like formal verification
    ---
    You can do formal verification in Lisp. Look up ACL2 (first link in Google search for that term). I'd still say that Haskell or Clean are a bit better for such purposes, but mainly because they are designed from the beginning to be comfortable to program without side-effects, while Lisp was not.

    and strong typing (which go very well together).
    ---
    C'mon. You're making this too easy. Lisp has a strong type system. All type errors are caught, unless you disable type-checking in your compiler. Maybe you mean *static* typing instead?

    That's probably because the only people who think Lisp Is The Answer to Everything are a little bit insane.
    ---
    Um, wasn't that my point? Its the "Lisp is the Answer to Everything" people that make it harder for normal people to push Lisp to areas where it would be really useful.

    --
    A deep unwavering belief is a sure sign you're missing something...
  17. We need open source rover software by HangingChad · · Score: 3, Interesting
    I'd put the /. community up against NASA any day. Instead of trying to be so secret about everything, open the software up to the community and let the collective propose solutions to some of these issues. Hey, it's our tax dollars developing all this stuff, why can't we play too?

    Besides robot exploration software would be handy right here. It would be neat to be able to send a research bot out in the deserts, deep oceans and jungle canopies of the world. Machines can go where we can't.

    Individually you can be damn annoying sometimes, but I'm constantly amazed and delighted by the collective intelligence of the /. pack.

    --
    That's our life, the big wheel of shit. - The Fat Man, Blue Tango Salvage
  18. Hear! Hear! I couldn't say it any better! by Teancum · · Score: 3, Interesting

    I was going to mod this one up, but I decided to give this reply some more emphasis by actually replying with some thoughtful encouraging words instead.

    It would be nice to be able to have some folks at JPL throw down the source code and engineering schematics and say to the geek/space/engineering community at large "We have a problem here and could use your suggestions to see if we can get this fixed."

    This (the mars missions) is obviously a big hit, as measured by replies on Slashdot, the number of hits on the website at JPL, stories in mainstream media, and other reasonable metrics to gague popularlity of a project. I'm sure that there are several geeks out there that wouldn't mind digging into the source code.

    The only reason I could see the engineers not wanting to do that is to open themselves up to obvious scrutiny for poor engineering and coding. (Whadda you mean the global variable named temp is the only variable. We also have temp2, temp3, and temp4. What do the numbers mean in those mean? You can get it from context, can't you?) That and some people just aren't used to allowing other into their "domain".

    Being 100% funded by public money should also be further reason for why this should be opened up. I also totally agree.