Slashdot Mirror


Reformatting a Machine 125 Million Miles Away

An anonymous reader writes: NASA's Opportunity rover has been rolling around the surface of Mars for over 10 years. It's still performing scientific observations, but the mission team has been dealing with a problem: the rover keeps rebooting. It's happened a dozen times this month, and the process is a bit more involved than rebooting a typical computer. It takes a day or two to get back into operation every time. To try and fix this, the Opportunity team is planning a tricky operation: reformatting the flash memory from 125 million miles away. "Preparations include downloading to Earth all useful data remaining in the flash memory and switching the rover to an operating mode that does not use flash memory. Also, the team is restructuring the rover's communication sessions to use a slower data rate, which may add resilience in case of a reset during these preparations." The team suspects some of the flash memory cells are simply wearing out. The reformat operation is scheduled for some time in September.

155 comments

  1. Hey, Bob, this is Jim by Anonymous Coward · · Score: 5, Funny

    We're gonna need you to go out to the rover and reboot it. Yeah, it got stuck. You should probably leave ASAP.

    1. Re:Hey, Bob, this is Jim by Anonymous Coward · · Score: 0

      NASAP?

    2. Re:Hey, Bob, this is Jim by Anonymous Coward · · Score: 0

      "Hey, I see you're trying to rewrite some flash there. Would you like me to help?" - NASAClippy

  2. Simple fix by SternisheFan · · Score: 2

    Easy, just gotta' replace the button battery.

    1. Re:Simple fix by sillybilly · · Score: 2

      It's running on solar power, that's how it lasts 10 years. Though the rechargeable battery must be tough to take so many recarchings.

      Ideally, you have redundant systems for such a situation, where you can take one of them down and use the other to do the booting, formatting, programming, as if there were a user sitting right next to it. They say it has a flashless mode of operation, but the way I think of it, as in a regular PC, with a BIOS, you can reformat the harddrive without booting off of and using the harddrive, such as booting from a floppy, or even ROM chip they used to have back in the 80's (ROM-DOS 3.3 or ROM BASIC). So when flashing a BIOS or a ROM chip, there is no lower level to boot from, but if you have Tandem, dual redundant systems for everything, you can boot from the lowest of lowest levels and have the partner system execute all the commands. So with Tandem failure is less frequent, as in, you're down to 50% capacity but still fully functioning ok, and can work on regaining the 100% capacity, while not using regular operations, for two days and the like. The problem with Tandem is the double or higher cost, and, in space missions, the extra power consumption and extra weight, and in space missions, weight is almost everything, as each lb has to be paid for dearly, on the order of $10,000/lb low Earth orbit, and who knows how many gazillion dollars per lb for a Mars mission.
      There used to be a company named Tandem, designing dual CPU redundant resilient failure tolerant systems, but they fell behind on chip design because of small size, plus high expense, and did not compete well in the computing field. For instance back in 1999 when Google started, they started with regular pc's of whateve the vogue of the day was, I don't know, 700 MHz PIII, maybe? And just jerry rigged a bunch of them into a daisy chain and voila, you have a Tandem-like, more than dual, more like thousandfold or millionfold duplicated, resilient supercomputer. But the principle of tandemness and fault tolerance was there. Maybe for space missions that need fault tolerance like that, it may be worth the extra rocket fuel weight in the first place to double the weight and duplicate most critical systems. The human body duplicates kidneys, lungs, but not liver, or heart, so there is a balance on what you want to go redundant on and what not. Life is easy with 2 kidneys, some people can live with only one kidney, but it's really difficult to live with zero kidneys.

    2. Re:Simple fix by Anonymous Coward · · Score: 1

      Wow. Talk about missing an obvious joke and over-thinking the response. Seriously epic *WHOOSH*

    3. Re: Simple fix by Anonymous Coward · · Score: 2, Funny

      Ass-burgers.

    4. Re:Simple fix by davester666 · · Score: 2

      what's this step 4??

      Press the reset button.

      Who the hell designed this stuff?

      --
      Sleep your way to a whiter smile...date a dentist!
    5. Re:Simple fix by sound+vision · · Score: 1

      tl;dr on the whole post BUT... I've had my iPod nano in daily use for the past 8 years and it's still going strong. True, it doesn't need to power any motors - but the design specs probably also allocate a lot less weight to the battery.

    6. Re: Simple fix by LinuxLuver · · Score: 1

      Don't think he missed it. The previous comment probably just provoked a related tangent.

      --
      Only boring people are ever bored.
    7. Re:Simple fix by Anonymous Coward · · Score: 0

      What conditions do you use your iPod Nano in? What's the daytime temperature, nighttime temperature, how much radiation is it exposed to, what is the atmospheric pressure it's exposed to routinely, and what is the chemical composition of said atmosphere?

      Also, how many rocket launches and atmospheric entries from traveling at thousands of miles per hour to a relative HALT has your iPod Nano made? What magnetic fields is your iPod Nano routinely exposed to? Finally, does Apple warrant any model of iPod for use on any planet other than Earth, or indeed in the vast, mostly empty space between said planets?

      Personally, I wish them all the luck in the world on this, and as impressive as your iPod may be, it's no Mars Rover. Perhaps you were being sarcastic; in that case, so am I.

  3. And I thought I was cool... by toygeek · · Score: 1

    When I reboot machines in Asia or UK/EU using IPMI from the US.

    1. Re:And I thought I was cool... by marcello_dl · · Score: 4, Funny

      And I thought I was cool when I reboot servers around the world thinking I am rebooting mine.

      --
      ---- MISSING MISCELLANEOUS DATA SEGMENT --- [sigdash] trolololol
    2. Re: And I thought I was cool... by Anonymous Coward · · Score: 0

      You should try my random shutdown program.

    3. Re: And I thought I was cool... by Anonymous Coward · · Score: 0

      Apparently I have.

  4. If there is a problem and need to call "support" by Anonymous Coward · · Score: 1

    do they get sombody in or from India?

  5. Send someone by TheDarkMaster · · Score: 1

    With a replacement SLC SSD and a screwdriver

    --
    Religion: The greatest weapon of mass destruction of all time
  6. ECC? by TechyImmigrant · · Score: 5, Funny

    They didn't do any ECC on the flash memory? I thought these people were rocket scientists.

    --
    I should use this sig to advertise my book ISBN-13 : 978-1501515132.
    1. Re:ECC? by Anonymous Coward · · Score: 0

      In space, no one can hear you trolling.

    2. Re:ECC? by Anonymous Coward · · Score: 3, Insightful

      As it happens, for flash, read errors are often transient. A better model than DRAM style ECC is to treat it more like a disk drive with checksums on each block. If you get an error, reread the block. And if you have a problem writing a block (e.g. the readback is wrong), just use a new block. Surely you've noticed that your USB thumbdrive gradually gets smaller with time as blocks wear out. (In space hardware, back in 2000, wear leveling was done manually.. still is as far as I know.. there's no nice rad-hard flash controller chips to make a big pile of MLC flash look like a disk drive, etc.)

      The long duration radiation performance of flash memory (particularly back in 2000, when these things were being designed) was/is not particularly well understood. There are a lot of what is called Enhanced Low Dose Radiation Effects (ELDREs) that are poorly understood for all semiconductor devices: you can't just blast the part in an accelerator at 1kRad/hr for a few days to get to a few hundred kRad and expect that this is the same as taking a few tens of Rad/hr over days and days and days, with 12 hours off after the sun goes down to anneal and heal.

      And, because resources on spacecraft are very precious, one doesn't blindly head off and say "let's just TMR everything". You make a rational choice based on the expected design life and the data you do have and pray for the best.

      And, of course, the design life was 3-6 months, and here we are 10 years later, still cranking along. I think it's done pretty well, all things considered.

    3. Re:ECC? by Anonymous Coward · · Score: 2

      The rocket scientists did their job ten years ago. They're working at McDonalds now.

    4. Re: ECC? by Anonymous Coward · · Score: 0

      ELDRS actually stands for "enhanced low dose rate sensitivity", and it's normally simulated using a Co-60 gamma source, not a particle accelerator. It's primarily a concern for bipolar devices, not MOS devices like flash memory, although radiation effects in general are difficult to predict.

    5. Re:ECC? by Anonymous Coward · · Score: 0

      They didn't do any ECC on the flash memory? I thought these people were rocket scientists.
      Rocket scientists *on a limited budget*. ... another version of the same:
      I guess the question I'm asked the most often is: "When you were sitting in that capsule listening to the count-down, how did you feel?" Well, the answer to that one is easy. I felt exactly how you would feel if you were getting ready to launch and knew you were sitting on top of two million parts -- all built by the lowest bidder on a government contract. -John Glenn talking about his early days at NASA preparing for lift off in Friendship 7

    6. Re:ECC? by Anonymous Coward · · Score: 0

      This is the most important post in the discussion.

    7. Re:ECC? by schlachter · · Score: 2

      Well, in their defense, ECC on the flash memory isn't exactly rocket science.

      --
      My God can beat up your God. Just kidding...don't take offense. I know there's no God.
    8. Re: ECC? by Anonymous Coward · · Score: 1

      This would make an interesting movie plot where they have to recall all the older, laid off rocket scientists working at McDonald's and bagging groceries at the supermarket to reboot an idle probe on a far away planet because it's the only one that can be repurposed to save the earth from an asteroid impact. But only the old guys know the hardware and can reprogram the firmware.

      Yeah I'm a laid off old guy. Get off my lawn!

    9. Re: ECC? by rickb928 · · Score: 1

      And add in the volunteer group that decided to save the project, working out of an abandoned McDonald's.

      Oh, wait....

      --
      deleting the extra space after periods so i can stay relevant, yeah.
    10. Re:ECC? by TechyImmigrant · · Score: 0

      If you're so smart, why aren't you advocating using BCH codes or Reed Solomon codes or some form of forward error correction code over code and data stored in flash so random bit errors in flash won't affect the code that is stored in the flash? What is your super clever alternative?
       

      --
      I should use this sig to advertise my book ISBN-13 : 978-1501515132.
    11. Re:ECC? by Nimey · · Score: 4, Insightful

      You're a poster child for Dunning-Kruger: some random on the Internet who thinks he's smarter than the folks who designed a Mars rover that lasted over 10 years past its 90-day expected life.

      --
      Hail Eris, full of mischief...

      E pluribus sanguinem
    12. Re:ECC? by Anonymous Coward · · Score: 0

      Even with error correction, there will be undetected errors and unrecoverable errors. Obviously, the degenerate case is where the drive isn't functional - no amount of error correction can help you. Mars is a high-radiation environment, so these drives have probably seen a fair amount of abuse.

      My guess is that they *do* use error correction (after all, even consumer flash drives use ECC), and that the high-radiation environment and normal wear-and-tear are simply causing the flash drives to start failing.

    13. Re:ECC? by DMUTPeregrine · · Score: 1

      There's also the matter that better ECCs cost more overhead. You can detect single bit errors with a simple parity bit, but double errors will go undetected. And even something like Reed-Solomon can't correct all the errors it can detect. Spacecraft going to mars have very limited mass budgets, there are often better places to spend the extra mass than on an additional redundant flash chip (and associated circuitry).

      --
      Not a sentence!
    14. Re:ECC? by Noah+Haders · · Score: 1

      They clearly overspecced it. Maybe if had been designed more reasonably they could have sent more.

    15. Re:ECC? by Anonymous Coward · · Score: 0

      They didn't do any ECC on the flash memory? I thought these people were rocket scientists.

      NASA laid-off the rocket scientists replacing them with astroturfers as a cost-saving measure. HR thought the astroturfers could double as landscapers for future colonies.

    16. Re: ECC? by Anonymous Coward · · Score: 0

      This sounds like what Deep Impact should've been.

    17. Re:ECC? by Anonymous Coward · · Score: 0

      Yeah, they over-engineered the thing by at least a factor of 40. Clearly they were geniuses.

    18. Re: ECC? by Anonymous Coward · · Score: 0

      And the pile of hooker corpses in the freezer.

    19. Re:ECC? by M1FCJ · · Score: 2

      Most of the hardware cost is the launch vehicle, not the rover.
      Most of the people (salary) cost is the people working on the data generated (all accross the universities around the world who analze the data and write papers), not the designers.

      Underspeccing it wouldn't have saved much.

      There's one that breaks this rule, the JWST. Just the endless redesigns have gobbled up so much money, I don't believe there will be enough Science generated by it to cover the build & launch costs.

    20. Re: ECC? by Anonymous Coward · · Score: 0

      Not a Co-60 source, too fast. Maybe a Cs-137 (e.g. about 1krad/month). But there are no known ELDS mechanisms for flash. And the radiation environment on the surface of Mars is not that bad. It has SOME atmosphere, not as much as earth. It's really great that it went this long without a real problem.

    21. Re: ECC? by daremonai · · Score: 1
      No wonder the last quarter-pounder I had tasted a little odd.

      Odder than usual, I mean.

    22. Re:ECC? by lightbounce · · Score: 2

      ECC use is standard with all flash storage. Flash is so unreliable that it can't be used without it, and it has nothing to do with the hard radiation environment on Mars. As for wear leveling, it's been standard since at least 1990 with the first attempts at flash storage. Why the rovers don't do it, I don't know. Maybe because it requires too many cycles of an already limited processor, plus dedicated storage space to keep "use counts" of all the flash blocks.

    23. Re:ECC? by Anonymous Coward · · Score: 0

      so dunning-kruger explains president obama?

    24. Re:ECC? by Agripa · · Score: 1

      The long duration radiation performance of flash memory (particularly back in 2000, when these things were being designed) was/is not particularly well understood.

      Flash is another form of floating gate memory. Wouldn't the known long duration performance of EPROM and EEPROM apply?

    25. Re:ECC? by Agripa · · Score: 1

      ECC use is standard with all flash storage. Flash is so unreliable that it can't be used without it, and it has nothing to do with the hard radiation environment on Mars.

      NOR Flash does not normally use ECC and has reliability closer to that of EEPROM than NAND Flash.

    26. Re:ECC? by TechyImmigrant · · Score: 1

      >You can detect single bit errors with a simple parity bit

      You can detect (2^32-1)/(2^32) of every possible failure pattern with a CRC. With a combination of a multiple bit error correction algorithm (with most correction schemes n bits can be corrected with 2n redundant error correction bits) and then the CRC can be used to tell if you correctly corrected the data.

      --
      I should use this sig to advertise my book ISBN-13 : 978-1501515132.
    27. Re:ECC? by WaffleMonster · · Score: 1

      You're a poster child for Dunning-Kruger: some random on the Internet who thinks he's smarter than the folks who designed a Mars rover that lasted over 10 years past its 90-day expected life.

      Not too often but occasionally the stupid get lucky and in some perverted way lack of knowledge and consideration of detail can lead to better outcomes.

      After awhile one has to admit having to be careful when you transmit for fears it would even be possible for commands to be misinterpreted or designing something which knowingly continually writes to flash memory using DOS era FAT filesystems is not a winning play no matter how much you throw the reliability arguments at the wall and expect them to stick.

      And all those commenting about what they instinctively noticed with their ignorant eyes as curiosity's chintzy wheels turning out to in fact be objective reality.

      The engineers might be smarter than us fools and idiots yet it does not automatically follow they were actually correct to make a particular tradeoff or the fools and idiots don't have a point.

      Usually best to stick to the facts and make arguments from merit vs accusing people of staying at a holiday inn express last night.

    28. Re:ECC? by TechyImmigrant · · Score: 1

      >You're a poster child for Dunning-Kruger

      Actually I am an engineer who has designed many error correction circuits for communication and storage systems. I think I know how much I know about error correction systems, which is plenty for this conversation.

      While the statement was made in Slashdot jackass style, the question is legitimate. Why didn't they do any or more ECC on the flash that is failing. There is probably a perfectly fine answer like "We knew the expected error rate and It was designed to last 10 times longer than the system", but the system lasted 40 time longer so the ECC correction capacity was exceeded". or "We had TMR, but the give then age of the system the error rate is now such that the error collision probability is too high". or "This flash claimed to be rad hardened but it turns out it isn;t".

      I'd like to know the answer because I like techy shit.

      --
      I should use this sig to advertise my book ISBN-13 : 978-1501515132.
    29. Re:ECC? by romons · · Score: 1

      On the other hand, it is all still working. It reboots occasionally. My computer does that. By reformatting, they will map out any bad sectors, which is probably the issue, and it'll run for another 10 years. Sounds like a smart technology tradeoff to me. Use cheap, off the shelf hardware, and KISS it to death. Write a special driver, or build special hardware to do ECC, and you end up with a bug that causes the system to freeze in an unrecoverable way.

      --
      Go to Heaven for the climate, Hell for the company -- Mark Twain
  7. Remote management by Anonymous Coward · · Score: 0

    Not that's a serious remote management of system :P

    Just wondering if flash memory cells are dying, they need to comeup some way to go arround those...

    1. Re:Remote management by ledow · · Score: 5, Informative

      Not really...

      The chances are that "reformat" isn't what we think and includes one of more of:

      1) Rewriting cells and allowing wear-levelling and sector-replacement to take place, and make bad sectors as bad.
      2) Write-testing and manually avoiding those sectors that don't perform as expected.
      3) Rewriting all the critical storage functions to avoid the already-known bad sectors.

      It's the kind of thing that anyone can play with. Not saying it's not risky on a remote device, but BadRAM etc. patches have been in places for years and that's a way to run Linux on machines with faulty ***RAM****, not just long-term storage.

      Many years ago, a bad sector on your hard drive was something you found out with scandisk (or previous tools) and then it was marked as bad and that was the end of that. Your PC wouldn't use it and so long as it wasn't the boot sector, that was the end of that. It was only the "creeping" bad sectors, where you got more bad sectors over time, that would really worry anyone.

      I imagine that it's not at all difficult to make sure that multiple boot sectors were in place if you really wanted to but why bother? The chances are billions to one. Chances are this hardware has MUCH better fault tolerance and multiple hardware watchdogs, firmware, and boot attempts to make sure it eventually gets back up SOMEHOW.

      There's a reason that even FAT stores two copies of the allocation table, why Linux ext filesystems store multiple copies of the superblock, etc. They come from a legacy where the occasional bad sector wasn't a problem and where 20Mb of hard drive cost more than the computer did so it was better to cope with the fault than just tell people to buy a new one. And their predecessors were (and still are) mainframes with hardware that's just that fault-tolerant in the first place anyway.

      It's not at all hard to write a filesystem that can cope with not only damage, but even recurring damage. You've seen PAR files presumably? The same could easily be done on a filesystem-level basis (and I imagine, somewhere, already is for some specialist niche).

      It's not that big a deal once they KNOW that's the problem. The biggest problem is that they only "suspect" that's the problem.

    2. Re:Remote management by pegdhcp · · Score: 1

      Ultrix used to mark bad sectors on the fly, as far as I could remember, if the disk was not a SCSI...

    3. Re: Remote management by Anonymous Coward · · Score: 0

      Only the earliest MFM disks needed the OS to mark bad sectors because there was no intelligence in the disk controller. As you said. .. if not SCSI. Modern ATA and SATA controllers will mark bad blocks on each write and remap the bad area so the OS never knows. Blocks that degrade over time or get damaged at power on/off are the troublesome ones and are only found when one day you can read those blocks any more.

      So scrub your disks weekly to find these blocks early.

      I always thought that the disk controller should do idle scrubbing. Are there any modern SATA disks that do this?

    4. Re:Remote management by lgw · · Score: 1

      You've seen PAR files presumably? The same could easily be done on a filesystem-level basis (and I imagine, somewhere, already is for some specialist niche).

      While all hard drives now do their own Hamming error correction (or something better), RAID2 is the same idea for "raw" storage that doesn't: you write explicit ECCs to redundant volumes to allow recovery from both drive loss and bad sectors.

      RAID5 with modern drives gives all the same resiliency, as the drives do the block-level ECC themselves, so you never see RAID2. But for a pile of flash memory, that's the filesystem-level equivalent of PAR files.

      --
      Socialism: a lie told by totalitarians and believed by fools.
    5. Re: Remote management by M1FCJ · · Score: 1

      Hah, I remember running the DOS debugger, poking into a certain address in the memory to access the MFM BIOS, then you could do a low level format where you could enter the sectors to mark as bad. Those were the days...

    6. Re: Remote management by M1FCJ · · Score: 1

      "g=c800:5."
      Hah, I almost remembered that one, good old Seagate controllers. I had the 800 but not the rest. :)

    7. Re:Remote management by MasterOfMagic · · Score: 1

      It's not at all hard to write a filesystem that can cope with not only damage, but even recurring damage. You've seen PAR files presumably? The same could easily be done on a filesystem-level basis (and I imagine, somewhere, already is for some specialist niche).

      You mean like RAID-5? Because RAID-5 was part of the inspiration for the PAR2 format.

    8. Re: Remote management by lightbounce · · Score: 1

      I always thought that the disk controller should do idle scrubbing. Are there any modern SATA disks that do this?

      No, the drives themselves don't do this because it pulls the head away from where the host wants/expects it to be. This would result in a lot of unexpected thrashing. If scrubbing is to be done, it is best done by the OS as a background task.

    9. Re:Remote management by Anonymous Coward · · Score: 0

      They did shit like this all the time with earlier probes. They'd get a cosmic ray hit that took out some memory address(es) and then have to rewrite the code and tell the system not to use those addresses.

      Old news.

      As to their HW skills, Pioneer craft are still out there talking and listening; I leave it to you to look up how old they are.

  8. Alternative Title by wisnoskij · · Score: 4, Insightful

    How to brick a 2.5 billion dollar device.

    --
    Troll is not a replacement for I disagree.
    1. Re:Alternative Title by TapeCutter · · Score: 1

      Not sure if it was opportunity or its twin, but one of them required a modem reset not long after landing.

      --
      And did you exchange a walk on part in the war for a lead role in a cage? - Pink Floyd.
    2. Re:Alternative Title by Anonymous Coward · · Score: 0

      Can't tell if trolling or just stupid.

    3. Re:Alternative Title by rasmusbr · · Score: 3, Insightful

      I would imagine that the system probably boots itself off of a ROM chip that has a routine for receiving data from Earth and storing it in RAM and then flashing that data onto the flash chip.

      If the rover does not boot from ROM then it is a miracle that it hasn't bricked itself yet.

    4. Re:Alternative Title by wisnoskij · · Score: 1

      And the state of the hardware. Some unknown number of systems on the real curiosity are degraded to the point of malfunctioning; And they have little to no way of exactly measuring what and where.

      --
      Troll is not a replacement for I disagree.
    5. Re:Alternative Title by dotancohen · · Score: 1

      And the state of the hardware. Some unknown number of systems on the real curiosity are degraded to the point of malfunctioning; And they have little to no way of exactly measuring what and where.

      Opportunity. Curiosity is on the other side of Mars, nuturing holes in its wheels and looking for cats to kill.

      --
      It is dangerous to be right when the government is wrong.
    6. Re:Alternative Title by Zarhan · · Score: 5, Interesting

      Not modem reset. The filesystem on Spirit had bunch of temp files and other stuff from the Earth-Mars flight, and apparently it just ran out of inodes. So basically they had to remote into whatever constitutes a bootloader with 20 mins of latency and remove some of the no-longer-needed files.

      See http://science.slashdot.org/st...

    7. Re:Alternative Title by Nimey · · Score: 1

      Never attribute to malice that which is adequately explained by stupidity.

      --
      Hail Eris, full of mischief...

      E pluribus sanguinem
    8. Re:Alternative Title by Agripa · · Score: 1

      I would imagine that the system probably boots itself off of a ROM chip that has a routine for receiving data from Earth and storing it in RAM and then flashing that data onto the flash chip.

      I wonder if the ROM would actually be a floating gate ROM instead of mask ROM or fuse based PROM in which case it would be more like EPROM or NOR Flash.

      Does anybody even make mask ROM or fuse based PROM any more?

    9. Re:Alternative Title by rasmusbr · · Score: 1

      I checked and it is EEPROM. And there are two EEPROM:s, I presume those are for redundancy in case one gets zapped.

  9. Is it running Windows? by mark_reh · · Score: 2

    Is it?

    1. Re:Is it running Windows? by Psychotria · · Score: 4, Informative

      You're probably joking, but the OS is VxWorks.

    2. Re:Is it running Windows? by Anonymous Coward · · Score: 0

      which has a similar reputation as a costly humongous beast sucking away any discussions of alternatives

  10. Err, if you're a system admin.. by Viol8 · · Score: 4, Funny

    ... you're not cool. Period. Sorry.

    1. Re:Err, if you're a system admin.. by Anonymous Coward · · Score: 1

      Everybody is a system admin when linux is involved, and I like it that way. But I digress.
      alias halt='echo Use shutdown instead'
      alias reboot='echo Use shutdown instead'

    2. Re:Err, if you're a system admin.. by Anonymous Coward · · Score: 0

      alias halt='echo "Use shutdown instead"'
      alias reboot='echo "Use shutdown instead"'

      FTFY

    3. Re:Err, if you're a system admin.. by Anonymous Coward · · Score: 1

      Typing out "Period" makes you look retarded.

    4. Re: Err, if you're a system admin.. by Anonymous Coward · · Score: 0

      Shell noob. echo doesn't need its arguments quoted unless one of those arguments might expand because of shell globbing and you don't want that globbing to occur. In this simple case of fixed strings definitely not.

    5. Re: Err, if you're a system admin.. by Anonymous Coward · · Score: 0

      Not just globbing -- also string splitting (and unless you know IFS, you don't know what an expansion will split on). Granted, not the case here, but good practices are good practices.

    6. Re:Err, if you're a system admin.. by marcello_dl · · Score: 1

      Err, if taking a server offline, no matter the reason, is a serious problem, then you are not a good - or properly funded - sysadmin.

      --
      ---- MISSING MISCELLANEOUS DATA SEGMENT --- [sigdash] trolololol
    7. Re:Err, if you're a system admin.. by TeknoHog · · Score: 3, Funny

      Chill out. They're just having that time of the month.

      --
      Escher was the first MC and Giger invented the HR department.
    8. Re:Err, if you're a system admin.. by Anonymous Coward · · Score: 0

      parent here, before fixing you might want to see if it was broken...

    9. Re:Err, if you're a system admin.. by bbsalem · · Score: 1

      I think he was talking about "In the current directory" :-)

  11. 2.5 billion? by Viol8 · · Score: 1

    I dunno so much these days. Its 10 years old and got a few miles on the clock plus collection for the new owner would be an issue. On the plus side vandalism won't be a worry. For a few centuries anyway.

  12. MRAM now or some other tech by Anonymous Coward · · Score: 0

    What non-volatile memory would you use now

  13. Alternative Title by Whiternoise · · Score: 2

    They will almost certainly do a dummy run on an identical piece of flight hardware on Earth. The only difference is how the data are sent.

  14. Why is it not trivial? by nurb432 · · Score: 1

    Why didn't they plan ahead for this sort of operation in the beginning, making it painless and 'reliable' ( as possible ).

    --
    ---- Booth was a patriot ----
    1. Re:Why is it not trivial? by beelsebob · · Score: 1

      Who says they didn't?

    2. Re:Why is it not trivial? by Anonymous Coward · · Score: 0

      wtf? Obviously, they did plan ahead, which is why they are able to reflash the chip. But if you mean plan ahead by running patrol reads or scrubbing, I imagine they didn't waste time and resources in much the same way as you dont to an extensive memory diagnostic test before you open an email. What kind of handwaving armchair wannabe are you?

    3. Re:Why is it not trivial? by SeaFox · · Score: 2

      Why didn't they plan ahead for this sort of operation in the beginning, making it painless and 'reliable' ( as possible ).

      That's a joke, right? We are talking about one of the two rovers that was sent to Mars on a mission planned to only last 90 days. They didn't see "flash memory wearing out from use" as a contingency they needed to plan for.

    4. Re:Why is it not trivial? by nurb432 · · Score: 1

      What kind of handwaving armchair wannabe are you?

      One that plans ahead well enough that this would not considered 'news'. Instead it would be just SoP.

      --
      ---- Booth was a patriot ----
    5. Re:Why is it not trivial? by Nimey · · Score: 0

      You're a poster child for Dunning-Kruger: some random on the Internet who thinks he's smarter than the folks who designed a Mars rover that lasted over 10 years past its 90-day expected life.

      --
      Hail Eris, full of mischief...

      E pluribus sanguinem
    6. Re:Why is it not trivial? by nurb432 · · Score: 0

      Who said i *think* that? I am smarter, which is an actual fact. ( tho to you, not knowing who i really am, I could appear to be a random person so i will give you that )

      --
      ---- Booth was a patriot ----
    7. Re:Why is it not trivial? by Anonymous Coward · · Score: 0

      You are a tool. It's a "fact" that you are smarter? Prove it, asshole.

    8. Re:Why is it not trivial? by Nimey · · Score: 1

      Put up or shut up: who are you, really?

      --
      Hail Eris, full of mischief...

      E pluribus sanguinem
    9. Re:Why is it not trivial? by nurb432 · · Score: 1

      Don't need to prove anything to a potty mouthed coward.

      --
      ---- Booth was a patriot ----
    10. Re:Why is it not trivial? by nurb432 · · Score: 0

      Ever hear of VX gas?

      --
      ---- Booth was a patriot ----
    11. Re:Why is it not trivial? by Nimey · · Score: 1

      If you're going to troll and be full of shit, save us both time and say so up front.

      --
      Hail Eris, full of mischief...

      E pluribus sanguinem
    12. Re:Why is it not trivial? by Nimey · · Score: 1

      Further, if this is you:
      http://www.fiero.nl/cgi-bin/fi...

      You're conclusively an idiot. Only an idiot believes in homeopathy.

      --
      Hail Eris, full of mischief...

      E pluribus sanguinem
    13. Re:Why is it not trivial? by nurb432 · · Score: 0

      It was just a question. It had nothing to do with who I am. Think of it as a quiz to see if you deserve to know who i am.

      However, you failed question #1 miserably, by not answering a simple yes/no question, and instead being a smart-ass. Thus not deserving.

      --
      ---- Booth was a patriot ----
    14. Re:Why is it not trivial? by Nimey · · Score: 1

      Boring troll is boring. Put your back into it, boy.

      --
      Hail Eris, full of mischief...

      E pluribus sanguinem
    15. Re:Why is it not trivial? by nurb432 · · Score: 1

      Sorry, not the same person. No "nickname" is overly unique in the online world these days. Its not 1980 anymore.

      But i dont want to blow my karma.

      --
      ---- Booth was a patriot ----
    16. Re:Why is it not trivial? by Nimey · · Score: 1

      Yeah, bullshit. Your nickname has enough entropy that it's exceedingly unlikely this is not you.

      --
      Hail Eris, full of mischief...

      E pluribus sanguinem
    17. Re:Why is it not trivial? by nurb432 · · Score: 0

      See, the problem is i wasn't doing that. You simply failed to impress me enough to deserve the truth. I rarely give anyone that chance, so you should feel special. But its nice to know you performed as expected.

      --
      ---- Booth was a patriot ----
    18. Re:Why is it not trivial? by nurb432 · · Score: 1

      Nah, its not. But believe what you wish, it is a free country. ( assuming a US citizen here )

      --
      ---- Booth was a patriot ----
    19. Re:Why is it not trivial? by Nimey · · Score: 1

      I don't believe a word of it.

      --
      Hail Eris, full of mischief...

      E pluribus sanguinem
    20. Re:Why is it not trivial? by Nimey · · Score: 1

      You really need to level-up your trolling. This is 101-level shit, son.

      --
      Hail Eris, full of mischief...

      E pluribus sanguinem
    21. Re:Why is it not trivial? by nurb432 · · Score: 1

      You are more than welcome to get a court order and demand IP addresses, then compare them. I'm sure both places log IP+posts.

      --
      ---- Booth was a patriot ----
    22. Re:Why is it not trivial? by Nimey · · Score: 1

      *snore*

      --
      Hail Eris, full of mischief...

      E pluribus sanguinem
    23. Re:Why is it not trivial? by nurb432 · · Score: 0

      Your choice. I offered a valid solution.

      --
      ---- Booth was a patriot ----
    24. Re:Why is it not trivial? by Anonymous Coward · · Score: 0

      SoP of a certain size (or mental importance) is always news. News about how a well-planned and executed operation goes smoothly is great... and news about vintage space gear is certainly "news for nerds".

    25. Re:Why is it not trivial? by Anonymous Coward · · Score: 0

      Come on Anthony, we all know already.

    26. Re:Why is it not trivial? by sound+vision · · Score: 1

      The specifications for this mission were that the rover should last 6 months on the surface. Currently we are at over 100 months.

    27. Re:Why is it not trivial? by TheDarkMaster · · Score: 1

      Do not waste your time with the Nimey, he's just another troll who can not understand what others write and he take pleasure in offending others when writing.

      --
      Religion: The greatest weapon of mass destruction of all time
    28. Re:Why is it not trivial? by nurb432 · · Score: 1

      Ya, i figured that out, a bit too late.

      --
      ---- Booth was a patriot ----
  15. Format by Anonymous Coward · · Score: 0

    Just format, not reformat. /troll

  16. Assumptions by DaMattster · · Score: 1

    I believe NASA is operating under the assumption that the rover's on board flash memory is still serviceable. 10 years ago flash memory was still in its relative infancy. A reformat and reload risks bricking the rover completely.

    1. Re:Assumptions by beelsebob · · Score: 2

      I believe you're assuming that the flash used on a rover that went to mars, and encounters all kinds of crazy radiation, is in some way similar to the crappy OCZ thing you stuck in your PC 10 years ago.

    2. Re:Assumptions by ssufficool · · Score: 0

      And that is why RAID 0 wear leveling is useful in this case.

    3. Re:Assumptions by Nimey · · Score: 0

      You're a poster child for Dunning-Kruger: some random on the Internet who thinks he's smarter than the folks who designed a Mars rover that lasted over 10 years past its 90-day expected life.

      --
      Hail Eris, full of mischief...

      E pluribus sanguinem
    4. Re:Assumptions by M1FCJ · · Score: 1

      Don't forget, we don't hear what the techies are talking about. What we're hearing is what the techies told to the PR guy distilled down to a journo, being summarized in The Register (!) and some other soft-tech sites, finally an inaccurate summary on the frontpage of Slashdot.

      I wouldn't be surprised if it were just a "fsck.ext4 -cc" (I know it's not an ext4, it was't even released when Opportunity soft-crashed and bounced around on Mars nor it runs Linux).

    5. Re:Assumptions by M1FCJ · · Score: 1

      Assigned a macro to this reply, haven't you? Clever boy, want a medal?

  17. Err, if you're a system admin.. by Anonymous Coward · · Score: 0

    Jokes about irrelevant characteristics aren't funny. Period.

  18. Re:If there is a problem and need to call "support by sillybilly · · Score: 2

    I'll be glad to help you with that Sir.

  19. Alternative Title by Anonymous Coward · · Score: 0

    Right, cause trying to extend the life of a 2.5 billion dollar device which was intended to last ~92 days and has instead lasted 4,072 days so far would be considered a total failure it that attempt failed and bricked it. We should just let it continue to crash.

  20. Failing flash cells? by Twinbee · · Score: 1

    I didn't realize they used OCZ for the storage tech. ;)

    --
    Why OpalCalc is the best Windows calc
    1. Re:Failing flash cells? by Anonymous Coward · · Score: 1

      It was designed to last 3 months and failed after 10 years.
      If OCZ was involved, it'd be the other way around. ;)

  21. I"d hate to be the guy by TigerPlish · · Score: 1

    I'd hate to be the guy a) pitching this operation at the change control meeting, and b) the guy signing off on this change.

    --
    The "Civilized World" jumped the shark ca. 1973.
    1. Re:I"d hate to be the guy by Anonymous Coward · · Score: 0

      Anymoron can use a computer, if you need things like
      Control and change, like Hope and change, put an Oxymoron in charge.

  22. I told them by Anonymous Coward · · Score: 0

    I told them to use TRIM, but noooo!

  23. Rebooting? by david999 · · Score: 0

    If it is rebooting all the time may I suggest do not use Windows as the OS : )

  24. Re:If there is a problem and need to call "support by Noah+Haders · · Score: 0

    A tech support guy from India helped me with my licensing problem. He was very nice and efficient and solved it right away. No complaints.

  25. It worked on Spirit by lemur3 · · Score: 4, Interesting

    they had to do this type of thing on spirit shortly after it arrived on mars..

    read more here: http://trs-new.jpl.nasa.gov/ds...

    or the PDF linked therin here http://trs-new.jpl.nasa.gov/ds...

    its got all sorts of awesome details.

    We commanded a shutdown, which terminated the
    current communication window, and the loss of signal occurred at the predicted time. Fifty minutes later, we commanded a beep at 7.8125 bps to alert us if the shutdown command did not work, and much to our disappointment, the beep was received!

    really a fun read. ..im guessing theyll be doing a lot of similar stuff

  26. Launch 2 rovers by Anonymous Coward · · Score: 0

    What would be the marginal cost of replicating the rover and launching a second rover at the same time? They could do twice as many experiments and explore two different areas.

  27. Protip by Anonymous Coward · · Score: 0

    Reformatting your flash will not solve the problem of cell wearout.

    What idiots used flash on a Mars mission anyway? We're talking about a multi-million dollar rover with a pretty much "sky's the limit" budget... and they use flash.

    1. Re: Protip by Anonymous Coward · · Score: 1

      They should have used what genius?

  28. Lucky for us not unionized work by Anonymous Coward · · Score: 0

    Yeah Yeah I'll go and fix that!

    But I am claiming 53 cents per kilometers as per stipulated in my collective agreement.

    I am also taking food and other expenses too.

  29. Deploy the Paperclip! by apraetor · · Score: 1

    I'm picturing something akin to those Shuttle missions to repair flaws in the Hubble telescope's optics, except involving a NASA-engineered paperclip.

  30. stressful job by lkernan · · Score: 1

    Man, hope they don't select the wrong partition.....

  31. Capacitors by njhunter · · Score: 1

    I'm sure they didn't get any of their capacitors from that bad batch a few years past.

  32. ROM by Anonymous Coward · · Score: 0

    Perhaps for the next rover - they can put the most widely use code in ROM the old fashion ROM (burn mask/fuse)
    and last for 100 years kind - not the EEROM, EROM(uv erase).

  33. Sort of like ... by PPH · · Score: 1

    ... handing over remote desktop access to tech support in Bangalore.

    Now if only we could get a Martian to IM during the process: "Yes. The little red LED is blinking ....."

    --
    Have gnu, will travel.
  34. sad by electrosoccertux · · Score: 1

    can parent be modded funny not insightful? Insightful is too depressing...

    Do unto others...

  35. 2014-- year of Linux on Mars? by electrosoccertux · · Score: 1

    ???

  36. anything starting with "why didn't they just..." by electrosoccertux · · Score: 2

    shoot the asker?

  37. Call Steve Gibson by Anonymous Coward · · Score: 0

    I wonder if Spinrite would do the job.

  38. Re:If there is a problem and need to call "support by sillybilly · · Score: 1

    Sometimes when I sound mocking, ironic and sarcastic, I'm actually serious, as in ironic-ironic, or sarcastic-sarcastic. A lot of Americans simply smack the phone down on Indian tech support, saying gimme somebody who speaks English. I patiently listen to them struggle through it.

  39. Ob IT Crowd by Anonymous Coward · · Score: 0

    Have you tried turning it off, and turning it back on again?

  40. Re:If there is a problem and need to call "support by Noah+Haders · · Score: 1

    that makes no sense.

  41. Re:If there is a problem and need to call "support by Anonymous Coward · · Score: 0

    Are rovers compatible with Team Speak?

  42. Re:If there is a problem and need to call "support by Anonymous Coward · · Score: 0

    That's true of 99.9% of sillybilly's comments. We haven't figured out if he's off his meds, retired and bored, or perhaps he's a little tinged w/ Asperger's (or perhaps all 3?), but he doesn't seem to be endangering himself or anyone else here, so kinda just let him rant and bumble along.

  43. No problem by Anonymous Coward · · Score: 0

    On the Comcast scale, 1 mile is 100 million miles because of all the hard work those poor guys put in. Since Comcast customer support (sometimes) operates over a distance longer than 1.25 miles, I think humanity has it in itself to be able to fix such distant objects.

  44. It's not exactly brain surgery by Anonymous Coward · · Score: 0
  45. The Real Rover Problem Explained by magicandjewel · · Score: 1

    Flash memory isn't the Rover's problem. It's still running XP and there are no more hot fixes. At this point the Rover's system has massive "bit rot," not to mention that it's been hacked countless times by the Chinese. Undeterred by this seemingly insurmountable problem, Microsoft has donated a Windows Phone for communications back to earth and a Surface Pro to power the Rover "because it's just like a computer." They didn't say just who's going to operate their touch-only interfaces. It all makes perfect sense because nobody in their right mind buys those things down on earth. Thus, new markets like Mars are vital to both products' successes. You might wonder how they will get into space. Microsoft has also kept mum on that, but the word is that there is still so much gas leftover from the Ballmer era that achieving liftoff is a trivial undertaking. -- Cary R., Microsoft Senior Technical Writer (ret.)

  46. Hmmm... by Anonymous Coward · · Score: 0

    Why don't they just download more RAM?