Spirit Sends Debug Information to Earth

ISDN to mars by UnderAttack · 2004-01-24 03:47 · Score: 5, Funny

128 kBits/sec! Quite a bit up from the ealire 100Bit/sec. Too bad Mars is too far from the next CO to qualify for DSL

(first post?)

--
---- join dshield.org Distributed Intrusion Detec

Re:ISDN to mars by tomstdenis · 2004-01-24 03:48 · Score: 5, Insightful

Most likely it's not a protocol that involves a lot of ACK'ing [e.g. huge packets with FECs]

Tom

--
Someday, I'll have a real sig.
Re:ISDN to mars by Guppy06 · 2004-01-24 04:59 · Score: 4, Funny

"Too bad Mars is too far from the next CO to qualify for DSL"

I guarantee you Mars will have DSL before I do.

You know what they say by Smallpond · 2004-01-24 03:47 · Score: 4, Interesting

A diagnostic is what runs when nothing else will.

The data rate is pretty good... by JessLeah · 2004-01-24 03:48 · Score: 5, Funny

...but the ping times suck. Can you imagine playing Quake over that kind of link?

--
Honey, I shrunk the Cygwin

Re:The data rate is pretty good... by Anonymous Coward · 2004-01-24 04:17 · Score: 4, Funny

[jplbob@nasa ~]$ ping spirit
PING spirit (192.168.1.10): 56 data bytes
64 bytes from 192.168.1.10: icmp_seq=0 ttl=237 time=960125 ms
64 bytes from 192.168.1.10: icmp_seq=1 ttl=237 time=961019 ms
64 bytes from 192.168.1.10: icmp_seq=2 ttl=237 time=960843 ms
64 bytes from 192.168.1.10: icmp_seq=3 ttl=237 time=959980 ms
64 bytes from 192.168.1.10: icmp_seq=4 ttl=237 time=960333 ms

--- spirit ping statistics ---
5 packets transmitted, 5 packets received, 0% packet loss
round-trip min/avg/max = 959980/960460/961019 ms
Re:The data rate is pretty good... by feidaykin · 2004-01-24 06:40 · Score: 5, Funny

...but the ping times suck. Can you imagine playing Quake over that kind of link?
Ping matters not to a true master such as myself. You would just have to wait ten minutes to find out that I owned your sorry ass. ;)

--
"To confine our attention to terrestrial matters would be to limit the human spirit." -Stephen Hawking

Wow by mbadolato · 2004-01-24 03:49 · Score: 5, Funny

Spirit sent 73 megabits at a rate of 128 kilobits per second.

Pretty damn scary that that's faster then most pr0n download's via Kazza... :)

Can low-power corrupt memory? by corebreech · 2004-01-24 03:49 · Score: 5, Interesting

I watched the press conference on NASA-TV and they talked about how the thing wouldn't go to sleep at night and so it got me to wondering about the low power question. Obviously they have the rover power off when power gets to a certain level, but what if that level is slightly off?

In other words, if the onboard CPU has enough power and continues to run but the memory doesn't have enough power, doesn't that cause all kinds of wackiness?

They keep talking about the data pointing to simultaneous faults... well, as programmers we know these are the very worst kinds of bugs to deal with, but with something as (I'm assuming) well written as their code, so doesn't that point to a memory problem? I mean, the think is working flat-out beautifully one moment, and then the next moment it goes tits up.

The other question I had concerned this motor they had turned on but which didn't complete its sequence. When they command the motor to do something, do they tell it to run for some interval of time, or do they tell it to achieve a specific position? I was thinking that if it's the latter, and then if it gets stuck somehow, this could create the low power situation as the motor just grinds away.

--
Is this truly the only Earth I can live on?

Re:Can low-power corrupt memory? by APDent · 2004-01-24 05:30 · Score: 5, Funny

using IANANE like it's an established acronym just makes you look stupid.

Or inane.
Re:Can low-power corrupt memory? by ultrasound · 2004-01-24 05:47 · Score: 5, Interesting

Generally there are low-voltage detection circuits inside and/or connected to the microprocessor that detect that power is fading, and wrap things up, terminating any writes in an orderly fashion if possible. Generally any power-down is going to be very slow (orders of 10s to 100s of milli-seconds or more) because of capacitor storage in the power supply. The LV device gives sufficient notice that power is fading so that the remaining processor time is more than ample to shut things down gracefully.

Obviously with volatile RAM without battery backup we shouldn't need to care about the state of the RAM on power-down as it is only temporary storage and will be re-initialised on power-up. Generally the storage components will have wider operating tolerances than the microprocessor so it is very unlikely that the RAM will get corrupted during the powerdown proceedure.

With non-volatile hardware such as battery backed RAM, flash, eeprom, fram etc we have a problem because these contain NV config data and firmware that must be consistent. And with some such as FLASH the write times can be very long, may be longer than the power-down time. In this case the general philosophy is to write the bytes, and the very last step is to update the checksum and set a valid data flag. Which means at worst the device boots up and knows its got some dodgy code or data on its hands, and hopefully handles it in a graceful fashion.

With something like the Spirit I would guess that some form of multiple redundancy is used so that there are multiple firmware images, with a switchable bootloader so that a new image or dataset can be uploaded to an area that is offline, and only once all of the checksums/message hashes are confirmed is the switch made. And hardware watchdogs are running so that if the worst happens and it hangs it can always boot an alternate image. I would also expect a backup OTP PROM image that is guaranteed never to change and known to work.
Re:Can low-power corrupt memory? by AndroidCat · 2004-01-24 06:33 · Score: 4, Funny

And with some such as FLASH the write times can be very long
Ahh! So you're saying that while the Spirit is willing, the flash is weak?

--
One line blog. I hear that they're called Twitters now.

Spirit rebooting 60 times a day by G4from128k · 2004-01-24 03:50 · Score: 5, Informative

CNN is reporting that spirit is self-rebooting 60 times a day. NASA suspects a hardware fault that is causing the processor to detect trouble and automatically reboot.

--
Two wrongs don't make a right, but three lefts do.

Re:Spirit rebooting 60 times a day by pongo000 · 2004-01-24 05:07 · Score: 4, Interesting

Something like 2/3 of NASA's recent missions have failed in some way or another. Is it quite possible that NASA engineers simply have not mastered the art and science of designing hardware and software operable in the harshest of environments?

In some ways, there is an air of arrogance in everything NASA does, from their press conferences to their marketing agreements. We have dead shuttle astronauts being transformed into "national heroes," even though their demise wasn't the result of any heroic sacrifices on their part, but rather a materials and systems failure scenario that NASA failed to handle properly. We have Spirit as the "little train that could," sending back waves of photographs of rocks that NASA engineers have actually named. Does the naming of rocks somehow bring NASA's mission closer to the unwashed masses who relate better to Beanie Babies than to the stark facts of reality?

Harsh as it sounds, NASA is reaping what they sow: A string of hardware and software failures that is serving as a backdrop to newly-mandated initiatives by Bush to send miners to the moon and astronauts to Mars. Yet NASA can't even seem to get a remote-control buggy to work correctly. The mind just reels at the catastrophes that await us between now and 2015 should NASA continue down this road of inept management and hardware/software designs insufficiently tested against the harsh envrions of space. As geeks, we owe it not only to ourselves but to the non-geek public to recognize these failures as serious shortcomings in the NASA culture. We must resist the temptation to blindly set NASA on a pedestal in the name of scientific achievement without first critically analyzing their failures.
Re:Spirit rebooting 60 times a day by cosmo7 · 2004-01-24 06:47 · Score: 4, Informative

Mars is still Microsoft-free.

Spirit uses Wind River's RTOS, VxWorks. The main computer (the Rover Electronics Module) uses a 20MHz 32-bit Rad 6000 CPU, a radiation-hardened PowerPC variant manufactured by BAe in England. The computer has 128M of ECC RAM and a 3M EEPROM. It connects with hardware via a Versa Module Europa (VME) bus.

The software was compiled with a compiler from Green Hills' MULTI development environment, but the developers coded using the Wind River Tornado IDE.
Re:Spirit rebooting 60 times a day by spaceyhackerlady · 2004-01-24 07:50 · Score: 5, Informative

Something like 2/3 of NASA's recent missions have failed in some way or another. Is it quite possible that NASA engineers simply have not mastered the art and science of designing hardware and software operable in the harshest of environments?

Maybe they have. That's how they know how difficult a task it is to get it right.
I am something of an aerospace engineer, and work professionally with real-time systems (based on VxWorks - fancy that!). Let me illustrate the kind of bizarre bug that can happen on a spacecraft, and how it was fixed from the ground.
Consider a satellite with a simple on-board computer. To guard against the OS locking up (no matter how good the software is, you can't protect against radiation-induced bit flips in memory), it has a hardware watchdog timer. The software resets the timer periodically, before the hardware can reboot the system. Things run well for a while.
Then the on-board system starts resetting for no apparent reason. No suggestion of memory problems, no apparent hardware problems. The problem is traced to a radiation-induced change in component values in the watchdog timer, causing the timer to go off sooner than expected. Until the satellite is finally turned down a few years later, an important task of the ground stations was checking for watchdog resets and adjusting the software watchdog task accordingly. When the software eventually spent all its time resetting the watchdog timer, the satellite could no longer function and was turned down.
The moral of the story: space is weird and hostile. Things happen. No matter how hard you try, you cannot always get it right.
...laura

CNN article by pvt_medic · 2004-01-24 03:50 · Score: 4, Interesting

Cnn has an article on some updates. Apparently the engineers been having all sorts of fun with the thing here a quick excert. "Cautioning that they will need more time to understand what went wrong, project engineers said they have determined that Spirit has rebooted or tried to reboot itself more than 60 times a day since the failure."

--
30% Troll, 50% Underrated, 10% Interesting
Score:5, Troll

Re:CNN article by AbbyNormal · 2004-01-24 04:54 · Score: 4, Funny

I understand NASA was in a tight budget situation, but overclocking a Martian Planet robot was probably not such a good idea.

--
Sig it.

It wasn't exactly 'filler' by Eevee · 2004-01-24 03:50 · Score: 5, Informative

Only a couple of frames were fillers of random values. Most of the frames were engineering data. No actual scientific data came down, though.

Still, it's a good sign that it's still able to talk.

Re:Linux Cost Tax Payers at least $410M...nothing by TheGrayArea · 2004-01-24 03:53 · Score: 5, Informative

You might want to check your facts before you spew. While the ground system is heavy on Linux according to the article you referenced, the actual OS on the rover itself is VxWorks from Wind River.
http://www.windriver.com/news/press/20040105.html

--

This space for rent.

Who to blame it on... by darth_silliarse · 2004-01-24 03:54 · Score: 5, Funny

Spirit Sends Debug Information to Earth

A Fatal Exception 0E has occurred at 0028:C0231810 in VXD VMM(0D) + 00001810

Cool!

--
I've noticed that everyone who is for abortion has already been born - Ronald Reagan

128 Kilobits by mcleodnine · 2004-01-24 03:56 · Score: 5, Funny

128 kbps over 35 million miles... looks like we'll need another benchmark to replace the station wagon full of DAT tapes

--
one better than mcleodeight

Little Green Men by BenBenBen · 2004-01-24 04:03 · Score: 4, Funny

Did the "filler data" look anything like this?

--
The Slashdot Paradox: "100% Overrated"

I wonder by skinfitz · 2004-01-24 04:04 · Score: 4, Funny

Doesn't Spirit's twin, Opportunity, start it's landing tomorrow?

It's probably some bizarre licensing issue for the OS causing it to shut down as it's detected that NASA are trying to run two copies at the same time.

Kind of like Beagle 2's problems caused by the transmissions being intercepted by the RIAA as they file a lawsuit against Colin Pillinger for offering illegal music downloads from Mars.

The cause has been found by Smallpond · 2004-01-24 04:04 · Score: 4, Funny

Fortunately, the cause of the blackout has been located and will be corrected soon.

Cause of Spirit problem known! by Jacco+de+Leeuw · 2004-01-24 04:06 · Score: 4, Funny

Pesky Martians! :-)

--
-------
Warning: Slashdot may contain traces of nuts.

Silly Spammers by pardasaniman · 2004-01-24 04:07 · Score: 5, Funny

It appears that while editing the crontab of the rover to send spam, the script-kiddie accidentally added a shutdown -r 24m . "Having the rover send spam was a great idea! When people ping the X-Originating IP, they'll surely timeout!!"

mars dvd message by xk · 2004-01-24 04:09 · Score: 5, Interesting

Has anyone cracked this yet?

-bk.

CONTENTS OF MESSAGE by tjstork · 2004-01-24 04:12 · Score: 4, Funny

"I can't swim.. I CaN'T sWim ... I cannot swim... I can't swim.. I can't swim.. I can't swim.. I can't swim.. I can't swim.. I can't swim.. sdf@#$@#$@#$

--
This is my sig.

NASA was trying to hide illegal mp3's on mars by Anonymous Coward · 2004-01-24 04:15 · Score: 5, Funny

rover: 128kbps
most mp3's: 128kbps
COINCIDENCE?
i think not.

Re:NASA was trying to hide illegal mp3's on mars by EpsCylonB · 2004-01-24 04:23 · Score: 4, Funny

Nasa wants us all dead!
Nasa sent up monkeys. Are they all accounted for?
Nasa sent up robots. Where are they now?

"We can defeat the monkeys. We can defeat the robots.
BUT NOT AT THE SAME TIME!!!"
- Lewis Black

I, for one, welcome our... by Anonymous Coward · 2004-01-24 04:18 · Score: 5, Funny

...European, constantly rebooting, battery draining overlords. Now we know Beagle 2 was not lost but was in transit to Gusev crater. It took a little time to silently creep up behind spirit. If we had a high-enough resolution camera we would see that damn dog continuously poking at the rover, pressing our reset button.

Cheers to the European engineers who caught us with our pants downs and jeers to the American engineers who thought our little rover needed an external reset button for some reason.

Re:Linux Cost Tax Payers at least $410M...nothing by sir_cello · 2004-01-24 04:23 · Score: 4, Informative

Some of us Engineers work with RTOS all the time, not just for fun-and-dandy projects, for for multi-million dollar outcomes. Consensus is that Linux is not good enough. QNX, VRTX, VxWorks etc are still the preferred choices, but everyone admits that Linux is getting there. Most of us don't hang out on slashdot, yet many Linux zealots do: you don't get a good opinion here.

New superlative: by Snork+Asaurus · 2004-01-24 04:27 · Score: 4, Funny

Extreme Remote Debugging

--
Sigs are bad for your health.

It's all done for the TV ratings by roman_mir · 2004-01-24 04:40 · Score: 5, Funny

you remember, the Apollo 13, the one with Tom Hanks? Where the austronauts believe that their transmission is watched by the viewers on Earth but in fact all TV networks refused the transmission, stating that NASA made flights to the Moon as exciting as trips to Pittsburgh (or something of this kind)?

This is what is happenning people, the new in reality TV - our own Mars Rover - The Ultimate Survivor. The Opportunity will be landing today, so the audience should be able to vote for which rover is going to be kicked out of the show.

The Drama, The Excitement, The Unknonw, The Sex... oh, wait!

--
You can't handle the truth.

Re:400 million and only one CPU by 0x0d0a · 2004-01-24 04:41 · Score: 5, Interesting

Nasa systems that involve human life are highly redundant. I remember a lecture by a NASA engineer about systems on the Shuttle. There are *seven* redundant computers which calculate data. That data requires identical answers from four to be accepted.

On Spirit, power is an issue. More CPUs == more power drain.

Furthermore, I remember the folks initially speculating that something was wrong with the power system. I stopped following it, but it said that this transmission was composed of power subsystem diagnostic data. Could be it's a response requested earlier that it didn't have enough juice to send, in which case more CPUs would have actually exacerbated the problem. :-)

--
May we never see th

Re:400 million and only one CPU by Hollinger · 2004-01-24 04:47 · Score: 5, Insightful

Well, you know, what's interesting about that is:
1. you'd have to increase the complexity of the device even more, exposing it to a higher risk of failure statistically
2. you'd need more complicated software and hardware that would require more time and effort (money & delays)
3. the hardware would need more power (limited batteries and solar panel capacity)
4. the system would be heavier and bigger (costs are measured in grams, iirc).

While you have a valid point, the constraints of this design give very strong tradeoffs among safety, feasability, and cash flow (and I'm sure there are others, but I'm not a rocket scientist). I'd imagine that some time was spent on redundant systems, but the adage of "Why have one when you can have two at twice the price?" only works when your budget can support the extra price of man-hours and cash.

I'd argue that where you work has unlimited available power, and if you need more, you can ask your power company for more. You have the money to spend on a X-thousand-dollar sever that's been pre-fabbed by whatever company you like. If you need more, you get more drop-shipped to you within days. NASA had to build these little buggers from the ground up.

<RANT>
You know, if you take your philosophy of simply duplicating the entire machine, there is a backup. It's called "Opportunity." It lands tomorrow.

I highly resent the fact that you've called some of the greatest engineers of our time "retarded." If you can't understand the problem (I certainly don't, but I do understand the concept of tradeoffs in design) you have no right to speak on the issue. Of course, this is slashdot. Everyone can mouth off about everything. Nevermind.
</RANT>

~MCH.

--
Michael C. Hollinger

Re:Suspiciously good pics of landing site from orb by AndroidCat · 2004-01-24 05:01 · Score: 5, Interesting

They're trying a new technique. From this article:

The MOC image of the Spirit lander and its landing site was acquired using a new technique that was pioneered by the MGS project in 2003. Called "cPROTO" (for Pitch and Roll Only Targeted Observation with planetary motion compensation), the approach allows MOC, which normally takes pictures 1.5 meters (5 feet) per pixel to 12 meters (40 feet) per pixel, to acquire images with a higher resolution. By pitching the MGS spacecraft at a rate faster than it orbits around Mars, and moving it in a way that compensates for the rotation of the planet, MOC is able to obtain images with a down-track resolution of about 50 cm/pixel (~20 inches/pixel), although the cross-track resolution remains ~1.5 m/pixel (5 ft/pixel). These images have a better signal-to-noise ratio than typical 1.5 m/pixel MOC images, as well. This technique allows the lander and other details not normally visible in a full-resolution MOC image to be seen.

--
One line blog. I hear that they're called Twitters now.

Fill Data by Detritus · 2004-01-24 05:16 · Score: 4, Informative

Fill data is typically transmitted when the telemetry multiplexer does not have any engineering or science data to send. Due to the way synchronous communications links work, something is always being transmitted, even if there is no "real data" available.

--
Mea navis aericumbens anguillis abundat

Re:Wind river by AaronW · 2004-01-24 05:22 · Score: 5, Informative

I wouldn't brag. I've been programming VxWorks for several years now and all I can say is it's a piece of crap for a complex system.

VxWorks does not provide any memory protection (well, AE does, but it's so buggy nobody uses it).

If a task dies, it does not clean up after it. All memory is global, i.e. any task can overwrite memory for any other task.

Wind River couldn't even implement a decent malloc implementation. I had to replace it with Doug Lea's DLMalloc code (which glibc's malloc is based off of). It fragments horribly, and becomes increasingly slower the more free blocks exist.

Just by replacing malloc, I brought the time down on our box from 50 minutes to under 3 minutes and went from tens of thousands of fragments to a couple of dozen.

If you want a reliable embedded system with a lot of complexity, go with QNX or perhapse a good embedded Linux (I like Timesys Linux myself - good realtime support).

At least with QNX if there's a problem in a task, it's much easier to isolate it and not kill the entire system. As it is on the product I'm working on, if a task dies about the only way to recover is to reboot. Also, VxWorks has piss-poor built-in debugging support. Sometimes you can get a stack trace. Tracing the heap is virtually impossible (and because it's a global memory pool, you don't even know what blocks were allocated by what task or even how much memory each task has allocated). In the product I'm working on I added such support to find memory leaks and detect memory corruption.

VxWorks AE does provide memory protection. We tried to use it, but it was so buggy and slow we had to drop it and go back to standard VxWorks.

VxWorks hasn't really changed in the last few years and Wind River is losing customers like crazy to the better alternatives. They're hemmoraging money at an astronomical rate and quickly losing market share to the likes of QNX and Linux.

Even the realtime performance of VxWorks isn't that great. The finest granularity for a reliable timer is 1/2 the system tick rate (often no more than 20ms resolution).

VxWorks doesn't have a shell as such either. The commands you type in are functions with parameters to those functions. You can do things like my_global = global_a + 7

or

my_func(&my_global, 3)

on the command line, but it's not at all like a traditional command line.

Most real-time Linux implementations arn't all that great either from my research into it. Most don't deal with priority inversion, or require a completely separate set of APIs for RT tasks (i.e. RT Linux). I found Timesys Linux to solve most of these issues and it looks like our next generation will be based off of either Timesys Linux or QNX.

-Aaron

--
This post is encrypted twice with ROT-13. Documenting or attempting to crack this encryption is illegal.

Re:should NASA let Wind River write the code? by Detritus · 2004-01-24 05:23 · Score: 5, Insightful

If there is a COTS (commercial off-the-shelf) real-time operating system available that meets the system requirements, why go to the risk and expense of writing your own from scratch? Do you expect NASA to fabricate every component in the spacecraft?

--
Mea navis aericumbens anguillis abundat

Improving NASA: Get-it-right vs. get experience by G4from128k · 2004-01-24 05:45 · Score: 4, Insightful

Is it quite possible that NASA engineers simply have not mastered the art and science of designing hardware and software operable in the harshest of environments?

While I would never claim that NASA is perfect, I think you underestimate the both the engineering challenge of putting a rover on Mars and the impact of more conservative, get-it-right, policies.

Interplanetary missions are the hardest of all because the engineers never get to actually test the whole device under realistic conditions. Although they can test and analyze each subsystem under a variety of simulated or near-realistic conditions, they have no way of building a test rover, putting it in interplanetary space of months, having is aerobrake into a thin atmosphere, parachute in a thin atmosphere, and crashland at high speed, and then operate all its mechanical parts under dusty low G conditions.

Second, get-it-right == conservatism == greater cost == fewer missions == less experience. The last thing NASA should do is spend more money, take more time, and do fewer missions. The only way we will really learn how to operate in space is to go into space. I'm not saying that better engineering won't help, only that more experience (unfettered by excessive conservatism) is a crucial part of learning to operate on other planets.

--
Two wrongs don't make a right, but three lefts do.

Re:Improving NASA: Get-it-right vs. get experience by Jeremi · 2004-01-24 06:15 · Score: 4, Insightful

given the vastness of time in a cosmological sense, shouldn't NASA be considering 100-year or 1000-year timetables?

That would be ideal, but keep in mind that NASA is funded by Congress, an entity that changes its mind about everything every 2-8 years. Any NASA program that takes too long to complete is very likely to be cancelled halfway through, wasting 100% of the resources that were put into it.

--

I don't care if it's 90,000 hectares. That lake was not my doing.
Re:Improving NASA: Get-it-right vs. get experience by AllUsernamesAreGone · 2004-01-24 06:34 · Score: 4, Insightful

It seems ludicrous to me that NASA is on a 15-year time table...given the vastness of time in a cosmological sense, shouldn't NASA be considering 100-year or 1000-year timetables?

Unfortunately, if you want to look at things on the scale of cosmological time, we don't even exist. Human beings have been around for a blink of the eye of the universe, and unless we get our backsides off this damp ball of rock as soon as possible there's every chance that within another blink, we won't exist anymore. Between climate change (not even human-caused - the "comfortable" Earth we know is just a fleeting hospitible break between the planet's normal fire and ice), potential self-destruction, impact events and a dozen other risks, our continued persistence in keeping all our eggs in one basket is nothing short of asking for annihilation. How many other "intelligent" species would sit there and watch as enough rock and ice to wipe out life plunges into a planet that is, comparitively, just next door and do nothing? We did when Shoemaker-Levy 9 hit Jupiter. My only comfort is that, should the human race be wiped out while confined to Earth by its own lack of vision and sense, it'll be a service to galactic evolution.

sure would be nice.. by maelstrom · 2004-01-24 06:02 · Score: 4, Insightful

To have some actual technical discussion on a site that is supposed to be filled with nerds, instead of the same tired jokes about martians.

--
The more you know, the less you understand.

Yes, but there are other possibilities by dtmos · 2004-01-24 06:36 · Score: 5, Insightful

Since Spirit is rebooting sixty times per day, a problem that started when an electric motor moving its spectrometer "conked out", one thinks first of a hardware failure, possibly leading to software corruption.

I don't know the boot sequence of Spirit, but in most battery-powered embedded systems with which I am familiar, an elaborate state machine design is made to ensure that, when the boot sequence is complete, the system has sufficient power to perform any task that may be requested of it. Since the power supply is limited, an unexpectedly heavy load on the primary supply could cause the supply voltage to the microcomputer to fall below its specified lower limit, leading to a system reset.

Now imagine that there is a hardware failure associated with some process that runs during the boot sequence--a voltage regulator turn-on, a heating system initialization, an electric motor activation, whatever--that results in excessive current drain. When this part of the boot sequence is reached, the supply voltage falls, and the microcomputer resets. This disables the problem-causing hardware, unloading the power supply. When the supply voltage recovers, the microcomputer reboots (either automatically, with a power-on reset, via a watchdog timer, or via some other means) and, when the critical part of the boot sequence is reached, the supply voltage falls again. The system is now in a continuous loop, in which it can remain indefinitely. (Or at least 60 times per day....)

Note that this situation can also arise due to a defect in the power supply--if the output impedance of the power supply has risen for some reason, its output voltage under lightly loaded conditions can be acceptable, but it may not be able to supply heavier loads.

One expects the Spirit power supply to be complex, with separate regulators for the microcomputer, radio transceiver, and electric motors, so looking for common circuits and systems would be the first thing to do when troubleshooting for this type of failure. Looking for system conditions that can cause a system reset would be another; the JPL people have lived with their systems for years now, and would have had many design reviews to identify possible system failure scenarios--I'm not telling them anything new here. I understand that the system telemetry received yesterday indicates that the power supply is within specification, so that seems to eliminate that possiblility.

The second alternative is a soft memory failure of some kind, either caused by a supply failure as the parent suggests or perhaps by a radiation event of some kind.

Note that these problems can be multi-disciplinary; for example, the problem could be caused by some vibration when a motor runs that loosens a broken connection created by a chemical reaction to something on the surface (to take an extreme example).

Re:No BSOD Jokes, Please by Anonymous Coward · 2004-01-24 06:49 · Score: 4, Funny

How about red screen of death jokes?

Closed source project... by jasno · 2004-01-24 06:59 · Score: 4, Interesting

Is there any reason the code, schematics and CAD designs aren't available for public viewing? Its a publicly funded project, and I don't think JPL has to worry about trade secrets.

If JPL would give us more information, I bet they'd have 50% of the entire engineering brainpower on the planet checking for races, inversions, memory leaks, hardware design flaws, etc.

If there was ever a project that could benefit from so many eyeballs, its space exploration. There are thousands of some of the most talented engineers on the planet who would jump at the chance to contribute to something like this.

--

http://www.masturbateforpeace.com/

Re:got some useful data by SnowZero · 2004-01-24 07:29 · Score: 4, Informative

On Earth at least, picking bits off of radio links usually involves an adaptive threshold and a clock that syncs to the clock of the sender. Sending too many 1's or 0's in a row can interfere with that because there aren't any "bit edges" on the signal. Sending random data ensures all patterns are equally likely and your adaptive filter stays happy for when you have real data to send. Otherwise you'll miss the first part while you re-establish the threshold and sync to the signal.

My guess is the NASA rover's link follows a similar principle, though its probably using some pretty damn fancy techniques to get the data from that far. Oh and missing the first part of the data would really suck for them since a retransmit would take 20 minutes.

Re:No BSOD Jokes, Please by wash23 · 2004-01-24 08:08 · Score: 4, Interesting

You know, it occurs to me that maybe instead of having an interactive rover with a billion complicated subsystems and spectrometers and cameras... it might be a good idea to launch a package full of smaller autonomous devices carrying different instrumentation... So you'd have a base that lands on mars, opens up (like the rover bases do) and releases 20 or 30 "dumb robots" on treads or big balloon tires(I'm thinking each the size of a big R/C car), some of which would have cameras, the rest instrumentation of whatever sort.. All of the little slaves would move around randomly or according to some simple program (either mechanical or software) and relay collected information to the base, which would transmit it to earth... Some of the camera bots would be designed to just move as far as possible and take as many pictures as possible... others would just do instrumental analyses of whatever they happen to bump into or land on... You wouldn't know exactly what the instruments were looking at but you'd probably be able to collect a sizable amount of data on a particular landing region; know what minerals are present, etc. You wouldn't know that pyramid shaped rock 12B contains olivine but you'd know olivine was present.

Slashdot Mirror

Spirit Sends Debug Information to Earth

50 of 477 comments (clear)