Out-of-memory problem caused Mars rover's glitch
The rover systems are again working on the Martian surface
Computerworld - A shortage of memory on board the Spirit Mars rover is what caused it to become unresponsive on the Martian surface on Jan. 22, raising fears that the Martian mission might end almost before it began in earnest.
Mike Deliman, a technical staff member at Wind River Systems Inc., which provided the real-time embedded operating system used in the mission, said the problem has been re-created in testing on Earth and appears to be entirely memory-related.
"It's not a software bug, it's not an application bug, and it's not a hardware bug," Deliman said. "It's a system constraint that we ran up against."
The Spirit rover dedicates 32MB of its 128MB of RAM to the onboard Wind River VxWorks operating system and a host of science applications, and as the mission progresses, technicians are scheduled to periodically delete old files and directories to clear out the memory for reuse, he said.
But with all the excitement after the Mars landing on Jan. 3, and with data being returned to Earth by the rover, that step was not performed quickly enough by mission technicians.
"We just ran out of memory, ran out of RAM," Deliman said. "This is why we initially lost contact" with the rover. The six-wheeled vehicle runs hundreds of tasks simultaneously in normal operations, with each operation using its own chunk of RAM, he said.
The VxWorks operating system was embedded in a specially prepared, radiation-hardened 20-MHz PowerPC CPU installed on each of the rovers, along with 128MB of RAM (see story). The hardware was cutting-edge back when it was chosen in the mid-1990s, but then it had to be treated to ensure its reliability in the radiation of deep space -- a process that takes five to 10 years.
"It's like having an old Windows machine that has a very little bit of disk space [remaining]," Deliman said. "When you run up against the end of your disk, if you don't clean it up, your system becomes unstable."
For about a week, scientists worked to figure out why the rover wasn't responding to commands from Earth and feared that a hardware problem could halt the rover in its tracks.
Technicians were eventually able to correct the problem when the rover went into a diagnostic mode, Deliman said. Diagnostic commands were beamed up to the machine, and a series of files and folders were deleted from a flash-memory-based file system board, allowing the rover to resume normal operations.
The Spirit rover is now continuing to take photographs on the Martian surface and conducting experiments for NASA. A second Mars rover, Opportunity, landed on Jan. 24 and has also been operating on the surface of the red planet. A minor glitch with a heater that won't shut off on Opportunity's robotic arm is the only problem experienced by that machine so far.
- 15 Non-Certified IT Skills Growing in Demand
- How 19 Tech Titans Target Healthcare
- Twitter Suffering From Growing Pains (and Facebook Comparisons)
- Agile Comes to Data Integration
- Slideshow: 7 security mistakes people make with their mobile device
- iOS vs. Android: Which is more secure?
- 11 sure signs you've been hacked
If you use ‘password,’ one the worst passwords, as your password, fail to keep antivirus protection updated and don’t bother to deploy security patches to close critical vulnerabilities, then maybe you should consider working for the cybersecurity-clueless federal government; you’d fit right in, according to Senator Tom Coburn's cybersecurity and critical infrastructure report.
- IT Certification Study Tips
- Register for this Computerworld Insider Study Tip guide and gain access to hundreds of premium content articles, cheat sheets, product reviews and more.
- Changing the Way Government Works: Four Technology Trends that Drive Down Costs and Increase Productivity
- This paper discusses four technology-based approaches to improving processes and increasing
productivity while driving down department and agency costs.
- HP HAVEn: See the big picture in Big Data
- HP HAVEn is the industry's first comprehensive, scalable, open, and secure platform for Big Data. Enterprises are drowning in a sea of data...
- What Datapipe customers need to know about the new PCI DSS 3.0 compliance standard
- This handy quick reference outlines what PCI DSS 3.0 is, who needs to be compliant and how Alert Logic solutions address the new...
- The 12 PCI DSS 3.0 requirements addressed by Peer 1 Hosting
- This handy quick reference outlines the 12 PCI DSS 3.0 requirements, who needs to be compliant and how Alert Logic solutions address the...
- Defense Throughout the Vulnerability Life Cycle
- This whitepaper provides insight into how to leverage threat and log management technologies to protect your IT assets throughout their vulnerability life cycle. All Government IT White Papers
- Meg Whitman presents Unlocking IT with Big Data During this Web Event you will hear Meg Whitman, President and CEO, HP discuss HAVEn - the #1 Big Data platform, as well...
- The New Way to Work Knowledge Vault This Knowledge Vault focuses on how, in today's increasingly virtual world, it's more important than ever to engage deeply with employees, suppliers, partners,...
- Getting Ready for BlackBerry Enterprise Service 10.2 Find out how BlackBerry® Enterprise Service 10 helps organizations address the full spectrum of EMM challenges, while balancing the needs of both the...
- Containerization Options: How to Choose the Best DLP Solution for Your Organization This webcast outlines a framework for making the right choice when it comes to containerization approaches, along with the pros and cons of...
- Mobile Apps and Devices Slash Customer Cycle Time Consolidated Engineering Laboratories' field employees used to collect data on triplicate forms that were sometimes hard to read and difficult to manage. After...
- All Government IT Webcasts