USENIX LISA: Ted T'so Helps with System Crashes and Presents SystemTap

Nov 12, 2008

Linux kernel developer Ted T'so shared his know-how in a number of tutorials at the USENIX LISA conference in San Diego. One theme was getting first help for system crashes, and in the process, he took the opportunity to present SystemTap.

"Don't panic," thus the first advice from T'so in his presentation on hardware crashes. Never spare time with his second advice: "make backups!" T'so himself puts a lot of weight in scripts to secure data, but he also points to open source solutions like Amanda and Sync, or commercial variants like BRU and Arkeia. After presenting a few possible hardware catastrophe scenarios ("Linux hardware can do some pretty creative crashing"), T'so suggests looking for causes at the lowest level. "I apologize" he says, "but I have to admit that Linux error messages aren't among the most sysadmin friendly." His suggestion is to integrate kernel log messages into the search.

Kernel developer Ted Ts'o at LISA USENIX conference

His first solution is to use the image backup as a way to rescue at least what's left. T'so suggests having a good handle on the material to help in resolving problems. One area is understanding hard disk partitions and Logical Volume Manager (LVM), where T'so spent some time in an overview on filesystem monitoring. His example of an effective file recovery system is e2fsck, part of the Linux ext filesystem. His encouraging conclusion was, if you know where the problem emanated and if you take the proper care, most data recovery will be successful.

During his afternoon session, Ted T'so, also representing the Linux Foundation, launched into presenting SystemTap, a new developer tool. The software package targets technical support folks who can use it to tap into Linux kernel data noninvasively. T'so points to its effectiveness even in its current early phases. He cautions against using any of the other tools currently shipped with products, especially the enterprise versions where updates are few and far between.

Detailed information on SystemTap is available in the Sourceware wiki, which includes an FAQ and advice pages. Upcoming versions of SystemTap should include a Script Toolchest, and the wiki contains informtion on it already.

Related content

  • Ladies at LISA
  • USENIX Tips for Admins: Virtualization and HPC Don't Go Together

    At the USENIX LISA conference in San Diego, Lee Damon and Mike Ciavarella presented a foundation workship as a tutorial for admins with tips and tricks for working efficiently. The presentors didn't agree on all points, but one commonality was that they advised against running High Performance Computing (HPC) together with virtualization.

  • USENIX LISA: When 1000 Computers are too Slow

    Open source developer Tobias Oetiker, in his presentation at the USENIX LISA conference in San Diego, described how he could help one thousand Windows users at Swisscom.com speed up their computers.

  • Tracing Tools

    Programs rarely reveal what they are doing in the background, but a few clever tools, of interest to both programmers and administrators, monitor this activity and log system functions.

  • Build your own Distro Intro

    Tools such as Linux From Scratch, NimbleX Live CD Generator, Kiwi image system, and the Ubuntu Customization Kit can help you create your own custom Linux system.

Comments

  • prepositions

    Thank you for the hint, I've corrected it.
  • prepositions

    "SystemTap, a new developer tool **from** the Linux Foundation"

    I don't think so.
comments powered by Disqus
Subscribe to our Linux Newsletters
Find Linux and Open Source Jobs
Subscribe to our ADMIN Newsletters

Support Our Work

Linux Magazine content is made possible with support from readers like you. Please consider contributing when you’ve found an article to be beneficial.

Learn More

News