I set up this Linux 'Watchdog' and now my system auto-reboots when it locks up

· Source: News and Advice on the World's Latest Innovations | ZDNET · Field: Technology & Digital — Cloud Computing & IT Infrastructure · Depth: Novice, short

Summary

The "Watchdog" application offers an automated solution for rebooting Linux systems that experience lock-ups, proving beneficial for both servers and remote desktops. This software operates by using a kernel module, "softdog," to establish a virtual device (`/dev/watchdog`); if this device is not "kicked" by a running process within a specified timeframe, the system automatically reboots. Installation is simple across various Linux distributions like Ubuntu, Fedora, and Arch, requiring package installation, loading the `softdog` kernel module, and configuring parameters in `/etc/watchdog.conf` such as `watchdog-timeout`. For systems equipped with specialty hardware, a more robust hardware "Watchdog" can be configured by modifying `systemd` settings within `/etc/systemd/system.conf` to ensure continuous system availability.

Key takeaway

The Linux Watchdog service automates system reboots when a machine locks up, ensuring continuous availability for critical servers and remote desktops. It operates by monitoring system responsiveness via a `softdog` kernel module, triggering a reboot if a configurable timeout (e.g., 20 seconds) is exceeded. This free, easily configurable solution prevents manual intervention and enhances system resilience for administrators.

Topics

Best for: IT Professional, DevOps Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by News and Advice on the World's Latest Innovations | ZDNET.