This is an old revision of the document!
Nagios plugins
check_disk
Check the amount of disk space available.
Warning on 20% remaining, critical on 10% remaining, looking at path “/”:
check_disk -w 20% -c 10% -p /
check_load
Check the current system load average. The load average format is the same used by “uptime” and “w”.
- -w WLOAD1,WLOAD5,WLOAD15 - Exit with WARNING status if load average exceeds WLOADn
- -c CLOAD1,CLOAD5,CLOAD15 - Exit with CRITICAL status if load average exceed CLOADn
- -r - Divide the load averages by the number of CPUs (when possible)
check_load -r -w 2,1.5,1 -c 4,3,2
check_procs
Check for running processes.
Some available arguments:
- -w [num]:[num] - warning for value in range
- -c [num]:[num] - critical warning for value in range
- -u <uid> - check for user ID
- -a <argument> - only check for processes with specified arguments
- -C <command> - only check for exact match of command, without path
OpenSSH
SSHD will have one proc for the master daemon, and one for each user login as well. So setting a minimal range of 1 will check if it is running at all:
check_procs -c 1: -u root -a /usr/sbin/sshd
check_swap
Check swap space.
Warn on 20% available remaining, critical on 10%:
check_swap -w 20% -c 10%