This is an old revision of the document!


Nagios plugins

check_disk

Check the amount of disk space available.

Warning on 20% remaining, critical on 10% remaining, looking at path “/”:

check_disk -w 20% -c 10% -p /

check_load

Check the current system load average. The load average format is the same used by “uptime” and “w”.

  • -w WLOAD1,WLOAD5,WLOAD15 - Exit with WARNING status if load average exceeds WLOADn
  • -c CLOAD1,CLOAD5,CLOAD15 - Exit with CRITICAL status if load average exceed CLOADn
  • -r - Divide the load averages by the number of CPUs (when possible)
check_load -r -w 2,1.5,1 -c 4,3,2

check_procs

Check for running processes.

Some available arguments:

  • -w [num]:[num] - warning for value in range
  • -c [num]:[num] - critical warning for value in range
  • -u <uid> - check for user ID
  • -a <argument> - only check for processes with specified arguments
  • -C <command> - only check for exact match of command, without path

Check that SSHD is running, with at least one process

SSHD will have one proc for the master daemon, and one for each user login as well. So setting a minimal range of 1 will check if it is running at all:

check_procs -c 1: -u root -a /usr/sbin/sshd

check_swap

Check swap space.

Warn on 20% available remaining, critical on 10%:

check_swap -w 20% -c 10%