3.7 HealthCheck

Use this Knowledge Script to monitor all Dell server-related services. This Knowledge Script raises an event if any service is not running and automatically re-starts the stopped services. In addition, this Knowledge Script raises an event if SNMP is not operating or cannot get a MIB variable value.

3.7.1 Resource Objects

Dell server, any Dell Service icons

3.7.2 Default Schedule

The default interval is Every 5 minutes.

3.7.3 Setting Parameter Values

Set the following parameters as needed.

Description

How To Set It

Collect data?

Set to y to collect data for charts and reports. If set to y, this Knowledge Script records the status of Dell server-related services. The default is n.

Community

Set the community string in Security Manager for Dell OpenManage server Agent or Dell-iDRAC resources:

  • Provide the SNMP community string that is required to access Dell OpenManage or iDRAC resources. If it is empty, then the community string configured in the security manager will be used. Refer Section 2.8, “Configuring SNMP,” of Management Guide to know how to configure the security manager.

    If community string is not configured in the security manger, then by default, Public will be used as the community string.

Event severity when service down or cannot get MIB value

Set the event severity, from 1 to 40, to indicate the importance of an event in which the service is down or cannot get the MIB variable value. The default is 5.

Dell Open Manage service

 

Auto-start service?

Set to y to automatically restart the stopped services. The default is y.

Event severity when service down; restart failed

Set the event severity, from 1 to 40, to indicate the importance of an event in which a service is down and restart failed. The default is 5.

Event severity when service down; restart succeeded

Set the event severity, from 1 to 40, to indicate the importance of an event in which a service is down and restart succeeded. The default is 25.

Event severity when service down; do not restart

Set the event severity, from 1 to 40, to indicate the importance of an event in which a service is down and will not be restarted. The default is 18.