4.56 SmartPhysicalDiskStats

Use this Knowledge Script to monitor physical disk activity and response time. This Knowledge Script combines a newly introduced parameter CPU 'Wait on IO' with disk load, to measure disk performance. CPU IOwait is the percentage of time the CPU has to wait on disk and if this is consistently high, it indicates that your storage device is too slow to keep up with incoming requests.

AIX only reports on disk load and ignores response time parameter.

4.56.1 Resource Objects

Physical disk folder or individual physical disks

4.56.2 Default Schedule

The default interval for this script is Every 10 minutes.

4.56.3 Setting Parameter Values

Set the following parameters as needed:

Description

How to Set It

General Settings

Event severity when job fails

Set the event severity level, from 1 to 40, to indicate the importance of an event reported when a job fails. The default is 5.

Enable debugging?

Select Yes to enable debugging. The default is unselected.

Event detail format

Select the format in which to view the event detail. The default is HTML Table.

Number of seconds between samples

Set the number of seconds, from 1 to 30, between samples. The default is 2.

Number of times sar/iostat should iterate before reporting an average value

Set the number of times, from 1 to 100 that sar/iostat should iterate before reporting an average value. The default is 2.

Raise event when AppManager fails to get metrics?

Select Yes to raise an event if AppManager fails to retrieve the metrics.The default is Yes.

Event severity

Set the event severity level, from 1 to 40, to indicate the importance of an event reported when a job fails. The default is 5.

Event Settings

Event if average response time of disk operations exceeds threshold? (y/n)

Select Yes to raise an event if the average response time of the disk operations exceeds the threshold. The default is Yes.

Threshold - Maximum average response time (unavailable on AIX)

Specify the threshold for the maximum average response time that can be detected before an event is raised. The default is 200 ms (milliseconds).

Event severity

Set the event severity level, from 1 to 40, to indicate the importance of the event when the maximum average response time exceeds the threshold you set. The default is 5.

Raise event if disk activity exceeds threshold? (y/n)

Select Yes to raise an event if the disk activity exceeds the threshold.The default is Yes.

Use CPU 'Wait on IO' in determining disk load?

Select Yes to use the CPU “Wait on IO” in determining the disk load. The default is Yes. If this parameter is selected, then even if the threshold for disk activity is reached, the event is not raised until an additional condition of CPU metrics iowait% > system% + user% is met.

Threshold - Maximum disk activity (% busy)

Specify the threshold for the busy percentage (%) of the Maximum disk activity. The default is 80%.

Event severity

Set the event severity level, from 1 to 40, to indicate the importance of an event when the busy percentage of the maximum disk activity exceeds the threshold you set.

Collect data settings

Collect data for average response time of disk operations?

Select Yes to collect data for charts and reports. If enabled, data collection returns the average response time of disk operations.The default is unselected.

Collect data for disk load?

Select Yes to collect data for charts and reports. If enabled, data collection returns the disk load. The default is unselected.

Collect data for KBs read per second?

Select Yes to collect data for charts and reports. If enabled, data collection returns KBs read per second. The default is unselected.

Collect data for KBs written per second?

Select Yes to collect data for charts and reports. If enabled, data collection returns KBs written per second. The default is unselected.

Collect data for throughput in KBs per second?

Select Yes to collect data for charts and reports. If enabled, data collection returns rate of disk read and write operations in KBs per second. The default is unselected.

Collect data for reads per second?

Select Yes to collect data for charts and reports. If enabled, data collection returns reads per second of the disk operations. The default is unselected.

Collect data for writes per second?

Select Yes to collect data for charts and reports. If enabled, data collection returns writes per second of the disk operations. The default is unselected.

Collect data for throughput per second?

Select Yes to collect data for charts and reports. If enabled, data collection returns throughput per second of the disk operations. The default is unselected.