You are viewing docs on Elastic's new documentation system, currently in technical preview. For all other Elastic docs, visit elastic.co/guide.

Host metrics

Learn about key host metrics used for host monitoring.

Learn about key host metrics displayed in the Infrastructure UI:

Hosts metrics

MetricDescription
Hosts
Number of hosts returned by your search criteria.

CPU usage metrics

MetricDescription
CPU Usage (%)

Percentage of CPU time spent in states other than Idle and IOWait, normalized by the number of CPU cores. This includes both time spent on user space and kernel space.

100% means all CPUs of the host are busy.

CPU Usage - iowait (%)
The percentage of CPU time spent in wait (on disk).
CPU Usage - irq (%)
The percentage of CPU time spent servicing and handling hardware interrupts.
CPU Usage - nice (%)
The percentage of CPU time spent on low-priority processes.
CPU Usage - softirq (%)
The percentage of CPU time spent servicing and handling software interrupts.
CPU Usage - steal (%)
The percentage of CPU time spent in involuntary wait by the virtual CPU while the hypervisor was servicing another processor. Available only on Unix.
CPU Usage - system (%)
The percentage of CPU time spent in kernel space.
CPU Usage - user (%)
The percentage of CPU time spent in user space. On multi-core systems, you can have percentages that are greater than 100%. For example, if 3 cores are at 60% use, then the system.cpu.user.pct will be 180%.
Load (1m)

1 minute load average.

Load average gives an indication of the number of threads that are runnable (either busy running on CPU, waiting to run, or waiting for a blocking IO operation to complete).

Load (5m)

5 minute load average.

Load average gives an indication of the number of threads that are runnable (either busy running on CPU, waiting to run, or waiting for a blocking IO operation to complete).

Load (15m)

15 minute load average.

Load average gives an indication of the number of threads that are runnable (either busy running on CPU, waiting to run, or waiting for a blocking IO operation to complete).

Normalized Load

1 minute load average normalized by the number of CPU cores.

Load average gives an indication of the number of threads that are runnable (either busy running on CPU, waiting to run, or waiting for a blocking IO operation to complete).

100% means the 1 minute load average is equal to the number of CPU cores of the host.

Taking the example of a 32 CPU cores host, if the 1 minute load average is 32, the value reported here is 100%. If the 1 minute load average is 48, the value reported here is 150%.

Memory metrics

MetricDescription
Memory Cache
Memory (page) cache.
Memory Free
Total available memory.
Memory Free (excluding cache)
Total available memory excluding the page cache.
Memory Total
Total memory capacity.
Memory Usage (%)

Percentage of main memory usage excluding page cache.

This includes resident memory for all processes plus memory used by the kernel structures and code apart from the page cache.

A high level indicates a situation of memory saturation for the host. For example, 100% means the main memory is entirely filled with memory that can't be reclaimed, except by swapping out.

Memory Used
Main memory usage excluding page cache.

Log metrics

MetricDescription
Log Rate
Derivative of the cumulative sum of the document count scaled to a 1 second rate. This metric relies on the same indices as the logs.

Network metrics

MetricDescription
Network Inbound (RX)
Number of bytes that have been received per second on the public interfaces of the hosts.
Network Inbound (TX)
Number of bytes that have been sent per second on the public interfaces of the hosts.

Disk metrics

MetricDescription
Disk Latency
Time spent to service disk requests.
Disk Read IOPS
Average count of read operations from the device per second.
Disk Read Throughput
Average number of bytes read from the device per second.
Disk Usage - Available (%)
Percentage of disk space available.
Disk Usage - Used (%)
Percentage of disk space used.
Disk Write IOPS
Average count of write operations from the device per second.
Disk Write Throughput
Average number of bytes written from the device per second.

On this page