Cluster Metrics
The Cluster Agent Dashboard metrics derive from the Kubernetes API, and they report information for the clusters and pods. For any defined set of namespaces, the Cluster Agent reports events on these Kubernetes and hardware resources.
Splunk AppDynamics monitors cluster health and Kubernetes objects for:
Cluster Agent
| Metric Name | Description | UI Location | Metric Path | 
|---|---|---|---|
| Availability | Availability of the Cluster Agent. This metric helps in identifying if the Cluster Agent is down. Value of 100 represents that the Cluster Agent status is active, thus available. | Cluster Agent|Availability | 
Cluster Summary Metrics
| Metric Name | Description | UI Location | Metric Path | 
|---|---|---|---|
| Error events count | Number of error events | Hardware Resources|Cluster|Error events count | |
| Evicted pods count | Number of evicted pods | Hardware Resources|Cluster|Evicted pods count | |
| Eviction threats count | Number of events that represent pod evictions | Hardware Resources|Cluster|Eviction threats count | |
| Image pull errors | Number of image pull errors | Hardware Resources|Cluster|Image pull errors | |
| Image pulls | Number of image pulls | Hardware Resources|Cluster|Image pulls | |
| Info events count | Number of informational events | Hardware Resources|Cluster|Info events count | |
| Pod errors | Number of errors related to pods | Hardware Resources|Cluster|Pod errors | |
| Pod Kills | Number of pods that were killed | Hardware Resources|Cluster|Pod Kills | |
| Pod restarts | Number of times the pods restarted | Hardware Resources|Cluster|Pod restarts | |
| Pods Scaledowns | Count of scaledowns; you can scale down your deployments and replica sets. | Hardware Resources|Cluster|Pods Scaledowns | |
| Pods count | Total count of pods | Hardware Resources|Cluster|Pods count | |
| Pods failed | Number of failed pods | Hardware Resources|Cluster|Pods failed | |
| Pods pending | Number of pods in a pending state. Pending status normally indicates an issue. See theKubernetes documentation. | Hardware Resources|Cluster|Pods pending | |
| Pods running | Number of pods in a running state | Hardware Resources|Cluster|Pods running | |
| Pods succeeded | Number of pods in Succeeded phase | Hardware Resources|Cluster|Pods succeeded | |
| Pods unknown | Number of pods in Unknown state | Hardware Resources|Cluster|Pods unknown | |
| Pods with Missing Dependencies - Config Maps and Secrets | If a pod is dependent on any Config Maps & Secrets, then those dependencies are missing. | Hardware Resources|Cluster|Pods With Missing Dependencies - Config Maps And Secrets (Pod Metrics for Inventory tab) | |
| Pods with Missing Dependencies - Services | If a pod is dependent on any Services, then those dependencies are missing. | 
 | Hardware Resources|Cluster|Pods With Missing Dependencies (Pod Metrics for Inventory Tab) | 
| Pods with No Limits | Number of pods with no limits (on CPU/memory) set. If you specified limits on any pod that you are starting, this metric indicates how many pods do not have a limit defined (Displays in the Inventory tab, under Pod Metrics). | Hardware Resources|Cluster|Pods With No Limits | |
| Pods With No Liveness Probe | Number of pods with no liveness probe. If you configured a probe in Kubernetes to monitor liveness, the values display in the Inventory tab, under Pod Metrics. | Hardware Resources|Cluster|Pods With No Liveness Probe | |
| Pods With No Readiness Probe | Number of pods with no readiness probe. If you configured a probe in Kubernetes to monitor readiness, the values display in the Inventory tab, under Pod Metrics. | Hardware Resources|Cluster|Pods With No Readiness Probe | |
| Privileged Pods | Number of privileged pods that run with root access (Displays in the Inventory tab, under Pod Metrics). | Hardware Resources|Cluster|Privileged Pods | |
| Storage errors | Overall number of errors related to storage for the cluster. | Hardware Resources|Cluster|Storage errors | |
| Storage quota violations | Number of storage quota violations; if someone exceeds that quota. | Hardware Resources|Cluster|Storage quota violations | 
CPU
CPU Capacity
| Metric Name | Description | UI Location | Metric Path | 
|---|---|---|---|
| Total (MilliCores) | Total CPU capacity for the cluster in MilliCores | Hardware Resources|Cluster|CPU|Capacity|Total (MilliCores) | |
| Used (MilliCores) | CPU capacity already used by the cluster in MilliCores | Hardware Resources|Cluster|CPU|Capacity|Used (MilliCores) | 
CPU Quota
| Metric Name | Description | UI Location | Metric Path | 
|---|---|---|---|
| Limit Used (%) | Percentage of CPU limit quota used | Hardware Resources|Cluster|CPU|Quota|Limit Used (%) | |
| Limit Used (MilliCores) | MilliCores value for CPU limit quota used | Hardware Resources|Cluster|CPU|Quota|Limit Used (MilliCores) | |
| Request Used (%) | Percentage of CPU request quota used | Hardware Resources|Cluster|CPU|Quota|Request Used (%) | |
| Request Used (MilliCores) | MilliCores value for CPU request quota used | Hardware Resources|Cluster|CPU|Quota|Request Used (Millicores) | 
CPU Utilization
| Metric Name | Description | UI Location | Metric Path | 
|---|---|---|---|
| Limit (MilliCores) | Limit of CPU which can be used by the pods. Only the pods belonging to monitored namespaces are used to calculate this metric. If this value is not specified for any pod, then the value is calculated as the CPU limit of the node. For example: 
 | Hardware Resources|Cluster|CPU|Utilization|Limit (MilliCores) | |
| Request (MilliCores) | MilliCore value of CPU for which all the pods in monitored namespaces have requested. | Hardware Resources|Cluster|CPU|Utilization|Request (MilliCores) | |
| Used (MilliCores) | Actual CPU which the pods from monitored namespaces are currently using. | Hardware Resources|Cluster|CPU|Utilization|Used (MilliCores) | 
DaemonSets
| Metric Name | Description | UI Location | Metric Path | 
|---|---|---|---|
| Count | Number of daemon sets that exist | HardwareResources|Cluster|DaemonSets|Count | |
| Nodes Available | Number of nodes that are running and available on the cluster | HardwareResources|Cluster|DaemonSets|Nodes Available | |
| Nodes MissScheduled | Number of nodes that are running, but should not be running | HardwareResources|Cluster|DaemonSets|Nodes MissScheduled | |
| Nodes Unavailable | Number of nodes that should be running, but are not running | HardwareResources|Cluster|DaemonSets|Nodes Unavailable | 
Deployments
| Metric Name | Description | UI Location | Metric Path | 
|---|---|---|---|
| Count | Number of deployments that exist in the cluster | HardwareResources|Cluster|Deployments|Count | |
| Replicas | Number of pod replicas in the cluster that are not in a terminated state | HardwareResources|Cluster|Deployments|Replicas | |
| Replicas Unavailable | Total number of unavailable pod replicas across all deployments in the cluster | HardwareResources|Cluster|Deployments|ReplicasUnavailable | 
Endpoints
| Metric Name | Description | UI Location | Metric Path | 
|---|---|---|---|
| Count | Number of endpoints in the cluster | HardwareResources|Cluster|Endpoints|Count | |
| Not Ready Address | Total number of not ready addresses for all the endpoints in the cluster | HardwareResources|Cluster|Endpoints|Not Ready Address | |
| Orphans | Total number of endpoints in the cluster which do not have any ready, nor any not ready addresses | HardwareResources|Cluster|Endpoints|Orphans | |
| Ready Address | Total number of ready addresses for all the endpoints in the cluster | HardwareResources|Cluster|Endpoints|Ready Address | 
Jobs
| Metric Name | Description | UI Location | Metric Path | 
|---|---|---|---|
| Count | Total number of jobs in the cluster. | Hardware Resources|Cluster|Jobs|Count | |
| Pods Active | Total number of active pods for all the jobs in the cluster. | Hardware Resources|Cluster|Jobs|Pods Active | |
| Pods Failed | Total number of pods which reached phase Failed for all the jobs in the cluster. | Hardware Resources|Cluster|Jobs|Pods Failed | |
| Pods Succeeded | Total number of pods which reached phase Succeeded for all the jobs in the cluster. | Hardware Resources|Cluster|Jobs|Pods Succeeded | 
Memory
Memory Capacity
| Metric Name | Description | UI Location | Metric Path | 
|---|---|---|---|
| Total (MB) | Total Memory capacity for the cluster in MBs. | Hardware Resources|Cluster|Memory|Capacity|Total (MB) | |
| Used (MB) | Memory capacity already used by the cluster in MBs | Hardware Resources|Cluster|Memory|Capacity|Used (MB) | 
Memory Quota
| Metric Name | Description | UI Location | Metric Path | 
|---|---|---|---|
| Limit Used (%) | Percentage of Memory limit quota used | Hardware Resources|Cluster|Memory|Quota|Limit Used (%) | |
| Limit Used (MB) | MB value for Memory limit quota used | Hardware Resources|Cluster|Memory|Quota|Limit Used (MB) | |
| Request Used (%) | Percentage of Memory request quota used | Hardware Resources|Cluster|Memory|Quota|Request Used (%) | |
| Request Used (MB) | MB value for Memory request quota used | Hardware Resources|Cluster|Memory|Quota|Request Used (MB) | 
Memory Utilization
| Metric Name | Description | UI Location | Metric Path | 
|---|---|---|---|
| Limit (MB) | Limit of Memory which can be used by the pods. Only the pods belonging to monitored namespaces are used to calculate this metric. If this value is not specified for any pod, then the value is calculated as the memory limit of the node. For example: 
 | Hardware Resources|Cluster|Memory|Utilization|Limit (MB) | |
| Request (MB) | MB value of Memory for which all the pods in monitored namespaces have requested. | Hardware Resources|Cluster|Memory|Utilization|Request (MB) | |
| Used (MB) | Actual Memory which the pods from monitored namespaces are currently using. | Hardware Resources|Cluster|Memory|Utilization|Used (MB) | 
Nodes
| Metric Name | Description | UI Location | Metric Path | 
|---|---|---|---|
| Master Count | Number of master nodes in the cluster | Hardware Resources|Cluster|Nodes|Master Count | |
| Worker Count | Number of worker nodes in the cluster | Hardware Resources|Cluster|Nodes|Worker Count | |
| Memory Pressure Count | Number of nodes that are under memory pressure in the cluster | Hardware Resources|Cluster|Nodes|Memory Pressure Count | |
| Disk Pressure Count | Number of nodes that are under disk pressure in the cluster | Hardware Resources|Cluster|Nodes|Disk Pressure Count | 
Pods
Pods Capacity
| Metric Name | Description | UI Location | Metric Path | 
|---|---|---|---|
| Total Count | Total number of pods that a cluster can support | 
 | Hardware Resources|Cluster|Pods|Capacity|Total Count | 
| Used Count | Number of pods already created in the cluster | Hardware Resources|Cluster|Pods|Capacity|Used Count | 
Pods CPU Usage
| Metric Name | Description | UI Location | Metric Path | 
|---|---|---|---|
| %Busy Scaled | This normalises the CPU usage percentage relative to the CPU limit, scaling it to a more detailed unit. This metric displays how much of the allocated CPU resources (measured in milli-cores) are being used, providing a precise view of CPU utilisation with the CPU limit of the resource. | Root|Individual Nodes|<namespace>/<pod-name>|Hardware Resources|CPU|%Busy Scaled | |
| %Busy | The percentage of the CPU used by a pod. If the CPU limit is provided for the pod, the busy % is calculated as the percentage of CPU used relative to the CPU limit of the pod. If CPU limit of the pod is not specified, this is calculated as the percentage of CPU used relative to the CPU limit of the node or cluster, whichever is available. | Root|Individual Nodes|<namespace>/<pod-name>|Hardware Resources|CPU|%Busy | 
Pods Memory Usage
| Metric Name | Description | UI Location | Metric Path | 
|---|---|---|---|
| Used (MB) | The amount of memory used by a pod. | Root|Individual Nodes|<namespace>/<pod-name>|Hardware Resources|Memory|Used (MB) | 
PVC
PVC Quota
| Metric Name | Description | UI Location | Metric Path | 
|---|---|---|---|
| Used | PVC quota already being used in the cluster (count) | Hardware Resources|Cluster|PVC|Quota|Used | |
| Used % | Percentage of PVC quota already being used in the cluster | Hardware Resources|Cluster|PVC|Quota|Used (%) | 
PVC Utilization
| Metric Name | Description | UI Location | Metric Path | 
|---|---|---|---|
| Capacity (MB) | Total PVC available for the pods in the monitored namespaces | Hardware Resources|Cluster|PVC|Utilization|Capacity (MB) | |
| Request (MB) | Value for PVC requested by pods in monitored namespaces | Hardware Resources|Cluster|PVC|Utilization|Request (MB) | 
ReplicaSets
| Metric Name | Description | UI Location | Metric Path | 
|---|---|---|---|
| Count | Number of replica set resources in the cluster | Hardware Resources|Cluster|Count | |
| Replicas | Total number of replicas for all the replica sets in the cluster | Hardware Resources|Cluster|ReplicaSets|Replicas | |
| Replicas Available | Total number of available replicas for all the replica sets in the cluster | Hardware Resources|Cluster|ReplicaSets|Replicas Available | |
| Replicas Unavailable | Total number of unavailable replicas for all the replica sets in the cluster | Hardware Resources|Cluster|ReplicaSets|Replicas Unavailable | 
Services
| Metric Name | Description | UI Location | Metric Path | 
|---|---|---|---|
| Count | Total number of Kubernetes Services running in the cluster | Hardware Resources|Cluster|Services|Count | 
StatefulSets
| Metric Name | Description | UI Location | Metric Path | 
|---|---|---|---|
| Count | Number of statefulsets in monitored namespaces | Hardware Resources|Cluster|StatefulSets|Count | |
| Replicas Ready | Number of replicas in a ready state across all statefulsets in monitored namespaces | Hardware Resources|Cluster|StatefulSets|Replicas Ready | |
| Replicas Desired | Number of replicas across all statefulsets in monitored namespaces which are specified as desired in statefulset spec | N/A | Hardware Resources|Cluster|StatefulSets|Replicas Desired | 
| Replicas Not Ready | Number of replicas across all statefulsets in monitored namespaces which are not ready and are yet to be created or started | Hardware Resources|Cluster|StatefulSets|Replicas Not Ready | |
| Collisions | Number of hash collisions for statefulsets across all namespaces monitored | N/A | Hardware Resources|Cluster|StatefulSets|Collisions | 
Storage Quota
| Metric Name | Description | UI Location | Metric Path | 
|---|---|---|---|
| Used (MB) | Storage quota used by the cluster in MB | Hardware Resources|Cluster|Storage|Quota|Used (MB) | |
| Used (%) | Percentage of storage quota used by the cluster | Hardware Resources|Cluster|Storage|Quota|Used (%) |