Configure the Prometheus receiver to collect NVIDIA Dynamo metrics
Send NVIDIA Dynamo metrics to Splunk Observability Cloud.
You can monitor the performance of NVIDIA Dynamo by configuring the Splunk Distribution of the OpenTelemetry Collector to send NVIDIA Dynamo metrics to Splunk Observability Cloud.
This solution uses the Prometheus receiver to collect metrics from NVIDIA Dynamo, which exposes the Prometheus-compatible /metrics endpoint.
- Deploy the Splunk Distribution of the OpenTelemetry Collector to your host or container platform:
- To manually activate the Prometheus receiver for NVIDIA Dynamo, make the following changes to your Collector
values.yamlconfiguration file. - Restart the Splunk Distribution of the OpenTelemetry Collector.
Configuration settings
To view the configuration options for the Prometheus receiver, see Settings.
Metrics
The following metrics are available for NVIDIA Dynamo. For more information, see Metrics in the NVIDIA Dynamo documentation.
These metrics are considered custom metrics in Splunk Observability Cloud.
| Metric name | Description |
|---|---|
dynamo_frontend_* |
Frontend metrics that measure request handling, token processing, and latency measurements. |
dynamo_component_* |
Component metrics that measure request counts, processing times, byte transfers, and system uptime. |
dynamo_preprocessor_* |
Component-specific metrics. |
|
Metrics related to backend engines. Backend engines expose their own metrics. |
Attributes
The following resource attributes are available for NVIDIA Dynamo.
| Attribute name | Description |
|---|---|
service.name |
Logical name of the component emitting data. |
service.instance.id |
Unique ID for one running instance (pod/replica) of that component. |
k8s.cluster.name |
Name of the Kubernetes cluster hosting the workload. |
namespace |
Kubernetes namespace where Dynamo pods run. Isolates the env/team. |
model_name |
LLM or model these metrics refer to. |
nvidia.com/dynamo-graph-deployment-name |
Label that links pods to one Dynamo graph deployment. |
Next steps
After you set up data collection, the data populates built-in dashboards that you can use to monitor and troubleshoot your instances.
For more information on using built-in dashboards in Splunk Observability Cloud, see: