Configure the Prometheus receiver to collect llm-d metrics
Send llm-d metrics to Splunk Observability Cloud.
You can monitor the performance of your llm-d stack by configuring the Splunk Distribution of the OpenTelemetry Collector to send llm-d metrics to Splunk Observability Cloud.
This solution uses the Prometheus receiver to collect metrics from llm-d, which exposes the Prometheus-compatible /metrics endpoint.
- Deploy the Splunk Distribution of the OpenTelemetry Collector to your host or container platform:
- To manually activate the Prometheus receiver for llm-d, make the following changes to your Collector
values.yamlconfiguration file. - Restart the Splunk Distribution of the OpenTelemetry Collector.
Configuration settings
To view the configuration options for the Prometheus receiver, see Settings.
Metrics
The following metrics are available for llm-d. For more information, see Prometheus metrics in the llm-d-inference-sim GitHub repository.
These metrics are considered custom metrics in Splunk Observability Cloud.
| Metric name | Metric type | Description |
|---|---|---|
inference_objective_request_total |
counter | Total inference model requests. |
inference_extension_scheduler_e2e_duration_seconds |
histogram | End-to-end scheduling latency. |
inference_objective_request_error_total |
counter | Total inference model request errors. |
inference_objective_input_tokens |
histogram | Input token count distribution. |
Attributes
| Attribute name | Description |
|---|---|
model_name |
The specific AI model being served. |
service.instance.id |
Unique identifier for a specific instance of the service. |
k8s.cluster.name |
Name of the Kubernetes cluster where the inference workload is running. |
namespace |
Kubernetes namespace used for logical isolation. |
host.name |
Physical or virtual machine name. |
k8s.node.name |
Kubernetes node name. |
Next steps
After you set up data collection, the data populates built-in dashboards that you can use to monitor and troubleshoot your instances.
For more information on using built-in dashboards in Splunk Observability Cloud, see: