Configure the Prometheus receiver to collect llm-d metrics

Send llm-d metrics to Splunk Observability Cloud.

You can monitor the performance of your llm-d stack by configuring the Splunk Distribution of the OpenTelemetry Collector to send llm-d metrics to Splunk Observability Cloud.

This solution uses the Prometheus receiver to collect metrics from llm-d, which exposes the Prometheus-compatible /metrics endpoint.

Deploy the Splunk Distribution of the OpenTelemetry Collector to your host or container platform:
To manually activate the Prometheus receiver for llm-d, make the following changes to your Collector values.yaml configuration file.
1. Add prometheus/llm-d to the receivers section. For example:
  YAML
  agent: config: receivers: prometheus/llm-d: config: scrape_configs: - bearer_token_file: /var/run/secrets/kubernetes.io/serviceaccount/token job_name: llm-d-epp metrics_path: /metrics scrape_interval: 30s static_configs: - labels: job: llm-d-epp namespace: # llm-d namespace workload: llm-d targets: - ['http://{host}:{port}'] - job_name: llm-d-decode metrics_path: /metrics scrape_interval: 30s static_configs: - labels: job: llm-d-decode namespace: llm-d workload: llm-d targets: - ['http://{host}:{port}']
```
agent:
  config:
    receivers:
      prometheus/llm-d:
        config:
          scrape_configs:
          - bearer_token_file: /var/run/secrets/kubernetes.io/serviceaccount/token
            job_name: llm-d-epp
            metrics_path: /metrics
            scrape_interval: 30s
            static_configs:
            - labels:
                job: llm-d-epp
                namespace: # llm-d namespace
                workload: llm-d
              targets:
              - ['http://{host}:{port}']
          - job_name: llm-d-decode
            metrics_path: /metrics
            scrape_interval: 30s
            static_configs:
            - labels:
                job: llm-d-decode
                namespace: llm-d
                workload: llm-d
              targets:
              - ['http://{host}:{port}']
```
2. Add prometheus/llm-d to the metrics pipeline of the service section. For example:
  YAML
  service: pipelines: metrics: receivers: - prometheus/llm-d
```
service:
  pipelines:
    metrics:
      receivers:
        - prometheus/llm-d
```
Restart the Splunk Distribution of the OpenTelemetry Collector.

Configuration settings

To view the configuration options for the Prometheus receiver, see Settings.

Metrics

The following metrics are available for llm-d. For more information, see Prometheus metrics in the llm-d-inference-sim GitHub repository.

These metrics are considered custom metrics in Splunk Observability Cloud.


Metric name	Metric type	Description
`inference_objective_request_total`	counter	Total inference model requests.
`inference_extension_scheduler_e2e_duration_seconds`	histogram	End-to-end scheduling latency.
`inference_objective_request_error_total`	counter	Total inference model request errors.
`inference_objective_input_tokens`	histogram	Input token count distribution.

Attributes

The following resource attributes are available for llm-d.


Attribute name	Description
`model_name`	The specific AI model being served.
`service.instance.id`	Unique identifier for a specific instance of the service.
`k8s.cluster.name`	Name of the Kubernetes cluster where the inference workload is running.
`namespace`	Kubernetes namespace used for logical isolation.
`host.name`	Physical or virtual machine name.
`k8s.node.name`	Kubernetes node name.

Next steps

After you set up data collection, the data populates built-in dashboards that you can use to monitor and troubleshoot your instances.

For more information on using built-in dashboards in Splunk Observability Cloud, see:

Built-in dashboards

View dashboards in Splunk Observability Cloud

Splunk Enterprise

Splunk Cloud Platform

Splunkbase

Enterprise Security

SOAR

IT Service Intelligence

Content Packs

Splunk Observability Cloud

AppDynamics SaaS

AppDynamics On-Premises

SAP Agent

Developer Documentation

Splunkbase

Splunk Enterprise

Splunk Cloud Platform

Splunkbase

DATA MANAGEMENT

SEARCH AND ANALYTICS

ADMINISTRATION

Enterprise Security

SOAR

ENTERPRISE SECURITY

SOAR

RELATED APPS

IT Service Intelligence

Content Packs

ITSI

IT Ops

ADMINISTRATION

EXTENSIONS

Splunk Observability Cloud

MONITORING

DATA MANAGEMENT

ADMINISTRATION

AppDynamics SaaS

AppDynamics On-Premises

SAP Agent

ESSENTIALS

MONITORING

ADMINISTRATION

Developer Documentation

Splunkbase

PLATFORM

OBSERVABILITY

REFERENCE

Resources

REFERENCE

Learn More

Support

Configure the Prometheus receiver to collect llm-d metrics

Configuration settings

Metrics

Attributes

Next steps