Introduction to Splunk AI Infrastructure Monitoring

Monitor and troubleshoot the performance of the AI infrastructure components used to build your AI applications.

Monitor and troubleshoot the performance of the AI infrastructure components used to build your AI applications with Splunk AI Infrastructure Monitoring.

AI Infrastructure Monitoring supports monitoring the health, availability, and usage of specialized AI components, such as large-language model (LLM) services, model-serving platforms, language frameworks, vector databases, models, infrastructure services, and microservices.

Set up AI Infrastructure Monitoring

Set up one or more of the following data integrations to collect data from AI infrastructure components. The data integrations use the Splunk Distribution of the OpenTelemetry Collector with an OpenTelemetry receiver to collect data.

What can I do with Splunk AI Infrastructure Monitoring?

After you set up data collection from supported AI infrastructure components to Splunk Observability Cloud, the data populates built-in experiences that you can use to monitor and troubleshoot your AI infrastructure.

The following table describes the tools you can use to monitor and troubleshoot your AI infrastructure components.
Do this With this tool Link to documentation
Orient and explore different layers of your AI tech stack. Built-in navigators
Assess service, endpoint, and system health at a glance. Built-in dashboards