Share data in the AI Toolkit

When the AI Toolkit is deployed on Splunk Enterprise, the Splunk platform sends aggregated usage data to Splunk Inc. ("Splunk") to help improve the AI Toolkit in future releases. For information about how to opt in or out, and how the data is collected, stored, and governed, see Share data in Splunk Enterprise.

What data is collected

The AI Toolkit collects the following basic usage information:


Component	Description	Example
`ai_processing_time`	Time taken to process the `ai` command request. Triggered during `ai` command usage.	JSON { "command": "ai", "ai_processing_time":0.7969220000000001 } `{ "command": "ai", "ai_processing_time":0.7969220000000001 }`
`algo_name`	Name of algorithm used in `fit` or `apply`.	JSON { "algo_name": "StandardScaler" } `{ "algo_name": "StandardScaler" }`
`app_context`	Name of the app from which search is run.	JSON { "app_context": "Splunk_ML_Toolkit" } `{ "app_context": "Splunk_ML_Toolkit" }`
`apply_time`	Time the `apply` command took.	JSON { 'apply_time': 0.005 } `{ 'apply_time': 0.005 }`
`app.session.Splunk_ML_Toolkit.changeSmartAssistantStep`	User progress through an AI Toolkit Smart Assistant.	JSON { component: app.session.Splunk_ML_Toolkit.changeSmartAssistantStep data: { [-] app: Splunk_ML_Toolkit experiment_id: 63fb7afba756455d8056b5e547f8545f experimentType: smart_outlier_detection page: smart_outlier_detection previousStep: learn step: define } deploymentID: 88A80D96D80B30B6F48E3FF9A0B318 eventID: 7185ae51-04aa-2025-8a57-6e0340e50c46 experienceID: d914fba4-7ca1-4370-a123-3a03a01d2569 optInRequired: 3 timestamp: 1585251931 userID: 60749ba2789ec1eee0ada6a0b5680512460559541023017ad6f5b4a3b0172841 version: 4 visibility: anonymous,support } { component: app.session.Splunk_ML_Toolkit.changeSmartAssistantStep data: { [-] app: Splunk_ML_Toolkit experiment_id: 63fb7afba756455d8056b5e547f8545f experimentType: smart_outlier_detection page: smart_outlier_detection previousStep: learn step: define } deploymentID: 88A80D96D80B30B6F48E3FF9A0B318 eventID: 7185ae51-04aa-2025-8a57-6e0340e50c46 experienceID: d914fba4-7ca1-4370-a123-3a03a01d2569 optInRequired: 3 timestamp: 1585251931 userID: 60749ba2789ec1eee0ada6a0b5680512460559541023017ad6f5b4a3b0172841 version: 4 visibility: anonymous,support }
`app.session.Splunk_ML_Toolkit.createExperiment`	User creating an AI Toolkit Experiment.	JSON { component: app.session.Splunk_ML_Toolkit.createExperiment data: { app: Splunk_ML_Toolkit experiment_id: 09ca5db894894c86b20b083941acaae0 experimentType: smart_forecast page: experiments } deploymentID: 88A80D96D80B30B6F48E3FF9A0B318 eventID: 8318866b-f2f5-35a4-1348-b82486b3a41f experienceID: dfbde5b8-eb57-10a3-5ced-3be47f2b8ad2 optInRequired: 3 timestamp: 1583786919 userID: 60749ba2789ec1eee0ada6a0b5680512460559541023017ad6f5b4a3b0172841 version: 4 visibility: anonymous,support } { component: app.session.Splunk_ML_Toolkit.createExperiment data: { app: Splunk_ML_Toolkit experiment_id: 09ca5db894894c86b20b083941acaae0 experimentType: smart_forecast page: experiments } deploymentID: 88A80D96D80B30B6F48E3FF9A0B318 eventID: 8318866b-f2f5-35a4-1348-b82486b3a41f experienceID: dfbde5b8-eb57-10a3-5ced-3be47f2b8ad2 optInRequired: 3 timestamp: 1583786919 userID: 60749ba2789ec1eee0ada6a0b5680512460559541023017ad6f5b4a3b0172841 version: 4 visibility: anonymous,support }
`app.session.Splunk_ML_Toolkit.createExperimentAlert`	Users creating alerts for AI Toolkit Experiments.	JSON { component: app.session.Splunk_ML_Toolkit.createExperimentAlert data: { app: Splunk_ML_Toolkit experiment_id: 46221dd8661d420aaa988ca7d41821ae experimentType: smart_forecast page: experiments } deploymentID: 88A80D96D80B30B6F48E3FF9A0B318 eventID: 6bd85948-4f9b-ff9d-bf02-18defe062eec experienceID: f2c4f65b-a723-88af-875a-73737bbc9061 optInRequired: 3 timestamp: 1584480173 userID: 60749ba2789ec1eee0ada6a0b5680512460559541023017ad6f5b4a3b0172841 version: 3 visibility: anonymous,support } { component: app.session.Splunk_ML_Toolkit.createExperimentAlert data: { app: Splunk_ML_Toolkit experiment_id: 46221dd8661d420aaa988ca7d41821ae experimentType: smart_forecast page: experiments } deploymentID: 88A80D96D80B30B6F48E3FF9A0B318 eventID: 6bd85948-4f9b-ff9d-bf02-18defe062eec experienceID: f2c4f65b-a723-88af-875a-73737bbc9061 optInRequired: 3 timestamp: 1584480173 userID: 60749ba2789ec1eee0ada6a0b5680512460559541023017ad6f5b4a3b0172841 version: 3 visibility: anonymous,support }
`app.session.Splunk_ML_Toolkit.loadAssistant`	Number of times the user has loaded an AI Toolkit Assistant.	JSON { component: app.session.Splunk_ML_Toolkit.loadAssistant data: { [-] app: Splunk_ML_Toolkit experiment_id: 6196da5dc78f4606925295ead869f023 experimentType: smart_clustering page: smart_clustering } deploymentID: 88A80D96D80B30B6F48E3FF9A0B318 eventID: 54e3887b-acf3-ba6c-7f4f-cef1373c4d99 experienceID: d914fba4-7ca1-4370-a123-3a03a01d2569 optInRequired: 3 timestamp: 1585270611 userID: 60749ba2789ec1eee0ada6a0b5680512460559541023017ad6f5b4a3b0172841 version: 4 visibility: anonymous,support } `{ component: app.session.Splunk_ML_Toolkit.loadAssistant data: { [-] app: Splunk_ML_Toolkit experiment_id: 6196da5dc78f4606925295ead869f023 experimentType: smart_clustering page: smart_clustering } deploymentID: 88A80D96D80B30B6F48E3FF9A0B318 eventID: 54e3887b-acf3-ba6c-7f4f-cef1373c4d99 experienceID: d914fba4-7ca1-4370-a123-3a03a01d2569 optInRequired: 3 timestamp: 1585270611 userID: 60749ba2789ec1eee0ada6a0b5680512460559541023017ad6f5b4a3b0172841 version: 4 visibility: anonymous,support }`
`app.session.Splunk_ML_Toolkit.saveExperiment`	Users saving their work in AI Toolkit Experiments.	JSON { component: app.session.Splunk_ML_Toolkit.saveExperiment data: { app: Splunk_ML_Toolkit experiment_id: 4f390e49096c43adb05feb29fe9bfbbc experimentType: smart_outlier_detection page: smart_outlier_detection } deploymentID: 88A80D96D80B30B6F48E3FF9A0B318 eventID: bdc34718-163c-56c0-3c7b-7d51380a258e experienceID: dfbde5b8-eb57-10a3-5ced-3be47f2b8ad2 optInRequired: 3 timestamp: 1583873964 userID: 60749ba2789ec1eee0ada6a0b5680512460559541023017ad6f5b4a3b0172841 version: 4 visibility: anonymous,support } { component: app.session.Splunk_ML_Toolkit.saveExperiment data: { app: Splunk_ML_Toolkit experiment_id: 4f390e49096c43adb05feb29fe9bfbbc experimentType: smart_outlier_detection page: smart_outlier_detection } deploymentID: 88A80D96D80B30B6F48E3FF9A0B318 eventID: bdc34718-163c-56c0-3c7b-7d51380a258e experienceID: dfbde5b8-eb57-10a3-5ced-3be47f2b8ad2 optInRequired: 3 timestamp: 1583873964 userID: 60749ba2789ec1eee0ada6a0b5680512460559541023017ad6f5b4a3b0172841 version: 4 visibility: anonymous,support }
`app.session.Splunk_ML_Toolkit.scheduleExperimentTraining`	Users scheduling model re-training for AI Toolkit Experiments.	JSON { component: app.session.Splunk_ML_Toolkit.scheduleExperimentTraining data: { app: Splunk_ML_Toolkit experiment_id: 46221dd8661d420aaa988ca7d41821ae experimentType: smart_forecast page: experiments scheduleEnabled: true } deploymentID: 88A80D96D80B30B6F48E3FF9A0B318 eventID: 629db0e3-0db1-0424-5e0d-f7e06e9965fb experienceID: f2c4f65b-a723-88af-875a-73737bbc9061 optInRequired: 3 timestamp: 1584480148 userID: 60749ba2789ec1eee0ada6a0b5680512460559541023017ad6f5b4a3b0172841 version: 3 visibility: anonymous,support } { component: app.session.Splunk_ML_Toolkit.scheduleExperimentTraining data: { app: Splunk_ML_Toolkit experiment_id: 46221dd8661d420aaa988ca7d41821ae experimentType: smart_forecast page: experiments scheduleEnabled: true } deploymentID: 88A80D96D80B30B6F48E3FF9A0B318 eventID: 629db0e3-0db1-0424-5e0d-f7e06e9965fb experienceID: f2c4f65b-a723-88af-875a-73737bbc9061 optInRequired: 3 timestamp: 1584480148 userID: 60749ba2789ec1eee0ada6a0b5680512460559541023017ad6f5b4a3b0172841 version: 3 visibility: anonymous,support }
`col_dimension`	Collects dimension of the dataset from model schema. Triggered during `apply`.	JSON { "col_dimension" : "Multiple columns single-dim input", "command" : "onnx_input_shape } `{ "col_dimension" : "Multiple columns single-dim input", "command" : "onnx_input_shape }`
`columns`	The number of columns being run through `fit` command.	JSON { "columns": 10 } `{ "columns": 10 }`
`command`	`fit`, `apply`, or `score`	JSON { "command":"fit" } `{ "command":"fit" }` JSON { "command":"apply" } `{ "command":"apply" }` JSON { "command":"score" } `{ "command":"score" }`
`csv_parse_time`	CSV parse time.	JSON { "csv_parse_time": 0.019296 } `{ "csv_parse_time": 0.019296 }`
`csv_read_time`	CSV read time.	JSON { "csv_read_time": 0.019296 } `{ "csv_read_time": 0.019296 }`
`csv_render_time`	CSV render time.	JSON { "csv_render_time" : 0.01162 } `{ "csv_render_time" : 0.01162 }`
`deployment.app`	Apps installed per Splunk instance.	JSON component: deployment.app data: { enabled: true host: monitoring name: alert_webhook version: 7.0.1 } date: 2018-10-26 deploymentID: 99b6ffd8-2e80-5e3b-905c-8c6f6fd743a0 executionID: F0AE995E8653D768A360E73BE3F544 timestamp: 1540570045 transactionID: 89F7329E-86AD-BBFD-034F-209CB8A06F05 version: 3 visibility: anonymous, support `component: deployment.app data: { enabled: true host: monitoring name: alert_webhook version: 7.0.1 } date: 2018-10-26 deploymentID: 99b6ffd8-2e80-5e3b-905c-8c6f6fd743a0 executionID: F0AE995E8653D768A360E73BE3F544 timestamp: 1540570045 transactionID: 89F7329E-86AD-BBFD-034F-209CB8A06F05 version: 3 visibility: anonymous, support`
`df_shape`	Shape of data input received from splunk. Triggered during `apply`.	JSON { "command" : "onnx_input_shape", "dataframe_shape" : "(768, 8)" } `{ "command" : "onnx_input_shape", "dataframe_shape" : "(768, 8)" }`
`example_name`	Name of the Showcase example being run.	JSON { 'example_name': "'Predict Server Power Consumption'" } `{ 'example_name': "'Predict Server Power Consumption'" }`
`experiment_id`	ID of the `fit` and `apply` run on the Experiments page. All preprocessing steps and final `fit` have the same ID.	JSON { "experiment_id": "6c47bca2776d4b6cb82685461d918180" } `{ "experiment_id": "6c47bca2776d4b6cb82685461d918180" }`
`fit_time`	Amount of time it took to run the `fit` command.	JSON { "fit_time": 39.87447 } `{ "fit_time": 39.87447 }`
`full_punct`	The punct of the data during `fit` or `apply`.	JSON { "full_punct": [ ...s-s-s[//:::.s-]s"s/-/////.s/."sss"://:/-//@:///-."s"/.s(;sssss)s/.s(,ss)s/...s/."s-ss ] } `{ "full_punct": [ ...s-s-s[//:::.s-]s"s/-/////.s/."sss"://:/-//@:///-."s"/.s(;sssss)s/.s(,ss)s/...s/."s-ss ] }`
`handle_time`	Time for the handler to handle the data.	JSON { "handle_time": 0.274072 } `{ "handle_time": 0.274072 }`
`metrics_type`	Collects the type of request sent. Used to differentiate model upload and model inference call flows. Contains two values: `onnx_upload` `onnx_infer`	JSON { "command" : "metrics_type", "metrics_type" : "onnx_upload" } `{ "command" : "metrics_type", "metrics_type" : "onnx_upload" }`
`model`	To capture the LLM model name under the specific provider while running the `ai` command.	JSON { "command": "ai", "model":"gpt-4o" } `{ "command": "ai", "model":"gpt-4o" }`
`modelId`	Model ID in which user saves their model.	JSON { modelId: 56ce5ff2442604580eca0f57f36b5b9c } `{ modelId: 56ce5ff2442604580eca0f57f36b5b9c }`
`model_upload`	Monitors the model upload process to determine if the model has been successfully uploaded and is ready for inference.	JSON { "command": "upload", "metrics_type": "onnx_upload" "model_upload": "1" } `{ "command": "upload", "metrics_type": "onnx_upload" "model_upload": "1" }`
`numColumns`	Total number of columns in the dataset.	JSON { numColumns: 16 } `{ numColumns: 16 }`
`numRows`	Total number of rows (events) in the dataset.	JSON { numRows: 150 } `{ numRows: 150 }`
`num_fields`	Total number of fields.	JSON { "num_fields": 4 } `{ "num_fields": 4 }`
`num_fields_fs`	Number of fields that have the `fs` for Field Selector prefix.	JSON { "num_fields_fs": 9 } `{ "num_fields_fs": 9 }`
`num_fields_PC`	Number of fields that have the `PC` for preprocessed prefix.	JSON { "num_fields_PC": 70 } `{ "num_fields_PC": 70 }`
`num_fields_prefixed`	Total number of preprocessed fields.	JSON { "num_fields_prefixed": 28 } `{ "num_fields_prefixed": 28 }`
`num_fields_RS`	Number of fields that have the `RS` for Robust Scaler prefix.	JSON { "num_fields_RS": 17 } `{ "num_fields_RS": 17 }`
`num_fields_SS`	Number of fields that have the `SS` for Standard Scaler prefix.	JSON { "num_fields_SS": 30 } `{ "num_fields_SS": 30 }`
`num_fields_tfidf`	Number of fields that have used term frequency-inverse document frequency preprocessing.	JSON { "num_fields_tfidf": 9 } `{ "num_fields_tfidf": 9 }`
`onnx_input_shape`	Shape of input data stored in the onnx model schema. Triggered during apply time.	JSON { "command" : "onnx_input_shape", "onnx_input_shape" : "['unk__16', 8]" } `{ "command" : "onnx_input_shape", "onnx_input_shape" : "['unk__16', 8]" }`
`onnx_model_size_on_disk`	Total size in MB taken up by the model file on the disk after encoding. Triggered during model upload.	JSON { "command": "onnx_model_size_on_disk_mb", "onnx_model_size_on_disk_mb": 0.001977 } `{ "command": "onnx_model_size_on_disk_mb", "onnx_model_size_on_disk_mb": 0.001977 }`
`onnx_upload_time`	Time taken to upload an onnx model file from UI. Triggered during model upload.	JSON { "command": "onnx_model_validate_and_upload", "onnx_upload_time":0.8969220000000001 } `{ "command": "onnx_model_validate_and_upload", "onnx_upload_time":0.8969220000000001 }`
`orig_sourcetype`	The original sourcetype of the machine data.	JSON { "orig_sourcetype" : "access_combined_wcookie" } `{ "orig_sourcetype" : "access_combined_wcookie" }`
`params`	Optional parameters used in `fit` step.	JSON { "params": "{{\"with_std\": \"true\", \"with_mean\": \"true\"}}" } `{ "params": "{{\"with_std\": \"true\", \"with_mean\": \"true\"}}" }`
`params`	Collects the boolean value of `supervise_split_by`. Checks whether DecisionTreeRegressor is used as part of DensityFunction.	JSON { "command": " "supervise_split_by": "true" " } `{ "command": " "supervise_split_by": "true" " }`
`partialFit`	Whether or not the `fit` is a type of partial fit action.	JSON { partialFit: True } `{ partialFit: True }`
`PID`	Process identifier associated with the command.	JSON { "PID" : 63654 } `{ "PID" : 63654 }`
`pipeline_stage`	Each preprocessing step on the Experiments page is assigned a number starting from 0. This helps determine the order of the preprocessing steps and length of the pipeline.	JSON { "pipeline_stage": 0 } `{ "pipeline_stage": 0 }`
`provider`	To capture the provider name while running the `ai` command.	JSON { "command": "ai", "provider": "Openai" } `{ "command": "ai", "provider": "Openai" }`
`rows`	The number of rows being run through `fit` command.	JSON { 'rows': 15627 } `{ 'rows': 15627 }`
`rows`	The number of rows processed at a given `ai` command request.	JSON { "command": "ai", "rows":100 } `{ "command": "ai", "rows":100 }`
`rows_processor_time`	Time taken to process the rows in seconds while using the `ai` command request.	JSON { "command": "ai", "rows_processor_time":0.7969220000000001 } `{ "command": "ai", "rows_processor_time":0.7969220000000001 }`
`SageMaker model apply/inference event`	The AWS Sagemaker model apply/inference event.	JSON { "command": "apply", "runtime": "sagemaker", "model": "sg_anomaly_model_detector", "algo_name": "sagemaker_custom_model", "total_processed_time": 3.5656, } `{ "command": "apply", "runtime": "sagemaker", "model": "sg_anomaly_model_detector", "algo_name": "sagemaker_custom_model", "total_processed_time": 3.5656, }`
`scoringName`	Name of the scoring operation if whitelisted. If name is not whitelisted, logs the hash of the `scoringName`.	CODE scoringName: mean_squared_error `scoringName: mean_squared_error`
`scoringTimeSec`	Time taken by the scoring operation.	CODE scoringTimeSec: 3.398707 `scoringTimeSec: 3.398707`
`UUID`	Universally unique identifier associated with command. This is 128-bit and used to keep each `fit`/`apply` unique.	JSON { "UUID": "7e0828e7-3059-4a43-8419-acc0e81f2f2d" } `{ "UUID": "7e0828e7-3059-4a43-8419-acc0e81f2f2d" }`
`container_id`, `status`, `cluster_type`, `hpa`, and memory usage of a container	Information about the container, including memory usage, cluster type, and HPA behavior, when the container is started and when the `fit` command is executed in AI Toolkit version 5.7.0.	JSON { "command": "fit", "fit_time": "3.56", "hpa_enabled": "0", "algo_name": "AITK_Container", "container_id": "dev", "cluster_type": "kubernetes", "auth_mode": "aws-iam", "min_replicas": "0", "max_replicas": "0", "min_cpu": "0", "max_cpu": "0", "min_memory": "512Mi", "max_memory": "3Gi", "csv_size": "35472839", "n_tools": 10, } `{ "command": "fit", "fit_time": "3.56", "hpa_enabled": "0", "algo_name": "AITK_Container", "container_id": "dev", "cluster_type": "kubernetes", "auth_mode": "aws-iam", "min_replicas": "0", "max_replicas": "0", "min_cpu": "0", "max_cpu": "0", "min_memory": "512Mi", "max_memory": "3Gi", "csv_size": "35472839", "n_tools": 10, }`
`model`, `rows`, `rows_ processing_ time`, `column_count`	Information about the model used, including the number of rows processed, the processing time taken, and the number of columns passed to the model when the `predictai` command is executed in AI Toolkit version 5.7.0.	JSON { "command": "apply", "model": "CTSM", "rows": "100", "column_count": 4, "rows_processing_time" : 3.457 } `{ "command": "apply", "model": "CTSM", "rows": "100", "column_count": 4, "rows_processing_time" : 3.457 }`
Invocation of the `ai` command using Splunk hosted LLMs	Information about the model used with input and output tokens.	JSON { "trigger" : "ai", "provider" : "Splunk Hosted Models", "model" : Llama-3.1-FoundationAI-SecurityLLM-base-8B , "input_token" : 1000, "output_token" : 4000, "total_token" : 5000"execution_status" : "success" , "error_type" : "", "time_processing" : 1.3 } `{ "trigger" : "ai", "provider" : "Splunk Hosted Models", "model" : Llama-3.1-FoundationAI-SecurityLLM-base-8B , "input_token" : 1000, "output_token" : 4000, "total_token" : 5000"execution_status" : "success" , "error_type" : "", "time_processing" : 1.3 }`

Splunk Cloud Platform

Share data in the AI Toolkit

What data is collected

ON THIS PAGE

Splunk Enterprise

Splunk Cloud Platform

Splunkbase

Enterprise Security

SOAR

IT Service Intelligence

Content Packs

Splunk Observability Cloud

AppDynamics SaaS

AppDynamics On-Premises

SAP Agent

Developer Documentation

Splunkbase

Splunk Enterprise

Splunk Cloud Platform

Splunkbase

DATA MANAGEMENT

SEARCH AND ANALYTICS

ADMINISTRATION

Enterprise Security

SOAR

ENTERPRISE SECURITY

SOAR

RELATED APPS

IT Service Intelligence

Content Packs

ITSI

IT Ops

ADMINISTRATION

EXTENSIONS

Splunk Observability Cloud

MONITORING

DATA MANAGEMENT

ADMINISTRATION

AppDynamics SaaS

AppDynamics On-Premises

SAP Agent

ESSENTIALS

MONITORING

ADMINISTRATION

Developer Documentation

Splunkbase

PLATFORM

OBSERVABILITY

REFERENCE

Resources

REFERENCE

Learn More

Support

Share data in the AI Toolkit

What data is collected