AI service data in Splunk AI Assistant for SPL
How to opt in or out of Training and Fine-Tuning
Allowing the use of your AI Service Data is turned on by default. You can turn access off from within Splunk AI Assistant for SPL on the Settings tab of the app.
Toggle the selector next to Allow Splunk to use your AI Service Data for Training and Fine-Tuning as defined in the Splunk AI Features Specific Offerings Terms as shown in the following image:
What data is collected
Splunk AI Assistant for SPL collects different context data depending on if you opt-in to allow use of your AI Service Data and opt-in to use the Personalization feature.
Personalization data
Personalization is turned off by default. You can turn Personalization on or back off from within Splunk AI Assistant for SPL on the Settings tab of the app. Deselect the box next to Personalize results.
Collected data is stored in the vector DB, and a cleanup job runs weekly to delete this information if you decide to opt-out of Personalization at a later date.
Data retention
Data outlined in this section is retained as set forth in the Splunk Data Retention Policy.
Chat data is stored in the KVStore on the customer's stack. If you choose to delete a chat, that chat data is deleted from your local KVStore collection.
Field specific data details
| Component | Description | Example |
|---|---|---|
app.session.copy_spl_clicked |
Data collected when SPL generated using the app is copied with the "Copy" button. |
JSON
|
app.Splunk_AI_Assistant |
Information including type, tenant, query, enabled_features, and request_id. |
JSON
|
app.Splunk_AI_Assistant |
Information including tenant, and type. |
JSON
|
app.Splunk_AI_Assistant.splgen |
Collects the chat_id. |
JSON
|
app.Splunk_AI_Assistant.splgen.feedback |
Information including enabled_features, feedback_id, and query. |
JSON
|
app.Splunk_AI_Assistant_Cloud.splgen |
Logging from Splunk AI Assistant for SPL Splunk app REST handlers. |
CODE
|
app.Splunk_AI_Assistant_Cloud.splgen |
Generation time. End to end (e2e) time from request start to end. |
CODE
|
app.Splunk_AI_Assistant_Cloud.splgen.openinsearch |
When the user clicks on the "Open in Search" button for some generated SPL. |
JSON
|
app.Splunk_AI_Assistant_Cloud.splgen.usage |
Feedback submitted by users with thumbs up/thumbs down/additional details UI in app. |
JSON
|
inference_spl_generation
|
Natural language prompt entered by the user in user_prompt field and intermediate rag/metadata responses retrieved from the large language models (LLMs). |
JSON
|
saia-tenant-id |
Hashed name of the tenant or stack ID. |
JSON
|
stackID
chat_id
app_version
|
Information collected from the StackID, UserID, ChatID, and App Version fields. |
CODE
|
job_id
user
chat_id
|
Information collected from the JobID, UserKey, User, and ChatID fields. |
CODE
|
input_word_count
output_word_count
output_char_count
|
Total numbers of the word and character counts for input and output responses. |
JSON
|
source_app_id |
SourceAppID information. |
CODE
|
num_distinct_clusters
avg_fields_per_cluster min_fields_per_cluster max_fields_per_cluster
|
Information collected on distinct clusters formed for each tenant, average number of clusters formed per sourcetype, average number of field lists collected per cluster, minimum number of fields per cluster, and maximum number of fields per cluster. |
JSON
|
generate_optimized_spl |
Tracks runtime optimization decisions and user behaviors during SPL search generation. |
JSON
|
enabled_features |
Tracks the app features currently turned on by the customer. |
JSON
|
orchestration_decision |
Tracks the orchestrated intent returned for a given user input, by the intent orchestration component. |
JSON
|
data_upload_sourcetype_metadata |
Tracks description generation/metadata collection stats, such as number of sourcetypes/indexes we have collected data for. |
JSON
|
Personalization data
Personalization is turned off by default. You can turn data sharing on or back off from within Splunk AI Assistant for SPL on the Settings tab of the app. Deselect the box next to Personalize results as shown in the following image:The following context data is collected if you opt-in to use Personalization.
This data is collected using 2 saved searches bundled with the assistant. These searches are only enabled if you opt-in for Personalization:
- Splunk AI Assistant for SPL - Field Summary
- Splunk AI Assistant for SPL - Search Logs
Collected data is stored in the vector database, and a cleanup job runs weekly to delete this information if you decide to opt-out of Personalization.
| Component | Description | Example |
|---|---|---|
app.Splunk_AI_Assistant.index_metadata |
Sourcetype metadata |
JSON
|
app.Splunk_AI_Assistant.previous_searches |
Previous searches |
JSON
|
num_indexes
num_sourcetypes
num_distinct_sourcetypes
average_sourcetype_per_index
num_spls
num_distinct_spls
num_users
num_distinct_users
average_spls_per_user
|
VectorDB metrics for all the tenants who opted for the personalization feature. |
JSON
|