Create an Amazon S3 dataset for Ingest Processor pipelines
Create an Amazon S3 dataset in the Data Management app for your Ingest Processor pipelines to send data to a specific bucket.
To send data from Ingest Processor to an Amazon S3 bucket, create an Amazon S3 dataset in the Data Management app on Splunk Cloud Platform. You can then use the dataset as a pipeline destination.
You can optionally configure the dataset to also support federated searches, so that you can use the same dataset to write and read data from Amazon S3.
The dataset uses an Amazon S3 connection for authentication. You can create multiple datasets that use the same connection.
- Your Splunk Cloud Platform deployment must be on version 10.4.2604 or higher.
Note: If your Splunk Cloud Platform deployment does not meet this requirement, see Create a legacy Amazon S3 destination for Ingest Processor.
- Your user account on the Splunk Cloud Platform deployment must have the
edit_datasetsandadmin_all_objectscapabilities. For more information, see the following pages:- Manage users for the Ingest Processor solution
- Define roles on the Splunk platform with capabilities in the Splunk Cloud Platform Manage Users and Security manual
-
You have an Amazon Web Services (AWS) account and an AWS IAM role with permissions that let you attach and modify custom trust policies and permissions policies for IAM roles. Contact your AWS administrator for assistance with AWS permissions.
-
The Amazon S3 bucket that you want to send data to does not have Object Lock turned on.
Note: Object Lock cannot be turned off after it is turned on, so you might need to create a new bucket. For more information, see https://docs.aws.amazon.com/AmazonS3/latest/userguide/object-lock-configure.html in the Amazon Simple Storage Service (S3) User Guide. -
You must have an Amazon S3 connection that authenticates to the Amazon S3 bucket that you want the dataset to represent. For more information, see Create an Amazon S3 connection for Ingest Processor pipelines.
To send data from Ingest Processor to your Amazon S3 bucket, create a pipeline that uses the Amazon S3 dataset as a destination. Then, apply the pipeline to Ingest Processor. For more information, see the following pages:
For information about running federated searches on Amazon S3 datasets, see Run federated searches over Amazon S3 datasets in the Splunk Cloud Platform Federated Search manual.