Set up an Edge Processor in a Docker container
-
Have access to a functioning tenant in the Splunk Cloud Platform environment.
-
Install Docker on the host machines where you will deploy the containers. As a best practice, update your Docker image to the latest stable image. See Update to the latest Docker image.
Architecture overview
Setup and install
Perform the following steps in order to set up and install an Edge Processor on a Docker container.
Create a shared service principal
docker run --rm \
-e REGION=<region> \
-e TENANT=<tenant-id> \
-e TOKEN=<token> \
splunk/edge-processor:<image-tag> \
eptools setup > principal.yaml
TOKEN in the install script. To locate it, navigate to the Edge Processors page, select or create a processor. Select from the drop-down menu in the top right corner. Then set the Instance type to Docker. Copy the token in the resulting script.
Running a containerized instance
After provisioning a shared service principal, perform the following tasks to set up a containerized Edge Processor instance.
-
Compile a list of ingress ports your instance uses to receive ingested data. To do so, navigate to the Edge Processors page in your tenant's web UI, and select Shared Settings in the top right corner.
-
Use Docker's
-pflag to define your ports for all supported ingestion types. Map each port to the corresponding port on the host machine. -
Next, find the
GROUP_IDvalue, which associates new instances with a specific Edge Processor. Select a processor listed on the Edge Processors page of your tenant's web UI, then copy its ID field, such as431e1ead-fd5b-4af8-ac89-ccae2ae81eda. This value also appears at the end of the page's URL. -
Run the following command to mount the principal.yaml file to the /opt/splunk-edge/etc/principal.yaml path in the container, publishing the ports gathered previously, and running the image:CODE
docker run -d \ -e TENANT=<tenant-id> \ -e GROUP_ID=<edge-processor-id> \ -e TOKEN=<access-token> \ -e MACHINE_HOSTNAME=$(hostname) \ -v $(pwd)/principal.yaml:/opt/splunk-edge/etc/principal.yaml \ -p <port-1>:<port-1> ··· -p <port-n>:<port-n> \ splunk/edge-processor:<image-tag>
Running multiple containerized instances
Because the generated service principal is designed to be shared by multiple Edge Processor instances, you can start more containers by rerunning the previous docker run command:
docker run -d \
-e TENANT=<tenant-id> \
-e GROUP_ID=<edge-processor-id> \
-e TOKEN=<access-token> \
-e MACHINE_HOSTNAME=$(hostname) \
-v $(pwd)/principal.yaml:/opt/splunk-edge/etc/principal.yaml \
-p <port-1>:<port-1> ··· -p <port-n>:<port-n> \
splunk/edge-processor:<image-tag>
<start new section here - see drafts later on the page>
Modifying GROUP_ID between runs registers instances with different Edge Processors in your tenant under the same service principal.
As a best practice, restrict service principals to 1 service principal per processor to avoid unnecessary confusion and potential rate limiting. You can use a single service principal across different host machines if the same principal.yaml file is mounted in each container, but this is not a best practice.
Avoiding port conflicts on a single host
Using multiple instances on a single host affects port availability. When running multiple containers on the same host machine, port conflicts become a bottleneck since each published container port must bind to a unique port on the host.
To prevent conflicts, publish the container port without pinning it to a specific host port by using -p <container_port> instead of -p <host_port>:<container_port>. When you omit the host port, Docker selects an available, ephemeral port at run time and publishes the container port accordingly.
This approach can hinder discoverability. Because host ports are assigned dynamically, clients don't know where to route traffic ahead of time. As a result, you must query the running containers to compile a per-host inventory of published ports, then distribute traffic among them using a load balancer or client-side routing.
docker inspect $(docker ps -q) -f '
{{- $id := slice .Id 0 12 -}}
{{- range $cport, $bindings := .NetworkSettings.Ports -}}
{{- if $bindings -}}
{{- range $b := $bindings -}}
{{ printf "%s: %s -> %s:%s\n" $id $cport $b.HostIp $b.HostPort }}
{{- end -}}
{{- end -}}
{{- end -}}
'
Using statically assigned host ports
docker run -p 9001:8088 {···}
docker run -p 9002:8088 {···}
docker run -p 9003:8088 {···}
Using a single service principal per processor
Modifying GROUP_ID between runs registers instances with different Edge Processors in your tenant under the same service principal.
As a best practice, restrict service principals to 1 service principal per processor to avoid unnecessary confusion and potential rate limiting. You can use a single service principal across different host machines if the same principal.yaml file is mounted in each container, but this is not a best practice.
Uninstall a containerized Edge Processor instance
Uninstall and cleanup a containerized Edge Processor instance.
docker stop <container-id> && docker rm <container-id>
docker run --rm \
-e TENANT=<tenant-id> \
-e TOKEN=<access-token> \
-v $(pwd)/principal.yaml:/opt/splunk-edge/etc/principal.yaml \
splunk/edge-processor:<image-tag> \
eptools cleanup
TOKEN is not the same as before. Instead, it is located in the Settings page of your tenant's Cloud Console UI. More specifically, navigate to https://console.scs.splunk.com/{tenant-id}/settings and copy the provided Access Token
Update to the latest Docker image
Additional considerations
-
Port changes made using the Edge Processor Shared settings page are not automatically propagated to Docker. When modifying these values, you must manually stop all running containers still bound to the previous ports, recreate them with the new port mappings, then update all affected ingestion flows to target these new ports.
-
Containerizing Edge Processor does not provide any additional control over the deployed binary. The container image version is independent of the Edge Processor binary version, which continues to follow the standard, non-container lifecycle managed by the Edge Processor team.
-
For deployments managed by
systemd, use Docker's--restart=alwayspolicy when starting an Edge Processor container to ensure automatic recovery after failures or host restarts.