Configure the Migration Tool
config.yaml that stores all necessary settings required for a smooth migration process. You must update this file with your specific environment details before running the migration tool.
Fine Tune the Properties of Migration Tool
The migration tool applies default values for each component property from the config.yaml file. However, you can adjust these values based on the size of your data. This page explains the available parameters and how you can configure them to optimize the performance of the migration tool.
Saturation Detection
The migration tool uses the values in this section to determine if data migration from Classic On-Premises to Virtual Appliance is saturated. The saturation detection section appears in the following sections of each component:
-
services.eum_filestore -
services.synth_filestore -
services.controller -
services.eum_db -
services.events
-
convergence_threshold_seconds: This parameter defines the allowable time difference between two consecutive migration (backup+restore) cycle for corresponding services.Increasing this value may cause the respective services to reach saturation more quickly, which can lead to potential data loss.
-
min_cycles_for_saturation: The number of consecutive comparisons required to confirm that the migration state has stabilized.For example, if set to 3, the tool identifies saturation only after three consecutive differences fall within the
convergence_threshold_seconds.Increasing this value improves confidence that the data migration is saturated before the cutover.
Events Service Elasticsearch Parameters
services.events section, you can update the following values to ensure the worker threads do not starve for system resources and improve system performance:
-
backup_worker_sleep_seconds: The duration the backup worker thread waits after completing one backup before starting the next. -
restore_worker_sleep_seconds: The duration the restore worker thread waits after completing one restore before starting the next. -
monitor_worker_sleep_seconds: The interval (in seconds) between monitor worker cycles to update migration progress and evaluate convergence. -
generic_ssh_timeout_seconds: General-purpose timeout for individual Elasticsearch API calls, SSH commands, and NFS state operations. -
snapshot_poll_interval_seconds: Polling interval for shard recovery progress checks and wait loops when fixing unassigned shards. -
snapshot_create_max_wait_seconds: Specifies the maximum time limit to wait for snapshot creation. -
cluster_health_timeout_seconds: Specifies the timeout for checking cluster health during a restore. -
shard_recover_timeout_seconds: Maximum time shard recovery can remain stalled before a reroute is attempted. -
poll_restore_status_timeout_seconds: The maximum amount of time permitted to monitor the recovery of all shards for a single snapshot. If shard recovery does not complete within this time frame, the restore operation will be marked as timed out, even if shards are still making progress. -
es_index_close_timeout_seconds: Timeout for closing existing indices on the destination before restoring a snapshot over them.
-
max_bytes_per_sec: Network bandwidth on each node. You can increase this value to achieve faster restore of backed up data -
max_concurrent_snapshot_file_downloads: Concurrent file downloads for each shrad recovery. -
max_concurrent_snapshot_file_downloads_per_node: Total concurrent downloads on each node across all recoveries. -
node_concurrent_recoveries: Concurrent shard recoveries on each node.