Databases 7 min read

Elasticsearch ILM Migration Performance Optimization Strategies

To prevent IO saturation during Elasticsearch ILM hot‑to‑cold migrations, the article recommends batching migrations, fine‑tuning max_bytes_per_sec, upgrading cold‑node storage and adding hot nodes, and automating monitoring with scripts that pause or resume ILM based on real‑time utilization thresholds.

Sohu Tech Products
Sohu Tech Products
Sohu Tech Products
Elasticsearch ILM Migration Performance Optimization Strategies

This article discusses performance issues encountered during Elasticsearch Index Lifecycle Management (ILM) when migrating hot data to cold nodes, which causes IO saturation affecting read/write operations. The current solution involves manually adjusting ILM policies to pause/resume Cold phase during peak/off-peak hours, but concerns exist about potential pitfalls.

The article analyzes the problem and existing solutions, identifying three main issues: 1) IO saturation during hot-to-cold data migration, 2) Manual ILM policy adjustments increasing management complexity, and 3) Ineffective max_bytes_per_sec configuration (set to 50M) leading to the temporary migration pause solution.

Four solution approaches are proposed:

1. Batch Data Migration: Implement staged migration by adjusting ILM policies with multiple phases, similar to bulk write optimization. A detailed ILM policy configuration example is provided, showing how to set rollover thresholds and phase transitions.

2. Optimize max_bytes_per_sec Settings: Fine-tune the migration speed limit with more precise values and dynamic adjustment based on system IO utilization.

3. Hardware and Resource Optimization: Upgrade cold node storage from SATA to SSD for better IO performance, add more hot nodes to distribute write load, and use Shard Allocation Awareness for better resource isolation.

4. Monitoring and Automation: Implement automated monitoring with Python scripts that pause/resume ILM based on real-time IO utilization thresholds. A Bash script example is also provided for email alerts when IO utilization exceeds 90%.

The article concludes that implementing these measures should better manage hot-to-cold data migration and reduce impact on read/write operations.

data migrationperformance optimizationbackend developmentElasticsearchILMIO managementmonitoring automation
Sohu Tech Products
Written by

Sohu Tech Products

A knowledge-sharing platform for Sohu's technology products. As a leading Chinese internet brand with media, video, search, and gaming services and over 700 million users, Sohu continuously drives tech innovation and practice. We’ll share practical insights and tech news here.

0 followers
Reader feedback

How this landed with the community

login Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.