Tag

Cluster Configuration

0 views collected around this technical thread.

HelloTech
HelloTech
Oct 31, 2023 · Big Data

Investigation of Data Loss in a Flink Kafka Consumer Caused by Mixed Kafka Cluster Configuration

The data loss in a Flink‑Kafka job was caused by a mis‑configured bootstrap.servers list that mixed production and pre‑release Kafka clusters, leading different subtasks to connect to different clusters, resulting in inconsistent partition discovery and offset fetching, which omitted several partitions until the list was corrected.

Cluster ConfigurationData LossElasticsearch
0 likes · 8 min read
Investigation of Data Loss in a Flink Kafka Consumer Caused by Mixed Kafka Cluster Configuration
Top Architect
Top Architect
Sep 10, 2020 · Operations

Elasticsearch Performance Tuning Guide: Configuration, System, and Usage Optimizations

This article provides a comprehensive guide to improving Elasticsearch performance and stability by covering configuration file tweaks, system‑level settings, and usage‑level optimizations such as hot‑thread analysis, pending tasks, field storage, translog handling, refresh intervals, shard management, and best practices for routing and alias usage.

Cluster ConfigurationElasticsearchPerformance Tuning
0 likes · 20 min read
Elasticsearch Performance Tuning Guide: Configuration, System, and Usage Optimizations
Architecture Digest
Architecture Digest
Sep 3, 2020 · Databases

Practical Elasticsearch Performance and Stability Tuning Guide

This article consolidates practical Elasticsearch tuning techniques—including configuration file adjustments, system‑level optimizations, and usage‑level settings—to improve cluster performance, stability, and resource efficiency for production environments.

Big DataCluster ConfigurationElasticsearch
0 likes · 15 min read
Practical Elasticsearch Performance and Stability Tuning Guide
Big Data Technology Architecture
Big Data Technology Architecture
May 15, 2020 · Big Data

Performance Tuning of Hive on Spark in YARN Mode

This article explains how to optimize Hive on Spark running on YARN, covering YARN node resource configuration, Spark executor and driver memory settings, dynamic allocation, parallelism, and key Hive parameters to achieve superior performance compared to Hive on MapReduce.

Big DataCluster ConfigurationHive
0 likes · 11 min read
Performance Tuning of Hive on Spark in YARN Mode
Big Data Technology Architecture
Big Data Technology Architecture
Aug 29, 2019 · Databases

Elasticsearch Indexing and Search Performance Tuning Guide

This guide explains how to improve Elasticsearch indexing speed, search speed, and overall cluster performance by adjusting bulk request sizes, refresh intervals, replica settings, hardware resources, mapping choices, caching strategies, and query preferences, while also offering general best‑practice recommendations.

Cluster ConfigurationElasticsearchHardware Optimization
0 likes · 18 min read
Elasticsearch Indexing and Search Performance Tuning Guide