Tag

Kafka

1 views collected around this technical thread.

Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Jun 10, 2025 · Backend Development

Why Is Your Kafka Consumer Slow? Proven Strategies to Boost Throughput

This article explains why Kafka consumers often become bottlenecks—due to complex processing, resource constraints, or sub‑optimal configuration—and provides concrete steps such as profiling Java code, simplifying logic, using background threads, scaling consumer instances, and tuning key consumer parameters.

ConsumerDocker ComposeJava
0 likes · 4 min read
Why Is Your Kafka Consumer Slow? Proven Strategies to Boost Throughput
Architect
Architect
Jun 6, 2025 · Backend Development

How to Prevent Duplicate Messages in Kafka and Pulsar: A Practical Guide

This article explains the three message delivery semantics, the common causes of duplicate messages in queue systems, and presents concrete producer‑side, broker‑side, and consumer‑side deduplication techniques for Kafka and Pulsar, including code samples and best‑practice recommendations.

KafkaMessage QueuePulsar
0 likes · 9 min read
How to Prevent Duplicate Messages in Kafka and Pulsar: A Practical Guide
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Jun 3, 2025 · Big Data

Kafka High-Concurrency Core Design Explained

This article explains how Kafka achieves high concurrency through its distributed broker cluster, partitioned topics, sequential log writes, message compression, asynchronous producer mechanisms, and OS page‑cache techniques, illustrating the combined architectural and performance optimizations that enable massive throughput.

High ConcurrencyKafkaPage Cache
0 likes · 4 min read
Kafka High-Concurrency Core Design Explained
Full-Stack Internet Architecture
Full-Stack Internet Architecture
May 27, 2025 · Big Data

Understanding Event Streaming in Kafka: Core Concepts, Architecture, and Use Cases

This article explains Kafka's event streaming concept, detailing events and streams, core components such as producers, topics, partitions, consumers, persistence, and typical real‑time data pipeline, event‑driven architecture, stream processing, and log aggregation use cases, highlighting its role as a foundational big‑data infrastructure.

Big DataEvent StreamingKafka
0 likes · 7 min read
Understanding Event Streaming in Kafka: Core Concepts, Architecture, and Use Cases
Full-Stack Internet Architecture
Full-Stack Internet Architecture
May 23, 2025 · Big Data

Step-by-Step Guide to Installing and Using Apache Kafka 3.8.1 on Linux

This tutorial walks through downloading, extracting, configuring, starting, creating topics, producing and consuming messages, and finally stopping Apache Kafka 3.8.1 on a Linux system, including all necessary command‑line instructions.

Big DataInstallationKafka
0 likes · 4 min read
Step-by-Step Guide to Installing and Using Apache Kafka 3.8.1 on Linux
Java Captain
Java Captain
May 23, 2025 · Backend Development

Common Causes of Kafka Message Loss and Mitigation Strategies

This article examines the typical reasons Kafka messages are lost across producers, brokers, and consumers, and provides detailed configuration recommendations and best‑practice solutions to significantly reduce the risk of data loss in distributed streaming systems.

BrokerConfigurationConsumer
0 likes · 15 min read
Common Causes of Kafka Message Loss and Mitigation Strategies
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
May 22, 2025 · Big Data

Optimizing Kafka Producer for High Throughput: Batching, Asynchronous Sending, Compression, and Concurrency

This article details practical techniques for boosting Kafka producer throughput, covering batch sending optimization, linger.ms tuning, asynchronous send handling, compression options, and concurrent sending strategies, while providing code examples and performance considerations for high‑throughput streaming applications.

BatchingCompressionHigh Throughput
0 likes · 4 min read
Optimizing Kafka Producer for High Throughput: Batching, Asynchronous Sending, Compression, and Concurrency
Full-Stack Internet Architecture
Full-Stack Internet Architecture
May 20, 2025 · Big Data

Why Learn Kafka? Core Benefits, Use Cases, and a Summary

This article explains why Kafka is widely adopted by top companies, outlines its high throughput, scalability, and durability, and describes key real‑time data pipeline, stream processing, and big‑data integration scenarios, concluding that mastering Kafka is essential for modern backend and data engineering roles.

Big DataData EngineeringKafka
0 likes · 4 min read
Why Learn Kafka? Core Benefits, Use Cases, and a Summary
Selected Java Interview Questions
Selected Java Interview Questions
May 15, 2025 · Backend Development

Six Common Approaches to Synchronize MySQL Data to Elasticsearch

This article reviews six mainstream solutions for keeping MySQL and Elasticsearch in sync—including synchronous double‑write, asynchronous MQ‑based double‑write, Logstash polling, Canal binlog listening, DataX batch migration, and Flink stream processing—detailing their scenarios, advantages, drawbacks, and practical code examples to guide optimal technical selection.

CanalData SynchronizationElasticsearch
0 likes · 8 min read
Six Common Approaches to Synchronize MySQL Data to Elasticsearch
Java Architecture Diary
Java Architecture Diary
May 12, 2025 · Big Data

Why Kafka 4.0’s KRaft Mode Is a Game‑Changer for Distributed Messaging

Kafka 4.0 replaces ZooKeeper with the native KRaft (Kafka Raft) consensus layer, simplifying deployment, boosting scalability, cutting latency, and introducing a new incremental consumer rebalance protocol and share‑group model that together reshape how distributed streaming applications are built and operated.

Consumer RebalanceJavaKRaft
0 likes · 9 min read
Why Kafka 4.0’s KRaft Mode Is a Game‑Changer for Distributed Messaging
Qunar Tech Salon
Qunar Tech Salon
May 9, 2025 · Operations

Kafka Production Optimization: Reducing Load and Improving Compression via Filebeat Tuning

This technical case study details how a high‑traffic Kafka logging cluster was optimized by adjusting Filebeat and Kafka parameters, increasing compression batch size, and tuning Kubernetes settings, resulting in significant reductions in request volume, network traffic, CPU usage, and overall resource consumption.

CompressionFilebeatKafka
0 likes · 10 min read
Kafka Production Optimization: Reducing Load and Improving Compression via Filebeat Tuning
Code Ape Tech Column
Code Ape Tech Column
May 9, 2025 · Databases

Efficient Strategies for Importing One Billion Records into MySQL

This article explains how to import 1 billion 1 KB log records stored in HDFS or S3 into MySQL by analyzing single‑table limits, using batch inserts, choosing storage engines, sharding, optimizing file‑reading methods, and coordinating distributed tasks with Redis, Redisson, and Zookeeper to ensure ordered, reliable, and high‑throughput data loading.

KafkaMySQLPerformance Tuning
0 likes · 19 min read
Efficient Strategies for Importing One Billion Records into MySQL
Java Tech Enthusiast
Java Tech Enthusiast
May 4, 2025 · Fundamentals

Technical Interview Q&A: TCP, Redis, Kafka, CAP Theorem, Singleton, C++ STL, and Algorithms

This guide reviews common backend interview topics, explaining TCP TIME_WAIT behavior, multi‑port listening, full page load steps, Redis data types, Kafka consumer sizing and at‑most‑once semantics, the CAP theorem, Singleton usage, C++ std::map complexity, and an O(n) doubly‑linked list reversal algorithm.

AlgorithmsC++CAP theorem
0 likes · 12 min read
Technical Interview Q&A: TCP, Redis, Kafka, CAP Theorem, Singleton, C++ STL, and Algorithms
Java Captain
Java Captain
Apr 14, 2025 · Backend Development

Project Initialization, Version Management, and Scaffold Setup for Spring Boot Backend Development

This guide walks through initializing a Spring Boot project, handling IDE setup challenges, managing version compatibility between Spring Cloud, Spring Boot, and Kafka, and provides practical code examples for global exception handling, logging, CORS configuration, Swagger integration, and useful development tools.

Exception HandlingKafkaMaven
0 likes · 12 min read
Project Initialization, Version Management, and Scaffold Setup for Spring Boot Backend Development
Efficient Ops
Efficient Ops
Apr 13, 2025 · Backend Development

Master Kafka’s Core Concepts: A Visual Guide to Topics, Partitions, and Architecture

This article visually breaks down Kafka’s fundamental concepts—including producers, consumers, topics, consumer groups, partitions, and cluster architecture—explaining how messages flow, are stored, and achieve load balancing and ordering within a distributed streaming system.

KafkaMessage StreamingTopics
0 likes · 7 min read
Master Kafka’s Core Concepts: A Visual Guide to Topics, Partitions, and Architecture
Java Tech Enthusiast
Java Tech Enthusiast
Apr 11, 2025 · Backend Development

Ensuring Message Processing Once in High-Concurrency Scenarios

The article explains how to guarantee that messages are processed only once in high‑concurrency environments by combining production‑side idempotent publishing, broker‑level deduplication with unique IDs, and consumption‑side business idempotency such as database constraints or distributed locks, while also recommending monitoring, metrics, and reconciliation as safety nets.

High ConcurrencyIdempotencyKafka
0 likes · 6 min read
Ensuring Message Processing Once in High-Concurrency Scenarios
macrozheng
macrozheng
Apr 11, 2025 · Backend Development

How to Guarantee Exactly‑Once Message Consumption in High‑Concurrency Systems

This article explains common causes of duplicate message consumption in high‑traffic systems and presents a three‑layer defense—producer idempotence, broker de‑duplication, and consumer idempotent design—plus monitoring and reconciliation strategies to achieve reliable exactly‑once processing.

KafkaMessage QueueMonitoring
0 likes · 7 min read
How to Guarantee Exactly‑Once Message Consumption in High‑Concurrency Systems
Sanyou's Java Diary
Sanyou's Java Diary
Apr 10, 2025 · Backend Development

Why RocketMQ Beats Kafka: Architecture Simplified and Features Amplified

This article explains how RocketMQ, a Chinese‑origin message queue, simplifies Kafka’s architecture while adding powerful features such as tag‑based filtering, transactional messaging, delayed and dead‑letter queues, and a unified commit‑log storage model, making delayed processing and high‑throughput scenarios easier to implement.

KafkaMessage QueueRocketMQ
0 likes · 10 min read
Why RocketMQ Beats Kafka: Architecture Simplified and Features Amplified
Architecture Digest
Architecture Digest
Apr 6, 2025 · Backend Development

Design and Implementation of a General‑Purpose Asynchronous Processing SDK for Backend Systems

This article introduces a reusable asynchronous processing SDK built on Spring, Kafka, and MySQL that leverages @AsyncExec annotations, transactional event listeners, and configurable thread pools to ensure reliable, non‑blocking execution, data consistency, and fault‑tolerant handling of business logic in backend applications.

AsyncJavaKafka
0 likes · 8 min read
Design and Implementation of a General‑Purpose Asynchronous Processing SDK for Backend Systems