Tagged articles

2124 articles

Page 4 of 22

Sep 11, 2024 · Backend Development

How a Two‑Level Cache Boosted High‑Concurrency Container Performance

By redesigning the caching layer with a two‑level architecture combining local and distributed caches, the author dramatically reduced CPU usage, lowered response times, and increased system capacity under high QPS workloads, while evaluating trade‑offs of various cache strategies, pre‑warming, refresh mechanisms, and operational considerations.

CachingDistributed SystemsPerformance Optimization

0 likes · 11 min read

How a Two‑Level Cache Boosted High‑Concurrency Container Performance

MaGe Linux Operations

Sep 10, 2024 · Backend Development

How Kafka Elects Leaders and Distributes Partitions: A Deep Dive

This article explains Kafka's leader election process, partition assignment strategy, distribution policies, file layout, and the evolution of consumer offset storage, providing a comprehensive overview of how Kafka ensures reliable and efficient message handling in a distributed environment.

Distributed SystemsKafkaPartition Assignment

0 likes · 5 min read

How Kafka Elects Leaders and Distributes Partitions: A Deep Dive

Architect

Sep 9, 2024 · Backend Development

Understanding Redisson DelayedQueue: Internal Mechanisms and Practical Usage

This article explains how Redisson's DelayedQueue works internally, covering its data structures, the basic workflow for sending and receiving delayed messages, the initialization process, and the scheduling logic that moves expired items from the delay queue to the target queue.

DelayedQueueDistributed SystemsJava

0 likes · 16 min read

Understanding Redisson DelayedQueue: Internal Mechanisms and Practical Usage

Baidu Intelligent Cloud Tech Hub

Sep 5, 2024 · Databases

How Vector Databases Power AI and RAG: Insights from Baidu’s DTCC 2024

This article reviews the 70‑year evolution of databases, explains how vector databases and Retrieval‑Augmented Generation (RAG) are reshaping AI applications, and details Baidu Intelligent Cloud's VectorDB architecture, performance advantages, real‑world use cases, and future trends in data engineering.

AIData EngineeringDatabase Architecture

0 likes · 16 min read

How Vector Databases Power AI and RAG: Insights from Baidu’s DTCC 2024

Tencent Cloud Developer

Sep 3, 2024 · Backend Development

Rate Limiting: Purpose, Algorithms, Implementation Methods, Strategies, and Considerations

Rate limiting safeguards system stability by capping request rates, employing algorithms such as fixed‑window, sliding‑window, leaky‑bucket, and token‑bucket, and can be applied at application, proxy, or hardware layers while using strategies like threshold setting, request classification, feedback, and ensuring fairness, flexibility, and transparency.

Distributed SystemsGolangRate Limiting

0 likes · 28 min read

Rate Limiting: Purpose, Algorithms, Implementation Methods, Strategies, and Considerations

JD Retail Technology

Aug 30, 2024 · Artificial Intelligence

GPU Optimization Practices for Training and Inference in JD Advertising Recommendation Systems

The article details JD Advertising's technical challenges and solutions for large‑scale sparse recommendation models, describing GPU‑focused storage, compute and I/O optimizations for both training and low‑latency inference, including distributed pipelines, heterogeneous deployment, batch aggregation, multi‑stream execution, and compiler extensions.

Distributed SystemsGPU optimizationRecommendation Systems

0 likes · 13 min read

GPU Optimization Practices for Training and Inference in JD Advertising Recommendation Systems

Xiaolei Talks DB

Aug 28, 2024 · Databases

What 15 Years of China’s DTCC Conferences Reveal About Database Evolution

The author reflects on a decade‑plus journey through China’s DTCC database conferences, describing personal growth from novice to speaker and organizer, sharing insights on Redis Cluster, distributed database selection, openGauss, time‑series databases, and the evolving themes that chart the industry's progress.

Distributed SystemsRedisTime Series Database

0 likes · 6 min read

What 15 Years of China’s DTCC Conferences Reveal About Database Evolution

Cognitive Technology Team

Aug 25, 2024 · Operations

Fault Isolation Techniques for High Availability in Distributed Systems

The article explains fault isolation as a key technique for improving distributed system availability, detailing multiple isolation levels—from data‑center to user‑level—and complementary strategies such as circuit breakers, timeouts, fast‑fail, load balancing, caching, and degradation switches.

Circuit BreakerDistributed SystemsResource Isolation

0 likes · 10 min read

Fault Isolation Techniques for High Availability in Distributed Systems

Top Architect

Aug 25, 2024 · Backend Development

Integrating Spring Boot with XXL‑Job for Distributed Task Scheduling

This article provides a comprehensive guide on integrating Spring Boot with the open‑source distributed scheduler XXL‑Job, covering installation, configuration of the admin console and executor, code examples, parameter details, comparison with other schedulers, and best practices for reliable task management.

Distributed SystemsJavaSpring Boot

0 likes · 17 min read

Integrating Spring Boot with XXL‑Job for Distributed Task Scheduling

Architect

Aug 23, 2024 · Backend Development

Understanding RocketMQ: Basic Concepts, Ordered Messages, and Transactional Messages

This article explains RocketMQ's core components, including domain models, message transmission models, reliability mechanisms, the challenges of ordered messaging, practical usage scenarios, and the design and implementation of transactional messages in both open‑source and proprietary versions.

Distributed SystemsMessage QueueOrdered Messages

0 likes · 21 min read

Understanding RocketMQ: Basic Concepts, Ordered Messages, and Transactional Messages

System Architect Go

Aug 22, 2024 · Fundamentals

Replication and Partitioning Mechanisms in Redis, Kafka, and Elasticsearch

This article examines the replication and partitioning designs of Redis, Kafka, and Elasticsearch, drawing on concepts from Designing Data‑Intensive Applications to illustrate core distributed‑system principles, common challenges, and practical configuration options.

DDIADistributed SystemsElasticsearch

0 likes · 7 min read

Replication and Partitioning Mechanisms in Redis, Kafka, and Elasticsearch

Alibaba Cloud Developer

Aug 16, 2024 · Backend Development

How to Diagnose and Fix Cache Consistency Issues in High‑Concurrency Systems

This article walks through a real‑world cache consistency bug in a high‑traffic push service, explains cache penetration, breakdown, and avalanche, compares strong and eventual consistency models, and presents practical cache update and invalidation strategies to prevent data mismatches.

CacheCache StrategiesConsistency

0 likes · 14 min read

How to Diagnose and Fix Cache Consistency Issues in High‑Concurrency Systems

DaTaobao Tech

Aug 14, 2024 · Backend Development

Deep Dive into RocketMQ Architecture and Core Components

The article thoroughly explains RocketMQ’s architecture—including stateless NameServers for routing, master‑slave Brokers with sequential CommitLog storage, stateless Producers and pull‑or‑push Consumers, message flow, indexing, and synchronous double‑write reliability—showing how these components deliver high‑throughput, scalable, fault‑tolerant messaging for distributed systems.

BrokerConsumerDistributed Systems

0 likes · 20 min read

Deep Dive into RocketMQ Architecture and Core Components

Huawei Cloud Developer Alliance

Aug 14, 2024 · Cloud Computing

How Huawei’s YuanRong Is Redefining General-Purpose Serverless Computing

Huawei Cloud unveiled YuanRong, a general‑purpose serverless platform presented at ACM SIGCOMM 2024, which tackles core challenges of function coordination, cold‑start latency, external state handling, and backend integration, and demonstrates its impact through automotive IoV and ERP case studies while outlining future AI‑driven innovations.

Distributed SystemsFunction Cold StartHuawei Cloud

0 likes · 12 min read

How Huawei’s YuanRong Is Redefining General-Purpose Serverless Computing

JD Cloud Developers

Aug 12, 2024 · Backend Development

Mastering High‑Concurrency: From Thread Pitfalls to Distributed Inventory Architecture

This article explores the three‑high requirements of modern internet systems, compares multithreading with asynchronous methods, and presents a practical, partition‑based inventory‑deduction architecture that leverages Redis caching, MySQL sharding, and asynchronous updates to achieve scalable high‑concurrency performance.

Distributed SystemsMySQLRedis

0 likes · 19 min read

Mastering High‑Concurrency: From Thread Pitfalls to Distributed Inventory Architecture

Su San Talks Tech

Aug 11, 2024 · Backend Development

Mastering Idempotency: Design Patterns and Code Examples for Reliable APIs

This article explains the concept of idempotency, outlines scenarios where it is essential, analyzes common causes of idempotency issues, and presents multiple practical solutions—including unique constraints, optimistic and pessimistic locks, distributed locks, token mechanisms, state machines, deduplication tables, and global request IDs—accompanied by concrete code examples.

Distributed SystemsIdempotencyOptimistic Lock

0 likes · 14 min read

Mastering Idempotency: Design Patterns and Code Examples for Reliable APIs

DaTaobao Tech

Aug 9, 2024 · Cloud Native

Deep Dive into RocketMQ: Storage Architecture and Consumer Processing Flow

RocketMQ stores messages in a sequential CommitLog while using a ConsumerQueue index for fast random access, and employs two looping threads—RebalanceService for load‑balancing queue assignments and PullMessageService for continuously pulling messages and dispatching them to user callbacks via a thread‑pooled consumer pipeline.

ConsumerDistributed SystemsJava

0 likes · 12 min read

Deep Dive into RocketMQ: Storage Architecture and Consumer Processing Flow

IT Services Circle

Aug 9, 2024 · Backend Development

Interview on xxl-job Task Scheduling Framework and Handling Overlapping Tasks

The interview discusses various routing and blocking strategies of the xxl-job distributed task scheduling framework, explains how it addresses task overlap, idempotency issues, and provides practical solutions such as single‑machine execution, locking mechanisms, and using a business date to avoid date‑related problems.

Distributed SystemsTask SchedulingXXL-JOB

0 likes · 10 min read

Interview on xxl-job Task Scheduling Framework and Handling Overlapping Tasks

Lobster Programming

Aug 9, 2024 · Backend Development

Why Zookeeper Is Losing Favor: Consistency vs. Availability Trade‑offs

Zookeeper, the open‑source coordination service modeled after Google’s Chubby, offers strong consistency for distributed applications, but its CP design, synchronous master‑slave replication, log‑heavy state changes, and limited health checks lead many large tech firms to favor more available alternatives, though it still fits certain use cases.

ConsistencyDistributed Systemscp

0 likes · 5 min read

Why Zookeeper Is Losing Favor: Consistency vs. Availability Trade‑offs

Tencent Cloud Developer

Aug 6, 2024 · Backend Development

Nine Common Techniques for Service Performance Optimization

The article outlines nine broadly applicable techniques—caching, parallel and batch processing, data compression, lock‑free design, sharding, eliminating unnecessary requests, and resource pooling—that together can dramatically cut service latency and improve throughput, as demonstrated by an 80% latency reduction in a real‑world project.

CachingConcurrencyDistributed Systems

0 likes · 28 min read

Nine Common Techniques for Service Performance Optimization

Rare Earth Juejin Tech Community

Aug 6, 2024 · Operations

ZooKeeper Core Concepts: Data Model, Node Types, Sessions, Cluster, Election, ZAB, Watch, ACL, and Distributed Lock Patterns

This article explains ZooKeeper's hierarchical data model, node types, session mechanism, cluster roles and election process, ZAB protocol, watch mechanism, ACL permissions, and common distributed lock implementations, providing a comprehensive overview of its core concepts and practical usage.

Cluster ManagementCoordination ServiceDistributed Systems

0 likes · 17 min read

ZooKeeper Core Concepts: Data Model, Node Types, Sessions, Cluster, Election, ZAB, Watch, ACL, and Distributed Lock Patterns

Su San Talks Tech

Aug 5, 2024 · Fundamentals

Mastering Distributed Systems: Common Pitfalls and How to Avoid Them

This article explains the core concepts of distributed systems—including the CAP theorem, BASE theory, message‑queue challenges, Redis sentinel issues, sharding strategies, unique ID generation, and distributed transaction patterns—while offering practical guidance to prevent common pitfalls and improve reliability.

CAP theoremDistributed SystemsID generation

0 likes · 27 min read

Mastering Distributed Systems: Common Pitfalls and How to Avoid Them

JD Cloud Developers

Aug 2, 2024 · Operations

How JD’s Advertising Platform Optimizes Load Balancing for Heterogeneous Clusters

Exploring the evolution of JD’s advertising online model system, this article examines the challenges of heterogeneous hardware load balancing, outlines static and dynamic strategies—including DNS, Nginx, LVS, Ribbon, and Dubbo—and presents a multi‑objective framework that improves service availability and resource utilization, achieving up to 20%+ efficiency gains.

Distributed Systemsheterogeneous hardwareload balancing

0 likes · 17 min read

How JD’s Advertising Platform Optimizes Load Balancing for Heterogeneous Clusters

JD Tech

Aug 1, 2024 · Backend Development

Overview of RPC and How to Build a Custom RPC Framework Using Netty

This article explains the fundamentals and key characteristics of Remote Procedure Call (RPC), outlines common use cases and popular frameworks, and then walks through the design and implementation of a simple custom RPC system—including a bespoke binary protocol, client‑side proxy generation, serialization, Netty‑based encoding/decoding, server‑side request handling, and result delivery—using Java and Netty.

Backend DevelopmentDistributed SystemsJava

0 likes · 21 min read

Overview of RPC and How to Build a Custom RPC Framework Using Netty

Architects' Tech Alliance

Jul 30, 2024 · Artificial Intelligence

Unlocking 10K‑GPU LLM Training: Inside MegaScale’s 55% MFU Breakthrough

This article translates and analyzes the MegaScale system—co‑developed by ByteDance and Peking University—that enables efficient, stable training of massive language models on clusters of more than 10,000 GPUs, achieving 55.2% MFU and a 1.34× speedup over Megatron‑LM.

Distributed SystemsGPU scalingLLM training

0 likes · 15 min read

Unlocking 10K‑GPU LLM Training: Inside MegaScale’s 55% MFU Breakthrough

Mike Chen's Internet Architecture

Jul 27, 2024 · Fundamentals

Understanding Distributed BASE: Basically Available, Soft State, and Eventual Consistency

This article explains the core concepts of the BASE theory—Basically Available, Soft State, and Eventual Consistency—illustrating how distributed systems trade off consistency for higher availability and performance, and provides practical examples and typical application scenarios.

AvailabilityBASEDistributed Systems

0 likes · 15 min read

Understanding Distributed BASE: Basically Available, Soft State, and Eventual Consistency

Architect

Jul 26, 2024 · Backend Development

Designing a High‑Performance Go‑Job Scheduler: Architecture, SDK & Task Flow

This article presents a comprehensive technical deep‑dive into Go‑Job, a Go‑native distributed task scheduling framework, covering its background, three‑layer architecture, service and task design, SDK modules, code examples, practical integration steps, and future enhancements for robust backend operations.

Distributed SystemsGoGo-Job

0 likes · 24 min read

Designing a High‑Performance Go‑Job Scheduler: Architecture, SDK & Task Flow

Mike Chen's Internet Architecture

Jul 26, 2024 · Databases

Comprehensive Overview of NewSQL Databases and Their Applications

This article explains what NewSQL databases are, outlines their core characteristics, reviews major products such as Spanner, CockroachDB, TiDB, OceanBase and TDSQL, and discusses typical use cases ranging from financial services to HTAP workloads.

CockroachDBDistributed SystemsNewSQL

0 likes · 8 min read

Comprehensive Overview of NewSQL Databases and Their Applications

DeWu Technology

Jul 24, 2024 · Backend Development

Mastering Unique Identifiers and Distributed Locks: From UUIDs to CAS

This article explores how various unique identifier schemes—random UUIDs, sequential numbers, hierarchical paths, and distributed ID generators—are applied across programming languages, file systems, databases, and networks, and explains when and how to enforce uniqueness using pre‑validation, database constraints, or distributed locks such as Redis and Zookeeper, while also linking these mechanisms to CAS‑based concurrency control.

Backend DevelopmentCASDistributed Systems

0 likes · 23 min read

Mastering Unique Identifiers and Distributed Locks: From UUIDs to CAS

JD Cloud Developers

Jul 24, 2024 · Operations

How JD.com’s Buffalo Scheduler Achieves High‑Performance, Scalable DAG Orchestration

Buffalo, JD.com’s in‑house distributed DAG scheduler, tackles massive task volumes and complex dependencies through a dual‑layer entity model, instance‑based execution, tiered scheduling, high‑availability architecture, event‑driven processing, in‑memory and cold‑hot data separation, delivering scalable, low‑latency ETL orchestration.

DAG schedulingDistributed SystemsETL orchestration

0 likes · 12 min read

How JD.com’s Buffalo Scheduler Achieves High‑Performance, Scalable DAG Orchestration

JD Tech Talk

Jul 15, 2024 · Backend Development

Performance Optimizations in RaftKeeper v2.1.0: Benchmark Results and Engineering Details

The article presents a detailed engineering analysis of RaftKeeper v2.1.0, highlighting benchmark‑driven performance improvements such as 11% write throughput gains, up to 198% faster read‑write mixed workloads, and multiple optimizations—including response serialization, list‑request handling, system‑call reduction, thread‑pool redesign, and asynchronous snapshot processing—validated on large ClickHouse clusters.

Distributed SystemsRaftKeeperbenchmark

0 likes · 11 min read

Performance Optimizations in RaftKeeper v2.1.0: Benchmark Results and Engineering Details

MaGe Linux Operations

Jul 13, 2024 · Operations

Unlocking Observability: A Complete Guide to OpenTelemetry Architecture and APIs

This article explains what OpenTelemetry is, its core components, key terminology, benefits, usage steps, and detailed architecture—including APIs, SDK pipelines, and the collector—providing a comprehensive overview for developers and operators seeking vendor‑neutral observability solutions.

Distributed SystemsMetricsObservability

0 likes · 13 min read

Unlocking Observability: A Complete Guide to OpenTelemetry Architecture and APIs

NetEase Cloud Music Tech Team

Jul 11, 2024 · Operations

Cloud Music Guizhou Data Center Migration: A Large-Scale Infrastructure Migration Case Study

In 2023 NetEase Cloud Music executed its largest ever data‑center migration, moving over 20,000 applications and more than one million queries per second to a new Guizhou facility while meeting zero‑downtime, strict latency and bandwidth limits through a batch‑wise, cross‑team strategy that incorporated automated upgrade platforms, standardized operations, and extensive risk‑mitigation measures.

Data Center MigrationDistributed SystemsTechnical debt

0 likes · 27 min read

Cloud Music Guizhou Data Center Migration: A Large-Scale Infrastructure Migration Case Study

DeWu Technology

Jul 8, 2024 · Game Development

StarWish Forest: Design and Implementation of a Growth-Based Interactive Game System

StarWish Forest, DeWu’s first growth‑based interactive game, lets users nurture virtual trees by collecting water droplets through daily check‑ins, team planting, PK battles and e‑commerce tasks, automatically advancing through fifteen levels and earning rewards, while its Redis‑driven architecture and modular design enable scalable social engagement and commercial conversion.

Distributed SystemsGame DevelopmentTeam Collaboration

0 likes · 16 min read

StarWish Forest: Design and Implementation of a Growth-Based Interactive Game System

IT Services Circle

Jul 6, 2024 · Backend Development

Design Techniques for High Availability in Large‑Scale Internet Architecture

This article explains the essential high‑availability design techniques for large‑scale internet systems, covering system splitting, decoupling, asynchronous processing, retry mechanisms, compensation, backup, multi‑active strategies, isolation, rate limiting, circuit breaking, and graceful degradation to ensure robust, scalable backend services.

Distributed SystemsSystem Designbackend

0 likes · 13 min read

Design Techniques for High Availability in Large‑Scale Internet Architecture

Su San Talks Tech

Jul 6, 2024 · Backend Development

Mastering High Availability: 10 Essential Design Techniques for Scalable Systems

This article explains ten core techniques—system splitting, decoupling, asynchrony, retry, compensation, backup, multi‑active strategies, isolation, rate limiting, circuit breaking, and degradation—that together enable robust, high‑availability architectures for modern backend services.

Distributed SystemsSystem Designfault tolerance

0 likes · 12 min read

Mastering High Availability: 10 Essential Design Techniques for Scalable Systems

Java Tech Enthusiast

Jul 2, 2024 · Databases

Practical Redis Use Cases and Code Examples

This guide walks backend developers through twenty real‑world Redis use cases—from basic caching, lotteries, and like/collect features to ranking, PV/UV counting, Bloom filters, sign‑in, geo‑search, rate limiting, ID generation, distributed locks, messaging, session sharing, and more—providing Spring Boot code samples, limitations, and best‑practice advice.

Backend DevelopmentCachingData Structures

0 likes · 55 min read

Practical Redis Use Cases and Code Examples

Open Source Tech Hub

Jul 2, 2024 · Backend Development

How to Build RPC Services with ThinkPHP Swoole: Step-by-Step Guide

This guide explains how to set up remote procedure calls in a ThinkPHP application using the Swoole extension, covering extension installation, Composer dependencies, server and client configuration, interface definition, hot‑reload handling, and common pitfalls such as Xdebug conflicts.

Distributed SystemsPHPRPC

0 likes · 6 min read

How to Build RPC Services with ThinkPHP Swoole: Step-by-Step Guide

dbaplus Community

Jun 23, 2024 · Databases

How Vitess Scales MySQL for YouTube: Architecture and Lessons

This article explains how Vitess was created to overcome MySQL leader‑follower replication limits at YouTube, detailing its sidecar VTTablet, stateless VTGate router, topology key‑value store, and scaling strategies that enable billions of users to be served reliably.

Database ArchitectureDistributed SystemsMySQL scaling

0 likes · 7 min read

How Vitess Scales MySQL for YouTube: Architecture and Lessons

Efficient Ops

Jun 20, 2024 · Operations

How Intelligent Ops Platforms Transform Distributed Banking Systems

This article explains how Chinese commercial banks are adopting intelligent operation platforms to collect, analyze, and visualize distributed system data in real time, enabling rapid root‑cause detection, full‑link tracing, and automated solution recommendations for complex financial services.

Distributed SystemsRoot Cause Analysisbanking

0 likes · 8 min read

How Intelligent Ops Platforms Transform Distributed Banking Systems

Ctrip Technology

Jun 20, 2024 · Backend Development

Design and Architecture of Ctrip Service Registration Center

The article explains Ctrip's service registration center architecture, including its two‑layer Data and Session design, multi‑sharding, fault‑tolerance mechanisms, Redis‑based cluster discovery, design trade‑offs such as proxy versus Smart SDK, hashing strategy, and operational considerations for burst traffic and future scaling.

Distributed SystemsRedis discoveryfault tolerance

0 likes · 16 min read

Design and Architecture of Ctrip Service Registration Center

Wukong Talks Architecture

Jun 20, 2024 · Backend Development

Seven Classic Use Cases of Message Queues

This article shares seven practical scenarios—such as asynchronous processing, traffic smoothing, message bus, delayed tasks, broadcast consumption, distributed transactions, and data hub integration—where message queues like RocketMQ, Kafka, ActiveMQ, and RabbitMQ help solve high‑concurrency challenges in modern backend systems.

Broadcast ConsumptionDelay MessagesDistributed Systems

0 likes · 11 min read

Seven Classic Use Cases of Message Queues

Aikesheng Open Source Community

Jun 11, 2024 · Databases

ClickHouse Data Recovery Procedure for a Failed Disk in a 4‑Shard 3‑Replica Cluster

This article details a step‑by‑step recovery of a ClickHouse 4‑shard, 3‑replica cluster after a node’s disks failed, covering verification of residual data, Zookeeper metadata cleanup, table reconstruction, distributed table restoration, and validation of synchronization across replicas.

Data RecoveryDatabase operationsDistributed Systems

0 likes · 8 min read

ClickHouse Data Recovery Procedure for a Failed Disk in a 4‑Shard 3‑Replica Cluster

Mike Chen's Internet Architecture

Jun 6, 2024 · Backend Development

Comprehensive Introduction to RocketMQ: Architecture, Principles, and Use Cases

This article provides a detailed overview of RocketMQ, covering its core concepts, four primary usage scenarios, architectural components such as NameServer, Broker, Producer, and Consumer, as well as message storage, ordering guarantees, high availability, fault tolerance, and transactional messaging.

Backend DevelopmentDistributed SystemsMessage Queue

0 likes · 6 min read

Comprehensive Introduction to RocketMQ: Architecture, Principles, and Use Cases

Sohu Tech Products

Jun 5, 2024 · Big Data

Why Kafka Is the Backbone of Modern Data Pipelines: Core Architecture and Use Cases

This article explains Kafka's role as a high‑throughput distributed message queue, detailing its core components, topic‑partition model, consumer groups, storage mechanisms, fault‑tolerance features, delivery guarantees, ZooKeeper coordination, and scalability strategies for building reliable real‑time data pipelines.

Big DataDistributed SystemsKafka

0 likes · 14 min read

Why Kafka Is the Backbone of Modern Data Pipelines: Core Architecture and Use Cases

Ops Development & AI Practice

Jun 5, 2024 · Fundamentals

How Paxos Guarantees Consistency in Distributed Systems – A Deep Dive

This article explains the Paxos consensus algorithm, detailing its roles, three-phase execution process, key properties such as consistency, availability and fault tolerance, and showcases its practical applications in distributed databases, file systems, and coordination services.

Consensus AlgorithmConsistencyDatabase

0 likes · 7 min read

How Paxos Guarantees Consistency in Distributed Systems – A Deep Dive

Efficient Ops

Jun 2, 2024 · Operations

Why Observability Is the Key to Reliable Distributed Systems

Observability, defined as measuring system state through logs, metrics, and tracing, enhances stability of distributed architectures by enabling rapid fault detection, deeper insight, and proactive issue resolution, distinguishing it from traditional monitoring and supporting DevOps, SRE, and business objectives.

Distributed SystemsMonitoring

0 likes · 17 min read

Why Observability Is the Key to Reliable Distributed Systems

Architect

May 31, 2024 · Backend Development

7 Common Cache Pitfalls and How to Avoid Them

This article outlines seven typical cache problems—including cache penetration, breakdown, avalanche, large keys, hot keys, hit‑rate issues, and data inconsistency—and provides practical solutions such as parameter validation, Bloom filters, null‑value caching, locking, auto‑renewal, random expiration, high‑availability setups, compression, and cache warming to improve system reliability and performance.

CachingDistributed SystemsRedis

0 likes · 22 min read

7 Common Cache Pitfalls and How to Avoid Them

Architect

May 30, 2024 · Backend Development

Inside Pulsar’s Bookie: A Deep Dive into Storage Architecture and Write/Read Paths

This article dissects Pulsar’s storage layer by examining the Bookie component, detailing its architecture, the sequential write‑ahead log, ledger management, journal handling, caching mechanisms, and the complete read/write call chains with concrete code examples and performance considerations.

Distributed SystemsJavaLedger

0 likes · 33 min read

Architect

May 28, 2024 · Backend Development

Mastering Microservice Architecture: Key Components and Best Practices

This article provides a comprehensive overview of a typical microservice architecture, detailing the roles of Nginx as the traffic entry, gateways, business services, service registries, caching, databases, messaging, logging, scheduling, and distributed storage, and offering practical guidance for high‑availability deployment.

Backend ArchitectureDistributed Systemsmicroservices

0 likes · 10 min read

Mastering Microservice Architecture: Key Components and Best Practices

DevOps Operations Practice

May 23, 2024 · Big Data

Understanding Elasticsearch: Architecture, Core Concepts, and How It Works

This article introduces Elasticsearch, an open‑source distributed search and analytics engine, explaining its architecture, core concepts such as clusters, nodes, shards, replicas, indices, inverted indexes, documents and fields, and how these components enable fast, scalable searching and data analysis.

Big DataDistributed SystemsElasticsearch

0 likes · 7 min read

Understanding Elasticsearch: Architecture, Core Concepts, and How It Works

Top Architect

May 20, 2024 · Big Data

Understanding Distributed Systems and Kafka: Concepts, Message Ordering, and Guarantees in Java

This article explains the fundamentals of distributed systems, introduces Apache Kafka’s architecture and core components, describes how Kafka ensures message ordering within partitions, and provides practical Java consumer configurations and techniques to guarantee ordered consumption of messages.

Distributed SystemsJavaMessage Ordering

0 likes · 10 min read

Understanding Distributed Systems and Kafka: Concepts, Message Ordering, and Guarantees in Java

Rare Earth Juejin Tech Community

May 20, 2024 · Big Data

Why Use Message Queues and an Introduction to Kafka with Practical Examples

This article explains the motivations for adopting message queues, outlines core concepts and protocols, compares mainstream MQ products, and provides a detailed walkthrough of Kafka architecture, cluster setup, native Java APIs, and Spring Boot integration with extensive code examples.

Big DataDistributed SystemsJava

0 likes · 23 min read

Why Use Message Queues and an Introduction to Kafka with Practical Examples

Ctrip Technology

May 17, 2024 · Backend Development

Comprehensive Log Governance and Mining Solution for Distributed Systems

This article presents a comprehensive log governance and mining solution for distributed systems, covering background challenges, usage scenarios, and detailed strategies such as distributed log integration, front‑back end traceability, standardized log management, large‑payload handling, efficient cleaning, and future plans for componentization and sampling.

Distributed SystemsLog ManagementPerformance Optimization

0 likes · 16 min read

Comprehensive Log Governance and Mining Solution for Distributed Systems

Cognitive Technology Team

May 16, 2024 · Operations

Guide to Building Stability in Distributed Systems

This guide presents comprehensive principles, best practices, and techniques for designing, deploying, and maintaining stable distributed systems, covering fault tolerance, monitoring, capacity planning, incident response, and operational reliability to help engineers achieve high availability.

Distributed SystemsOperationsreliability engineering

0 likes · 1 min read

Guide to Building Stability in Distributed Systems

Sohu Tech Products

May 15, 2024 · Backend Development

Design and Implementation of a Simplified RocketMQ SDK

To simplify RocketMQ usage, I designed an internal SDK that abstracts complex client configurations, offering single‑call ordered message sending, per‑message consumption, unified result handling, and enforced consistent subscription settings, thereby reducing developer effort and preventing common integration errors.

Backend DevelopmentDistributed SystemsJava

0 likes · 6 min read

Design and Implementation of a Simplified RocketMQ SDK

Top Architect

May 15, 2024 · Backend Development

Implementing a Simple Java RPC Framework: Architecture, Service Registration, Serialization, and Proxy Generation

This article explains the principles and implementation of a Java RPC framework, covering service registration with Zookeeper, serialization, compression, network communication via Netty, dynamic proxy generation using reflection and Javassist, and performance considerations, providing extensive code examples for each component.

Distributed SystemsJavaJavassist

0 likes · 25 min read

Implementing a Simple Java RPC Framework: Architecture, Service Registration, Serialization, and Proxy Generation

Cognitive Technology Team

May 15, 2024 · Fundamentals

The Fallacies of Distributed Systems: Understanding Common Network Assumptions

This article revisits the classic “Fallacies of Distributed Systems” introduced by Peter Deutsch, explaining why assumptions such as reliable networks, zero latency, infinite bandwidth, secure and homogeneous communication are false, and offering practical strategies like retries, caching, batching, and security‑first design to build robust distributed applications.

Distributed SystemsFallaciesLatency

0 likes · 4 min read

The Fallacies of Distributed Systems: Understanding Common Network Assumptions

Mike Chen's Internet Architecture

May 11, 2024 · Big Data

Comprehensive Introduction to Apache Kafka: Architecture, Features, and Use Cases

This article provides a detailed overview of Apache Kafka, covering its core characteristics, distributed architecture, key components such as topics, partitions, brokers, producers, consumers, ZooKeeper, and common application scenarios like log collection, event‑driven architecture, real‑time analytics, and monitoring.

Big DataDistributed SystemsKafka

0 likes · 7 min read

Architect

May 10, 2024 · Backend Development

Rate Limiting Algorithms: Fixed Window, Sliding Window, Leaky Bucket, Token Bucket, and Distributed Rate Limiting

This article explains the importance of rate limiting in microservice architectures, introduces four basic algorithms—fixed window, sliding window, leaky bucket, and token bucket—compares their advantages and disadvantages, and presents both single-machine and distributed implementations with Go code examples.

Distributed SystemsGolangRate Limiting

0 likes · 29 min read

Rate Limiting Algorithms: Fixed Window, Sliding Window, Leaky Bucket, Token Bucket, and Distributed Rate Limiting

Mike Chen's Internet Architecture

May 9, 2024 · Big Data

Understanding Apache Kafka: Features, Architecture, and Real‑World Use Cases

This article provides a comprehensive overview of Apache Kafka, covering its core features, architectural components, message flow, and common application scenarios such as log collection, decoupled messaging, activity tracking, operational monitoring, and stream processing.

Apache KafkaDistributed SystemsMessage Queue

0 likes · 6 min read

Architect

May 1, 2024 · Fundamentals

Why ULID Beats UUID: A Deep Dive into Features, Specs, and Python Usage

This article compares UUID and ULID, explains the limitations of UUID versions, details ULID's timestamp‑based, lexicographically sortable design, presents its binary layout and encoding, and shows how to generate and manipulate ULIDs in Python with concrete code examples.

Distributed SystemsPythonSorting

0 likes · 8 min read

Why ULID Beats UUID: A Deep Dive into Features, Specs, and Python Usage

Su San Talks Tech

May 1, 2024 · Backend Development

7 Common Cache Pitfalls and How to Avoid Them in Backend Systems

This article outlines seven typical cache issues—including cache penetration, breakdown, avalanche, data inconsistency, large keys, hot keys, and low hit rates—explains their causes, and provides practical solutions such as parameter validation, Bloom filters, locking, expiration randomization, key sharding, and cache warming to improve system reliability and performance.

CacheDistributed SystemsRedis

0 likes · 20 min read

7 Common Cache Pitfalls and How to Avoid Them in Backend Systems

Selected Java Interview Questions

Apr 30, 2024 · Fundamentals

Understanding ULID: A Universally Unique Lexicographically Sortable Identifier

This article explains ULID, a 128‑bit identifier that combines a millisecond‑precision timestamp with high‑entropy randomness, offering lexicographic ordering, URL‑safe Base32 encoding, and a lower collision risk compared to UUIDs, along with Python usage examples and specifications.

Distributed SystemsPythonULID

0 likes · 6 min read

Understanding ULID: A Universally Unique Lexicographically Sortable Identifier

FunTester

Apr 28, 2024 · Backend Development

Tackling Data Consistency: Master‑Slave, Master‑Master & Leaderless Architectures

The article examines why distributed systems inevitably face data‑consistency challenges and breaks down three common service‑architecture patterns—master‑slave, master‑master, and leaderless—detailing their replication mechanisms, advantages, drawbacks, and practical solutions such as synchronous, semi‑synchronous, asynchronous replication, quorum handling, node‑failure recovery, and conflict resolution strategies.

Data ConsistencyDistributed SystemsMaster‑Slave

0 likes · 14 min read

Tackling Data Consistency: Master‑Slave, Master‑Master & Leaderless Architectures

Architect's Guide

Apr 27, 2024 · Backend Development

Understanding RPC Frameworks: Concepts, Architecture, and Common Implementations

This article explains the fundamentals of Remote Procedure Call (RPC) frameworks, their underlying architecture, communication steps, role of providers, consumers, and registries, and compares popular RPC solutions with other service paradigms such as SOA, SOAP, and REST.

Backend DevelopmentDistributed SystemsREST

0 likes · 10 min read

Understanding RPC Frameworks: Concepts, Architecture, and Common Implementations

AntTech

Apr 23, 2024 · Databases

The Cloud Era of Databases: Insights from OceanBase Chief Scientist Yang Zhenkun

In his OceanBase developer conference keynote, chief scientist Yang Zhenkun explains how cloud resource pooling enables distributed databases to achieve elastic compute and storage, discusses the evolution of databases, the challenges of transaction processing, and envisions fully shared, on‑demand cloud database services.

Cloud ComputingDistributed SystemsOceanBase

0 likes · 7 min read

The Cloud Era of Databases: Insights from OceanBase Chief Scientist Yang Zhenkun

ITPUB

Apr 22, 2024 · Backend Development

How Meta Achieves Near‑Perfect Cache Consistency: Lessons from Polaris

This article explains Meta's approach to cache invalidation and consistency, detailing why ultra‑high consistency matters, how their Polaris monitoring system detects and resolves inconsistencies, and provides a simplified Python example that illustrates the underlying mechanisms and challenges.

ConsistencyDistributed SystemsMeta

0 likes · 12 min read

How Meta Achieves Near‑Perfect Cache Consistency: Lessons from Polaris

21CTO

Apr 20, 2024 · Backend Development

Why Backend Development Is Harder Today Than 20 Years Ago

Over the past two decades, backend development has shifted from simple server‑side coding to a complex, multidisciplinary field, driven by soaring user expectations, massive system scale, an overwhelming array of technologies, and the need for cloud, container, and cross‑team collaboration skills.

Distributed SystemsSoftware Architecturebackend

0 likes · 9 min read

Why Backend Development Is Harder Today Than 20 Years Ago

Architects' Tech Alliance

Apr 17, 2024 · Cloud Native

What Is Microservices? Core Concepts, Architecture, and Implementation Explained

Microservices, also known as microservice architecture, is a cloud‑native approach that breaks a single application into loosely coupled, independently deployable services, each with its own tech stack, communicating via REST APIs, event streams, or message brokers, and organized around business capabilities and bounded contexts.

Cloud NativeDistributed Systemsarchitecture

0 likes · 3 min read

What Is Microservices? Core Concepts, Architecture, and Implementation Explained

Tongcheng Travel Technology Center

Apr 17, 2024 · Backend Development

In-Depth Analysis of Apache RocketMQ Architecture, Operation Principles, and High‑Throughput Mechanisms

This article provides a comprehensive overview of Apache RocketMQ, detailing its core components, producer and consumer workflows, storage strategies, master‑slave synchronization, Raft‑based half‑write and leader election mechanisms, and best‑practice recommendations for high‑throughput, fault‑tolerant messaging systems.

Backend DevelopmentDistributed SystemsHigh Throughput

0 likes · 22 min read

In-Depth Analysis of Apache RocketMQ Architecture, Operation Principles, and High‑Throughput Mechanisms

Mike Chen's Internet Architecture

Apr 17, 2024 · Backend Development

Comprehensive Overview of RocketMQ Architecture and Core Components

This article provides a detailed introduction to RocketMQ, covering its purpose as a distributed messaging middleware, the roles of NameServer, Producer, Broker, and Consumer, and explains how each component functions within a scalable, high‑throughput architecture.

Backend DevelopmentDistributed SystemsMessage Queue

0 likes · 5 min read

Architect

Apr 9, 2024 · Backend Development

Microservice Architecture Design Patterns: Overview, Benefits, Drawbacks, and Usage Guidelines

This article provides a comprehensive overview of microservice architecture, detailing its definition, key characteristics, advantages and disadvantages, and presenting ten essential design patterns—including database per service, event sourcing, CQRS, Saga, BFF, API gateway, Strangler, circuit breaker, externalized configuration, and consumer‑driven contract testing—along with their pros, cons, and appropriate scenarios.

Backend ArchitectureDesign PatternsDistributed Systems

0 likes · 29 min read

Microservice Architecture Design Patterns: Overview, Benefits, Drawbacks, and Usage Guidelines

FunTester

Apr 7, 2024 · Backend Development

Generating Globally Unique Identifiers (GUID) for Performance Testing: UUID, Distributed Services, Snowflake Algorithm, and Thread‑Local Techniques

This article explains why globally unique identifiers are needed in performance testing, compares several common solutions such as Java UUID, Redis/Zookeeper distributed ID generators, the Snowflake algorithm, and thread‑local or shared counters, and provides complete Java code examples for each approach.

Distributed SystemsGUIDJava

0 likes · 13 min read

Generating Globally Unique Identifiers (GUID) for Performance Testing: UUID, Distributed Services, Snowflake Algorithm, and Thread‑Local Techniques

Architects' Tech Alliance

Apr 6, 2024 · Artificial Intelligence

How ByteDance Scaled LLM Training to Over 10,000 GPUs: Inside the MegaScale System

The article analyzes ByteDance and Peking University's MegaScale system that enables efficient, stable training of large language models on clusters exceeding ten thousand GPUs, detailing algorithmic tweaks, 3D parallel communication overlap, operator optimizations, data‑pipeline improvements, network tuning, and fault‑tolerance mechanisms that together achieve a 55.2% MFU on a 175B model.

Distributed SystemsGPU clustersLLM training

0 likes · 15 min read

How ByteDance Scaled LLM Training to Over 10,000 GPUs: Inside the MegaScale System

Architect

Apr 4, 2024 · Backend Development

Mastering High Availability: 9 Essential Design Techniques for Scalable Systems

The article walks through nine practical techniques—system splitting, decoupling, asynchronous processing, retry, compensation, backup, multi‑active deployment, rate limiting, circuit breaking, and degradation—explaining why each is needed, how they are implemented in real‑world microservice architectures, and what trade‑offs to consider.

Circuit BreakerDistributed SystemsRate Limiting

0 likes · 13 min read

Mastering High Availability: 9 Essential Design Techniques for Scalable Systems

Practical DevOps Architecture

Apr 4, 2024 · Databases

ClickHouse Training Course Overview and Curriculum

This article introduces a comprehensive ClickHouse training program that covers fundamental concepts, architecture, installation, distributed cluster design, data import, performance tuning, and includes a detailed list of 33 video modules and additional recommended reading resources for large‑scale data analytics.

Big DataClickHouseColumnar Database

0 likes · 4 min read

ClickHouse Training Course Overview and Curriculum

Su San Talks Tech

Mar 30, 2024 · Backend Development

7 Real-World Message Queue Patterns Every Backend Engineer Should Know

From asynchronous decoupling to distributed transactions, this article explores seven classic message‑queue use cases—such as peak‑shaving, event buses, delayed tasks, broadcast consumption, and data hub integration—illustrated with real‑world examples from e‑commerce, ride‑hailing, and lottery systems, plus RocketMQ code snippets.

Distributed SystemsKafkaMessage Queue

0 likes · 13 min read

7 Real-World Message Queue Patterns Every Backend Engineer Should Know

Mike Chen's Internet Architecture

Mar 26, 2024 · Fundamentals

Mastering Message Queues: Core Concepts, Designs, and Popular MQs

Message Queues (MQ) are essential for scalable, decoupled systems, offering asynchronous communication, reliability, buffering, and high availability; this guide explains MQ fundamentals, characteristics, architectures like point-to-point and publish/subscribe, core components, design considerations, and reviews popular solutions such as RabbitMQ, Kafka, ActiveMQ, RocketMQ, and Pulsar.

Distributed SystemsMessage QueueMessaging

0 likes · 9 min read

Mastering Message Queues: Core Concepts, Designs, and Popular MQs

Tencent Cloud Developer

Mar 26, 2024 · Databases

Data Consistency in Distributed Systems: Master‑Slave, Master‑Master, and Leaderless Architectures

The article compares master‑slave, master‑master, and leaderless distributed architectures, explaining how synchronous, semi‑synchronous, and asynchronous replication affect consistency, latency and scalability, and showing that each pattern trades write throughput, conflict‑resolution complexity, and availability against strong data correctness.

Data ConsistencyDistributed SystemsMaster‑Slave

0 likes · 15 min read

Data Consistency in Distributed Systems: Master‑Slave, Master‑Master, and Leaderless Architectures

Java Backend Technology

Mar 24, 2024 · Databases

Beyond Caching: How Redis Powers Real‑World Storage and Services

Redis is widely known as a high‑performance cache, but its rich data structures, persistence options, and distributed features enable it to serve as a primary storage engine for many internet services, from user profiles and leaderboards to rate limiting, messaging, and geo‑location, while presenting scalability and reliability challenges.

CachingData StructuresDistributed Systems

0 likes · 19 min read

Beyond Caching: How Redis Powers Real‑World Storage and Services

Mike Chen's Internet Architecture

Mar 23, 2024 · Backend Development

Unlocking Dubbo: A Visual Guide to Its 10‑Layer Architecture and Call Flow

This article explains Dubbo's role as a high‑performance RPC framework, details its ten‑layer architecture, and walks through the eleven‑step call process, providing clear diagrams and concise descriptions for developers building distributed systems.

Distributed SystemsDubboRPC

0 likes · 7 min read

Unlocking Dubbo: A Visual Guide to Its 10‑Layer Architecture and Call Flow

dbaplus Community

Mar 21, 2024 · Backend Development

Inside DeWu’s Self‑Built Redis: Architecture, Automation & High‑Availability

This article details DeWu's self‑built high‑performance distributed Redis cache system, covering its Proxy‑based architecture, core components like ConfigServer, Redis‑Proxy and Redis‑Server, the automated operations platform for deployment and scaling, as well as monitoring, alerting, stability measures and future roadmap.

Distributed SystemsRedis

0 likes · 22 min read

Inside DeWu’s Self‑Built Redis: Architecture, Automation & High‑Availability

FunTester

Mar 21, 2024 · Operations

How Dynamic Task‑Grabbing Cuts Distributed Batch Jobs from Hours to Minutes

This article presents a detailed case study of optimizing a distributed batch processing system by replacing static shard‑key concurrency with a dynamic task‑grabbing mechanism, dramatically reducing execution time from several hours to under fifteen minutes while maintaining stable resource usage.

Batch ProcessingDistributed SystemsPerformance Optimization

0 likes · 8 min read

How Dynamic Task‑Grabbing Cuts Distributed Batch Jobs from Hours to Minutes

Cognitive Technology Team

Mar 17, 2024 · Backend Development

Understanding Idempotence: Definition, Importance, Common Causes, and Practical Backend Solutions

Idempotence ensures that repeating the same operation yields the same result without side effects, a critical property in distributed and transactional systems; the article defines the concept, explains why it matters, lists common causes of non‑idempotent behavior, and presents several practical backend solutions such as unique indexes, anti‑duplicate tables, optimistic and pessimistic locks, tokens, distributed locks, and state machines.

Distributed SystemsIdempotencebackend

0 likes · 8 min read

Understanding Idempotence: Definition, Importance, Common Causes, and Practical Backend Solutions

Rare Earth Juejin Tech Community

Mar 17, 2024 · Fundamentals

Implementing Log Snapshotting in Raft: A Step‑by‑Step Guide

This article provides a comprehensive tutorial on adding log snapshotting (snapshotting) to a Raft‑based distributed key‑value store, explaining the motivation, the snapshot mechanism, and detailed Go code for generating, transferring, applying, and persisting snapshots to reduce log size and improve performance.

ConsensusDistributed SystemsGo

0 likes · 15 min read

Implementing Log Snapshotting in Raft: A Step‑by‑Step Guide

Architecture & Thinking

Mar 15, 2024 · Backend Development

Why RPC Is Essential for Building Scalable IM Clusters: A Beginner’s Guide

This article explains the fundamentals of Remote Procedure Call (RPC), why it is crucial for developing production‑grade instant‑messaging clusters, how it differs from plain HTTP, the typical RPC call flow, common frameworks such as gRPC, Thrift and Dubbo, and practical considerations for high‑throughput distributed systems.

Backend DevelopmentDistributed SystemsIM

0 likes · 12 min read

Why RPC Is Essential for Building Scalable IM Clusters: A Beginner’s Guide

dbaplus Community

Mar 12, 2024 · Databases

How Didi Scaled Log Search by Replacing Elasticsearch with ClickHouse

Facing PB‑scale daily logs and costly Elasticsearch bottlenecks, Didi redesigned its log‑search architecture by migrating to ClickHouse, detailing the challenges, storage redesign, cluster upgrades, performance optimizations, stability fixes, and the resulting cost reduction and query speed gains.

ClickHouseDistributed Systemselasticsearch migration

0 likes · 15 min read

How Didi Scaled Log Search by Replacing Elasticsearch with ClickHouse

Mike Chen's Internet Architecture

Mar 11, 2024 · Backend Development

Understanding RPC: Principles, Architecture, and Implementation Process

This article explains why remote procedure call (RPC) is essential for modern distributed services, describes the evolution from monolithic to service‑oriented architectures, and details the core components, communication steps, serialization, service discovery, and full call flow of an RPC framework.

Backend DevelopmentDistributed SystemsRPC

0 likes · 7 min read

Understanding RPC: Principles, Architecture, and Implementation Process

Linux Code Review Hub

Mar 11, 2024 · Databases

How Didi Built a Next‑Gen Log Storage System with ClickHouse

Didi migrated its massive PB‑scale log data from Elasticsearch to ClickHouse, redesigning storage with separate Log and Trace clusters, optimizing partition and sorting keys, introducing native TCP connectors, and revamping HDFS cold‑hot separation, achieving up to four‑fold query speed gains and 30% lower hardware costs.

ClickHouseDistributed SystemsFlink

0 likes · 15 min read

How Didi Built a Next‑Gen Log Storage System with ClickHouse

DataFunSummit

Mar 8, 2024 · Databases

Ant TuGraph Computing Engine Architecture and Applications

Ant TuGraph’s open‑source graph computing engine, led by Fang Zhihong, will be introduced covering its development history, architectural design, technical principles, integrated stream‑batch‑graph processing capabilities, real‑world large‑scale graph use cases, and future roadmap, offering insights into design, implementation, and value.

Big DataDistributed SystemsTuGraph

0 likes · 2 min read

Ant TuGraph Computing Engine Architecture and Applications

JavaEdge

Mar 4, 2024 · Backend Development

How a Billion‑Scale Flight Search Engine Scales with Multi‑Level Caching and Smart Load Balancing

This article explains how a high‑throughput, low‑latency flight search system handles billions of daily requests by employing multi‑level caching, distributed Redis and MongoDB stores, AI‑driven TTL optimization, and advanced load‑balancing techniques such as pooling and overload protection.

CachingDistributed SystemsMachine Learning

0 likes · 21 min read

How a Billion‑Scale Flight Search Engine Scales with Multi‑Level Caching and Smart Load Balancing

Tencent Cloud Developer

Feb 28, 2024 · Backend Development

Comprehensive Guide to Rate Limiting Algorithms and Distributed Rate Limiting Solutions

This guide explains why rate limiting is essential for micro‑service stability, outlines six design principles, details four classic algorithms—fixed window, sliding window, leaky bucket, and token bucket—and compares centralized Redis, load‑balancer cache, and coordination‑service distributed solutions.

Distributed SystemsGolangRate Limiting

0 likes · 30 min read

Comprehensive Guide to Rate Limiting Algorithms and Distributed Rate Limiting Solutions

Aikesheng Open Source Community

Feb 27, 2024 · Cloud Native

Detailed Overview of LiteIO Architecture, Components, and Volume Lifecycle

This article provides a comprehensive technical overview of LiteIO, describing its core and CSI components, the complete volume lifecycle within Kubernetes, Disk‑Agent responsibilities, common implementation pitfalls, storage‑pool construction methods, and the design of the node‑disk‑controller, scheduler, and CSI modules.

CSICloud NativeDistributed Systems

0 likes · 13 min read

Detailed Overview of LiteIO Architecture, Components, and Volume Lifecycle

Mike Chen's Internet Architecture

Feb 27, 2024 · Cloud Native

Mastering Spring Cloud: A Deep Dive into Microservices Architecture

This article explains the fundamentals of microservices, why monolithic applications become limiting as they grow, and how Spring Cloud and its components—such as Eureka, Ribbon, Feign, Hystrix, Zuul, Config, and Sleuth—provide a complete framework for building scalable, resilient distributed systems.

Circuit BreakerDistributed Systemsspring-cloud

0 likes · 7 min read

Mastering Spring Cloud: A Deep Dive into Microservices Architecture

Java Architect Essentials

Feb 25, 2024 · Backend Development

Designing a High‑Concurrency Ticket‑Seckill System: Architecture, Load Balancing, and Go Implementation

The article analyzes the extreme‑traffic challenges of the 12306 ticket‑seckill service, presents a layered load‑balancing architecture, compares inventory‑deduction strategies, and demonstrates a complete Go‑based prototype with Nginx weighted round‑robin, Redis stock management, and ApacheBench performance testing.

Distributed SystemsGohigh concurrency

0 likes · 19 min read

Designing a High‑Concurrency Ticket‑Seckill System: Architecture, Load Balancing, and Go Implementation

DataFunSummit

Feb 20, 2024 · Big Data

BitSail Open‑Source Data Integration Engine: Architecture, New Features, CDC Solutions and Future Outlook

This article introduces ByteDance's open‑source data integration engine BitSail, covering its background, layered architecture, recent feature enhancements, automated testing framework, CDC‑based full‑library synchronization solutions, and future development plans for connectors and real‑time data consistency.

Big DataCDCDistributed Systems

0 likes · 12 min read

BitSail Open‑Source Data Integration Engine: Architecture, New Features, CDC Solutions and Future Outlook

Mike Chen's Internet Architecture

Feb 20, 2024 · Backend Development

Mastering Dubbo: Deep Dive into Architecture, Components, and Best Practices

This article provides a comprehensive overview of Dubbo, a Java RPC framework, covering its necessity, core components, implementation principles, architecture layers, supported protocols, registry options, load‑balancing strategies, and typical application scenarios for building scalable micro‑services.

Backend DevelopmentDistributed SystemsDubbo

0 likes · 9 min read

Mastering Dubbo: Deep Dive into Architecture, Components, and Best Practices

DataFunTalk

Feb 12, 2024 · Databases

High‑Availability Architecture of TuGraph‑DB: Design, Planning, and Deployment

This article explains the high‑availability architecture of TuGraph‑DB, covering the concepts of HA, the Raft consensus algorithm, cluster design, server and client mechanisms, snapshot handling, and future roadmap such as witness nodes and on‑demand snapshots.

Cluster DeploymentDistributed SystemsGraph Database

0 likes · 12 min read

High‑Availability Architecture of TuGraph‑DB: Design, Planning, and Deployment

Mike Chen's Internet Architecture

Feb 8, 2024 · Backend Development

Core Principles of Message Queues: 12 Key Concepts Explained

This article provides a comprehensive overview of message queue fundamentals, covering producers, consumers, brokers, point-to-point and publish/subscribe models, ordering, ACK mechanisms, eventual consistency, transactions, persistence, high availability, and selection criteria for various MQ technologies.

Distributed SystemsMQMessage Queue

0 likes · 10 min read

Core Principles of Message Queues: 12 Key Concepts Explained