Tagged articles

31 articles

Page 1 of 1

Dec 30, 2025 · Operations

Uncovering the Split‑Lock Chaos that Crashed AMD Servers During Double‑11

A detailed post‑mortem of a high‑priority fault on AMD servers shows how a split‑lock triggered by Python UDFs caused CPI spikes, CPU overload, and bus‑lock events, and explains the investigation steps, code analysis, reproduction tests, and mitigation measures taken to restore stability.

AMDjemallocmemory allocation

0 likes · 23 min read

Uncovering the Split‑Lock Chaos that Crashed AMD Servers During Double‑11

DeWu Technology

Aug 11, 2025 · Backend Development

How We Uncovered Hidden Bottlenecks in Rust Services with Profiling

After migrating thousands of Java cores to Rust, the team used jemalloc and pprof profiling to pinpoint why a few services only improved 10%, refactored the OSS client for reuse, and achieved up to 20% CPU reduction and significant memory savings, demonstrating the power of deep performance analysis in production Rust services.

AsyncProfilingRust

0 likes · 14 min read

How We Uncovered Hidden Bottlenecks in Rust Services with Profiling

Tencent Cloud Developer

Apr 28, 2025 · Backend Development

Performance Optimization Techniques for High‑Throughput Backend Systems

The article outlines seven practical performance‑optimization techniques for high‑throughput back‑ends—including replacing protobuf with native C++ classes, adopting cache‑friendly data structures, using jemalloc/tcmalloc, implementing lock‑free double buffers, simplifying structs for specific scenarios, and leveraging profiling tools—while stressing balanced, incremental improvements.

Backend DevelopmentCCache Friendly

0 likes · 16 min read

Performance Optimization Techniques for High‑Throughput Backend Systems

360 Zhihui Cloud Developer

Apr 27, 2025 · Databases

Why MySQL Memory Stays High and How to Optimize It

This article explains MySQL's memory architecture, why memory usage often stays high after spikes, and provides practical steps—including connection checks, slow query analysis, workload scaling, and switching to jemalloc—to diagnose and reduce memory consumption on 360's internal cloud platform.

Connection ManagementDatabase PerformanceMemory Optimization

0 likes · 7 min read

Why MySQL Memory Stays High and How to Optimize It

Huolala Tech

Mar 4, 2025 · Backend Development

Debugging Rust Memory Leaks in Frontend Services: Tools, Techniques, and Real-World Fixes

This article walks through a real-world investigation of memory leaks in a Rust-based frontend service, detailing common leak scenarios, profiling tools like tokio-console and jemalloc, load testing with k6, and the step-by-step analysis that uncovered regex misuse and cache bugs, ultimately stabilizing memory usage.

K6Memory LeakProfiling

0 likes · 12 min read

Debugging Rust Memory Leaks in Frontend Services: Tools, Techniques, and Real-World Fixes

Deepin Linux

Feb 14, 2025 · Fundamentals

Understanding Jemalloc: Principles, Comparisons, and Optimization Practices

This article provides a comprehensive overview of Jemalloc, covering its architecture, memory allocation fundamentals, performance comparison with ptmalloc and tcmalloc, practical optimization cases across web, database, and big‑data workloads, and detailed configuration guidelines to improve memory efficiency and multithreaded performance.

Performance Optimizationfragmentationjemalloc

0 likes · 31 min read

Understanding Jemalloc: Principles, Comparisons, and Optimization Practices

vivo Internet Technology

Dec 11, 2024 · Databases

RocksDB Memory Usage Analysis and Optimization: Troubleshooting Excessive Memory Consumption in Production

The article examines a production RocksDB memory‑usage problem where two instances consumed 59 GB on a 32‑CPU, 64‑GB server, identifies glibc ptmalloc’s unreclaimed free memory as the main cause, and shows that switching to jemalloc cuts usage by roughly 25 % while improving I/O and CPU efficiency.

Linux Memory ManagementRocksDBglibc

0 likes · 11 min read

RocksDB Memory Usage Analysis and Optimization: Troubleshooting Excessive Memory Consumption in Production

vivo Internet Technology

Nov 6, 2024 · Fundamentals

Analysis of glibc Memory Management and Solutions to an Online Memory Incident

The article examines a real‑world memory alarm in a Vivo service, explains how glibc’s ptmalloc allocator manages heap memory via brk, sbrk, and mmap, shows why freed chunks stay in bins, and recommends limiting heap growth or switching to jemalloc for faster reclamation.

Backend DevelopmentC runtimeglibc

0 likes · 20 min read

Analysis of glibc Memory Management and Solutions to an Online Memory Incident

Rare Earth Juejin Tech Community

Jul 8, 2024 · Backend Development

Understanding Netty's Pooled Memory Allocation Mechanism Based on jemalloc4

This article explains Netty's memory pool architecture after switching from jemalloc3 to jemalloc4, detailing the new memory size classes, core components such as PoolArena, PoolChunkList, PoolChunk, and PoolSubpage, and provides Java code snippets to illustrate their implementations.

Nettyjemallocperformance

0 likes · 10 min read

Understanding Netty's Pooled Memory Allocation Mechanism Based on jemalloc4

MaGe Linux Operations

May 4, 2024 · Databases

Unlocking Redis Memory: How Its Internal Model Impacts Performance

This article explains Redis's memory model—including memory statistics, allocation, internal data structures, object types, and encoding—while showing practical examples for estimating usage, optimizing consumption, and troubleshooting fragmentation in high‑concurrency environments.

Data StructuresMemory ManagementOptimization

0 likes · 34 min read

Unlocking Redis Memory: How Its Internal Model Impacts Performance

FunTester

Apr 12, 2024 · Backend Development

Performance Optimization Techniques for Backend Systems: Replacing Protobuf with C++ Classes, Cache‑Friendly Structures, jemalloc, and Lock‑Free Data Structures

The article presents practical backend performance optimization methods—including substituting Protobuf with native C++ classes, employing cache‑friendly data structures, integrating jemalloc/tcmalloc, using lock‑free double‑buffer designs, and tailoring data formats—to achieve up to three‑fold speed improvements and significant latency reductions.

Backend DevelopmentCCache Friendly

0 likes · 15 min read

Performance Optimization Techniques for Backend Systems: Replacing Protobuf with C++ Classes, Cache‑Friendly Structures, jemalloc, and Lock‑Free Data Structures

Architect

Mar 12, 2024 · Backend Development

Boost C++ Service Performance: 3× Faster with Classes, Cache‑Friendly Structures, jemalloc and Lock‑Free Designs

This article walks through a series‑by‑step performance‑tuning process for high‑throughput C++ services, replacing Protobuf with plain classes, adopting cache‑friendly hash tables, switching to jemalloc, implementing a double‑buffer lock‑free data structure, and tailoring data formats, each backed by concrete code examples, benchmark results, and analysis of trade‑offs.

CCache FriendlyPerformance Optimization

0 likes · 20 min read

Boost C++ Service Performance: 3× Faster with Classes, Cache‑Friendly Structures, jemalloc and Lock‑Free Designs

High Availability Architecture

Mar 6, 2024 · Backend Development

Performance Optimization Techniques: Replacing Protobuf with C++ Classes, Cache‑Friendly Structures, jemalloc, and Lock‑Free Designs

This article presents practical performance‑optimization strategies for high‑throughput C++ services, including replacing Protobuf with hand‑written classes, adopting cache‑friendly data structures, using jemalloc/tcmalloc instead of the default allocator, employing lock‑free double‑buffer designs, tailoring data formats for specific workloads, and leveraging profiling tools to measure gains.

CCache FriendlyOptimization

0 likes · 17 min read

Performance Optimization Techniques: Replacing Protobuf with C++ Classes, Cache‑Friendly Structures, jemalloc, and Lock‑Free Designs

Tencent Cloud Developer

Feb 29, 2024 · Backend Development

Performance Optimization Strategies for High‑Throughput Backend Services

The article outlines practical, continuous performance‑optimization tactics for high‑throughput back‑end services—replacing Protobuf with lightweight C++ classes, using cache‑friendly data structures, adopting jemalloc/tcmalloc, employing lock‑free double buffers, tailoring data formats, and leveraging profiling tools—to achieve multi‑fold speedups while balancing maintainability.

C++Cache FriendlyPerformance Optimization

0 likes · 18 min read

Performance Optimization Strategies for High‑Throughput Backend Services

Aikesheng Open Source Community

Jan 16, 2024 · Databases

Resolving MySQL OOM Issues Caused by Full‑Text Indexes: Tcmalloc vs. Jemalloc Memory Management

This article analyzes a MySQL 5.7 OOM incident triggered by a heavy full‑text query, examines memory usage patterns with Tcmalloc, demonstrates how switching to Jemalloc releases memory, and provides step‑by‑step commands and observations to prevent similar outages.

Full-Text IndexMemory ManagementMySQL

0 likes · 11 min read

Resolving MySQL OOM Issues Caused by Full‑Text Indexes: Tcmalloc vs. Jemalloc Memory Management

政采云技术

Sep 7, 2023 · Backend Development

Understanding Netty's Memory Management and Allocation Strategies

This article explains how Netty implements memory management by borrowing concepts from Jemalloc and Tcmalloc, detailing the hierarchy of arenas, chunks, pages and sub‑pages, the allocation algorithms for both large and small buffers, and the role of thread‑local caches in reducing fragmentation and improving performance.

Backend DevelopmentJavaMemory Management

0 likes · 24 min read

Understanding Netty's Memory Management and Allocation Strategies

iQIYI Technical Product Team

Aug 11, 2023 · Artificial Intelligence

Debugging Random OOM Issues in PyTorch Distributed Training on A100 Clusters

The iQIYI backend team traced random OOM crashes in PyTorch Distributed Data Parallel on an A100 cluster to a malformed DDP message injected by a security scan, which forced a near‑terabyte allocation; using jemalloc for diagnostics, they mitigated the issue by adjusting scan policies and collaborating with PyTorch to harden the protocol.

Memory DebuggingOOMPyTorch

0 likes · 9 min read

Debugging Random OOM Issues in PyTorch Distributed Training on A100 Clusters

ByteDance SYS Tech

May 26, 2023 · Fundamentals

Unlock Faster C++ Performance: Practical Jemalloc Optimization Techniques

This article explains the fundamentals of Linux memory allocation, introduces Jemalloc’s core algorithms and data structures, and provides concrete optimization steps—including arena tuning, tcache configuration, and slab size adjustments—to achieve measurable CPU savings in high‑concurrency C++ services.

ArenaC++jemalloc

0 likes · 19 min read

Unlock Faster C++ Performance: Practical Jemalloc Optimization Techniques

Ctrip Technology

Mar 9, 2023 · Backend Development

Optimizing Hotel Query Service Memory Usage: GC Tuning, Native Memory Management, and Migration to jemalloc

This article details the systematic reduction of memory consumption for Ctrip's hotel query service by halving container memory, evaluating and tuning modern garbage collectors, diagnosing off‑heap leaks, and ultimately replacing the default ptmalloc allocator with jemalloc to achieve stable performance and lower resource costs.

Backend PerformanceContainerizationGarbage Collection

0 likes · 22 min read

Optimizing Hotel Query Service Memory Usage: GC Tuning, Native Memory Management, and Migration to jemalloc

Qunar Tech Salon

Jan 31, 2023 · Operations

Root Cause Analysis and Mitigation of JVM GC‑Induced OOM and Memory Fragmentation in a Containerized Hotel Pricing Service

This article details how long JVM garbage‑collection pauses and glibc ptmalloc memory‑fragmentation caused container OOM kills in a hotel‑pricing system, and explains the step‑by‑step diagnosis, JVM tuning, Kubernetes health‑check adjustments, and the replacement of ptmalloc with jemalloc to eliminate the issue.

JVMKubernetesMemoryFragmentation

0 likes · 9 min read

Root Cause Analysis and Mitigation of JVM GC‑Induced OOM and Memory Fragmentation in a Containerized Hotel Pricing Service

ByteDance Web Infra

Aug 19, 2022 · Fundamentals

In‑Depth Analysis of dlmalloc, jemalloc, Scudo, and PartitionAlloc for Virtual‑Machine Memory Management

This article examines the design goals, key implementation details, strengths and weaknesses of four widely used memory allocators—dlmalloc, jemalloc, Scudo, and PartitionAlloc—highlighting how they address fragmentation, performance, and security in virtual‑machine runtimes and offering guidance for building efficient, safe allocators.

Scudodlmallocjemalloc

0 likes · 27 min read

In‑Depth Analysis of dlmalloc, jemalloc, Scudo, and PartitionAlloc for Virtual‑Machine Memory Management

iQIYI Technical Product Team

Nov 27, 2020 · Artificial Intelligence

Optimizing TensorFlow Serving Model Hot‑Update to Eliminate Latency Spikes in CTR Recommendation Systems

By adding model warm‑up files, separating load/unload threads, switching to the Jemalloc allocator, and isolating TensorFlow’s parameter memory from RPC request buffers, iQIYI’s engineers reduced TensorFlow Serving hot‑update latency spikes in high‑throughput CTR recommendation services from over 120 ms to about 2 ms, eliminating jitter.

Model Hot UpdateTensorFlow ServingWarmup

0 likes · 11 min read

Optimizing TensorFlow Serving Model Hot‑Update to Eliminate Latency Spikes in CTR Recommendation Systems

58 Tech

Oct 21, 2020 · Backend Development

Understanding Java Memory Pools: Netty’s Implementation and Underlying Theory

This article revisits memory allocation and reclamation concepts by examining Java's Netty memory pool implementation, its theoretical basis in jemalloc, and practical design choices such as arena allocation, thread‑local caches, pool chunks, sub‑pages, and multi‑threaded performance considerations.

Garbage CollectionJavaNetty

0 likes · 21 min read

Understanding Java Memory Pools: Netty’s Implementation and Underlying Theory

Architecture Digest

Mar 27, 2020 · Databases

Understanding Redis Memory Model: Objects, Allocation, and Internal Encoding

This article explains Redis's memory model by describing how to query memory usage, the roles of used_memory, used_memory_rss, mem_fragmentation_ratio, and mem_allocator, and then dives into the internal structures such as redisObject, SDS, jemalloc, and the encoding strategies for strings, lists, hashes, sets, and sorted sets.

Data StructuresMemory ManagementRedis

0 likes · 28 min read

Understanding Redis Memory Model: Objects, Allocation, and Internal Encoding

Huawei Cloud Developer Alliance

Dec 16, 2019 · Backend Development

Boost Kunpeng Server Apps: 7 Proven Performance Tuning Techniques

This guide walks you through seven practical optimization methods for Kunpeng‑based servers—including compiler flags, buffer selection, result caching, memory‑copy reduction, lock refinement, jemalloc integration, and cache‑line alignment—to fully exploit the hardware’s capabilities.

Compiler FlagsKunpengMemory

0 likes · 14 min read

Boost Kunpeng Server Apps: 7 Proven Performance Tuning Techniques

MaGe Linux Operations

Apr 20, 2019 · Databases

Unlocking Redis Memory: Deep Dive into Its Internal Model and Optimization

This article explains Redis's memory model—including memory statistics, allocation, object structures, internal encodings, and practical optimization techniques—so developers can accurately estimate memory usage, reduce fragmentation, and improve performance of high‑concurrency applications.

Memory ModelOptimizationRedis

0 likes · 32 min read

Unlocking Redis Memory: Deep Dive into Its Internal Model and Optimization

MaGe Linux Operations

Dec 25, 2018 · Databases

Unlocking Redis: How Its Memory Model Impacts Performance and Cost

This article explains Redis's memory model—including memory statistics, allocation strategies, object types, internal encodings, and practical optimization techniques—so developers can accurately estimate memory usage, reduce fragmentation, and improve overall system efficiency.

Memory ModelOptimizationRedis

0 likes · 31 min read

Unlocking Redis: How Its Memory Model Impacts Performance and Cost

Ctrip Technology

Oct 17, 2018 · Databases

Root Cause Analysis and Resolution of Intermittent Redis Connection Failures

This article presents a detailed investigation of occasional Redis connection errors in a large‑scale production environment, analyzing network packets, TCP backlog behavior, Redis internal client‑cron logic, jemalloc memory reclamation, and ultimately resolving the issue by adjusting query‑buffer handling and upgrading Redis to a newer version.

Performance tuningconnection timeoutjemalloc

0 likes · 19 min read

Root Cause Analysis and Resolution of Intermittent Redis Connection Failures

Efficient Ops

Aug 26, 2018 · Backend Development

Unlocking Redis Memory: How to Measure, Understand, and Optimize Its Usage

This article explains how to monitor Redis memory usage with the INFO command, interprets key metrics such as used_memory, used_memory_rss, and mem_fragmentation_ratio, and dives into Redis's internal memory layout, including allocators, redisObject, SDS, and the various object types and their encodings.

Backend DevelopmentData StructuresRedis

0 likes · 25 min read

Unlocking Redis Memory: How to Measure, Understand, and Optimize Its Usage

MaGe Linux Operations

Apr 29, 2018 · Backend Development

Unlock Redis Performance: Deep Dive into Its Memory Model and Optimization

This article explains Redis's memory model—including memory statistics, allocation, object structures, internal encoding, and practical optimization techniques—so developers can accurately estimate memory usage, reduce fragmentation, and choose the most efficient data representations for high‑performance applications.

Memory ModelOptimizationRedis

0 likes · 32 min read

Unlock Redis Performance: Deep Dive into Its Memory Model and Optimization

Java Backend Technology

Apr 18, 2018 · Databases

Unlocking Redis: Deep Dive into Its Memory Model and Optimization Techniques

This article explains Redis’s memory model—including memory statistics, allocation, object structures, internal encodings, and practical optimization strategies—providing detailed insights into how Redis stores data, manages memory fragmentation, and how developers can estimate and reduce memory usage for high‑performance deployments.

Data StructuresMemory ModelOptimization

0 likes · 30 min read

Unlocking Redis: Deep Dive into Its Memory Model and Optimization Techniques