Databases 19 min read

Baidu's High-Performance KV Storage System Practice: Cloud-Native Optimization and Software-Hardware Co-Design

Baidu’s UNDB KV storage system, built with cloud‑native containerization and software‑hardware co‑design, separates key‑value processing to cut write amplification below 1.5×, leverages an OpenChannel SSD engine for over 30% performance gains, scales to 10,000+ servers and 100 PB, handles more than a trillion daily requests across search and feed, and cuts operating costs by nearly half.

Baidu Geek Talk

Sep 27, 2021

Baidu's High-Performance KV Storage System Practice: Cloud-Native Optimization and Software-Hardware Co-Design

This article introduces Baidu's practice of building high-performance KV storage systems using cloud-native, full-user-state, and software-hardware协同 technologies. Since 2016, Baidu's "Search + Information Feed" dual-engine驱动构建内容生态, with KV storage serving as the widely-used online storage service in search and recommendation middleware, facing dual tests of storage scale and access traffic.

By early 2018, Baidu had deployed over 10,000 servers for various KV storage services, with data scale exceeding 100PB, handling nearly 100 billion daily access requests. The challenges included: 1) Performance challenges - balancing read/write performance, latency, and space utilization in engine design; 2) Cloud-native challenges - containerization transformation requiring fine-grained control of memory and storage I/O, and dynamic management requiring fault tolerance and elastic scaling; 3) Meeting specialized business requirements while maintaining common capabilities.

The solution, UNDB (United NoSQL Database), implements: Key-Value separation engine reducing write amplification to <1.5x with space amplification <1.6x; OpenChannel SSD-based software-hardware协同 engine achieving >30% performance improvement with overall amplification <1.1x; Cloud-native architecture with Operator-based global data scheduling; Multi-model storage architecture with pluggable layers for interfaces, data models, data synchronization, and engines.

Results: The system covers Baidu's main information feed and search scenarios with hundreds of thousands of instances, handling over 1 trillion daily access requests, reducing costs by nearly 50%.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

cloud-native NoSQL KV storage distributed storage OpenChannel SSD RocksDB optimization software-hardware co-design UNDB

Written by

Baidu Geek Talk

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.