Tag

bitmap indexing

1 views collected around this technical thread.

DataFunSummit
DataFunSummit
May 27, 2024 · Big Data

Design and Optimization of Zhihu's Bridge Platform for DMP/CDP: Architecture, Challenges, and Solutions

This article presents a comprehensive case study of Zhihu's Bridge platform, detailing its background, five core modules, unified architecture built on Spark and Flink, bitmap‑based tagging, and performance optimizations that address query speed, write latency, and high‑QPS online checks while outlining future directions with Doris 2.0 and large language models.

CDPDMPFlink
0 likes · 27 min read
Design and Optimization of Zhihu's Bridge Platform for DMP/CDP: Architecture, Challenges, and Solutions
政采云技术
政采云技术
Sep 19, 2023 · Big Data

Techniques for Processing Massive Data: Sorting, Querying, Top‑K, and Deduplication

This article explains core concepts and practical solutions for handling massive datasets that cannot fit into memory, covering batch processing, distributed sorting, bitmap indexing, hash‑based lookups, top‑K extraction, and deduplication techniques with code examples and multi‑machine strategies.

Deduplicationbig databitmap indexing
0 likes · 18 min read
Techniques for Processing Massive Data: Sorting, Querying, Top‑K, and Deduplication
DataFunTalk
DataFunTalk
May 9, 2021 · Big Data

User Segmentation and Growth Practices for Mini‑Programs Based on Doris

This article presents a comprehensive case study of how Baidu’s senior R&D engineer Zhao Yuyang built a Doris‑based user‑segmentation system for mini‑programs, detailing the product’s private‑domain fine‑grained operation capabilities, the four technical challenges, the architecture and solutions—including global dictionaries, bitmap storage, partitioning, tag optimization, dynamic‑static query handling, and rapid user‑package generation—along with future roadmap plans.

Data EngineeringDorisbig data
0 likes · 20 min read
User Segmentation and Growth Practices for Mini‑Programs Based on Doris