Bilibili Tech
Jan 31, 2023 · Big Data
Design and Optimization of Real-Time Data Quality Control (DQC) Platform on Bilibili's Big Data System
Bilibili redesigned its real-time data-quality control platform by replacing per-rule Flink jobs with a unified, dynamically-configured architecture that classifies Kafka topics, aggregates via InfluxDB full-table and continuous queries, mitigates data inflation, adds a high-performance proxy, and implements robust monitoring and recovery to ensure scalable, reliable data quality for its big-data services.
Big DataDQCFlink
0 likes · 22 min read