Baidu Tech Salon
Feb 28, 2024 · Big Data
Design, Optimization, and Practice of Baidu's Fusion Compute Engine for Data Warehouse
Baidu’s Fusion Compute Engine, built on Spark with a one‑layer wide‑table model, combines data‑skipping, push‑down, code‑generation, vectorization and extensive tuning to cut ad‑hoc query latency to seconds, shrink storage by ~30 %, and accelerate ETL workloads while maintaining stability for massive data‑warehouse workloads.
BaiduFusion Compute EngineSpark
0 likes · 10 min read