Jan 28, 2026 · Databases

How We Fixed Minute‑Level Data Rollbacks by Replacing Impala with Apache Doris

Facing mysterious minute‑level data rollbacks caused by Impala's metadata cache, a team migrated from a T+1 Hive‑Impala stack to Apache Doris, achieving real‑time consistency, higher performance, simplified ETL, and reduced operational complexity across their points‑based loyalty system.

Apache DorisData WarehouseImpala

0 likes · 9 min read

How We Fixed Minute‑Level Data Rollbacks by Replacing Impala with Apache Doris

Tencent Architect

Feb 23, 2021 · Artificial Intelligence

Analysis and Optimization of CephFS I/O Performance for AI Training on the Xingchen Compute Platform

This article investigates why AI training tasks on Tencent's Xingchen compute platform experience severe I/O slowdown when using CephFS, analyzes the underlying Ceph‑FUSE and MDS mechanisms, and proposes metadata‑caching and file‑caching optimizations that can accelerate training speed by three to four times.

AI trainingCeph-FUSECephFS

0 likes · 21 min read

Analysis and Optimization of CephFS I/O Performance for AI Training on the Xingchen Compute Platform