DataFunTalk
Mar 3, 2021 · Big Data
Kwai Scheduler: Scaling YARN for Ultra‑Large Clusters at Kuaishou
This article presents Kuaishou's large‑scale offline computing challenges and describes how the team customized YARN and built the Kwai scheduler to achieve multi‑threaded, pluggable resource scheduling for clusters of tens of thousands of nodes, supporting diverse workloads such as ETL, ad‑hoc queries, machine‑learning training, and real‑time Flink jobs.
Cluster OptimizationDistributed SystemsKwai Scheduler
0 likes · 15 min read