DaTaobao Tech
Mar 29, 2022 · Artificial Intelligence
Dynamic Weight Averaging and Gradient Normalization for Multi‑Task Recommendation Models
To improve multi‑task recommendation in the “每平每屋” system, the team augments an MMoE ranking model with dynamic weight averaging, dynamic task prioritization, and GradNorm gradient normalization, stabilizing loss convergence across CTR, CVR, and fav tasks and delivering 3–4% online metric gains.
A/B testingDynamic Weight AveragingGradient Normalization
0 likes · 10 min read