Tag

dual-tower DNN

0 views collected around this technical thread.

DataFunTalk
DataFunTalk
Feb 27, 2021 · Artificial Intelligence

Optimizing Coarse Ranking Models for Short Video Recommendation: From GBDT to Dual‑Tower DNN and Cascading

This article details the practical upgrades of iQIYI's short‑video recommendation coarse‑ranking pipeline, moving from a GBDT model to a dual‑tower DNN, applying knowledge distillation, embedding compression, inference optimizations, and finally a cascade architecture to align with the fine‑ranking model while reducing resource consumption.

cascading modelcoarse rankingdual-tower DNN
0 likes · 12 min read
Optimizing Coarse Ranking Models for Short Video Recommendation: From GBDT to Dual‑Tower DNN and Cascading
iQIYI Technical Product Team
iQIYI Technical Product Team
Feb 26, 2021 · Artificial Intelligence

Optimization of Coarse Ranking Models for Short‑Video Recommendation at iQIYI

iQIYI’s short‑video recommendation team replaced a GBDT coarse‑ranking model with a lightweight dual‑tower DNN, applied knowledge distillation, sparse‑aware embedding optimization, and inference merging, then introduced a cascade MMOE architecture, achieving comparable accuracy with half the memory, ~19 ms latency reduction, and measurable gains in watch time, CTR and engagement.

Online Inferencecascade modelcoarse ranking
0 likes · 15 min read
Optimization of Coarse Ranking Models for Short‑Video Recommendation at iQIYI