Dec 6, 2023 · Artificial Intelligence

Real-time Controllable Multi-Objective Re-ranking Models for Taobao Feed Recommendation

The paper introduces a real‑time controllable, multi‑objective re‑ranking framework for Taobao’s feed recommendation that combines actor‑critic reinforcement learning with hypernetworks to instantly adjust objective weights, handling diverse media and cold‑start constraints while delivering higher click‑through, diversity, and cold‑start ratios with only 20‑25 ms latency.

AlibabaReal-time ControlRecommendation Systems

0 likes · 34 min read

Real-time Controllable Multi-Objective Re-ranking Models for Taobao Feed Recommendation

DataFunTalk

Nov 14, 2023 · Artificial Intelligence

Real-Time Controllable Multi-Objective Re‑ranking for Taobao Feed

This article presents a comprehensive study of a controllable multi‑objective re‑ranking model for Taobao's information‑flow recommendation, detailing the challenges of complex feed scenarios, three modeling paradigms (V1‑V3), an actor‑critic reinforcement learning framework with hypernet‑generated weights, and extensive online evaluation results.

Real-time ControlRecommendation SystemsRe‑ranking

0 likes · 31 min read