Artificial Intelligence 11 min read

Evolution of Alibaba's Advertising CTR Prediction Models: From Linear Methods to Deep Interest Evolution Networks

The article reviews the characteristics of e‑commerce personalized prediction, outlines Alibaba's model iteration from large‑scale linear regression to deep learning architectures such as DIN, CrossMedia, and Deep Interest Evolution, and discusses future directions like disentangled representation and white‑box modeling.

DataFunTalk
DataFunTalk
DataFunTalk
Evolution of Alibaba's Advertising CTR Prediction Models: From Linear Methods to Deep Interest Evolution Networks

The talk begins by describing the unique traits of e‑commerce data personalization, where ads are displayed based on inferred user intent rather than explicit queries, and outlines three key dimensions—explicit content, account‑level IDs, and final feedback—to characterize the data.

It then traces Alibaba Mama's model evolution since 2012, starting with large‑scale feature‑rich linear regression (MLR) and its non‑linear extension (MLR with piecewise linear learning), followed by the first generation of deep CTR models that combine embedding layers with simple MLPs, achieving significant performance gains.

To address the diversity of user interests, the Deep Interest Network (DIN) introduces an attention‑based activation unit that dynamically selects relevant historical behaviors, improving CTR by 10%, CVR by 3.3%, and GPM by 12.6%.

Subsequent work incorporates visual features through the CrossMedia network, handling massive image data by storing compressed embeddings on remote servers.

Recognizing the limitations of sequential RNNs for heterogeneous interests, the Deep Interest Evolution (DIE) framework adds an interest extraction layer (attention‑augmented GRU) and an interest evolution layer, enabling the model to capture evolving user preferences over time.

To balance model complexity and online latency, a teacher‑student distillation approach is employed, featuring collaborative training, parameter sharing, and gradient blocking to compress the model while preserving performance.

The presentation concludes with two future research directions: developing disentangled representations tailored to e‑commerce items, and building more interpretable white‑box models that expose the concepts influencing user decisions.

Author bio: Zhou Guorui, Alibaba algorithm expert, Ph.D. candidate at Beijing University of Posts and Telecommunications, focuses on large‑scale machine learning, NLP, computational advertising, and recommendation systems; core developer of Alibaba's XDL deep learning framework.

E-commerceDeep LearningCTR predictionRecommendation systemsattention mechanismmodel evolution
DataFunTalk
Written by

DataFunTalk

Dedicated to sharing and discussing big data and AI technology applications, aiming to empower a million data scientists. Regularly hosts live tech talks and curates articles on big data, recommendation/search algorithms, advertising algorithms, NLP, intelligent risk control, autonomous driving, and machine learning/deep learning.

0 followers
Reader feedback

How this landed with the community

login Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.