Alimama Tech
Author

Alimama Tech

Official Alimama tech channel, showcasing all of Alimama's technical innovations.

247
Articles
0
Likes
769
Views
0
Comments
Recent Articles

Latest from Alimama Tech

100 recent articles max
Alimama Tech
Alimama Tech
May 7, 2026 · Artificial Intelligence

Dual‑Phase RL‑LLM Framework DARA for Few‑Shot Online Advertising Budget Allocation

The DARA framework splits online advertising budget allocation into a few‑shot LLM reasoning stage and a fine‑grained optimizer stage, enhanced by a dynamically updated RL‑fine‑tuning algorithm (GRPO‑Adaptive), achieving significantly lower ROI variance than traditional baselines in both real and simulated environments.

LLMbudget allocationfew-shot learning
0 likes · 16 min read
Dual‑Phase RL‑LLM Framework DARA for Few‑Shot Online Advertising Budget Allocation
Alimama Tech
Alimama Tech
Apr 9, 2026 · Artificial Intelligence

How LLM‑Powered AI Transforms Taobao Product Selection: From DeepSearch to Agentic RL

This article analyzes the challenges of traditional product selection on Taobao and presents an LLM‑driven solution that combines multi‑round online search, DeepSearch vs. WideSearch strategies, sample construction, SFT and RL training, and shows experimental results that improve relevance, diversity, and efficiency of the selected product set.

LLMe-commerceproduct selection
0 likes · 20 min read
How LLM‑Powered AI Transforms Taobao Product Selection: From DeepSearch to Agentic RL
Alimama Tech
Alimama Tech
Mar 26, 2026 · Industry Insights

How Alibaba’s Large User Model (LUM) Boosted CTR by 4.5% and Scaled to Billions of Parameters

The article analyzes the evolution from traditional modular recommendation models to a generative Large User Model (LUM), detailing its three‑stage paradigm, tokenization, training objectives, scaling‑law findings, offline and online experiments, and the AI‑infra innovations that enabled a 4.5% CTR lift in production.

CTR predictionGenerative ModelingRecommendation Systems
0 likes · 18 min read
How Alibaba’s Large User Model (LUM) Boosted CTR by 4.5% and Scaled to Billions of Parameters
Alimama Tech
Alimama Tech
Mar 5, 2026 · Artificial Intelligence

How BiCB Revolutionizes Real‑Time Bidding for Live Ads with Light‑Weight Optimization

This article presents the BiCB (Binary Constrained Bidding) algorithm, a lightweight auto‑bidding solution for live advertising that combines optimal bidding formulas derived from linear‑programming dual analysis with traffic‑prediction statistics to achieve near‑optimal performance under budget and CPC constraints.

Online OptimizationTraffic Predictionauto-bidding
0 likes · 17 min read
How BiCB Revolutionizes Real‑Time Bidding for Live Ads with Light‑Weight Optimization
Alimama Tech
Alimama Tech
Feb 5, 2026 · Artificial Intelligence

Can Few-Shot Reinforcement Learning Supercharge Budget-Constrained Auto-Bidding?

This paper introduces ABPlanner, a few‑shot, context‑aware budget planner that enhances budget‑constrained auto‑bidding in online advertising by hierarchically allocating budgets across short‑term stages and training a sequential decision‑maker with deep reinforcement learning, achieving significant gains in simulated and real‑world A/B tests.

auto-biddingbudget allocationfew-shot learning
0 likes · 13 min read
Can Few-Shot Reinforcement Learning Supercharge Budget-Constrained Auto-Bidding?
Alimama Tech
Alimama Tech
Jan 27, 2026 · Artificial Intelligence

How Alibaba’s “Yuanfang” Agent System Enables Reliable AI Coding for Complex Ad Engines

This article details Alibaba‑Mama’s AI‑coding initiative, describing how a decoupled CommonAds architecture and a multi‑agent framework called “Yuanfang” transform large‑scale advertising engine development by integrating AI‑friendly design, context‑driven specifications, and automated code‑generation, testing, and validation pipelines.

AI codingCommonAdsIFLOW‑CLI
0 likes · 16 min read
How Alibaba’s “Yuanfang” Agent System Enables Reliable AI Coding for Complex Ad Engines
Alimama Tech
Alimama Tech
Jan 7, 2026 · Artificial Intelligence

Can Text‑Driven Vibe Coding Tame Complex AI Infra? A Deep Dive into GPU Time‑Sharing for Agentic RL

This article examines the limitations of Vibe Coding for large AI infrastructure, proposes a text‑driven, document‑centric workflow, and presents a time‑multiplexed GPU scheduling solution that dramatically improves rollout throughput and reduces timeouts in large‑scale Agentic RL training.

Design DocumentsGPU schedulingTime Multiplexing
0 likes · 21 min read
Can Text‑Driven Vibe Coding Tame Complex AI Infra? A Deep Dive into GPU Time‑Sharing for Agentic RL