Alimama Tech
Author

Alimama Tech

Official Alimama tech channel, showcasing all of Alimama's technical innovations.

247
Articles
0
Likes
770
Views
0
Comments
Recent Articles

Latest from Alimama Tech

100 recent articles max
Alimama Tech
Alimama Tech
Jul 23, 2025 · Artificial Intelligence

How Differentiable Solver Search Accelerates Diffusion Model Sampling

This article presents a differentiable solver search method that quickly finds high‑quality sampling paths for diffusion models, demonstrating significant FID improvements across Rectified‑Flow, DDPM/VP, and text‑to‑image models while requiring no model parameter changes.

AIdifferentiable solverdiffusion models
0 likes · 20 min read
How Differentiable Solver Search Accelerates Diffusion Model Sampling
Alimama Tech
Alimama Tech
Jul 17, 2025 · Artificial Intelligence

How to Build a High‑Scoring AI Werewolf Agent: Strategies, Prompt Engineering, and Code

This article details the author's experience designing a top‑performing AI Werewolf agent for the Taotian Group's AI Werewolf Challenge, covering game rules, core challenges, prompt engineering, caching, concurrent requests, model selection, reinforcement‑learning‑style tuning, and tactical strategies for each role, with code examples.

AI AgentLLMPrompt Engineering
0 likes · 25 min read
How to Build a High‑Scoring AI Werewolf Agent: Strategies, Prompt Engineering, and Code
Alimama Tech
Alimama Tech
Jul 11, 2025 · Artificial Intelligence

Inside Alibaba’s Taotian AI Team: From Werewolf Games to RecGPT

This article explores Alibaba’s Taotian technology team, revealing how its young engineers blend cutting‑edge AI research with playful projects like AI‑driven Werewolf games, develop large‑scale recommendation models such as RecGPT, and foster a culture of curiosity, rapid experimentation, and flat, tech‑first management.

AIGCArtificial IntelligenceR&D Management
0 likes · 24 min read
Inside Alibaba’s Taotian AI Team: From Werewolf Games to RecGPT
Alimama Tech
Alimama Tech
Jul 9, 2025 · Artificial Intelligence

How to Make LLMs Recognize and Resolve Their Own Uncertainty

This article introduces ConfuseBench, a benchmark that classifies LLM uncertainty into document‑missing, ability‑limited, and ambiguous types, and presents methods—including retrieval, chain‑of‑thought, and clarification—to detect and actively resolve uncertainty, improving answer quality across diverse tasks.

ClarificationInquiryLLM
0 likes · 17 min read
How to Make LLMs Recognize and Resolve Their Own Uncertainty
Alimama Tech
Alimama Tech
Jun 25, 2025 · Artificial Intelligence

Introducing ROLL: A Scalable, User‑Friendly RL Framework for Large‑Scale LLM Training

ROLL is an open‑source reinforcement‑learning framework designed for large language model post‑training that combines multi‑task RL, agentic support, flexible algorithm configuration, elastic resource scheduling, and rich observability, delivering significant accuracy gains across benchmarks while remaining easy to use for researchers, product developers, and infrastructure engineers.

AI FrameworkLarge Language ModelsOpen Source
0 likes · 11 min read
Introducing ROLL: A Scalable, User‑Friendly RL Framework for Large‑Scale LLM Training
Alimama Tech
Alimama Tech
May 14, 2025 · Artificial Intelligence

Deep Research‑Driven Risk Root‑Cause Analysis with Domain Graph Constraints for Large‑Scale Advertising Traffic

This article presents a large‑scale advertising risk‑control solution that combines deep‑research paradigms, domain‑graph constraints, and large language models to enable explainable, responsible, and high‑precision fraud detection, detailing system architecture, challenges, demo workflow, and future directions.

AILarge Language Modeladvertising fraud
0 likes · 11 min read
Deep Research‑Driven Risk Root‑Cause Analysis with Domain Graph Constraints for Large‑Scale Advertising Traffic
Alimama Tech
Alimama Tech
May 12, 2025 · Artificial Intelligence

Universal Recommendation Model (URM): A General Large‑Model Recall System for Advertising

The article presents the Universal Recommendation Model (URM), a large‑language‑model‑based recall framework that integrates world knowledge and e‑commerce expertise through knowledge injection and prompt‑driven alignment, achieving significant offline recall gains and a 3.1% increase in ad consumption while meeting high‑QPS, low‑latency production constraints.

AdvertisingLarge Language ModelMultimodal
0 likes · 17 min read
Universal Recommendation Model (URM): A General Large‑Model Recall System for Advertising
Alimama Tech
Alimama Tech
Apr 23, 2025 · Artificial Intelligence

How AI Agents Outsmart Humans in the “Who Is Spy” Campus Challenge

The campus AI Agent competition showcased how large‑language‑model‑powered agents can reason, deceive, and collaborate in a social deduction game, revealing model performance trends, participant insights, and future directions for multi‑agent AI research.

AIAgent CompetitionLarge Language Models
0 likes · 6 min read
How AI Agents Outsmart Humans in the “Who Is Spy” Campus Challenge
Alimama Tech
Alimama Tech
Apr 23, 2025 · Artificial Intelligence

Distribution-aware Graph Prompt Tuning (DAGPrompT) for Heterophilic Graphs

Distribution‑aware Graph Prompt Tuning (DAGPrompT) tackles the pre‑training/downstream mismatch on heterophilic graphs by jointly applying low‑rank GLoRA adaptation and hop‑specific prompts that recast tasks as link‑prediction, yielding up to 4.79% accuracy gains and an average 2.43% improvement in few‑shot node classification.

Prompt Tuningdistribution-awarefew-shot learning
0 likes · 9 min read
Distribution-aware Graph Prompt Tuning (DAGPrompT) for Heterophilic Graphs
Alimama Tech
Alimama Tech
Apr 23, 2025 · Artificial Intelligence

Explainable LLM-driven Multi-dimensional Distillation for E-Commerce Relevance Learning

The paper introduces an explainable LLM framework (ELLM‑rele) that uses chain‑of‑thought reasoning and a multi‑dimensional knowledge distillation pipeline to compress large‑model relevance judgments into lightweight student models, achieving superior offline relevance scores and online click‑through and conversion improvements in Taobao’s search advertising.

Knowledge DistillationLLMchain-of-thought
0 likes · 17 min read
Explainable LLM-driven Multi-dimensional Distillation for E-Commerce Relevance Learning