Tagged articles
3 articles
Page 1 of 1
PaperAgent
PaperAgent
Feb 2, 2026 · Artificial Intelligence

How Kimi K2.5 Achieves Multimodal Mastery with Joint Training and Agent Swarms

The Kimi K2.5 technical report reveals how a Chinese team combined joint text‑vision training, a novel Zero‑Vision SFT method, and a parallel agent‑swarm architecture to deliver top‑ranked multimodal performance, dramatically faster inference, and open‑source access for broader AI research.

AI researchAgent SwarmKimi-K2.5
0 likes · 9 min read
How Kimi K2.5 Achieves Multimodal Mastery with Joint Training and Agent Swarms
AI Frontier Lectures
AI Frontier Lectures
Jul 30, 2025 · Artificial Intelligence

DualReal: Seamless Identity and Motion Customization for Video Generation

DualReal introduces a novel adaptive joint training framework that simultaneously customizes subject identity and motion dynamics in video generation, overcoming the conflicts of traditional isolated approaches by using a dual-domain perception adapter and stage-fusion controller, achieving up to 31.8% improvement on CLIP‑I and DINO‑I metrics.

Video Generationdiffusion modelsdual-domain adaptation
0 likes · 13 min read
DualReal: Seamless Identity and Motion Customization for Video Generation
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Sep 23, 2024 · Artificial Intelligence

AlignRec: A Joint Training Framework for Aligning Multimodal Representations with Personalized Recommendation

AlignRec is a joint‑training framework that synchronizes multimodal encoders with personalized recommendation models through a staged alignment strategy and three specialized loss functions, preserving both content and ID signals, and achieving state‑of‑the‑art performance on multiple datasets while releasing superior Amazon multimodal features.

AIevaluation metricsjoint training
0 likes · 11 min read
AlignRec: A Joint Training Framework for Aligning Multimodal Representations with Personalized Recommendation