Tagged articles
3 articles
Page 1 of 1
AI Engineer Programming
AI Engineer Programming
Apr 5, 2026 · Artificial Intelligence

How Kimi, Cursor, and Chroma Use Reinforcement Learning to Train Agent Models

The article analyzes three recent technical reports—Moonshot AI's Kimi K2.5, Cursor's Composer 2, and Chroma's Context‑1—detailing how each system trains agent models with reinforcement learning, parallel orchestration, self‑summarization, and self‑editing, and highlights shared methodological themes and performance gains.

Chroma Context-1Cursor ComposerKimi
0 likes · 19 min read
How Kimi, Cursor, and Chroma Use Reinforcement Learning to Train Agent Models
PaperAgent
PaperAgent
Mar 4, 2026 · Artificial Intelligence

How Doubao-Seed-2.0 Redefines Native Multimodal Agents and Coding

Doubao-Seed-2.0 showcases a native multimodal architecture that unifies vision and language, delivers state‑of‑the‑art visual‑language performance, and dramatically improves code generation for front‑end, bug‑fixing, and research‑assistant tasks, illustrating the shift toward truly functional AI agents.

AI Research AssistantDoubaoagent models
0 likes · 9 min read
How Doubao-Seed-2.0 Redefines Native Multimodal Agents and Coding
Old Meng AI Explorer
Old Meng AI Explorer
Jan 23, 2026 · Artificial Intelligence

How a 4B‑Parameter AgentCPM‑Explore Beats 30B Models in Long‑Range Tasks

AgentCPM‑Explore, a 4‑billion‑parameter open‑source agent model, breaks the conventional belief that larger models always perform better by achieving state‑of‑the‑art results on eight long‑duration benchmarks, surpassing many 8B and even some 30B models while enabling efficient edge deployment.

AIEdge AIPerformance Evaluation
0 likes · 12 min read
How a 4B‑Parameter AgentCPM‑Explore Beats 30B Models in Long‑Range Tasks