Tagged articles
2 articles
Page 1 of 1
Machine Heart
Machine Heart
May 30, 2026 · Artificial Intelligence

Solving AdamW & Muon Instability: Pion Optimizer Updates Large Models on an Iso‑Spectral Manifold

The Pion optimizer leverages iso‑spectral manifold updates to preserve the spectral norm of weight matrices, eliminating additive‑update instability and enabling stable, efficient training of billion‑parameter LLMs across pre‑training, fine‑tuning, and reinforcement‑learning stages, outperforming AdamW and Muon.

AdamWLarge Language ModelsMuon
0 likes · 14 min read
Solving AdamW & Muon Instability: Pion Optimizer Updates Large Models on an Iso‑Spectral Manifold
Machine Heart
Machine Heart
Apr 25, 2026 · Artificial Intelligence

How DeepSeek and Kimi’s Open‑Source Collaboration Is Redefining China’s AI Landscape

The article analyses DeepSeek V4’s technical report, revealing repeated “encounters” between DeepSeek and Kimi—shared MLA attention, Muon optimizer, and divergent long‑context strategies—while highlighting their open‑source releases, hardware adaptations, and ecosystem impact that dramatically lower deployment costs for Chinese AI.

AIDeepSeekKimi
0 likes · 10 min read
How DeepSeek and Kimi’s Open‑Source Collaboration Is Redefining China’s AI Landscape