Tagged articles
1 articles
Page 1 of 1
Baobao Algorithm Notes
Baobao Algorithm Notes
Oct 10, 2024 · Artificial Intelligence

How MCTS Powers Inference in OpenAI’s o1: A Deep Dive with rStar

This article explains how the inference component of OpenAI’s o1 model can be implemented using Monte‑Carlo Tree Search, detailing the action space, rollout process, UCT scoring, and best‑path selection, with a concrete walkthrough of Microsoft’s open‑source rStar code.

Large Language ModelsMCTSOpenAI o1
0 likes · 26 min read
How MCTS Powers Inference in OpenAI’s o1: A Deep Dive with rStar