Tagged articles
5 articles
Page 1 of 1
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
May 23, 2026 · Artificial Intelligence

10M‑Parameter Model Solves ARC and Sudoku – Bengio Team Bets on Multi‑Trajectory Reasoning

A 10‑million‑parameter GRAM model from Bengio, KAIST, Mila and NYU achieves 97% accuracy on Sudoku‑Extreme and competitive scores on ARC‑AGI tasks by replacing deterministic recursive updates with a probabilistic multi‑trajectory process, and extensive ablations show that both random guidance and depth‑supervised training are essential for its performance.

ARC‑AGIGRAMGenerative Recursive Reasoning
0 likes · 9 min read
10M‑Parameter Model Solves ARC and Sudoku – Bengio Team Bets on Multi‑Trajectory Reasoning
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
May 22, 2026 · Artificial Intelligence

How a 10M‑Parameter Model Beats Large Models on Sudoku and ARC with Multi‑Trajectory Reasoning

The GRAM model introduced by Yoshua Bengio’s team replaces deterministic recursive updates with probabilistic multi‑trajectory sampling, enabling a 10 M‑parameter network to achieve 97 % accuracy on Sudoku‑Extreme, 52 %/11 % on ARC‑AGI, and near‑perfect results on N‑Queens and graph‑coloring, while also supporting unconditional generation tasks.

ARC‑AGIGRAMSudoku
0 likes · 9 min read
How a 10M‑Parameter Model Beats Large Models on Sudoku and ARC with Multi‑Trajectory Reasoning
Baobao Algorithm Notes
Baobao Algorithm Notes
Mar 16, 2025 · Artificial Intelligence

Can a 7B LLM Master Sudoku From Scratch Using Reinforcement Learning?

This article details how a 7B parameter language model, fine‑tuned with DeepSeek's GRPO reinforcement‑learning algorithm and a carefully crafted multi‑component reward system, learned to solve Sudoku puzzles without any cold‑start data, outperforming a comparable 3B model and revealing key insights for structured reasoning tasks.

AI trainingGRPOQwen
0 likes · 15 min read
Can a 7B LLM Master Sudoku From Scratch Using Reinforcement Learning?