Tag

Critique Fine-Tuning

1 views collected around this technical thread.

DataFunTalk
DataFunTalk
Mar 9, 2025 · Artificial Intelligence

Critique Fine-Tuning (CFT): Boosting Large Language Model Reasoning with Minimal Data

The paper introduces Critique Fine-Tuning (CFT), a method that replaces simple imitation in supervised fine‑tuning with critique‑based learning, achieving superior reasoning performance on mathematical benchmarks using only 50 K samples, outperforming traditional reinforcement‑learning approaches that require millions of examples.

AI reasoningCritique Fine-TuningMathematical Benchmarks
0 likes · 7 min read
Critique Fine-Tuning (CFT): Boosting Large Language Model Reasoning with Minimal Data