Tag

grokking

0 views collected around this technical thread.

Architect
Architect
Apr 19, 2023 · Artificial Intelligence

Emergence in Large Language Models: Phenomena, Explanations, and Implications

This article reviews the emergence phenomena observed in large language models, explains how model scale, in‑context learning and chain‑of‑thought prompting contribute to sudden performance gains, discusses small‑model alternatives, and explores the relationship between emergence and the training‑time Grokking effect.

AI researchChain-of-ThoughtIn-Context Learning
0 likes · 13 min read
Emergence in Large Language Models: Phenomena, Explanations, and Implications