Machine Heart
May 29, 2026 · Artificial Intelligence
When a Celebrity Name Stumped LLMs: The Year‑Old Insight Behind Low‑Frequency Token Degradation
A fan's test of the idol Ma Jiaqi exposed a large‑language‑model's inability to generate his name, leading to an analysis that links the failure to low‑frequency token degradation, academic papers on frequency‑aware prompting and training, and a confirming tokenizer change by Anthropic.
AnthropicEMNLPacl
0 likes · 14 min read
