Teaching 7,000 Languages: How LASA’s Semantic Bottleneck Enables Multilingual LLM Safety
The paper reveals a language‑agnostic "semantic bottleneck" layer inside large language models and introduces LASA, a three‑step framework that locates this layer, extracts safety signals with a lightweight interpreter, and injects them via KTO loss, dramatically improving multilingual safety without per‑language data collection.
