KeSpeech: A Large-Scale Chinese Mandarin Dialect Speech Benchmark Presented at NeurIPS 2021
KeSpeech, a benchmark jointly released by Beike AI and Tsinghua University at NeurIPS 2021, provides a massive Chinese Mandarin dialect dataset covering 30,000 speakers from 34 cities, supporting speech recognition, speaker verification, dialect identification, and voice conversion tasks, and includes rich multi‑scenario and parallel corpora for advanced research.