Didi Tech
Jun 1, 2018 · Artificial Intelligence
Didi's Attention-Based End-to-End Mandarin Speech Recognition: A Detailed Review
Didi’s attention‑based end‑to‑end Mandarin speech recognizer, built on the Listen‑Attend‑Spell architecture and modeling roughly 5,000 common characters, delivers 15‑25% relative accuracy gains over its prior LSTM‑CTC system while cutting model size, latency and server requirements and simplifying training by eliminating separate acoustic, pronunciation and language components.
AttentionEnd-to-EndLAS
0 likes · 14 min read