DaTaobao Tech
Mar 31, 2025 · Artificial Intelligence
AI Audio Generation and Voice Synthesis Practices at Taobao
The article surveys Taobao’s AI‑generated audio pipeline, detailing eight technical papers on image‑to‑video, OpenAI o1, multimodal video, and large‑model voice synthesis, while highlighting advances like VALL‑E, CosyVoice, F5‑TTS, data‑cleaning methods, and e‑commerce applications such as voice‑cloned live streams, multilingual TTS, AI video‑audio integration, and audiobook production.
AI audioTTSdata cleaning
0 likes · 11 min read