Nine Institutions Unveil Comprehensive Survey of Audio‑Visual Intelligence in the Large‑Model Era
A joint survey by nine leading research groups maps a decade of audio‑visual intelligence (AVI) progress, presenting an evolution tree, unified taxonomy, three core strands, and six future research axes that together chart the role of AVI in large‑foundation models.

