Multimodal Content Understanding and Cold-Start Practices in NetEase Cloud Music Community Recommendation System

This article details how NetEase Cloud Music leverages multimodal content understanding—using audio models like MusicCLIP and Audio MAE and image‑text fusion via FLAVA—to improve recommendation performance for new content and new users, covering system architecture, cold‑start solutions, and future AI‑driven directions.

AI modelsMultimodal Learningaudio representation

0 likes · 15 min read

Multimodal Content Understanding and Cold-Start Practices in NetEase Cloud Music Community Recommendation System

DataFunSummit

Jul 18, 2024 · Artificial Intelligence

Tencent Music Tianqin Lab’s Practice and Applications of Audio Representation Large Models

This article reviews Tencent Music Tianqin Lab’s research on audio representation large models, covering background, the evolution of audio features, self‑supervised methods such as SimCLR, BYOL, MAE, MLM, benchmark results, multimodal extensions, and real‑world applications like song authenticity detection and search ranking.

Tencent Musicaudio representationmultimodal AI

0 likes · 20 min read

Tencent Music Tianqin Lab’s Practice and Applications of Audio Representation Large Models

DataFunTalk

Jun 30, 2024 · Artificial Intelligence

Application and Exploration of Large Audio Representation Models for Cold-Start Songs in QQ Music

This article presents a technical overview of how large‑scale audio representation models are fine‑tuned with I2I co‑occurrence and U2I interaction data to improve cold‑start song recommendation on QQ Music, describing the challenges, methodology, deployment scenarios, and experimental results.

I2I fine-tuningU2I fine-tuningaudio representation

0 likes · 17 min read

Application and Exploration of Large Audio Representation Models for Cold-Start Songs in QQ Music