Tag

audio representation

0 views collected around this technical thread.

DataFunSummit
DataFunSummit
Sep 16, 2024 · Artificial Intelligence

Multimodal Content Understanding and Cold-Start Practices in NetEase Cloud Music Community Recommendation System

This article details how NetEase Cloud Music leverages multimodal content understanding—using audio models like MusicCLIP and Audio MAE and image‑text fusion via FLAVA—to improve recommendation performance for new content and new users, covering system architecture, cold‑start solutions, and future AI‑driven directions.

AI modelsCold Startaudio representation
0 likes · 15 min read
Multimodal Content Understanding and Cold-Start Practices in NetEase Cloud Music Community Recommendation System
DataFunSummit
DataFunSummit
Jul 18, 2024 · Artificial Intelligence

Tencent Music Tianqin Lab’s Practice and Applications of Audio Representation Large Models

This article reviews Tencent Music Tianqin Lab’s research on audio representation large models, covering background, the evolution of audio features, self‑supervised methods such as SimCLR, BYOL, MAE, MLM, benchmark results, multimodal extensions, and real‑world applications like song authenticity detection and search ranking.

Tencent Musicaudio representationlarge models
0 likes · 20 min read
Tencent Music Tianqin Lab’s Practice and Applications of Audio Representation Large Models
DataFunTalk
DataFunTalk
Jun 30, 2024 · Artificial Intelligence

Application and Exploration of Large Audio Representation Models for Cold-Start Songs in QQ Music

This article presents a technical overview of how large‑scale audio representation models are fine‑tuned with I2I co‑occurrence and U2I interaction data to improve cold‑start song recommendation on QQ Music, describing the challenges, methodology, deployment scenarios, and experimental results.

I2I fine-tuningU2I fine-tuningaudio representation
0 likes · 17 min read
Application and Exploration of Large Audio Representation Models for Cold-Start Songs in QQ Music