Tag

visual-language

0 views collected around this technical thread.

DataFunSummit
DataFunSummit
Jan 10, 2024 · Artificial Intelligence

Baidu Commercial Multimodal Understanding and AIGC Innovation Practices

This article presents Baidu's commercial multimodal understanding and AIGC innovations, detailing rich‑media multimodal perception, a unified large‑scale representation framework, scenario‑specific fine‑tuning, and practical applications such as marketing copy, digital‑human video, and poster generation.

AIGCBaiduMultimodal
0 likes · 12 min read
Baidu Commercial Multimodal Understanding and AIGC Innovation Practices
DataFunTalk
DataFunTalk
Sep 5, 2023 · Artificial Intelligence

Baidu Commercial Multimodal Understanding and AIGC Innovation Practices

This article presents Baidu's commercial multimodal understanding framework and AIGC innovations, detailing rich-media multimodal perception, the VICAN‑12B multimodal representation‑generation model, scenario‑specific fine‑tuning, feature quantization for ranking, and practical applications such as marketing content generation, digital‑human video creation, and poster synthesis.

AIGCBaiduMultimodal
0 likes · 12 min read
Baidu Commercial Multimodal Understanding and AIGC Innovation Practices