Tagged articles
2 articles
Page 1 of 1
Xiaomi Tech
Xiaomi Tech
Mar 3, 2026 · Artificial Intelligence

Xiaomi Scores 14 Papers at CVPR 2026, Showcasing Breakthroughs in Large Models and Autonomous Driving

CVPR 2026 accepted 14 Xiaomi papers spanning long‑video understanding, multimodal reasoning, GUI agents, and autonomous driving, each accompanied by arXiv and GitHub links, and introducing novel frameworks such as REVISOR, EMO‑R3, TimeViper, MSJoE, SafeGRPO, GUI‑CEval, ProactiveMobile, ParkGaussian, UFO, TraqPoint, SimScale, MeanFuser and DVGT.

Autonomous DrivingCVPR 2026Long Video Understanding
0 likes · 19 min read
Xiaomi Scores 14 Papers at CVPR 2026, Showcasing Breakthroughs in Large Models and Autonomous Driving
JD Tech Talk
JD Tech Talk
Mar 13, 2025 · Artificial Intelligence

CTR-Driven Advertising Image Generation with Multimodal Large Language Models

This paper proposes CAIG, a novel method for generating high-CTR advertising images using multimodal large language models, combining reinforcement learning and preference optimization to align generated content with product features.

CTR predictionMultimodal Large Language ModelsPreference Optimization
0 likes · 10 min read
CTR-Driven Advertising Image Generation with Multimodal Large Language Models