Tagged articles
3 articles
Page 1 of 1
Machine Heart
Machine Heart
Apr 30, 2026 · Artificial Intelligence

Can Internet Videos Replace 3D Annotations? Introducing SceneVerse++ – the Largest Real‑World 3D Scene Dataset

The BIGAI team presents SceneVerse++, a massive real‑world indoor 3D scene dataset built from unlabelled internet videos via an automated pipeline, and demonstrates substantial zero‑shot and fine‑tuned performance gains on 3D detection, spatial VQA, and vision‑language navigation tasks.

3D scene understandingSceneVerse++automated data pipeline
0 likes · 18 min read
Can Internet Videos Replace 3D Annotations? Introducing SceneVerse++ – the Largest Real‑World 3D Scene Dataset
Amap Tech
Amap Tech
Oct 4, 2025 · Artificial Intelligence

How JanusVLN Redefines Vision‑Language Navigation with Dual Implicit Memory

JanusVLN presents a groundbreaking Vision‑and‑Language Navigation framework that decouples semantic understanding from spatial geometry using dual implicit memory, eliminates explicit memory overhead, achieves state‑of‑the‑art performance with only RGB video input, and dramatically improves efficiency and generalization across VLN benchmarks.

3D spatial reasoningDual Implicit Memorymultimodal LLM
0 likes · 10 min read
How JanusVLN Redefines Vision‑Language Navigation with Dual Implicit Memory
Meituan Technology Team
Meituan Technology Team
Jun 15, 2023 · Artificial Intelligence

Meituan Technical Team's 8 CVPR 2023 Papers: Overview and Insights

This article reviews eight CVPR 2023 papers selected by Meituan’s technology team, covering self‑supervised learning, domain adaptation, federated learning, object detection, 3D reconstruction, GAN‑based pre‑training, RGB‑T tracking, vision‑language navigation, and visual‑textual layout generation, highlighting each work’s methodology, experiments, and reported performance gains.

3D Object DetectionCVPR 2023GaN
0 likes · 15 min read
Meituan Technical Team's 8 CVPR 2023 Papers: Overview and Insights