End-to-End BEV+Transformer Perception and Modeling for High-Definition Map Production
By fusing LiDAR point clouds and camera images into a unified bird‑eye‑view space and applying Transformer‑based perception, multi‑sensor fusion, and graph‑diffusion modeling, the proposed BEV+Transformer framework automatically detects and smooths ground‑level line features and signs for high‑definition maps with centimeter‑level accuracy, boosting production efficiency and reducing cost.