Administrator by China Associction for Science and Technology
Sponsored by China Society of Automotive Engineers
Published by AUTO FAN Magazine Co. Ltd.

Automotive Engineering ›› 2025, Vol. 47 ›› Issue (6): 1198-1206.doi: 10.19562/j.chinasae.qcgc.2025.06.018

Previous Articles    

PolarSparse4D: Polar Parametrization for Vision-Based Surround-View Temporal Sparse 3D Object Detection

Chao Wei1,2,Shuxin Sui1(),Luxing Li1   

  1. 1.The Special Vehicle Research Institute,Beijing Institute of Technology,Beijing 100081
    2.National Key Laboratory of Special Vehicle Design and Manufacturing Integration Technology,Beijing 100081
  • Received:2024-10-16 Revised:2024-12-20 Online:2025-06-25 Published:2025-06-20
  • Contact: Shuxin Sui E-mail:13021028800@163.com

Abstract:

To address the trade-off between accuracy and real-time performance in vision-based surround-view 3D object detection for autonomous vehicles, PolarSparse4D, a sparse query-based method using polar parametrization, is proposed. The model consists of an image encoder, a 3D anchor decoder and an auxiliary quality assessment branch for training. Firstly, to avoid the detection distance limitation caused by parameter normalization, a feature encoding method that decouples the center distance and azimuth angle of the 3D anchor boxes is designed. Secondly, by designing an anchor spatial information interaction self-attention module and a temporal information fusion module, the spatiotemporal information fusion process of anchors is completed efficiently and accurately. Finally, an anchor box parameter quality assessment branch is established to improve the detection accuracy and model convergence speed significantly. The experimental results on the nuScenes validation set show that the proposed model achieves 41.3% and 52.5% on mAP and NDS, respectively, with a speed of 19.2 FPS, demonstrating high accuracy and real-time capability.

Key words: 3D object detection, surround-view camera, polar parametrization, autonomous vehicle