Administrator by China Associction for Science and Technology
Sponsored by China Society of Automotive Engineers
Published by AUTO FAN Magazine Co. Ltd.

Automotive Engineering ›› 2025, Vol. 47 ›› Issue (6): 1122-1132.doi: 10.19562/j.chinasae.qcgc.2025.06.011

Previous Articles    

MSF-Diffuser: A Multi-sensor Adaptive Fusion Autonomous Driving Method Based on Diffusion Model Under BEV

Mingchen Wang1,Hai Wang1(),Yingfeng Cai2,Long Chen2,Yicheng Li2   

  1. 1.School of Automotive and Traffic Engineering,Jiangsu University,Zhenjiang 212013
    2.Institute of Automotive Engineering,Jiangsu University,Zhenjiang 212013
  • Received:2024-10-29 Revised:2025-02-03 Online:2025-06-25 Published:2025-06-20
  • Contact: Hai Wang E-mail:wanghai1019@163.com

Abstract:

Autonomous driving algorithms are a major research focus in the field of intelligent vehicles. Currently, to achieve panoramic autonomous driving, most domestic approaches use multi-sensor fusion. However, existing solutions face problems such as low sensor utilization and unreasonable fusion strategies. For these problems, in this paper, an autonomous driving framework based on multi-sensor fusion (camera+LiDAR+Radar) under a bird's-eye view (BEV) is proposed. In this framework, dual encoding based on point and velocity is used, coupled with feature interaction to extract millimeter-wave radar point cloud features, thereby enhancing the utilization of millimeter-wave radar information and facilitating subsequent fusion. In the fusion module, LSTM is used to store the features from multiple modalities as well as the fused BEV features, which allows for the calculation of feature consistency loss between different modalities and continuity loss for the fused BEV features and historical frames, leading to smoother and more precise feature fusion. Finally, the diffusion model is introduced and the Multi-modal U-Net is proposed for denoising, which improves the robustness of trajectory planning. Extensive experiments are conducted using the CARLA simulator on the authoritative Longest-06 benchmark and Town-05 Long benchmark, getting a DS (Driving Score) of 73.80±1.01 and 73.7±1.3 respectively. The results show that the proposed approach achieves better panoramic autonomous driving with superior performance and flexibility compared to existing methods.

Key words: autonomous driving, multi-sensor fusion, feature interaction, diffusion model