Administrator by China Associction for Science and Technology
Sponsored by China Society of Automotive Engineers
Published by AUTO FAN Magazine Co. Ltd.

Automotive Engineering ›› 2025, Vol. 47 ›› Issue (3): 412-417.doi: 10.19562/j.chinasae.qcgc.2025.03.003

Previous Articles    

Research on Multimodal Rejection Model of Cockpit Based on ChatGLM2 Large Model

Qiang Zhang1,4,Qin Shi1,2,3,Teng Cheng1,2,3(),Hao Ni1,2,3   

  1. 1.School of Automotive and Transportation Engineering,Hefei University of Technology,Hefei 230009
    2.Key Laboratory for Automated Vehicle Safety Technology of Anhui Province,Hefei 230009
    3.Engineering Research Center for Intelligent Transportation and Cooperative Vehicle-Infrastructure of Anhui Province,Hefei 230009
    4.Chery Automobile Co. ,Ltd. ,Wuhu 241000
  • Received:2024-05-20 Revised:2024-07-23 Online:2025-03-25 Published:2025-03-21
  • Contact: Teng Cheng E-mail:cht616@hfut.edu.cn

Abstract:

In the field of intelligent connected vehicles, the recognition accuracy of in-car systems for non-command voice input in complex environment (the proportion of correct voice input recognition by the system) is of great significance. To address this challenge, in this paper a multimodal rejection model is proposed. The model is based on the open-source ChatGLM2-6B large language model and has undergone exclusive rejection dataset construction and model fine-tuning for the in-vehicle interaction scenario. The rejection dataset is collected from real driving scenarios, integrating voice information with the driver's facial orientation, gestures, and emotion, and other non-verbal signals to provide richer interaction information, effectively overcoming the limitation of pure language recognition mechanisms in complex environment. Through experiments, it is found that the multimodal rejection model shows higher recognition accuracy (ACC) and lower false rejection rate (FRR) on the test set compared to the pure language rejection model.

Key words: intelligent connected vehicles, in-vehicle speech interaction, rejection, large model