Administrator by China Associction for Science and Technology
Sponsored by China Society of Automotive Engineers
Published by AUTO FAN Magazine Co. Ltd.

Automotive Engineering ›› 2025, Vol. 47 ›› Issue (8): 1479-1489.doi: 10.19562/j.chinasae.qcgc.2025.08.005

Previous Articles    

Driver Behavior Recognition Method via MobileViT Model and Optical Flow Fusion

Huizhi Xu(),Jianzhao Zhang,Xiancai Jiang,Chengju Song   

  1. School of Civil Engineering and Transportation,Northeast Forestry University,Harbin 150040
  • Received:2025-01-07 Revised:2025-04-17 Online:2025-08-25 Published:2025-08-18
  • Contact: Huizhi Xu E-mail:stedu@126.com

Abstract:

Based on the MobileViT algorithm, a novel driver behavior recognition model of Mse-MViT model is proposed in this paper, which integrates Convolutional Neural Networks (CNNs) with Transformers. The model uses the optical flow algorithm for recursive image processing, enabling the extraction of key frame sequences from the initial frame to the apex frame of a video clip to effectively capture driver motion information. A self-constructed Driver-vior dataset is introduced. Through multi-scale feature fusion, an SE attention mechanism, and dual-branch architecture, the model achieves comprehensive integration of motion cues with global and local image features. The experimental results show that the Mse-MViT model achieves a driver behavior recognition accuracy of 95.83%, exhibiting superior performance and robustness. Furthermore, comparative experiments conducted on the State Farm dataset show a 2.5% improvement in accuracy, validating the generalization capability and effectiveness of the proposed method.

Key words: driver behavior recognition, optical flow algorithm, MobileViT, multi-scale feature fusion