Administrator by China Associction for Science and Technology
Sponsored by China Society of Automotive Engineers
Published by AUTO FAN Magazine Co. Ltd.

Automotive Engineering ›› 2023, Vol. 45 ›› Issue (6): 974-988.doi: 10.19562/j.chinasae.qcgc.2023.06.008

Special Issue: 智能网联汽车技术专题-感知&HMI&测评2023年

Previous Articles     Next Articles

Real-Time Detection of Distracted Driving Behavior Based on Deep Convolution-Tokens Dimensionality Reduction Optimized Visual Transformer Model

Xia Zhao,Zhao Li,Rui Fu,Zhenzhen Ge(),Chang Wang   

  1. School of Automobile of Chang’an University,Xi’an 710064
  • Received:2022-11-18 Revised:2023-01-17 Online:2023-06-25 Published:2023-06-16
  • Contact: Zhenzhen Ge E-mail:gezhenzhen@chd.edu.cn

Abstract:

To address the problems that the end-to-end Deep Convolutional Neural Network (DCNN) based driving behavior detection model lacks global feature extraction ability, and the Vision Transformer (ViT) model is not good at capturing underlying features with a large number of model parameters, this paper proposes a ViT model that combines deep convolution and Tokens downscaled optimization for real-time detection of driver distraction behavior. Comparison experiments with other models, ablation experiments and visualization experiments of the models’ attention region are carried out to fully validate the superiority of the proposed model. The mean accuracy and precision of the proposed model are 96.93% and 96.95%, respectively. The number of the model parameters is 21.22 M; and the online inference speed based on the real vehicle platform is 23.32 fps, indicating that the proposed model can achieve real-time distracted behavior detection. The result of the study is beneficial to the control strategy development and distraction warning of human-machine co-driving system.

Key words: automotive engineering, distracted behavior detection model, vision Transformer, multi-headed attention mechanism, convolutional neural network, Tokens dimensionality reduction