基于类间距优化的分心驾驶行为识别模型训练方法

doi:10.19562/j.chinasae.qcgc.2022.02.009

汽车工程 ›› 2022, Vol. 44 ›› Issue (2): 225-232.doi: 10.19562/j.chinasae.qcgc.2022.02.009

所属专题：智能网联汽车技术专题-感知&HMI&测评2022年

基于类间距优化的分心驾驶行为识别模型训练方法

张斌¹,付俊怡²,夏金祥¹()

^1.电子科技大学信息与软件工程学院，成都　610051
^2.中国地质大学（武汉）经济管理学院，武汉　430000

收稿日期:2021-10-14 修回日期:2021-11-08 出版日期:2022-02-25 发布日期:2022-02-24
通讯作者: 夏金祥 E-mail:jxxia@uestc.edu.cn
基金资助:
厅市共建智能终端四川省重点实验室开放基金(SCITLAB-0012)

A Metric Space Optimized Method for Driver Distraction Recognition Model Training

Bin Zhang¹,Junyi Fu²,Jinxiang Xia¹()

^1.School of Information and Software Engineering，University of Electronic Science and Technology of China，Chengdu 　610051
^2.School of Economics and Management，China University of Geosciences （Wuhan），Wuhan 　430000

Received:2021-10-14 Revised:2021-11-08 Online:2022-02-25 Published:2022-02-24
Contact: Jinxiang Xia E-mail:jxxia@uestc.edu.cn

摘要/Abstract

摘要：

分心驾驶行为识别任务可以看作细粒度图像分类任务，即图像中较小区域所包含的特征决定了该图像的类别，如一张图像是正常驾驶还是与副驾驶聊天完全由驾驶员的脸部朝向来决定。对于那些图像差异很小的类别，图像分类通常训练方法训练出的模型无法高精度地区分。针对这一问题，提出了基于类间距优化的分心驾驶行为识别模型训练方法，通过增大模型从异类图像所提取特征向量之间的欧式距离，使得模型学到可以区分那些图像差异很小的类别的细微特征，进而提高模型对这些类别的分类准确率。该方法实现了端到端的模型训练，既不增加模型的推理时延，又不引入额外监督信息。State Farm数据集上的试验表明，与图像分类通常训练方法比，该训练方法有效提高了模型的准确率。

关键词: 分心驾驶行为识别, 类间距优化, 特征向量, 图像分类

Abstract:

Driver distraction recognition task can be regarded as a fine-grained image classification task， i.e.， the features contained in a small area of the image determine the category of it. For example， whether a driver is driving normally or chatting with the co-pilot is only determined by the driver’s face orientation. For those categories with slight image differences， the model trained by ordinary image classification method is usually unable to distinguish them with high precision. To solve this problem， a metric space optimized method of distracted driving behavior recognition model training is proposed. By increasing the Euclidean distance between the feature vectors extracted from images of different categories， the model can learn the subtle features to classify these categories， and then improve the model's classification accuracy. The method realizes end-to-end model training without increasing the inference time or introducing in additional supervision information. Experiments on the State Farm dataset show that compared with the ordinary training methods of image classification， the proposed method effectively improves the accuracy of the model.

Key words: driver distraction recognition, metric space optimization, feature vector, image classification

张斌,付俊怡,夏金祥. 基于类间距优化的分心驾驶行为识别模型训练方法[J]. 汽车工程, 2022, 44(2): 225-232.

Bin Zhang,Junyi Fu,Jinxiang Xia. A Metric Space Optimized Method for Driver Distraction Recognition Model Training[J]. Automotive Engineering, 2022, 44(2): 225-232.

图/表 11

图1

图2

图3

图4

图5

图6

表1

表2

图7

图8

表3

参考文献 20

1	NCSA. Distracted driving 2019［EB/OL］. https：//crashstats.nhtsa.dot.gov/#！/#%2F 2021.
2	钟铭恩，吴平东，彭军强，等. 基于脑电信号的驾驶员情绪状态识别研究［J］. 中国安全科学学报， 2011（9）.
	ZHONG M E， WU P D， PENG J Q， et al. Research on recognition of driver's emotional state based on EEG signal［J］. China Safety Science Journal， 2011（9）.
3	陈骥驰，王宏，王翘秀，等. 基于脑电信号的疲劳驾驶状态研究［J］. 汽车工程， 2018， 40（5）： 515-520.
	CHEN J C，WANG H，WANG Q X，et al. A study on drowsy driving state based on EEG signals［J］. Automotive Engineering， 2018，40（5）：515-520.
4	王琼，王欢，赵春霞，等. 基于眼睛状态识别的驾驶员疲劳监测［J］. 南京理工大学学报（自然科学版）， 2010（4）.
5	MOSLEMI N， AZMI R， SORYANI M. Driver distraction recognition using 3D convolutional neural networks［C］.2019 4th International Conference on Pattern Recognition and Image Analysis （IPRIA）. IEEE， 2019： 145-151.
6	LI P， LU M， ZHANG Z， et al. A novel spatial-temporal graph for skeleton-based driver action recognition［C］.2019 IEEE Intelligent Transportation Systems Conference （ITSC）. IEEE， 2019： 3243-3248.
7	施冬梅，肖锋. 基于改进长短时记忆网络的驾驶行为检测方法研究［J］. 汽车工程， 2021， 43（8）： 1203-1209.
	SHI D M， XIAO F. Study on driving behavior detection method based on improved long and short term memory network［J］. Automotive Engineering， 2021， 43（8）： 1203-1209.
8	LU M， HU Y， LU X. Driver action recognition using deformable and dilated faster R-CNN with optimized region proposals［J］. Applied Intelligence， 2020， 50（4）： 1100-1111.
9	AI Y， XIA J， SHE K， et al. Double attention convolutional neural network for driver action recognition［C］.2019 3rd International Conference on Electronic Information Technology and Computer Engineering （EITCE）. IEEE， 2019： 1515-1519.
10	HUANG C， WANG X， CAO J， et al. HCF： a hybrid CNN framework for behavior detection of distracted drivers［J］. IEEE Access， 2020， 8： 109335-109349.
11	ZHANG N， DONAHUE J， GIRSHICK R B， et al. Part-based R-CNNs for fine-grained category detection［J］. European Conference on Computer Vision （ECCV）， 2014： 834-849.
12	LIN T Y， ROYCHOWDHURY A， MAJI S. Bilinear cnn models for fine-grained visual recognition［C］.Proceedings of the IEEE International Conference on Computer Vision. 2015： 1449-1457.
13	ZHANG X， ZHOU F， LIN Y， et al. Embedding label structures for fine-grained feature representation［C］.Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2016： 1114-1123.
14	SCHROFF F， KALENICHENKO D， PHILBIN J. Facenet： a unified embedding for face recognition and clustering［C］.Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2015： 815-823.
15	SNELL J， SWERSKY K， ZEMEL R S. Prototypical networks for few-shot learning［J］. arXiv preprint arXiv：， 2017.
16	ZHOU B， KHOSLA A， LAPEDRIZA A， et al. Learning deep features for discriminative localization［C］.Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2016： 2921-2929.
17	StateFarm distracted driver detection［DB/OL］. https：//www.kaggle.com/c/state-farm-distracted-driver-detection 2015.
18	SANDLER M， HOWARD A， ZHU M， et al. Mobilenetv2： inverted residuals and linear bottlenecks［C］.Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2018： 4510-4520.
19	HE K， ZHANG X， REN S， et al. Deep residual learning for image recognition［C］.Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition， 2016： 770-778.
20	CHOLLET F. XCEPTION： deep learning with depthwise separable convolutions［C］.Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition， 2017： 1251-1258.

数据集名称	类别数	每类样本数	样本总数
训练集A	10	200	2 000
训练集B	10	400	4 000
训练集C	10	1 000	10 000
测试集	10	1 000	10 000

基于类间距优化的分心驾驶行为识别模型训练方法

A Metric Space Optimized Method for Driver Distraction Recognition Model Training

RichHTML

PDF (PC)

赞

可视化

摘要/Abstract

引用本文

使用本文

图/表 11

参考文献 20

相关文章 0

Metrics

本文评价

推荐阅读 10