面向路侧视角目标检测的轻量级YOLOv7-R算法

doi:10.19562/j.chinasae.qcgc.2023.10.006

摘要/Abstract

摘要：

针对V2X中的路侧感知单元在检测过程中，模型部署困难的问题、被测目标所呈现的多尺度问题及目标之间遮挡问题，提出了一种基于YOLOv7算法的轻量级检测算法YOLOv7-R。首先使用改进的EfficientNetv2-s重新构建YOLOv7的主干网络，减小模型参数量，提高模型的推理速度。其次，采用CA坐标注意力机制，保留精确的位置信息，加强模型对多尺度目标的检测性能；同时采用Focal-EIoU损失函数，提升算法精度。最后，在预处理阶段使用GridMask数据增强，提升算法对被遮挡目标的学习能力。实验结果表明：相较于基线算法YOLOv7，该算法在DAIR-V2X-I数据集上的map@0.5和map@0.5：0.95分别提高了3%与4.8%，检测速率达到了96.3 $f / s$ ，从而在满足轻量化要求的同时得到更优的检测精度，有效地实现了路侧单元对交通参与者的检测任务。

关键词: 深度学习, 路侧感知, YOLOv7, 轻量化, 注意力机制, 车路协同

Abstract:

A lightweight detection algorithm YOLOV7-R based on the YOLOv7 algorithm is proposed to solve the problems of model deployment difficulty， multi-scale problem of the measured target and occlusion problem between targets in the detection process of the road side sensing unit in V2X. Firstly， the backbone of YOLOv7 is rebuilt using the improved EfficientNetv2-s to reduce the model parameters and improve the model detection speed. Secondly， CA coordinate attention mechanism is adopted to retain accurate location information to enhance the performance of the model for multi-scale targets. At the same time， Focal-EIoU loss function is utilized to enhance the accuracy of the algorithm. Finally， GridMask image enhancement is used in the pre-processing stage to improve the learning ability of the algorithm for the blocked target. The experimental results show that compared with the baseline algorithm YOLOv7， the map@0.5 and map@0.5：0.95 value of the proposed algorithm on the DAIR-V2X-I dataset is increased by 3% and 4.8%， respectively， with the detection rate reaching 96.3 f/s， which can meet the requirements of lightweight and obtain better detection accuracy， and effectively implement the detection task of the road side unit for traffic participants.

Key words: deep learning, roadside perception, YOLOv7, lightweight, attention mechanism, IVICS

张小俊,奚敬哲,史延雷,袁安录. 面向路侧视角目标检测的轻量级YOLOv7-R算法[J]. 汽车工程, 2023, 45(10): 1833-1844.

Xiaojun Zhang,Jingzhe Xi,Yanlei Shi,Anlu Yuan. Lightweight YOLOv7-R Algorithm for Road-Side View Target Detection[J]. Automotive Engineering, 2023, 45(10): 1833-1844.

图/表 14

图1

图2

图3

图4

图5

图6

图7

图8

图9

图10

表1

消融实验结果"

模型	EfficientNetv2-e	Coordinate Attention	Focal-EIoULoss	mAP@0.5/%	mAP@0.5：0.95/%	AP_small	AP_medium	P_arams/M	FPS/（ $f · s - 1$ ）
（1）				89.3	59.1	23.7	58.6	36.9	77.3
（2）	√			87.7	56.9	22.9	58.1	17.1	95.6
（3）	√	√		90	59.6	25.3	63.5	17.4	92.5
（4）			√	90.4	62.1	23.8	59	37.2	80.9
本文	√	√	√	92.3	63.9	25.7	64.3	17.6	96.3

表1

图11

表2

不同算法对比"

模型	mAP@0.5/%	AP_small	P_arams/M	FPS/（ $f · s - 1$ ）
SSD	72.1	18.1	25.0	56.6
EfficientDet	86.3	18.5	18.2	88.5
YOLOv4	86.6	19.4	50.5	55.2
YOLOv5-m	88.0	21.0	18.4	74.0
YOLOv5-s	85.3	19.1	9.5	86.3
YOLOv7	89.3	23.7	36.9	77.3
YOLOX-s	86.1	21.2	10.1	61.0
YOLOv8	90.7	23.7	25.9	79.2
Ours	92.3	25.7	17.6	96.3

表2

图12

参考文献 24

1	GIRSHICK R， DONAHUE J， DARRELL T， et al. Rich feature hierarchies for accurate object detection and semantic segmentation［C］. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Columbus， USA， 2014： 580-587.
2	REDMON J， DIVVALA S， GIRSHICK R， et al. You only look once：unified， real-time object detection［C］. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas， USA， 2016： 779-788.
3	LIU W， DRAGOMIR A， DUMITRU E， et al. Ssd： single shot multibox detector［C］. European Conference on Computer Vision. Amsterdam， Netherlands， 2016：21-37.
4	WU T H， WANG T W， LIU Y Q. Real-time vehicle and distance detection based on improved YOLOv5 network［C］. 2021 3rd World Symposium on Artificial Intelligence（WSAI）， Guangzhou， China， 18-20 June 2021：24-28.
5	LIU S S， GENG Y N， SONG Y H， et al. Research on small target pedestrian detection algorithm based on improved YOLOv3［C］.International Conference on Genetic and Evolutionary Computing. Springer， Singapore， 2021： 203-214.
6	ZHOU S Y， YIN J. YOLO-Ship：a visible light ship detection method［C］. Proceedings of the 2022 2nd International Conference on Consumer Electronics and Computer Engineering. Piscataway：IEEE， Guangzhou， China， 2022： 113-118.
7	张舜然. 基于深度学习的路侧视角下多目标检测模型研究［D］. 长春：吉林大学，2022.
	ZHANG S R. Research on multiple object detection model from the roadside perspective based on deep learning［D］. Changchun： Jilin University， 2022.
8	皮任东. 基于路侧激光雷达和摄像头融合的目标轨迹追踪方法研究［D］. 济南：山东大学，2022.
	PI R D. Research on object tracking method based on fusion of roadside LiDAR and camera［D］. Jinan： Shandong University， 2022.
9	TAN M X， LE Q. Efficientnetv2：smaller models and faster training［C］. Proceedings of the 38th International Conference on Machine Learning. Vienna， Austria， 2021：10096-10106.
10	WANG C Y， ALEXEY B， LIAO H Y. YOLOv7：trainable bag-of-freebies sets new state-of-the-art for real-time object detectors［J］. arXiv Preprint arXiv：， 2022.
11	HU J， SHEN L， SUN G. Squeeze-and-excitation networks［C］. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition， Salt Lake City， USA， 2018： 7132－7141.
12	HOU Q B， ZHOU D Q， FENG J S. Coordinate attention for efficient mobile network design［J］. arXiv Preprint arXiv： 2103. 02907， 2021.
13	ZHANG Y F， REN W Q， ZHANG Z， et al. Focal and efficient iou loss for accurate bounding box regression ［J］. arXiv Preprint arXiv： 2101. 08158， 2021.
14	ZHANG X D， ZENG H， GUO S， et al. Efficient long-range attention network for image super-resolution ［J］. arXiv Preprint arXiv：， 2022.
15	WANG C Y， BOCHKOVSKIY A， LIAO H Y M. Scaled-YOLOv4： scaling cross stage partial network［C］. 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition. New York： IEEE Press， 2021： 13024-13033.
16	DING X H， ZHANG X Y， MA N N， et al. RepVGG： making VGG-style ConvNets great again［C］.2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition. New York： IEEE Press， 2021： 13728-13737.
17	STEFAN E， EIJI U， KENJI D. Sigmoid-weighted linear units for neural network function approximation in reinforcement learning ［J］. arXiv Preprint arXiv：， 2017.
18	MELEKHO V， I，TIULPIN A， SATTLER T，et al. DGC-Net： dense geometric correspondence network［C］. 2019 IEEE Winter Conference on Applications of Computer Vision， Waikoloa Village， Hawaii 2019： 1034-1042.
19	ZHENG Z H. Distance-IoU loss： faster and better learning for bounding box regression［C］. Proceedings of the AAAI Conference on Artificial Intelligence， San Francisco， USA， 2020， 34（7）： 12993-13000.
20	HE K M， GOYAL P， LIN S Y， et al. Focal loss for dense object detection ［J］. arXiv Preprint arXiv：， 2017.
21	CHEN P G， LIU S， ZHAO H S， et al. GridMask data augmentation ［J］. arXiv Preprint arXiv：， 2020.
22	TAN M X， PANG R M， LE Q. EfficientDet： scalable and efficient object detection［C］. IEEE Conference on Computer Vision and Pattern Recognition， Seattle， USA， 2020： 10778-10787.
23	WANG C Y， BOCHKOVSKIY A， LIAO H Y M. YOLOv4： optimal speed and accuracy of object detection［J］. arXiv Preprint arXiv：， 2020.
24	GE Z， LIU S T， WANG F， et al. YOLOX： exceeding YOLO series in 2021［J］. arXiv Preprint arXiv：， 2021.

[1]	段利斌,张雨,杜展鹏,刘冶钢,孟祥新,田冠男,郑海洋,吴闯. 基于VRB/OW-GFRP混合结构的CTB电池包上盖总成轻量化设计研究[J]. 汽车工程, 2024, 46(2): 290-299.
[2]	高泽, 楚遵康, 石稼晟, 林滏, 饶卫雄, 余海燕. 基于图网络的汽车零部件应力场快速预测方法研究[J]. 汽车工程, 2024, 46(1): 170-178.
[3]	谷先广, 陈红林, 俞陆新, 张代胜. 精密铸铝件一体化设计及在车身轻量化中的应用[J]. 汽车工程, 2024, 46(1): 179-186.
[4]	李勇滔,孙晨旭,郑伟光,许恩永,李育方,王善超. 基于毫米波雷达与视觉融合的碰撞预警[J]. 汽车工程, 2023, 45(9): 1666-1676.
[5]	刘卫国,项志宇,刘锐,李国栋,王子旭. 基于深度学习的端到端车辆运动规划方法研究[J]. 汽车工程, 2023, 45(8): 1343-1352.
[6]	赵东宇, 赵树恩. 基于级联YOLOv7的自动驾驶三维目标检测[J]. 汽车工程, 2023, 45(7): 1112-1122.
[7]	赵嘉豪,齐志权,齐智峰,王皓,何磊. 基于轮胎特征点的并行大型车辆朝向角计算[J]. 汽车工程, 2023, 45(6): 1031-1039.
[8]	赵霞,李朝,付锐,葛振振,王畅. 基于深度卷积-Tokens降维优化视觉Transformer的分心驾驶行为实时检测[J]. 汽车工程, 2023, 45(6): 974-988.
[9]	金立生,纪丙东,郭柏苍. 基于多层时空融合网络的驾驶人注意力预测[J]. 汽车工程, 2023, 45(5): 759-767.
[10]	靳春宁,高妍,高世哲,邹天下,刘洋,张智恒. 基于辊冲一体式纵梁的轻量化拖挂式房车底盘[J]. 汽车工程, 2023, 45(5): 865-872.
[11]	陈妍妍,王海,蔡英凤,陈龙,李祎承. 基于检测的高效自动驾驶实例分割方法[J]. 汽车工程, 2023, 45(4): 541-550.
[12]	陈一哲,范宏德,王祎纯,王辉,李俊,华林. 车用纤维金属层板构件冲压变形行为研究[J]. 汽车工程, 2023, 45(3): 517-526.
[13]	兰凤崇,陈继开,陈吉清,蒋心平,李子涵,潘威. 实车数据驱动的锂电池剩余使用寿命预测方法研究[J]. 汽车工程, 2023, 45(2): 175-182.
[14]	段利斌,周华锦,杜展鹏,张雨,徐伟,刘星,江浩斌. 基于SHCA-T算法的车身骨架多工况耐撞性优化设计[J]. 汽车工程, 2023, 45(2): 304-312.
[15]	李琳辉,张鑫亮,付一帆,连静,马家旭. 基于TC-YOLOv7算法的可见光与红外后融合检测研究[J]. 汽车工程, 2023, 45(12): 2280-2290.