基于深度学习的端到端车辆运动规划方法研究

doi:10.19562/j.chinasae.qcgc.2023.08.005

摘要/Abstract

摘要：

在现有端到端的深度学习自动驾驶框架中，普遍存在规划控制预测精度低的问题，这往往是由于输入数据源单一、无法兼顾时间和空间信息而导致的。为更好地体现虚拟仿真测试中自车与环境、交通参与者的历史交互过程对当前时刻决策的影响，本文设计了一种用于自动驾驶仿真环境下车辆运动规划任务的多级时空注意力长短期记忆网络。该算法提取和表征自动驾驶环境的深层抽象信息，并在仿真平台中实现端到端的车辆运动控制。首先，将前视摄像头模型获取的RGB仿真数据的历史连续视频帧序列作为输入，使用卷积模块提取单一时刻图像的空间特征；其次，使用LSTM模块融合图像历史时刻的空间信息，从而获得时间上下文特征。同时，为提高对时空关键信息的提取能力并加速网络收敛，本文在多级时空特征的融合部分采用了时空注意力机制。本研究在Carla仿真平台上进行了测试验证，实验结果表明本文所提出的方法相比单一时空算法更能精确地模仿人类驾驶决策行为。

关键词: 车辆运动规划, 端到端, 时空注意力, 深度学习, 仿真, LSTM

Abstract:

In existing end-to-end deep learning-based autonomous driving frameworks， there is a common problem of low accuracy in planning and control prediction， often due to the single-source input data and inability to balance spatial and temporal information. To better reflect the impact of the historical interaction process between the ego vehicle， environment， and traffic participants on the current decision-making in virtual simulation testing， this paper designs a multi-level spatiotemporal attention long short-term memory network for vehicle motion planning in autonomous driving simulation environment. The algorithm extracts and represents deep abstract information of the autonomous driving environment and realizes end-to-end vehicle motion control in the simulation platform. Firstly， a convolutional module is used to extract spatial features of a single image at a specific moment using the historical continuous video frame sequence of RGB simulation data acquired by the forward-facing camera model as input. Secondly， the LSTM module is used to fuse the spatial information of the image across historical moment to obtain temporal contextual features. Additionally， to enhance the ability to extract spatiotemporal key information and accelerate network convergence， a spatiotemporal attention mechanism is applied in the fusion part of the multi-level spatiotemporal features. The proposed method is tested and validated on the Carla simulation platform. The experimental results show that the proposed method can more accurately simulate human driving decision-making behavior compared to the single spatiotemporal algorithm.

Key words: vehicle motion planning, end-to-end, space-time attention, deep learning, simulation, LSTM

刘卫国,项志宇,刘锐,李国栋,王子旭. 基于深度学习的端到端车辆运动规划方法研究[J]. 汽车工程, 2023, 45(8): 1343-1352.

Weiguo Liu,Zhiyu Xiang,Rui Liu,Guodong Li,Zixu Wang. Research on End-to-End Vehicle Motion Planning Method Based on Deep Learning[J]. Automotive Engineering, 2023, 45(8): 1343-1352.

图/表 22

图1

图2

图3

图4

图5

图6

表1

图7

图8

表2

图9

图10

图11

表3

表4

表5

表6

表7

表8

图12

图13

图14

参考文献 8

1	POMERLEAU D A.Alvinn： an autonomous land vehicle in a neural network［J］.Morgan Kaufmann Publishers Inc. 1989.DOI：US1715121 A.
2	CODEVILLA F， MÜLLER M， LÓPEZ A， et al. End-to-end driving via conditional imitation learning［C］.2018 IEEE International Conference on Robotics and Automation （ICRA）. IEEE， 2018： 4693-4700.
3	BOJARSKI M， DEL TESTA D， DWORAKOWSKI D， et al. End to end learning for self-driving cars［J］. arXiv preprint arXiv：1604.07316， 2016， 103.
4	ARGALL B D， CHERNOVA S， VELOSO M， et al. A survey of robot learning from demonstration［J］. Robotics and Autonomous Systems， 2009， 57（5）： 469-483.
5	ERAQI H M， MOUSTAFA M N， HONER J. End-to-end deep learning for steering autonomous vehicles considering temporal dependencies［J］. arXiv preprint arXiv：， 2017.
6	DOSOVITSKIY A， ROS G， CODEVILLA F， et al. CARLA： an open urban driving simulator［C］.Conference on Robot Learning. PMLR， 2017： 1-16.
7	CHEN Y， PRAVEEN P， PRIYANTHA M， et al. Learning on-road visual control for self-driving vehicles with auxiliary tasks［C］.2019 IEEE Winter Conference on Applications of Computer Vision （WACV）. IEEE， 2019： 331-338.
8	JUNG A B， WADA K， CRALL J， et al. Imgaug. 2020［J］. GitHub Repository： https：//github. com/aleju/imgaug， 2020， 720.

计算机配置
CPU	Intel（R） Core（TM） i7-8700K CPU @ 3.70 GHz
GPU		GeForce GTX 3090 Ti
OS		Ubuntu 18.04
Driver		NVIDIA version 470
CUDA		Version 11
CPU	Intel（R） Core（TM） i7-8700K CPU @ 3.70 GHz
虚拟环境配置
Environment management system			Conda 4.5.12
Python distributions		Client side： 2.7.16 Server side： 3.7.1
机器学习工具
Tensorflow-gpu		Version 1.13.1
Keras		Version 2.2.4
Numpy		Version 1.15.1
Pandas		Version 0.24.2
Simulator		Carla 0.9.5

数据	数据描述	数据格式
前视摄像头图像	来自车辆前置中央摄像头的图像	PNG
速度	车辆的当前速度（m/s）	int
速度限制	车辆所在位置的速度限制（km/h）	［float， float］
位置	车辆的位置，［x-cord， y-cord］	［float， float， float］
车辆控制信号	车辆控制信号，［steer， throttle， brake］	［float， float， float］
建议控制信号	Carla自动驾驶仪建议的控制信号，［steer， throttle， brake］	int
交通信号灯状态	车辆位置的交通灯状态： Green（1）， Red（0）	int
导航信号输入	激活的导航输入： Turn Left， Turn Right Straight Ahead， Continue Straight， Change Lane Left， Change Lane Right	int
驾驶环境	仿真驾驶环境： Highway， Rural	int
天气标识	Sun position presets Weather condition presets Sun attitude Sun azimuth Clouds amount Rain amount Puddles amount Wind intensity Fog intensity Fog Distance	［sunset \| day \| night］［clear \| overcast \| rain］［int， int］［int， int］［int， int］［int， int］［int， int］［int， int］［int， int］［int， int］

数据平衡方法	训练步数	晴天&中午	晴天&傍晚	大雨&中午	小雨&中午	阴天&傍晚	平均
无	12 938	15.79%	15.79%	21.32%	15.79%	15.79%	15.78%
无	20 241	10.22%	38.00%	15.28%	17.85%	33.66%	23.03%
过滤	6 158	17.85%	16.53%	26.65%	15.79%	16.53%	18.45%
过滤	11 236	30.10%	10.22%	13.51%	11.96%	33.66%	20.89%
扩充	6 064	36.45%	38.00%	10.22%	15.79%	29.20%	25.76%
扩充	12 068	33.66%	44.98%	22.71%	64.17%	20.87%	37.19%
扩充	24 137	13.91%	63.92%	30.10%	38.00%	63.92%	41.88%

数据平衡方法	训练步数	障碍物碰撞	卡死	车道入侵无恢复	车道入侵并恢复	无视指令
无	12 938	6	4	5	0	6
无	20 241	6	2	5	1	6
过滤	6 158	13	7	7	5	1
过滤	11 236	7	4	11	6	0
扩充	6 064	3	3	4	1	7
扩充	12 068	5	3	3	2	5
扩充	24 137	3	2	5	2	3

模型	训练步数	晴天&中午	晴天&傍晚	大雨&中午	小雨&中午	阴天&傍晚	平均
LsNN	23 936	13.91%	63.92%	30.10%	38.00%	63.92%	41.88%
LsNN + sine	23 936	79.61%	84.91%	40.43%	66.14%	75.53%	70.21%