基于DDPG的无人车智能避障方法研究*

doi:10.19562/j.chinasae.qcgc.2019.02.013

汽车工程 ›› 2019, Vol. 41 ›› Issue (2): 206-212.doi: 10.19562/j.chinasae.qcgc.2019.02.013

基于DDPG的无人车智能避障方法研究^*

徐国艳, 宗孝鹏, 余贵珍, 苏鸿杰

北京航空航天大学交通科学与工程学院,北京 100191

收稿日期:2017-11-17 出版日期:2019-02-25 发布日期:2019-02-25
通讯作者: 徐国艳,副教授,博士,E-mail:xuguoyan@buaa.edu.cn
基金资助:
国家自然科学基金(51775016)资助

A Research on Intelligent Obstacle Avoidance of Unmanned Vehicle Based on DDPG Algorithm

Xu Guoyan, Zong Xiaopeng, Yu Guizhen, Su Hongjie

School of Transportation Science and Engineering, Beihang University, Beijing 100191

Received:2017-11-17 Online:2019-02-25 Published:2019-02-25

摘要/Abstract

摘要： 本文中提出一种基于强化学习的无人车智能避障方法。鉴于无人车运动必须满足内外约束,包括汽车动力学约束和交通规则约束,且动作输出必须连续,而传统强化学习无法应对连续动作空间问题,提出了一种改进的DDPG算法,解决连续动作空间问题,实现转向盘转角和加速度的连续输出;采取多源传感器数据融合,满足无人车避障算法的状态输入;增加车辆内外约束条件,使输出动作更合理有效。最后,在开源仿真平台TORCS进行仿真,验证了算法的有效性和鲁棒性。

关键词: 无人车, 避障, 强化学习, TORCS

Abstract: An intelligent obstacle avoidance scheme for unmanned vehicle based on reinforcement learning is proposed in this paper. In view of that the movement of unmanned vehicle must meet both interior and exterior constraints, including vehicle dynamics constraints and traffic rule constraints and its output must be continuous, which the traditional reinforcement learning cannot assure, an improved deep deterministic policy gradient algorithm is proposed to tackle continuous motion space issue and achieve the continuous output of steering wheel angle and acceleration. Multi-source sensor data fusion is adopted to fulfill the state input of unmanned vehicle obstacle avoidance algorithm and both interior and exterior constraints are added to make output motion more reasonable and effective. Finally a simulation is conducted on the open-source simulation platform TORCS and the effectiveness and robustness of the algorithm verified

Key words: unmanned vehicle, obstacle avoidance, reinforcement learning, TORCS

徐国艳, 宗孝鹏, 余贵珍, 苏鸿杰. 基于DDPG的无人车智能避障方法研究^*[J]. 汽车工程, 2019, 41(2): 206-212.

Xu Guoyan, Zong Xiaopeng, Yu Guizhen, Su Hongjie. A Research on Intelligent Obstacle Avoidance of Unmanned Vehicle Based on DDPG Algorithm[J]. , 2019, 41(2): 206-212.

[1]	刘卫国,项志宇,刘伟平,齐道新,王子旭. 基于分布式强化学习的车辆控制算法研究[J]. 汽车工程, 2023, 45(9): 1637-1645.
[2]	高锋,冯德福,胡秋霞. 面向NMPC运动规划系统的数值优化加速技术[J]. 汽车工程, 2023, 45(8): 1438-1447.
[3]	李军, 周伟, 唐爽. 基于自适应拟合的智能车换道避障轨迹规划[J]. 汽车工程, 2023, 45(7): 1174-1183.
[4]	金立生,韩广德,谢宪毅,郭柏苍,刘国峰,朱文涛. 基于强化学习的自动驾驶决策研究综述[J]. 汽车工程, 2023, 45(4): 527-540.
[5]	严凉,吴晓东,胡川. 面向密集障碍规避的人车共享转向控制系统[J]. 汽车工程, 2023, 45(12): 2222-2233.
[6]	胡丹丹,尹鹏飞,牛国臣,赵金聚. 非结构化道路下离轴式拖挂车辆主动避障控制研究[J]. 汽车工程, 2023, 45(12): 2318-2329.
[7]	李捷,吴晓东,许敏,刘永刚. 基于强化学习的城市场景多目标生态驾驶策略[J]. 汽车工程, 2023, 45(10): 1791-1802.
[8]	齐春阳,宋传学,宋世欣,靳立强,王达,肖峰. 基于逆强化学习的混合动力汽车能量管理策略研究[J]. 汽车工程, 2023, 45(10): 1954-1964.
[9]	赵越,胡纪滨,吴维,魏超. 无人车全轮蟹行转向稳定性鲁棒控制与试验验证[J]. 汽车工程, 2022, 44(8): 1126-1135.
[10]	高振海,闫相同,高菲. 基于逆向强化学习的纵向自动驾驶决策方法[J]. 汽车工程, 2022, 44(7): 969-975.
[11]	李江坤,邓伟文,任秉韬,王文奇,丁娟. 基于场景动力学和强化学习的自动驾驶边缘测试场景生成方法[J]. 汽车工程, 2022, 44(7): 976-986.
[12]	唐斌,许占祥,江浩斌,蔡英凤,胡子添,杨铮奕. 基于分段优化的车辆换道避障轨迹规划[J]. 汽车工程, 2022, 44(6): 831-841.
[13]	宋东鉴,朱冰,赵健,韩嘉懿,刘彦辰. 基于驾驶行为生成机制的智能汽车类人行为决策[J]. 汽车工程, 2022, 44(12): 1797-1808.
[14]	王宏伟,刘晨宇,李磊,张昊天. 基于高效NMPC算法的无人车轨迹跟踪控制研究[J]. 汽车工程, 2022, 44(10): 1494-1502.
[15]	王海,李洋,蔡英凤,孙恺,陈龙. 基于激光雷达的3D实时车辆跟踪[J]. 汽车工程, 2021, 43(7): 1013-1021.

基于DDPG的无人车智能避障方法研究^*

A Research on Intelligent Obstacle Avoidance of Unmanned Vehicle Based on DDPG Algorithm

PDF (PC)

赞

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

Metrics

本文评价

推荐阅读 10