汽车工程 ›› 2019, Vol. 41 ›› Issue (2): 206-212.doi: 10.19562/j.chinasae.qcgc.2019.02.013

• • 上一篇    下一篇

基于DDPG的无人车智能避障方法研究*

徐国艳, 宗孝鹏, 余贵珍, 苏鸿杰   

  1. 北京航空航天大学交通科学与工程学院,北京 100191
  • 收稿日期:2017-11-17 出版日期:2019-02-25 发布日期:2019-02-25
  • 通讯作者: 徐国艳,副教授,博士,E-mail:xuguoyan@buaa.edu.cn
  • 基金资助:
    国家自然科学基金(51775016)资助

A Research on Intelligent Obstacle Avoidance of Unmanned Vehicle Based on DDPG Algorithm

Xu Guoyan, Zong Xiaopeng, Yu Guizhen, Su Hongjie   

  1. School of Transportation Science and Engineering, Beihang University, Beijing 100191
  • Received:2017-11-17 Online:2019-02-25 Published:2019-02-25

摘要: 本文中提出一种基于强化学习的无人车智能避障方法。鉴于无人车运动必须满足内外约束,包括汽车动力学约束和交通规则约束,且动作输出必须连续,而传统强化学习无法应对连续动作空间问题,提出了一种改进的DDPG算法,解决连续动作空间问题,实现转向盘转角和加速度的连续输出;采取多源传感器数据融合,满足无人车避障算法的状态输入;增加车辆内外约束条件,使输出动作更合理有效。最后,在开源仿真平台TORCS进行仿真,验证了算法的有效性和鲁棒性。

关键词: 无人车, 避障, 强化学习, TORCS

Abstract: An intelligent obstacle avoidance scheme for unmanned vehicle based on reinforcement learning is proposed in this paper. In view of that the movement of unmanned vehicle must meet both interior and exterior constraints, including vehicle dynamics constraints and traffic rule constraints and its output must be continuous, which the traditional reinforcement learning cannot assure, an improved deep deterministic policy gradient algorithm is proposed to tackle continuous motion space issue and achieve the continuous output of steering wheel angle and acceleration. Multi-source sensor data fusion is adopted to fulfill the state input of unmanned vehicle obstacle avoidance algorithm and both interior and exterior constraints are added to make output motion more reasonable and effective. Finally a simulation is conducted on the open-source simulation platform TORCS and the effectiveness and robustness of the algorithm verified

Key words: unmanned vehicle, obstacle avoidance, reinforcement learning, TORCS