汽车工程 ›› 2022, Vol. 44 ›› Issue (12): 1797-1808.doi: 10.19562/j.chinasae.qcgc.2022.12.001

所属专题: 智能网联汽车技术专题-感知&HMI&测评2022年

• •    下一篇

基于驾驶行为生成机制的智能汽车类人行为决策

宋东鉴,朱冰,赵健(),韩嘉懿,刘彦辰   

  1. 吉林大学,汽车仿真与控制国家重点实验室,长春  130022
  • 收稿日期:2022-04-29 修回日期:2022-06-06 出版日期:2022-12-25 发布日期:2022-12-22
  • 通讯作者: 赵健 E-mail:zhaojian@jlu.edu.cn
  • 基金资助:
    国家自然科学基金(52172386)

Human-Like Behavior Decision-Making of Intelligent Vehicles Based on Driving Behavior Generation Mechanism

Dongjian Song,Bing Zhu,Jian Zhao(),Jiayi Han,Yanchen Liu   

  1. Jilin University,State Key Laboratory of Automotive Simulation and Control,Changchun  130022
  • Received:2022-04-29 Revised:2022-06-06 Online:2022-12-25 Published:2022-12-22
  • Contact: Jian Zhao E-mail:zhaojian@jlu.edu.cn

摘要:

本文通过分析驾驶人驾驶行为生成机制,构建了类人行为决策策略(HBDS)。它具有匹配驾驶行为生成机制的策略框架,通过最大熵逆强化学习得到类人奖励函数,并采用玻尔兹曼理性噪声模型建立行为概率与累积奖励的映射关系。通过预期轨迹空间的离散化处理,避免了连续高维空间积分中的维数灾难,并基于统计学规律和安全约束对预期轨迹空间进行压缩和修剪,提升了HBDS采样效率。HBDS在NGSIM数据集上进行训练和测试的结果表明,HBDS能做出符合驾驶人个性化认知特性和行为特征的行为决策。

关键词: 智能汽车, 类人驾驶, 行为决策, 逆强化学习

Abstract:

In this paper, a human-like behavior decision-making strategy (HBDS) is established by analyzing drivers’ driving behavior generation mechanism. HBDS has a framework that matches the driving behavior generation mechanism, obtains the human-like reward function through maximum entropy inverse reinforcement learning, and adopts the Boltzman noisily-rational model to build the mapping relationship between behavior probability and its cumulative reward. By discretizing the expected trajectory space, the curse of dimensionality in the integration of continuous high-dimensional space is avoided, and based on statistical law and safety constraint, the expected trajectory space is compressed and pruned, enhancing the sampling efficiency of HBDS. The strategy is trained and tested on NGSIM dataset, and the results show that HBDS can make behavior decisions that conform to the driver’s personalized cognitive and behavioral characteristics.

Key words: intelligent vehicles, human-like driving, behavior decision-making, inverse reinforcement learning