Administrator by China Associction for Science and Technology
Sponsored by China Society of Automotive Engineers
Published by AUTO FAN Magazine Co. Ltd.

Automotive Engineering ›› 2023, Vol. 45 ›› Issue (10): 1954-1964.doi: 10.19562/j.chinasae.qcgc.2023.10.016

Special Issue: 新能源汽车技术-电驱动&能量管理2023年

Previous Articles     Next Articles

Research on Energy Management Strategy for Hybrid Electric Vehicles Based on Inverse Reinforcement Learning

Chunyang Qi1,Chuanxue Song2,Shixin Song3,Liqiang Jin2,Da Wang3,Feng Xiao1()   

  1. 1.Jilin University,State Key Laboratory of Automotive Simulation and Control,Changchun  130022
    2.College of Automotive Engineering,Jilin University,Changchun  130022
    3.School of Mechanical and Aerospace Engineering,Jilin University,Changchun  130022
  • Received:2023-03-26 Revised:2023-05-12 Online:2023-10-25 Published:2023-10-23
  • Contact: Feng Xiao E-mail:xiaofengjl@jlu.edu.cn

Abstract:

Energy management strategy is one of the key technologies for hybrid vehicles. With the continuous upgrading of computing power and hardware devices, more and more scholars have gradually carried out research on learning-based energy management strategies. In the study of reinforcement learning-based energy management strategies for hybrid electric vehicles, the orientation of the interaction between the intelligent agent and the environment is determined by the reward function. However, most of the current reward function design is subjectively determined or based on experience, which is difficult to objectively describe the expert's intention, so in that condition there is no guarantee that the intelligent body will learn the optimal driving strategy for a given reward function. To address these problems, an energy management strategy based on inverse reinforcement learning is proposed in this paper to obtain the reward function weights under the expert trajectory by means of inverse reinforcement learning and use them to guide the behavior of the engine and battery intelligent agents. Then, the modified weights are input again into the positive reinforcement learning training. The fuel consumption value, SOC variation curve, reward training process and power source torque are used to verify the accuracy of the weight value and its advantage in terms of fuel saving capability. In summary, the algorithm has improved the fuel saving effect by 5%~10%.

Key words: hybrid electric vehicle, maximum entropy reverse reinforcement learning, energy management strategy, positive reinforcement learning