Administrator by China Associction for Science and Technology
Sponsored by China Society of Automotive Engineers
Published by AUTO FAN Magazine Co. Ltd.

Automotive Engineering ›› 2025, Vol. 47 ›› Issue (11): 2070-2082.doi: 10.19562/j.chinasae.qcgc.2025.11.002

Previous Articles    

An Efficient Learning Method for Multi-Modal Task Path Planning of Flying Vehicles

Jing Zhao1,2,Chao Yang1,2(),Weida Wang1,2,Ying Li1,2,Changle Xiang1,2   

  1. 1.School of Mechanical Engineering,Beijing Institute of Technology,Beijing 100081
    2.Hefei Unmanned Intelligent Equipment Research Institute,Beijing Institute of Technology,Hefei 230041
  • Received:2025-04-11 Revised:2025-05-31 Online:2025-11-25 Published:2025-11-28
  • Contact: Chao Yang E-mail:cyang@bit.edu.cn

Abstract:

Flying vehicles have attracted significant attention in urban traffic, rescue transportation, and other operational fields. Efficient multi-modal task path planning effectively improves their operational efficiency in these fields. Therefore, an efficient learning method for multi-modal task path planning of flying vehicles is proposed. Firstly, the action space of the flying vehicle is optimized, retaining the actions of take-off, landing, and moving towards the target position. Simultaneously, a probability selection mechanism for non-target direction actions is designed. Secondly, considering the air-ground coordination characteristics of the flying vehicle, a novel reward function of Q-learning is designed. And a reward enhancement mechanism based on historical optimal path experience is proposed. Finally, a path smoothing method is proposed to obtain a smooth and continuous path for the air-ground cooperative task. Compared with the multi-modal paths planned by A*, Q-learning, and D* Lite, the multi-modal path planned by this method successively reduces the running distance by 10.35, 126.75, and 162.10 m, respectively. In terms of learning efficiency, the method reduces the learning time by 45.97% compared to Q-learning.

Key words: flying vehicles, multi-modal task path planning, action space, reward function, path smoothing