Welcome to Journal of Automotive Safety and Energy,

Journal of Automotive Safety and Energy ›› 2024, Vol. 15 ›› Issue (5): 702-710.DOI: 10.3969/j.issn.1674-8484.2024.05.008

• Intelligent Driving and Intelligent Transportation • Previous Articles     Next Articles

Vehicle longitudinal speed planning based on deep reinforcement learning CLPER-DDPG

LIU Peng1(), ZHAO Kegang1,*(), LIANG Zhihao1, YE Jie2   

  1. 1. South China University of Technology, School of Mechanical & Automotive Engineering, Guangzhou 510641, China
    2. Foshan University, School of Mechatronic Engineering and Automation, Foshan 528225, China
  • Received:2024-05-15 Revised:2024-10-09 Online:2024-10-31 Published:2024-11-07

Abstract:

To solve the problems of planner convergence difficulty in vehicle longitudinal speed planning and stability issues during scenario transitions, a planner was designed using a multilayer perceptron, incorporating the Deep Deterministic Policy Gradient (DDPG) algorithm with Prioritized Experience Replay (PER) and Curriculum Learning (CL). The simulation scenarios were designed for model training and testing, as well as comparative experiments among the three algorithms of DDPG, DDPG with Prioritized Experience Replay (PER-DDPG), and DDPG with both Prioritized Experience Replay and Curriculum Learning (CLPER-DDPG). Real-vehicle experiments were also carried out on actual roads within the Park. The results show that the CLPER-DDPG algorithm, comparing with the DDPG algorithm, the convergence speed of the planner is improved by 56.45%, the mean distance error is reduced by 16.61%, the mean speed error is decreased by 15.25%, and the mean jerk is lowered by 18.96%. Furthermore, when the parameters of environmental conditions and sensor hardware in the experimental scenarios are changed, the model could ensure that the longitudinal speed planning task will be completed safely.

Key words: autonomous driving, longitudinal velocity planning, deep deterministic policy gradient (DDPG) algorithm, curriculum learning mechanism, prioritized experience replay mechanism

CLC Number: