Journal of Automotive Safety and Energy ›› 2023, Vol. 14 ›› Issue (2): 202-211.DOI: 10.3969/j.issn.1674-8484.2023.02.007
Previous Articles Next Articles
HAN Ling(
), ZHANG Hui, FANG Ruoyu, LIU Guopeng, ZHU Changsheng, CHI Ruifeng
Received:2022-09-14
Revised:2022-11-21
Online:2023-04-30
Published:2023-04-27
CLC Number:
HAN Ling, ZHANG Hui, FANG Ruoyu, LIU Guopeng, ZHU Changsheng, CHI Ruifeng. Global path planning strategy based on an improved deep reinforcement learning[J]. Journal of Automotive Safety and Energy, 2023, 14(2): 202-211.
Add to citation manager EndNote|Ris|BibTeX
URL: https://www.journalase.com/EN/10.3969/j.issn.1674-8484.2023.02.007
| 场景 | 通行距离 / km | ||
|---|---|---|---|
| DQN | SQDQN | Dijkstra | |
| 1 | 4.879 | 4.879 | 4.879 |
| 2 | 5.760 | 5.760 | 5.760 |
| 3 | 6.320 | 6.320 | 6.320 |
| 4 | 3.850 | 3.850 | 3.850 |
| 5 | 4.329 | 4.329 | 4.329 |
| 6 | 6.783 | 6.415 | 6.415 |
| 7 | 3.987 | 3.987 | 3.987 |
| 8 | 5.310 | 5.310 | 5.310 |
| 9 | 4.982 | 4.982 | 4.982 |
| 10 | 5.567 | 5.567 | 5.567 |
| 11 | 6.587 | 6.587 | 6.587 |
| 12 | 4.872 | 4.872 | 4.872 |
| 13 | 6.606 | 6.350 | 6.350 |
| 14 | 5.876 | 5.876 | 5.876 |
| 15 | 4.469 | 4.469 | 4.469 |
| 16 | 5.524 | 5.310 | 5.310 |
| 17 | 4.860 | 4.860 | 4.860 |
| 18 | 4.187 | 4.187 | 4.187 |
| 19 | 4.621 | 4.621 | 4.621 |
| 20 | 4.897 | 4.897 | 4.897 |
| 场景 | 通行距离 / km | ||
|---|---|---|---|
| DQN | SQDQN | Dijkstra | |
| 1 | 4.879 | 4.879 | 4.879 |
| 2 | 5.760 | 5.760 | 5.760 |
| 3 | 6.320 | 6.320 | 6.320 |
| 4 | 3.850 | 3.850 | 3.850 |
| 5 | 4.329 | 4.329 | 4.329 |
| 6 | 6.783 | 6.415 | 6.415 |
| 7 | 3.987 | 3.987 | 3.987 |
| 8 | 5.310 | 5.310 | 5.310 |
| 9 | 4.982 | 4.982 | 4.982 |
| 10 | 5.567 | 5.567 | 5.567 |
| 11 | 6.587 | 6.587 | 6.587 |
| 12 | 4.872 | 4.872 | 4.872 |
| 13 | 6.606 | 6.350 | 6.350 |
| 14 | 5.876 | 5.876 | 5.876 |
| 15 | 4.469 | 4.469 | 4.469 |
| 16 | 5.524 | 5.310 | 5.310 |
| 17 | 4.860 | 4.860 | 4.860 |
| 18 | 4.187 | 4.187 | 4.187 |
| 19 | 4.621 | 4.621 | 4.621 |
| 20 | 4.897 | 4.897 | 4.897 |
| 场景 | 通行距离 / km | 平均Q值 | ||||
|---|---|---|---|---|---|---|
| DQN | SQDQN | Dijkstra | DQN | SQDQN | ||
| 1 | 4.962 | 4.962 | 4.879 | 17.09 | 15.59 | |
| 2 | 5.837 | 5.837 | 5.760 | 18.68 | 15.30 | |
| 3 | 6.423 | 6.423 | 6.320 | 17.40 | 15.67 | |
| 4 | 3.933 | 3.933 | 3.850 | 16.96 | 15.15 | |
| 5 | 4.419 | 4.419 | 4.329 | 17.96 | 15.06 | |
| 6 | 5.420 | 5.420 | 4.469 | 20.79 | 18.03 | |
| 7 | 4.072 | 4.072 | 3.987 | 16.45 | 15.54 | |
| 8 | 5.403 | 5.403 | 5.310 | 18.50 | 15.71 | |
| 9 | 5.651 | 5.387 | 5.310 | 22.85 | 15.48 | |
| 10 | 5.657 | 5.657 | 5.567 | 18.83 | 16.00 | |
| 11 | 6.673 | 6.673 | 6.587 | 17.14 | 15.79 | |
| 12 | 4.965 | 4.965 | 4.872 | 18.61 | 15.49 | |
| 13 | 6.705 | 6.420 | 6.350 | 21.97 | 16.12 | |
| 14 | 5.978 | 5.978 | 5.876 | 18.98 | 16.01 | |
| 15 | 6.903 | 6.492 | 6.415 | 20.24 | 15.58 | |
| 16 | 5.074 | 5.074 | 4.982 | 17.49 | 15.18 | |
| 17 | 4.952 | 4.952 | 4.860 | 18.26 | 15.39 | |
| 18 | 4.320 | 4.320 | 4.187 | 17.53 | 15.69 | |
| 19 | 4.701 | 4.701 | 4.621 | 18.14 | 15.38 | |
| 20 | 5.008 | 5.008 | 4.897 | 17.50 | 14.92 | |
| 场景 | 通行距离 / km | 平均Q值 | ||||
|---|---|---|---|---|---|---|
| DQN | SQDQN | Dijkstra | DQN | SQDQN | ||
| 1 | 4.962 | 4.962 | 4.879 | 17.09 | 15.59 | |
| 2 | 5.837 | 5.837 | 5.760 | 18.68 | 15.30 | |
| 3 | 6.423 | 6.423 | 6.320 | 17.40 | 15.67 | |
| 4 | 3.933 | 3.933 | 3.850 | 16.96 | 15.15 | |
| 5 | 4.419 | 4.419 | 4.329 | 17.96 | 15.06 | |
| 6 | 5.420 | 5.420 | 4.469 | 20.79 | 18.03 | |
| 7 | 4.072 | 4.072 | 3.987 | 16.45 | 15.54 | |
| 8 | 5.403 | 5.403 | 5.310 | 18.50 | 15.71 | |
| 9 | 5.651 | 5.387 | 5.310 | 22.85 | 15.48 | |
| 10 | 5.657 | 5.657 | 5.567 | 18.83 | 16.00 | |
| 11 | 6.673 | 6.673 | 6.587 | 17.14 | 15.79 | |
| 12 | 4.965 | 4.965 | 4.872 | 18.61 | 15.49 | |
| 13 | 6.705 | 6.420 | 6.350 | 21.97 | 16.12 | |
| 14 | 5.978 | 5.978 | 5.876 | 18.98 | 16.01 | |
| 15 | 6.903 | 6.492 | 6.415 | 20.24 | 15.58 | |
| 16 | 5.074 | 5.074 | 4.982 | 17.49 | 15.18 | |
| 17 | 4.952 | 4.952 | 4.860 | 18.26 | 15.39 | |
| 18 | 4.320 | 4.320 | 4.187 | 17.53 | 15.69 | |
| 19 | 4.701 | 4.701 | 4.621 | 18.14 | 15.38 | |
| 20 | 5.008 | 5.008 | 4.897 | 17.50 | 14.92 | |
| [1] | Ibarra-Rojas O J, Delgado F. Planning, operation, and control of bus transport systems: a literature review[J]. Transp Res Part B:Methodolog, 2015, 77: 38-75. |
| [2] | KE Li, RAO Xuan, PANG Xiaobing, et al. Route Search and Planning: a survey[J]. Big Data Res, 2021, 26: 1-11. |
| [3] |
Dudeja C, Kumar P. An improved weighted sum-fuzzy Dijkstra’s algorithm for shortest path problem[J]. Soft Comput, 2022, 26(7): 3217-3226.
doi: 10.1007/s00500-022-06871-w |
| [4] |
Senbiswas R, Pal A, Werho T, et al. A graph theoretic approach to power system vulnerability identification[J]. IEEE Trans Power Syst, 2021, 36(2): 923-935.
doi: 10.1109/TPWRS.59 URL |
| [5] |
CHEN Yijing. Application of Improved dijkstra algorithm in coastal tourism route planning[J]. J Coastal Res, 2020, 106: 251-254.
doi: 10.2112/SI106-059.1 URL |
| [6] | XU Minghua, Liu Yuqing, HUANG Qilin, et al. An improved dijkstra’s shortest path algorithm for sparse network[J]. Appl Math Compu, 2007, 185(1): 247-254. |
| [7] | ZHANG Yan, LI Lingling, LIN Xiongzheng, et al. Development of path planning approach using improved a-star algorithm in agv system[J]. J Internet Tech, 2019, 20(3): 915-924. |
| [8] | 杜茂, 杨林. 基于交通时空特征的车辆全局路径规划算法[J]. 汽车安全与节能学报, 2021, 12(1): 52-61. |
| DU Mao, YANG Lin. Vehicle global path planning algorithm based on traffic space-time characteristics[J]. J Auto Safe Energ, 2021, 12(1): 52-61. (in Chinese) | |
| [9] |
SONG Rui, LIU Yuanchang, Bucknall R. Smoothed A* algorithm for practical unmanned surface vehicle path planning[J]. Appl Ocean Res, 2019, 83: 9-20.
doi: 10.1016/j.apor.2018.12.001 URL |
| [10] | Pereira F, Brasil P, Cuadros M, et al. Analysis of local trajectory planners for mobile robot with robot operating system[J]. IEEE Latin Ame Trans, 2022, 20(1): 92-99. |
| [11] | QIAN Xiaohui, ZHONG Xiaopeng. Optimal individualized multimedia tourism route planning based on ant colony algorithms and large data hidden mining[J]. Multimed Tools, 2019, 78(15): 22099-22108. |
| [12] | LI Changgeng, HUANG Xia, DING Jun, et al. Global path planning based on a bidirectional alternating search A* algorithm for mobile robots[J]. Compu Indu Eng, 2022, 168: 1-17. |
| [13] | 张瑞鑫, 王伟, 田泽, 等. 基于模型约束A*算法的无人机三维航迹规划[J]. 国外电子测量技术, 2022, 41(9): 163-169. |
| ZHANG Ruixin, WANG Wei, TIAN Ze, et al. Three dimensional path planning of uav based on model constrained A * algorithm[J]. Fore Electro Meas Tech, 2022, 41(9): 163-169. (in Chinese) | |
| [14] | JIANG Chunyan, Fu Jingfang, LIU Weiyan. Research on vehicle routing planning based on adaptive ant colony and particle swarm optimization algorithm[J]. Int’l J Intell Transp Syst Res, 2021, 19(1): 83-91. |
| [15] | 肖金壮, 余雪乐, 周刚, 等. 一种面向室内AGV路径规划的改进蚁群算法[J]. 仪器仪表学报, 2022, 43(3): 277-285. |
| XIAO Jinzhuang, YU Xuele, ZHOU Gang, et al. An improved ant colony algorithm for indoor agv path planning[J]. J Instru, 2022, 43(3): 277-285. (in Chinese) | |
| [16] | 杨立炜, 付丽霞, 王倩, 等. 多层优化蚁群算法的移动机器人路径规划研究[J]. 电子测量与仪器学报, 2021, 35(9): 10-18. |
| YANG Liwei, FU Lixia, WANG Qian, et al. Research on path planning of mobile robot based on multi-level optimization ant colony algorithm[J]. J Electro Meas Instr, 2021, 35(9): 10-18. (in Chinese) | |
| [17] | LI Xiaojing, YU Dongman. Study on an optimal path planning for a robot based on an improved ant colony algorithm[J]. Automatic Contr Compu Sci, 2019, 53(3): 236-243. |
| [18] | LOU Ping, XU Kun, JIANG Xuemei, et al. Path planning in an unknown environment based on deep reinforcement learning with prior knowledge[J]. J Intell Fuzzy Syst, 2021, 41(6): 5773-5789. |
| [19] | PAN Jie, WANG Xuesong, CHENG Yuhu, et al. Multisource transfer double dqn based on actor learning[J]. IEEE Trans Neur Networks Learn Syst, 2018 29(6): 2227-2238. |
| [20] | 李文礼, 张友松. 基于深度强化学习的车辆自主避撞决策控制模型[J]. 汽车安全与节能学报, 2021, 12(2): 201-209. |
| LI Wenli, ZHANG Yousong. Vehicle autonomous collision avoidance decision control model based on deep reinforcement learning[J]. J Auto Safe Energy 2021, 12(2): 201-209. (in Chinese) | |
| [21] |
LI Jianxin, CHEN Yiting, ZHAO Xiuniao, et al. An improved DQN path planning algorithm[J]. J Supercomput, 2022, 78(1): 616-639.
doi: 10.1007/s11227-021-03878-2 |
| [22] |
PENG Ningyezi, XI Yuliang, RAO Jinmeng. Urban multiple route planning model using dynamic programming in reinforcement learning[J]. IEEE Trans Intel Transp Syst, 2021, 23(7): 8037-8047.
doi: 10.1109/TITS.2021.3075221 URL |
| [23] | Watkins C J C H. Learning from delayed rewards[D]. Cambridge: University of Cambridge, 1989. |
| [24] | Van Hasselt H. Double Q¬learning[C]// 23rd Adv Neur Info Proc Syst (NeurIPS). Canada, Vancouver, British Columbia, 2010: 1-19. |
| [25] | Martin M L, Carro B, Esguevillas A. Application of deep reinforcement learning to intrusion detection for supervised problems[J]. Expert Syst Appl, 2020, 141: 1-15. |
| [26] | 黄琰, 张锦. 基于深度强化学习的车辆路径问题求解方法[J]. 交通运输工程与信息学报, 2022, 20(3): 114-127. |
| HUANG Yan, ZHANG Jin. Vehicle routing problem solving method based on deep reinforce-ment learning[J]. J Transp Eng Info, 2022, 20(3): 114-127. (in Chinese) |
| [1] | WANG Yue, DUAN Hongwei, ZHONG Wei, YANG Lu, HE Lei, CHAI Fulai, SHI Xiaoyang. Path planning method for leader-follower multi-vehicle formation with integrating GoT-SAC [J]. Journal of Automotive Safety and Energy, 2026, 17(1): 122-129. |
| [2] | YANG Zongru, HU Yunze, LIU Shiqi, GUAN Yang, WU Wei, LIU Chang. Distributed active perception path planning for the estimation of parking occupancy status [J]. Journal of Automotive Safety and Energy, 2026, 17(1): 140-148. |
| [3] | ZHANG Bingli, ZHANG Zhisen, ZHANG Yangyang, LIU An, XU Yonghua. BI-RRT* path planning method based on GA optimization and path extension heuristic sampling [J]. Journal of Automotive Safety and Energy, 2025, 16(6): 923-933. |
| [4] | PENG Qianlong, JIN bieshu, WANG Jianqiang, WANG Guangwei. Skeleton guided hierarchical autonomous valet parking path planning method with lane constraints [J]. Journal of Automotive Safety and Energy, 2025, 16(5): 784-792. |
| [5] | LI Shunming, WANG Changrong, SHI Wenbei. Progress of mobile charging robot for photovoltaic energy storage and charging [J]. Journal of Automotive Safety and Energy, 2025, 16(4): 505-520. |
| [6] | CHEN Xiaofeng, WANG Lanwen, MA Guo, ZHANG Lei, BAO Jiading, JING Hui. Energy and stability aware path planning for autonomous vehicles in off road environments [J]. Journal of Automotive Safety and Energy, 2025, 16(3): 496-503. |
| [7] | KUANG Xinghong, SHEN Jiacheng. Improved Northern Goshawk Optimization Algorithm and its application in intelligent vehicle path planning [J]. Journal of Automotive Safety and Energy, 2025, 16(1): 148-158. |
| [8] | QIN Yaqin, DONG Shuai, XIE Jiming, CHEN Liang, LIU Yonghua, GUO Miao. Methods for predicting vehicle trajectories in motorway weaving zones based on driving risk fields [J]. Journal of Automotive Safety and Energy, 2024, 15(6): 952-961. |
| [9] | LIU Yang, ZHAN Jiahao, LI Shen, LI Xiaopeng, CHEN Jun. Future of autonomous driving: Single autonomous driving and intelligent vehicle-infrastructure collaboration systems [J]. Journal of Automotive Safety and Energy, 2024, 15(5): 611-633. |
| [10] | HUANG Zheng, WANG Hongxing, DU Biao, GAO Song, GAO Feng. Intelligent inspection method for power transmission towers, substations, and distribution poles using fixed UAV nests [J]. Journal of Automotive Safety and Energy, 2024, 15(5): 670-679. |
| [11] | SHI Liying, ZHOU Guofeng, LI Zexing, CAO Liling. Adaptive federated learning algorithm for differential intersection based on 3DSSD [J]. Journal of Automotive Safety and Energy, 2024, 15(5): 732-741. |
| [12] | HUANG Chen, JIA Dingpeng, SUN Xiaoqiang, XU Qing. Intelligent vehicle path planning method based on peripheral vehicle trajectory prediction [J]. Journal of Automotive Safety and Energy, 2024, 15(5): 753-762. |
| [13] | LI Yulong, XIE Hui, SONG Kang. An obstacle avoidance path planning algorithm for autonomous buses based on tracking error observation and target measurement error observation [J]. Journal of Automotive Safety and Energy, 2024, 15(4): 579-590. |
| [14] | JIN Lisheng, WEI Qingsong, XIE Xianyi, SHI Yewei, LUO Guofeng, LI Keqiang. Multi-vehicle cooperative path planning at untrusted intersections based on DMPC [J]. Journal of Automotive Safety and Energy, 2024, 15(2): 235-241. |
| [15] | MENG Qingjing, SI Junde, ZHANG Xinyu, SUN Honglin, WANG Xiaoyu, RONG Songsong. 3D path planning algorithm for ground and air amphibious platform based on graph search [J]. Journal of Automotive Safety and Energy, 2024, 15(2): 253-260. |
| Viewed | ||||||
|
Full text |
|
|||||
|
Abstract |
|
|||||