Journal of Automotive Safety and Energy ›› 2023, Vol. 14 ›› Issue (2): 202-211.DOI: 10.3969/j.issn.1674-8484.2023.02.007
Previous Articles Next Articles
HAN Ling(
), ZHANG Hui, FANG Ruoyu, LIU Guopeng, ZHU Changsheng, CHI Ruifeng
Received:2022-09-14
Revised:2022-11-21
Online:2023-04-30
Published:2023-04-27
CLC Number:
HAN Ling, ZHANG Hui, FANG Ruoyu, LIU Guopeng, ZHU Changsheng, CHI Ruifeng. Global path planning strategy based on an improved deep reinforcement learning[J]. Journal of Automotive Safety and Energy, 2023, 14(2): 202-211.
Add to citation manager EndNote|Ris|BibTeX
URL: https://www.journalase.com/EN/10.3969/j.issn.1674-8484.2023.02.007
| 场景 | 通行距离 / km | ||
|---|---|---|---|
| DQN | SQDQN | Dijkstra | |
| 1 | 4.879 | 4.879 | 4.879 |
| 2 | 5.760 | 5.760 | 5.760 |
| 3 | 6.320 | 6.320 | 6.320 |
| 4 | 3.850 | 3.850 | 3.850 |
| 5 | 4.329 | 4.329 | 4.329 |
| 6 | 6.783 | 6.415 | 6.415 |
| 7 | 3.987 | 3.987 | 3.987 |
| 8 | 5.310 | 5.310 | 5.310 |
| 9 | 4.982 | 4.982 | 4.982 |
| 10 | 5.567 | 5.567 | 5.567 |
| 11 | 6.587 | 6.587 | 6.587 |
| 12 | 4.872 | 4.872 | 4.872 |
| 13 | 6.606 | 6.350 | 6.350 |
| 14 | 5.876 | 5.876 | 5.876 |
| 15 | 4.469 | 4.469 | 4.469 |
| 16 | 5.524 | 5.310 | 5.310 |
| 17 | 4.860 | 4.860 | 4.860 |
| 18 | 4.187 | 4.187 | 4.187 |
| 19 | 4.621 | 4.621 | 4.621 |
| 20 | 4.897 | 4.897 | 4.897 |
| 场景 | 通行距离 / km | ||
|---|---|---|---|
| DQN | SQDQN | Dijkstra | |
| 1 | 4.879 | 4.879 | 4.879 |
| 2 | 5.760 | 5.760 | 5.760 |
| 3 | 6.320 | 6.320 | 6.320 |
| 4 | 3.850 | 3.850 | 3.850 |
| 5 | 4.329 | 4.329 | 4.329 |
| 6 | 6.783 | 6.415 | 6.415 |
| 7 | 3.987 | 3.987 | 3.987 |
| 8 | 5.310 | 5.310 | 5.310 |
| 9 | 4.982 | 4.982 | 4.982 |
| 10 | 5.567 | 5.567 | 5.567 |
| 11 | 6.587 | 6.587 | 6.587 |
| 12 | 4.872 | 4.872 | 4.872 |
| 13 | 6.606 | 6.350 | 6.350 |
| 14 | 5.876 | 5.876 | 5.876 |
| 15 | 4.469 | 4.469 | 4.469 |
| 16 | 5.524 | 5.310 | 5.310 |
| 17 | 4.860 | 4.860 | 4.860 |
| 18 | 4.187 | 4.187 | 4.187 |
| 19 | 4.621 | 4.621 | 4.621 |
| 20 | 4.897 | 4.897 | 4.897 |
| 场景 | 通行距离 / km | 平均Q值 | ||||
|---|---|---|---|---|---|---|
| DQN | SQDQN | Dijkstra | DQN | SQDQN | ||
| 1 | 4.962 | 4.962 | 4.879 | 17.09 | 15.59 | |
| 2 | 5.837 | 5.837 | 5.760 | 18.68 | 15.30 | |
| 3 | 6.423 | 6.423 | 6.320 | 17.40 | 15.67 | |
| 4 | 3.933 | 3.933 | 3.850 | 16.96 | 15.15 | |
| 5 | 4.419 | 4.419 | 4.329 | 17.96 | 15.06 | |
| 6 | 5.420 | 5.420 | 4.469 | 20.79 | 18.03 | |
| 7 | 4.072 | 4.072 | 3.987 | 16.45 | 15.54 | |
| 8 | 5.403 | 5.403 | 5.310 | 18.50 | 15.71 | |
| 9 | 5.651 | 5.387 | 5.310 | 22.85 | 15.48 | |
| 10 | 5.657 | 5.657 | 5.567 | 18.83 | 16.00 | |
| 11 | 6.673 | 6.673 | 6.587 | 17.14 | 15.79 | |
| 12 | 4.965 | 4.965 | 4.872 | 18.61 | 15.49 | |
| 13 | 6.705 | 6.420 | 6.350 | 21.97 | 16.12 | |
| 14 | 5.978 | 5.978 | 5.876 | 18.98 | 16.01 | |
| 15 | 6.903 | 6.492 | 6.415 | 20.24 | 15.58 | |
| 16 | 5.074 | 5.074 | 4.982 | 17.49 | 15.18 | |
| 17 | 4.952 | 4.952 | 4.860 | 18.26 | 15.39 | |
| 18 | 4.320 | 4.320 | 4.187 | 17.53 | 15.69 | |
| 19 | 4.701 | 4.701 | 4.621 | 18.14 | 15.38 | |
| 20 | 5.008 | 5.008 | 4.897 | 17.50 | 14.92 | |
| 场景 | 通行距离 / km | 平均Q值 | ||||
|---|---|---|---|---|---|---|
| DQN | SQDQN | Dijkstra | DQN | SQDQN | ||
| 1 | 4.962 | 4.962 | 4.879 | 17.09 | 15.59 | |
| 2 | 5.837 | 5.837 | 5.760 | 18.68 | 15.30 | |
| 3 | 6.423 | 6.423 | 6.320 | 17.40 | 15.67 | |
| 4 | 3.933 | 3.933 | 3.850 | 16.96 | 15.15 | |
| 5 | 4.419 | 4.419 | 4.329 | 17.96 | 15.06 | |
| 6 | 5.420 | 5.420 | 4.469 | 20.79 | 18.03 | |
| 7 | 4.072 | 4.072 | 3.987 | 16.45 | 15.54 | |
| 8 | 5.403 | 5.403 | 5.310 | 18.50 | 15.71 | |
| 9 | 5.651 | 5.387 | 5.310 | 22.85 | 15.48 | |
| 10 | 5.657 | 5.657 | 5.567 | 18.83 | 16.00 | |
| 11 | 6.673 | 6.673 | 6.587 | 17.14 | 15.79 | |
| 12 | 4.965 | 4.965 | 4.872 | 18.61 | 15.49 | |
| 13 | 6.705 | 6.420 | 6.350 | 21.97 | 16.12 | |
| 14 | 5.978 | 5.978 | 5.876 | 18.98 | 16.01 | |
| 15 | 6.903 | 6.492 | 6.415 | 20.24 | 15.58 | |
| 16 | 5.074 | 5.074 | 4.982 | 17.49 | 15.18 | |
| 17 | 4.952 | 4.952 | 4.860 | 18.26 | 15.39 | |
| 18 | 4.320 | 4.320 | 4.187 | 17.53 | 15.69 | |
| 19 | 4.701 | 4.701 | 4.621 | 18.14 | 15.38 | |
| 20 | 5.008 | 5.008 | 4.897 | 17.50 | 14.92 | |
| [1] | Ibarra-Rojas O J, Delgado F. Planning, operation, and control of bus transport systems: a literature review[J]. Transp Res Part B:Methodolog, 2015, 77: 38-75. |
| [2] | KE Li, RAO Xuan, PANG Xiaobing, et al. Route Search and Planning: a survey[J]. Big Data Res, 2021, 26: 1-11. |
| [3] |
Dudeja C, Kumar P. An improved weighted sum-fuzzy Dijkstra’s algorithm for shortest path problem[J]. Soft Comput, 2022, 26(7): 3217-3226.
doi: 10.1007/s00500-022-06871-w |
| [4] |
Senbiswas R, Pal A, Werho T, et al. A graph theoretic approach to power system vulnerability identification[J]. IEEE Trans Power Syst, 2021, 36(2): 923-935.
doi: 10.1109/TPWRS.59 URL |
| [5] |
CHEN Yijing. Application of Improved dijkstra algorithm in coastal tourism route planning[J]. J Coastal Res, 2020, 106: 251-254.
doi: 10.2112/SI106-059.1 URL |
| [6] | XU Minghua, Liu Yuqing, HUANG Qilin, et al. An improved dijkstra’s shortest path algorithm for sparse network[J]. Appl Math Compu, 2007, 185(1): 247-254. |
| [7] | ZHANG Yan, LI Lingling, LIN Xiongzheng, et al. Development of path planning approach using improved a-star algorithm in agv system[J]. J Internet Tech, 2019, 20(3): 915-924. |
| [8] | 杜茂, 杨林. 基于交通时空特征的车辆全局路径规划算法[J]. 汽车安全与节能学报, 2021, 12(1): 52-61. |
| DU Mao, YANG Lin. Vehicle global path planning algorithm based on traffic space-time characteristics[J]. J Auto Safe Energ, 2021, 12(1): 52-61. (in Chinese) | |
| [9] |
SONG Rui, LIU Yuanchang, Bucknall R. Smoothed A* algorithm for practical unmanned surface vehicle path planning[J]. Appl Ocean Res, 2019, 83: 9-20.
doi: 10.1016/j.apor.2018.12.001 URL |
| [10] | Pereira F, Brasil P, Cuadros M, et al. Analysis of local trajectory planners for mobile robot with robot operating system[J]. IEEE Latin Ame Trans, 2022, 20(1): 92-99. |
| [11] | QIAN Xiaohui, ZHONG Xiaopeng. Optimal individualized multimedia tourism route planning based on ant colony algorithms and large data hidden mining[J]. Multimed Tools, 2019, 78(15): 22099-22108. |
| [12] | LI Changgeng, HUANG Xia, DING Jun, et al. Global path planning based on a bidirectional alternating search A* algorithm for mobile robots[J]. Compu Indu Eng, 2022, 168: 1-17. |
| [13] | 张瑞鑫, 王伟, 田泽, 等. 基于模型约束A*算法的无人机三维航迹规划[J]. 国外电子测量技术, 2022, 41(9): 163-169. |
| ZHANG Ruixin, WANG Wei, TIAN Ze, et al. Three dimensional path planning of uav based on model constrained A * algorithm[J]. Fore Electro Meas Tech, 2022, 41(9): 163-169. (in Chinese) | |
| [14] | JIANG Chunyan, Fu Jingfang, LIU Weiyan. Research on vehicle routing planning based on adaptive ant colony and particle swarm optimization algorithm[J]. Int’l J Intell Transp Syst Res, 2021, 19(1): 83-91. |
| [15] | 肖金壮, 余雪乐, 周刚, 等. 一种面向室内AGV路径规划的改进蚁群算法[J]. 仪器仪表学报, 2022, 43(3): 277-285. |
| XIAO Jinzhuang, YU Xuele, ZHOU Gang, et al. An improved ant colony algorithm for indoor agv path planning[J]. J Instru, 2022, 43(3): 277-285. (in Chinese) | |
| [16] | 杨立炜, 付丽霞, 王倩, 等. 多层优化蚁群算法的移动机器人路径规划研究[J]. 电子测量与仪器学报, 2021, 35(9): 10-18. |
| YANG Liwei, FU Lixia, WANG Qian, et al. Research on path planning of mobile robot based on multi-level optimization ant colony algorithm[J]. J Electro Meas Instr, 2021, 35(9): 10-18. (in Chinese) | |
| [17] | LI Xiaojing, YU Dongman. Study on an optimal path planning for a robot based on an improved ant colony algorithm[J]. Automatic Contr Compu Sci, 2019, 53(3): 236-243. |
| [18] | LOU Ping, XU Kun, JIANG Xuemei, et al. Path planning in an unknown environment based on deep reinforcement learning with prior knowledge[J]. J Intell Fuzzy Syst, 2021, 41(6): 5773-5789. |
| [19] | PAN Jie, WANG Xuesong, CHENG Yuhu, et al. Multisource transfer double dqn based on actor learning[J]. IEEE Trans Neur Networks Learn Syst, 2018 29(6): 2227-2238. |
| [20] | 李文礼, 张友松. 基于深度强化学习的车辆自主避撞决策控制模型[J]. 汽车安全与节能学报, 2021, 12(2): 201-209. |
| LI Wenli, ZHANG Yousong. Vehicle autonomous collision avoidance decision control model based on deep reinforcement learning[J]. J Auto Safe Energy 2021, 12(2): 201-209. (in Chinese) | |
| [21] |
LI Jianxin, CHEN Yiting, ZHAO Xiuniao, et al. An improved DQN path planning algorithm[J]. J Supercomput, 2022, 78(1): 616-639.
doi: 10.1007/s11227-021-03878-2 |
| [22] |
PENG Ningyezi, XI Yuliang, RAO Jinmeng. Urban multiple route planning model using dynamic programming in reinforcement learning[J]. IEEE Trans Intel Transp Syst, 2021, 23(7): 8037-8047.
doi: 10.1109/TITS.2021.3075221 URL |
| [23] | Watkins C J C H. Learning from delayed rewards[D]. Cambridge: University of Cambridge, 1989. |
| [24] | Van Hasselt H. Double Q¬learning[C]// 23rd Adv Neur Info Proc Syst (NeurIPS). Canada, Vancouver, British Columbia, 2010: 1-19. |
| [25] | Martin M L, Carro B, Esguevillas A. Application of deep reinforcement learning to intrusion detection for supervised problems[J]. Expert Syst Appl, 2020, 141: 1-15. |
| [26] | 黄琰, 张锦. 基于深度强化学习的车辆路径问题求解方法[J]. 交通运输工程与信息学报, 2022, 20(3): 114-127. |
| HUANG Yan, ZHANG Jin. Vehicle routing problem solving method based on deep reinforce-ment learning[J]. J Transp Eng Info, 2022, 20(3): 114-127. (in Chinese) |
| [1] | HONG Ying, SHA Yuchen, DING Fei, CHEN Zhu, ZHANG Dengyin. Resource allocation optimization and performance evaluation for 5G cellular vehicle-to-everything (C-V2X) [J]. Journal of Automotive Safety and Energy, 2023, 14(3): 346-354. |
| [2] | ZHANG Hong, YU Hailiang, ZHENG Zan, YUAN Shengdong, XIONG Guoqiang. Modeling and application of road intersections based on digital twins [J]. Journal of Automotive Safety and Energy, 2023, 14(1): 55-61. |
| [3] | LI Pingfei, JIN Siyu, HU Wenhao, GAO Li, CHE Yaoyu, TAN Zhengping, DONG Xiaofei. Complexity evaluation of vehicle-vehicle accident scenarios for autonomous driving simulation tests [J]. Journal of Automotive Safety and Energy, 2022, 13(4): 697-704. |
| [4] | LI Wenli, XIAO Kaiwen, REN Yongpeng, LI Chao, Yi Fan. Path planning and control method for vehicle obstacle avoidance in pedestrian crossing scenes [J]. Journal of Automotive Safety and Energy, 2022, 13(3): 489-501. |
| [5] | ZHENG Yangjun, HE Shuai, SHUAI Zhibin, LI Jianqiu, GAI Jiangtao, LI Yong, ZHANG Ying, LI Guohui. Lateral velocity estimation for four-wheel-independent-drive electric vehicles based on deep reinforcement learning [J]. Journal of Automotive Safety and Energy, 2022, 13(2): 309-316. |
| [6] | LI Yaohua, FAN Jikang, LIU Yang, HE Jie, LI Zetian, PAN Shaofei. Path planning and path tracking control for autonomous vehicle based on MPC with adaptive dual-horizon-parameters [J]. Journal of Automotive Safety and Energy, 2021, 12(4): 528-539. |
| [7] | DU Mao, YANG Lin, JIN Yue, TU Jiayu. Vehicle global path planning algorithm based on spatio-temporal characteristics of traffic [J]. Journal of Automotive Safety and Energy, 2021, 12(1): 52-61. |
| [8] | LI Guofa,CHEN Yaoyu,Lü Chen,TAO Da,CAO Dongpu,CHENG Bo. Key techniques of semantic analysis of driving behavior in decision making of autonomous vehicles [J]. Journal Of Automotive Safety And Energy, 2019, 10(4): 391-412. |
| [9] | ZHOU Qing, JI Peijun, HUANG Yi, et al. Challenges and opportunities of smart occupant protection against motor vehicle collision accidents in future traffic environment [J]. Journal Of Automotive Safety And Energy, 2017, 08(04): 333-350. |
| [10] | ZHU Xichan, LIU Zhichao, LI Lin. Evasive manoeuvre for emergency steering based on typical vehicle-pedestrian use case [J]. Journal Of Automotive Safety And Energy, 2015, 6(03): 217-223. |
| [11] | ZHOU Jing, PENG Hui. String Stability of Adaptive Cruise Control Systems with Servo-loop Dynamics [J]. Journal of Automotive Safety and Energy, 2010, 1(1): 30-39. |
| Viewed | ||||||
|
Full text |
|
|||||
|
Abstract |
|
|||||