实际环境中基于深度Q学习的无人车路径规划

doi:10.6040/j.issn.1672-3961.0.2020.247

山东大学学报 (工学版) ›› 2021, Vol. 51 ›› Issue (1): 100-107.doi: 10.6040/j.issn.1672-3961.0.2020.247

实际环境中基于深度Q学习的无人车路径规划

肖浩¹(),廖祝华^1,^2,*(),刘毅志^1,²,刘思林¹,刘建勋^1,²

1. 湖南科技大学计算机科学与工程学院, 湖南湘潭 411201
2. 知识处理与网络化制造湖南省普通高校重点实验室, 湖南湘潭 411201

收稿日期:2020-06-28 出版日期:2021-02-20 发布日期:2021-03-01
通讯作者: 廖祝华 E-mail:xiaohao1217@foxmail.com;zhliao@hnust.edu.cn
作者简介:肖浩(1995—), 男, 四川成都人, 硕士研究生, 主要研究方向为机器学习和路径规划. E-mail: xiaohao1217@foxmail.com
基金资助:
国家科学自然基金资助项目(61370227);湖南省自然科学基金资助项目(2017JJ2081);湖南省自然科学基金资助项目(2018JJ4052);湖南省教育厅重点资助项目(17A070);湖南省教育厅重点资助项目(19A172);湖南省教育厅重点资助项目(19A174);科学研究资助项目(17C0646);科学研究资助项目(19C0755)

Unmanned vehicle path planning based on deep Q learning in real environment

Hao XIAO¹(),Zhuhua LIAO^1,^2,*(),Yizhi LIU^1,²,Silin LIU¹,Jianxun LIU^1,²

1. School of Computer Science and Engineering, Hunan University of Science and Technology, Xiangtan 411201, Hunan, China
2. Hunan Provincial Key Laboratory of Knowledge Processing and Networked Manufacturing, Xiangtan 411201, Hunan, China

Received:2020-06-28 Online:2021-02-20 Published:2021-03-01
Contact: Zhuhua LIAO E-mail:xiaohao1217@foxmail.com;zhliao@hnust.edu.cn

摘要/Abstract

摘要：

实际交通环境规划最优路径的重要问题是无人车智能导航, 而无人车全局路径规划研究主要在于模拟环境中算法求解速度的提升, 考虑大部分仅路径距离最优或局限于当前道路的自身状况, 本研究针对实际环境中的其他因素及其未来的变化和动态路网中无人车全局路径规划的复杂任务, 基于预测后再规划的思想提出面向实际环境的无人车驾驶系统框架, 并结合深度Q学习和深度预测网络技术提出一种快速全局路径规划方法(deep prediction network and deep Q network, DP-DQN), 从而利用时空、天气等道路特征数据来预测未来交通状况、求解全局最优路径。基于公开数据集的试验和评价后发现, 本研究提出的方法与Dijkstra、A^*等算法相比, 行车时间最高降低了17.97%。

关键词: 路径规划, 交通环境, 城市路网, 深度Q学习, 深度预测网络

Abstract:

It was an important problem for the intelligent navigation of unmanned vehicles that planning the optimal path in the actual traffic environment. At present, many researches about global path planning of unmanned vehicle mainly focused on the improvement of algorithm solution speed in the simulation environment. Most of them just only considered the optimal path distance or the current road conditions, also ignored other factors and future changes in the actual environment. In order to complete the complex task that competing global path planning of unmanned vehicle in dynamic road network, this research put forward a framework of unmanned vehicle driving system for practical environment based on the thought of planning after prediction, and put forward DP-DQN which was a fast global path planning method combined with deep Q learning and deep prediction network technology. This method used the road characteristic data such as time and space, weather et al to predict the future traffic situation, and then competed the global optimal path. Finally, experimental results based on open datasets showed that the proposed method reduced driving time 17.97% at most than Dijkstra, A^*, algorithm et al.

Key words: global path planning, traffic environment, urban road network, deep Q learning, deep prediction network

中图分类号:

TP311

肖浩,廖祝华,刘毅志,刘思林,刘建勋. 实际环境中基于深度Q学习的无人车路径规划[J]. 山东大学学报 (工学版), 2021, 51(1): 100-107.

Hao XIAO,Zhuhua LIAO,Yizhi LIU,Silin LIU,Jianxun LIU. Unmanned vehicle path planning based on deep Q learning in real environment[J]. Journal of Shandong University(Engineering Science), 2021, 51(1): 100-107.

图/表 11

图1

图2

图3

图4

图5

图6

表1

表2

图7

表3

图8

参考文献 13

1	DIKMEN M, BURNS C M. Autonomous driving in the real world: Experiences with tesla autopilot and summon[C]//Proceedings of the 8th International Conference on Automotive User Interfaces and Interactive Vehicular Applications. Ann Arbor MI, USA: ACM, 2016.
2	PADEN B , CAP M , YONG S Z , et al. A survey of motion planning and control techniques for self-driving urban vehicles[J]. IEEE Transactions on Intelligent Vehicles, 2016, 1 (1): 33- 55. doi: 10.1109/TIV.2016.2578706
3	HART P E , NILSSON N J , RAPHAEL Bertram . A formal basis for the heuristic determination of minimum cost paths[J]. IEEE Transactions on Systems Science and Cybernetics, 1968, 4 (2): 100- 107. doi: 10.1109/TSSC.1968.300136
4	DIJKSTRA E W . A note on two problems in connexion with graphs[J]. Numerische Mathematik, 1959, 1 (1): 269- 271. doi: 10.1007/BF01386390
5	NIEWOLA A , PODSEDKOWSKI L . L* algorithm: a linear computational complexity graph searching algorithm for path planning[J]. Journal of Intelligent & Robotic Systems, 2018, 91 (3-4): 425- 444.
6	GOLDBERG A V, HARRELSON C. Computing the shortest path: a search meets graph theory[C]//Proceedings of the Sixteenth Annual Acm-Siam Symp-osium on Discrete Algorithms. Vancouver, Canada: SIAM, 2005.
7	PANOV A I , YAKOVLEV K S , SUVOROV R . Grid path planning with deep reinforcement learning: Preliminary results[J]. Procedia Computer Science, 2018, 123, 347- 353. doi: 10.1016/j.procs.2018.01.054
8	DED B, KHAN S R, HASSAN K T, et al. Travel time prediction using machine learning and weather impact on traffic conditions[C]//2019 IEEE 5th International Conference for Convergence in Technology (I2CT). Pune, India: IEEE, 2019.
9	LIEBIG T , PIATKOWSKI N , BOCKERMANN C , et al. Dynamic route planning with real-time traffic predictions[J]. Information Systems, 2017, 64, 258- 265. doi: 10.1016/j.is.2016.01.007
10	WAN J , LIU J , SHAO Z . Mobile crowd sensing for traffic prediction in internet of vehicles[J]. Sensors, 2016, 16 (1): 88. doi: 10.3390/s16010088
11	HE F, XU Y, WANG X, et al. ALT-based route planning in dynamic time-dependent road networks[C]//Proceedings of the 2019 2nd International Conference on Machine Learning and Machine Intelligence. Jakarta, Indonesia: ACM, 2019.
12	DIAL R , GLOVER F , KARNEY D , et al. A computational analysis of alternative algorithms and labeling techniques for finding shortest path trees[J]. Networks, 1979, 9 (3): 215- 248. doi: 10.1002/net.3230090304
13	TSO G K F , YAU K K W . Predicting electricity energy consumption: a comparison of regression analysis, decision tree and neural networks[J]. Energy, 2007, 32 (9): 1761- 1768. doi: 10.1016/j.energy.2006.11.010

多维度评价

Viewed

Full text

1468

HTML			PDF

Just accepted	Online first	Issue	Just accepted	Online first	Issue
0	0	31	0	0	1437

From	Others	local

Times	75	1393
Rate	5%	95%

Abstract

1208

Just accepted	Online first	Issue

0	0	1208

From	Others	local

Times	1200	8
Rate	99%	1%

Cited

Web of Science	Crossref	ScienceDirect	Search for Citations in Google Scholar >>


This page requires you have already subscribed to WoS.

Shared

Discussed

字段	说明	样例数据
obj_id	TTI对象id	841
batch_time	时刻	2018-01-01T00:00:00
tti	交通指数数据	1.186 65
speed	平均速度	47.398 3
geom	TTI对象几何范围	MULTILINESTRING((104.137 43 30.605 91, 104.1383230.60538), …)
access_time	通行时间	10.657 6
temperature	温度	5
weather	天气	多云
wind	风力	无持续微风
AQI	空气质量指数	115轻度污染
POI经纬度	地图兴趣点经纬度(GCJ-02坐标系)	104.020 22, 30.702 2
是否点击	2: 有点击1: 未点击	2

网络类型	MAE	MSE	耗时/us
DNN	5.64	7.74	17
RNN	6.97	7.85	911
GRU	5.52	6.28	2 000
LSTM	5.79	8.16	1 911

算法	距离1 km		距离5 km		距离10 km		距离15 km		距离20 km		距离25 km
算法	O(n)/ms	H/min	O(n)/ms	H/min	O(n)/ms	H/min	O(n)/ms	H/min	O(n)/ms	H/min	O(n)/ms	H/min
A^*	0.49	6.12	1.56	14.89	7.32	18.37	18.34	25.10	45.58	36.01	102.04	39.51
Dijkstra	3.63	6.23	6.48	14.75	26.26	18.64	44.87	25.78	111.35	36.49	232.64	39.88
DQN	0.31	6.59	1.03	14.12	4.14	18.90	6.87	25.56	8.74	36.13	10.89	39.10
DP-DQN	0.46	5.11	1.42	13.24	5.64	16.17	7.19	21.26	10.26	32.03	18.76	37.13
ATL	0.27	6.61	1.13	13.73	3.28	19.14	9.84	25.17	12.87	35.25	25.57	38.33
DATL	1.17	5.12	3.23	13.84	6.74	15.81	13.87	21.22	17.49	32.24	31.46	37.51

[1]	李彩虹,方春,王志强,夏斌,王凤英. 基于超混沌同步控制的移动机器人全覆盖路径规划[J]. 山东大学学报 (工学版), 2019, 49(6): 63-72.
[2]	周风余,万方,焦建成,边钧健. 家庭陪护机器人自主充电系统研究与设计[J]. 山东大学学报 (工学版), 2019, 49(1): 55-65, 74.
[3]	张强. 核环境多关节蛇形机械臂的运动控制系统设计[J]. 山东大学学报 (工学版), 2018, 48(6): 122-131.
[4]	刘彬,张仁津. 一种采用两段粒子群优化的路径规划方法[J]. 山东大学学报(工学版), 2012, 42(1): 12-18.
[5]	严宣辉, 肖国宝*. 基于定长实数路径编码机制的移动机器人路径规划[J]. 山东大学学报(工学版), 2012, 42(1): 59-65.
[6]	陈明志1,许春耀2,陈健2,余轮2. 基于语义信息的虚拟环境路径规划[J]. 山东大学学报(工学版), 2011, 41(4): 106-112.

实际环境中基于深度Q学习的无人车路径规划

Unmanned vehicle path planning based on deep Q learning in real environment

RichHTML

PDF (PC)

摘要/Abstract

引用本文

使用本文

图/表 11

参考文献 13

相关文章 6

多维度评价

本文评价

推荐阅读 10