基于改进YOLOv3的复杂场景车辆分类与跟踪

doi:10.6040/j.issn.1672-3961.0.2019.412

摘要/Abstract

摘要：

针对天气条件和车辆间相互遮挡对车辆分类与跟踪准确性和稳定性的影响,提出一种基于改进YOLOv3与匹配跟踪的混合模型。改进的YOLOv3网络参照密集连接卷积网络的设计思想,将网络中的残差层替换为密集卷积块并改变网络的设计结构,利用Softmax分类器将密集卷积块与卷积层中融合的特征进行分类。根据单帧图像的检测结果,设计目标匹配函数解决视频序列中车辆的跟踪问题。在KITTI数据集的测试中,改进算法的平均准确率为93.01%,帧率达到48.98帧/s,在自建的数据集中平均识别率为95.79%。试验结果表明,本研究方法在复杂场景中能够有效的区分车辆种类且准确性更高,车辆跟踪的算法具有较高准确性和鲁棒性。

关键词: 图像处理, 车辆分类, 卷积神经网络, YOLOv3, 匹配跟踪

Abstract:

Aiming at the influence of weather conditions and mutual occlusion of vehicles on vehicle classification and tracking accuracy and stability, a hybrid model based on improved YOLOv3 and matching tracking was proposed. The improved YOLOv3 network refered to DenseNet′s design idea, replaced the residual layer in the network with a dense convolution block and changed the design structure of the network. The fused features of dense convolution blocks and convolution layers were classified by using Softmax classifier. According to the detection result of single frame image, the target matching function was designed to solve the vehicle tracking problem in video sequence. In the KITTI dataset test, the improved algorithm achieved an average precision of 93.01%, the number of frames per second reached 48.98, and the average recognition rate in the self-built dataset was 95.79%. The experimental results showed that the proposed method could effectively distinguish the types of vehicles in complex scenes with higher accuracy. At the same time, the method had higher accuracy and robustness in vehicle tracking.

Key words: image processing, vehicle classification, convolutional neural network, YOLOv3, match tracking

中图分类号:

TP391

宋士奇,朴燕,蒋泽新. 基于改进YOLOv3的复杂场景车辆分类与跟踪[J]. 山东大学学报 (工学版), 2020, 50(2): 27-33.

Shiqi SONG,Yan PIAO,Zexin JIANG. Vehicle classification and tracking for complex scenes based on improved YOLOv3[J]. Journal of Shandong University(Engineering Science), 2020, 50(2): 27-33.

图/表 9

图1

图2

图3

表1

图4

图5

表2

图6

表3

参考文献 16

1	赵娜, 袁家斌, 徐晗. 智能交通系统综述[J]. 计算机科学, 2014, 41 (11): 7- 11. doi: 10.11896/j.issn.1002-137X.2014.11.002
	ZHAO Na , YUAN Jiabin , XU Han . Overview of intelligent transportation systems[J]. Computer Science, 2014, 41 (11): 7- 11. doi: 10.11896/j.issn.1002-137X.2014.11.002
2	DUANMU F , MA Z , WANG Y . Fast mode and partition decision using machine learning for intra-frame coding in HEVC screen content coding extension[J]. IEEE Journal on Emerging and Selected Topics in Circuits and Systems, 2016, 6 (4): 517- 531. doi: 10.1109/JETCAS.2016.2597698
3	SARIPAN K, NUTHONG C. Tree-based vehicle classification system[C]//2017 14th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON). Phuket, Thailand: IEEE, 2017: 439-442.
4	PUROHIT N, ISRANI D.Vehicle classification and surveillance using machine learning technique[C]//2017 2nd IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT). Bengaluru, India: IEEE, 2017: 910-914.
5	KAFAI M , BHANU B . Dynamic Bayesian networks for vehicle classification in video[J]. IEEE Transactions on Industrial Informatics, 2012, 8 (1): 100- 109. doi: 10.1109/TII.2011.2173203
6	GIRSHICK R, DONAHUE J, DARRELL T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation[C]//2014 IEEE Conference on Computer Vision and Pattern Recognition(CVPR). Columbus, USA: IEEE, 2014: 580-587.
7	REN S, HE K, Girshick R, et al. Faster R-cnn: towards real-time object detection with region proposal networks[C]//Advances in Neural Information Processing Systems. Quebec, Canada: Curran Associates Inc., 2015: 91-99.
8	REDMON J, DIVVALA S, GIRSHICK R, et al. You only look once: unified, real-time object detection[C]//2016 IEEE Conference on Computer Vision and Pattern Recognition(CVPR). Las Vegas, USA: IEEE, 2016: 779-788.
9	LIU W, ANGUELOV D, ERHAN D, et al. SSD: single shot multibox detector[C]//European Conference on Computer Vision. Amsterdam, Netherlands: Springer, 2016: 21-37.
10	REDMON J, FARHADI A. Yolov3: an incremental improvement[EB/OL].(2018-04-01) [2019-07-22]. https://www.bibsonomy.org/bibtex/bbdec3df168e9809-d9e61423d4b4e062.arXiv:1804.02767v1, 2018.
11	HUANG G, LIU Z, VAN Der Maaten L, et al. Densely connected convolutional networks[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition(CVPR). Hawaii, USA: IEEE, 2017: 4700-4708.
12	LIN T Y, DOLLAR P, GIRSHICK R, et al. Feature pyramid networks for object detection[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition(CVPR). Hawaii, USA: IEEE, 2017: 2117-2125.
13	GEIGER A, LENZ P, URTASUN R. Are we ready for autonomous driving? the kitti vision benchmark suite[C]//2012 IEEE Conference on Computer Vision and Pattern Recognition(CVPR). Rhode Island, USA: IEEE, 2012: 3354-3361.
14	KALAL Z, MATAS J, MIKOLAJCZYK K. Pn learning: Bootstrapping binary classifiers by structural constraints[C]//2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition(CVPR). California, USA: IEEE, 2010: 49-56.
15	WANG N, YEUNG D Y. Learning a deep compact image representation for visual tracking[C]//Advances in Neural Information Processing Systems. Lake Tahoe, USA: Curran Associates Inc, 2013: 809-817.
16	ZHONG W, LU H, YANG M H. Robust object tracking via sparsity-based collaborative model[C]//2012 IEEE Conference on Computer Vision and Pattern Recognition(CVPR). Rhode Island, USA: IEEE, 2012: 1838-1845.

相关文章 15

[1]	董明书,陈俐企,马川义,张珠皓,孙仁娟,管延华,庄培芝. 沥青路面内部裂缝雷达图像智能判识算法研究[J]. 山东大学学报 (工学版), 2025, 55(3): 72-79.
[2]	李伟豪,王苹苹,许万博,魏本征. 结构先验引导的多模态腰椎MRI图像分割算法[J]. 山东大学学报 (工学版), 2025, 55(1): 66-76.
[3]	鲁志恒,霍延强,韩汶,杜聪,刘轶鹏,张宏博. 基于图像数据和碎石集料级配与用量的碎石集料空隙率快速检测方法[J]. 山东大学学报 (工学版), 2024, 54(6): 89-99.
[4]	马翔悦,徐金东,倪梦莹. 基于多尺度特征模糊卷积神经网络的遥感图像分割[J]. 山东大学学报 (工学版), 2024, 54(3): 44-54.
[5]	岳仁峰,张嘉琦,刘勇,范学忠,李琮琮,孔令鑫. 基于颜色和纹理特征的立体车库锈蚀检测技术[J]. 山东大学学报 (工学版), 2024, 54(3): 64-69.
[6]	迟云浩,杨璐,郭杰,郝凡昌,聂秀山. 基于注意力特征融合网络的手指静脉图像质量评价方法[J]. 山东大学学报 (工学版), 2023, 53(6): 56-62.
[7]	那绪博,张莹,李沐阳,陈元畅,华云鹏. 基于ODCG的网约车需求预测模型[J]. 山东大学学报 (工学版), 2023, 53(5): 48-56.
[8]	范海雯,郝旭东,赵康,邢法财,蒋哲,李常刚. 基于卷积神经网络的含分布式光伏配电网静态等值[J]. 山东大学学报 (工学版), 2023, 53(4): 140-148.
[9]	王智伟,徐海超,郭相阳,马炯,褚云龙,陈前昌,卢治. 基于卷积神经网络和层次分析的新能源电源调频能力智能预测方法[J]. 山东大学学报 (工学版), 2022, 52(5): 70-76.
[10]	韩天雨,路长厚,李建美,尹昂,侯秋林. 利用图像处理技术测量丝杠螺距的机器视觉系统[J]. 山东大学学报 (工学版), 2022, 52(3): 80-85.
[11]	张学思,张婷,刘兆英,江天鹏. 基于轻量型卷积神经网络的海面红外显著性目标检测方法[J]. 山东大学学报 (工学版), 2022, 52(2): 41-49.
[12]	王心哲,邓棋文,王际潮,范剑超. 深度语义分割MRF模型的海洋筏式养殖信息提取[J]. 山东大学学报 (工学版), 2022, 52(2): 89-98.
[13]	尹旭,刘兆英,张婷,李玉鑑. 基于弱监督和半监督学习的红外舰船分割方法[J]. 山东大学学报 (工学版), 2022, 52(2): 99-106.
[14]	宋怀雷, 邬忠虎, 李利平, 娄义黎, 孙文吉斌, 刘镐, 左宇军. 基于数字图像的微观尺度下方解石脉对页岩各向异性的影响[J]. 山东大学学报 (工学版), 2021, 51(5): 91-99.
[15]	陶亮,刘宝宁,梁玮. 基于CNN-LSTM 混合模型的心律失常自动检测[J]. 山东大学学报 (工学版), 2021, 51(3): 30-36.

多维度评价

Viewed

Full text

Abstract

Cited

Shared

Discussed

方法	平均准确率均值/%	传输速率/(帧·s^-1)
R-CNN	52.76	0.38
Fast R-CNN	61.53	0.54
SSD	80.01	51.26
YOLO	56.78	67.43
YOLOv3	90.76	48.99
本文	93.01	48.98

方法	场景	目标分类率	误检率	漏检率
Fast R-CNN	白天	77.78	11.56	10.66
YOLO	白天	73.26	19.75	6.99
YOLOv3	白天	92.41	6.46	1.13
本文	白天	97.03	2.91	0.06
Fast R-CNN	夜晚	71.67	13.15	15.18
YOLO	夜晚	68.50	17.28	14.22
YOLOv3	夜晚	87.98	7.63	4.39
文本	夜晚	94.76	4.09	1.15
Fast R-CNN	雨天	77.54	11.98	10.48
YOLO	雨天	72.33	19.47	8.20
YOLOv3	雨天	91.89	6.78	1.33
本文	雨天	96.73	2.03	1.24
Fast R-CNN	雾天	75.02	12.83	12.15
YOLO	雾天	71.32	16.84	11.84
YOLOv3	雾天	89.57	7.55	2.88
本文	雾天	94.71	3.69	1.60
Fast R-CNN	雪天	75.54	11.98	12.48
YOLO	雪天	70.33	18.47	11.20
YOLOv3	雪天	90.89	7.78	1.33
本文	雪天	95.73	3.03	1.24

方法	成功率	精确率
CSK	54.5	39.8
TLD	55.8	43.7
ALSA	53.2	42.4
本文	55.5	45.6