您的位置:山东大学 -> 科技期刊社 -> 《山东大学学报(工学版)》

山东大学学报 (工学版) ›› 2020, Vol. 50 ›› Issue (4): 22-27.doi: 10.6040/j.issn.1672-3961.0.2019.416

• • 上一篇    下一篇

基于卷积神经网络的深度线段分类算法

赵宁宁,唐雪嵩*,赵鸣博   

  1. 东华大学信息科学与技术学院, 上海 201620
  • 发布日期:2020-08-13
  • 作者简介:赵宁宁(1995— ),女, 山东菏泽人, 硕士研究生, 主要研究方向为人工智能与图像处理. E-mail:2171363@mail.dhu. edu.cn. *通信作者简介:唐雪嵩(1985— ),男,湖南长沙人,讲师,博士,主要研究方向为人工智能与图像处理. E-mail:tangxs@dhu.edu.cn
  • 基金资助:
    国家自然科学基金资助项目(61601112);中央高校基本科研业务费专项资金和东华大学励志计划资助项目

Depth segment classification algorithm based on convolutional neural network

ZHAO Ningning, TANG Xuesong*, ZHAO Mingbo   

  1. College of Information Science and Technology, Donghua University, Shanghai 201620, China
  • Published:2020-08-13

摘要: 为解决单目图像中冗余像素点不利于深度神经网络快速完成深度信息检测的问题,提出一种基于卷积神经网络的深度线段分类算法。对NYU-Depth数据集使用线段检测算法进行线段检测得到原始图像的线段特征图,通过数据预处理结合深度数据得到表征深度信息的线段集合及其标签,提出适用于线段特征的卷积神经网络,实现单目图像中深度线段的分类。通过在不同线段数目上进行多次多组对比试验,深度线段分类准确率达到73.50%。试验结果证明了利用卷积神经网络实现深度线段分类的可实施性,有助于更好的利用图像几何特征解决深度估计问题。

关键词: 单目图像, 深度估计, 卷积网络, 深度线段, 分类算法

Abstract: In order to solve the problem that redundant pixels in monocular images influenced depth information detection, a depth segment classification algorithm based on the convolutional neural network was proposed. We used NYU-Depth dataset to detect the segment-based features. Afterward, depth information was represented by line segments and its labels by the data preprocessing. The convolutional neural network was designed for considering the characteristics of the segments, and the classification of depth segments in monocular images was realized. By conducting several multi-group comparison experiments on different hyper-parameters, the accuracy of depth segment classification reached 73.50%. This experimental results proved the implement ability of the depth segment classification based on convolutional neural network, which was helpful to deep estimation using geometric features of images.

Key words: monocular image, depth estimation, convolutional network, depth segment, classification algorithm

中图分类号: 

  • TP183
[1] ZHONG Y F, FFENG F, ZHANG L P. Large patch convolutional neural networks for the scene classification of high spatial resolution imagery[J]. Journal of Applied Remote Sensing, 2016, 10(2): 025006.
[2] MASSON G, YANG D, et al. Version and vergence eye movements in humans: open-loop dynamics determined by monocular rather than binocular image speed[J]. Vision Research, 2002, 42(26): 2853-2867.
[3] SHOTTON J, FITZGIBBON A, COOK M, et al. Real-time human pose recognition in parts from single depth images[J]. Communications of the ACM, 2013, 56(1): 116-124.
[4] KANEKO A M, YAMAMOTO K. Two-view monocular depth estimation by optic-flow-weighted fusion[J]. IEEE Robotics and Automation Letters, 2019, 4(2): 830-837.
[5] AURISANO A, RADOVIC A, ROCCO D, et al. A convolutional neural network neutrino event classifier[J]. Journal of Instrumentation, 2016, 11(9): P09001.
[6] LIU F, SHEN C, LIN G. Deep convolutional neural fields for depth estimation from a single image[C] //Computer Vision and Pattern Recognition(CVPR). Boston, USA: IEEE Computer Society, 2015: 5162-5170.
[7] BO L, DAI Y, HE M. Monocular depth estimation with hierarchical fusion of dilated CNNs and soft-weighted-sum inference[J]. Pattern Recognition, 2017, 83: 328-339.
[8] GRIGOREV A, JIANG F, RHO S, et al. Depth estimation from single monocular images using deep hybrid network[J]. Multimedia Tools and Applications, 2017, 76(18): 18585-18604.
[9] DAN X, RICCI E, OUYANG W, et al. Monocular depth estimation using multi-scale continuous CRFs as sequential deep networks[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2019, 41(6): 1426-1440.
[10] LI J, KLEIN R, YAO A, et al. A two-streamed network for estimating fine-scaled depth maps from single RGB images[J]. Computer Vision and Image Understanding, 2019, 186:25-36.
[11] WANG X, HOU C, PU L, et al. A depth estimating method from a single image using FoE CRF[J]. Multimedia Tools & Applications, 2015, 74(21): 9491-9506.
[12] XU H, JIANG M, LI F. Depth estimation algorithm based on data-driven approach and depth cues for stereo conversion in three-dimensional displays[J]. Optical Engineering, 2016, 55(12): 12106-1-12106-11.
[13] LI B, DAI Y, HE M. Monocular depth estimation with hierarchical fusion of dilated CNNs and soft-weighted-sum inference[J]. Pattern Recognition, 2018, 83: 328-339.
[14] CHEN L, PAPANDREOU G, KOKKINOS I, et al. Semantic image segmentation with deep convolutional nets and fully connected CRFs[J]. Computer Science, 2018, 40(4): 357-361.
[15] CAO Y, SHEN C, SHEN H. Exploiting depth from single monocular images for object detection and semantic segmentation[J]. IEEE Transactions on Image Processing, 2017, 26(2): 836-846.
[16] CAO Y, WU Z, SHEN C.Estimating depth from monocular images as classification using deep fully convolutional residual Networks[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2016, 38(10): 1-11.
[17] CHOI S, MIN D, HAM B, et al. Depth analogy: data-driven approach for single image depth estimation using gradient samples[J]. IEEE Transactions on Image Processing, 2015, 24(12): 5953-5966.
[18] QIN H, LI X, WANG Y, et al. Depth estimation by parameter transfer with a lightweight model for single still images[J]. IEEE Transactions on Circuits & Systems for Video Technology, 2017, 27(4): 748-759.
[19] 覃勋辉,马戎.一种基于梯度的直线段检测算法[J].光子学报, 2012,41(2):205-209. RONG X, MA R. A line segments detection algorithm based on grad[J]. Photonics Journal, 2012, 41(2): 205-209.
[20] GIOI R, MOREL J,et al. LSD: a fast line segment detector with a false detection control[J]. IEEE Transactions on Software Engineering, 2010, 32(4): 722-732.
[21] PATON K. Line detection by local methods[J]. Computer Graphics and Image Processing, 1979, 9(4): 316-332.
[22] BACON J, KING-SMITH P E. The detection of line segments[J]. Perception, 1977, 6(2): 125-131.
[1] 邹正标,刘毅志,廖祝华,赵肄江. 动态交通流量预测的时空注意力图卷积网络[J]. 山东大学学报 (工学版), 2024, 54(5): 50-61.
[2] 武新章,梁祥宇,朱虹谕,张冬冬. 基于CEEMDAN-GRA-PCC-ATCN的短期风电功率预测[J]. 山东大学学报 (工学版), 2022, 52(6): 146-156.
[3] 黄彩云,陈德武,何吉福,胡艺,王楠,陈沛. 基于改进双路径网络的上肢肌肉骨骼异常检测[J]. 山东大学学报 (工学版), 2022, 52(3): 25-33.
[4] 周杨浩,刘一帆,李瑮. 一种自动读取指针式仪表读数的方法[J]. 山东大学学报 (工学版), 2019, 49(4): 1-7.
[5] 王婷婷,翟俊海,张明阳,郝璞. 基于HBase和SimHash的大数据K-近邻算法[J]. 山东大学学报(工学版), 2018, 48(3): 54-59.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
[1] 张永花,王安玲,刘福平 . 低频非均匀电磁波在导电界面的反射相角[J]. 山东大学学报(工学版), 2006, 36(2): 22 -25 .
[2] 孔祥臻,刘延俊,王勇,赵秀华 . 气动比例阀的死区补偿与仿真[J]. 山东大学学报(工学版), 2006, 36(1): 99 -102 .
[3] 来翔 . 用胞映射方法讨论一类MKdV方程[J]. 山东大学学报(工学版), 2006, 36(1): 87 -92 .
[4] 余嘉元1 , 田金亭1 , 朱强忠2 . 计算智能在心理学中的应用[J]. 山东大学学报(工学版), 2009, 39(1): 1 -5 .
[5] 季涛,高旭,孙同景,薛永端,徐丙垠 . 铁路10 kV自闭/贯通线路故障行波特征分析[J]. 山东大学学报(工学版), 2006, 36(2): 111 -116 .
[6] 秦通,孙丰荣*,王丽梅,王庆浩,李新彩. 基于极大圆盘引导的形状插值实现三维表面重建[J]. 山东大学学报(工学版), 2010, 40(3): 1 -5 .
[7] 孙殿柱,朱昌志,李延瑞 . 散乱点云边界特征快速提取算法[J]. 山东大学学报(工学版), 2009, 39(1): 84 -86 .
[8] 孙玉利,李法德,左敦稳,戚美 . 直立分室式流体连续通电加热系统的升温特性[J]. 山东大学学报(工学版), 2006, 36(6): 19 -23 .
[9] 胡天亮,李鹏,张承瑞,左毅 . 基于VHDL的正交编码脉冲电路解码计数器设计[J]. 山东大学学报(工学版), 2008, 38(3): 10 -13 .
[10] 卜德云 张道强. 自适应谱聚类算法研究[J]. 山东大学学报(工学版), 2009, 39(5): 22 -26 .