基于弱监督和半监督学习的红外舰船分割方法

doi:10.6040/j.issn.1672-3961.0.2021.329

摘要/Abstract

摘要： 为降低获取像素级标签的成本,提出一种基于弱监督和半监督学习的红外舰船分割方法,在残差网络(residual network, ResNet)的基础上,设计一个自适应定位模块,并使用相似损失、前景损失和背景损失训练自适应定位模块,生成舰船定位图;利用少量像素级标签数据和大量定位图数据交替训练显著性网络生成显著图;用条件随机场优化显著图,并结合图像级标签生成伪标签图像,使用伪标签图像训练分割网络,得到红外舰船的分割结果。在红外舰船数据集上的平均交并比为71.18%,与当前其他先进方法进行对比,平均交并比提高了9.47%,试验结果表明自适应定位模块能够有效定位红外舰船,交替训练方法可以使红外舰船的边缘更准确。

关键词: 红外舰船, 语义分割, 弱监督学习, 半监督学习, 卷积神经网络

中图分类号:

尹旭,刘兆英,张婷,李玉鑑. 基于弱监督和半监督学习的红外舰船分割方法[J]. 山东大学学报 (工学版), 2022, 52(2): 99-106.

YIN Xu, LIU Zhaoying, ZHANG Ting, LI Yujian. Infrared ship segmentation method based on weakly-supervised and semi-supervised learning[J]. Journal of Shandong University(Engineering Science), 2022, 52(2): 99-106.

参考文献

[1] LONG J, SHELHAMER E, DARRELL T. Fully convolutional networks for semantic segmentation[C] //Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. New York, USA: IEEE, 2015: 3431-3440.
[2] RONNEBERGER O, FISCHER P, BROX T. U-net: convolutional networks for biomedical image segmentation[C] //Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention. Cham, Switzerland: Springer, 2015: 234-241.
[3] BADRINARAYANAN V, KENDALL A, CIPOLLA R. Segnet: a deep convolutional encoder-decoder architecture for image segmentation[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(12): 2481-2495.
[4] LIN G, MILAN A, SHEN C, et al. Refinenet: multi-path refinement networks for high-resolution semantic segmentation[C] //Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. New York, USA: IEEE, 2017: 1925-1934.
[5] CHEN L C, PAPANDREOU G, KOKKINOS I, et al. Deeplab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 40(4): 834-848.
[6] ZHAO H, SHI J, QI X, et al. Pyramid scene parsing network[C] //Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. New York, USA: IEEE, 2017: 2881-2890.
[7] CHEN L C, ZHU Y, PAPANDREOU G, et al. Encoder-decoder with atrous separable convolution for semantic image segmentation[C] //Proceedings of the European Conference on Computer Vision. Cham, Switzerland: Spri-nger, 2018: 801-818.
[8] 陈倩. 基于局部区域生长和Faster R-CNN的弱监督图像语义分割[D].合肥: 安徽大学, 2020. CHEN Qian. Weakly supervised image semantic segmentation based on local region growth and faster R-CNN[D]. Hefei: Anhui University, 2020.
[9] ZHOU B, KHOSLA A, LAPEDRIZA A, et al. Learning deep features for discriminative localization[C] //Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. New York, USA: IEEE, 2016: 2921-2929.
[10] SINGH K K, LEE Y J. Hide-and-seek: forcing a network to be meticulous for weakly-supervised object and action localization[C] //Proceedings of the 2017 IEEE International Conference on Computer Vision. New York, USA: IEEE, 2017: 3544-3553.
[11] WEI Y, FENG J, LIANG X, et al. Object region mining with adversarial erasing: a simple classification to semantic segmentation approach[C] //Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. New York, USA: IEEE, 2017: 1568-1576.
[12] MAI J, YANG M, LUO W. Erasing integrated learning: a simple yet effective approach for weakly supervised object localization[C] //Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. New York, USA: IEEE, 2020: 8766-8775.
[13] ZHANG X, WEI Y, FENG J, et al. Adversarial complementary learning for weakly supervised object localization[C] //Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. New York, USA: IEEE, 2018: 1325-1334.
[14] HOU Q, JIANG P T, WEI Y, et al. Self-erasing network for integral object attention[J]. Advances in Neural Information Processing Systems, 2018, 31: 549-559.
[15] KIM D, CHO D, YOO D, et al. Two-phase learning for weakly supervised object localization[C] //Proceedings of the IEEE International Conference on Computer Vision. New York, USA: IEEE, 2017: 3534-3543
[16] WANG X, SHRIVASTAVA A, GUPTA A. A-fast-rcnn: hard positive generation via adversary for object detection[C] //Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. New York, USA: IEEE, 2017: 2606-2615.
[17] WEI Y, XIAO H, SHI H, et al. Revisiting dilated convolution: a simple approach for weakly- and semi-supervised semantic segmentation[C] //Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. New York, USA: IEEE, 2018: 7268-7277.
[18] FAN J, ZHANG Z, TAN T, et al. Cian: cross-image affinity net for weakly supervised semantic segmentation[C] //Proceedings of the AAAI Conference on Arti-ficial Intelligence. Palo Alto, USA: AAAI, 2020, 10762-10769.
[19] FAN R, HOU Q, CHENG M M, et al. Associating inter-image salient instances for weakly supervised semantic segmentation[C] //Proceedings of the European Conference on Computer Vision. Berlin, Germany: Springer, 2018: 367-383.
[20] KOLESNIKOV A, LAMPERT C H. Seed, expand and constrain: three principles for weakly-supervised image segmentation[C] //Proceedings of the European Conference on Computer Vision. Cham, Switzerland: Springer, 2016: 695-711.
[21] HUANG Z, WANG X, WANG J, et al. Weakly-supervised semantic segmentation network with deep seeded region growing[C] //Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. New York, USA: IEEE, 2018: 7014-7023.
[22] WANG X, YOU S, LI X, et al. Weakly-supervised semantic segmentation by iteratively mining common object features[C] //Proceedings of the IEEE Confe-rence on Computer Vision and Pattern Recognition. New York, USA: IEEE, 2018: 1354-1362.
[23] AHN J, KWAK S. Learning pixel-level semantic affinity with image-level supervision for weakly supervised semantic segmentation[C] //Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. New York, USA: IEEE, 2018: 4981-4990.
[24] SELVARAJU R R, COGSWELL M, DAS A, et al. Grad-cam: visual explanations from deep networks via gradient-based localization[C] //Proceedings of the IEEE International Conference on Computer Vision. New York, USA: IEEE, 2017: 618-626.
[25] SOULY N, SPAMPINATO C, SHAH M. Semi supervised semantic segmentation using generative adversarial network[C] //Proceedings of the IEEE International Conference on Computer Vision. New York, USA: IEEE, 2017: 5688-5696.
[26] AI J, CHEN S, DENG P, et al. CycleGANs for semi-supervised defects segmentation[C] //Proceedings of the 2020 International Conference on Sensing, Measurement & Data Analytics in the Era of Artificial Intel-ligence(ICSMD). Xi'an, China: IEEE, 2020: 611-616.
[27] HE K, ZHANG X, REN S, et al. Deep residual learning for image recognition[C] //Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. New York, USA: IEEE, 2016: 770-778.
[28] WU Z, SU L, HUANG Q. Cascaded partial decoder for fast and accurate salient object detection[C] //Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. New York, USA: IEEE, 2019: 3907-3916.
[29] KINGMA D, BA J. Adam: a method for stochastic optimization[C] //Proceedings of the 3rd International Conference on Learning Representations. CA, USA: ICLR, 2015: 1-13.
[30] KRÄHENB(¨overU)HL P, KOLTUN V. Efficient inference in fully connected crfs with gaussian edge potentials[J]. Advances in Neural Information Processing Systems, 2011, 24(24): 109-117.

相关文章 15

[1]	吴秋兰,尚素雅,张家辉,孙守鑫,张峰,周波,高峥,史文宠. 基于多尺度特征融合的马铃薯疮痂病图像语义分割方法[J]. 山东大学学报 (工学版), 2025, 55(4): 1-8.
[2]	董明书,陈俐企,马川义,张珠皓,孙仁娟,管延华,庄培芝. 沥青路面内部裂缝雷达图像智能判识算法研究[J]. 山东大学学报 (工学版), 2025, 55(3): 72-79.
[3]	李伟豪,王苹苹,许万博,魏本征. 结构先验引导的多模态腰椎MRI图像分割算法[J]. 山东大学学报 (工学版), 2025, 55(1): 66-76.
[4]	马翔悦,徐金东,倪梦莹. 基于多尺度特征模糊卷积神经网络的遥感图像分割[J]. 山东大学学报 (工学版), 2024, 54(3): 44-54.
[5]	迟云浩,杨璐,郭杰,郝凡昌,聂秀山. 基于注意力特征融合网络的手指静脉图像质量评价方法[J]. 山东大学学报 (工学版), 2023, 53(6): 56-62.
[6]	王碧瑶,韩毅,崔航滨,刘毅超,任铭然,高维勇,陈姝廷,刘嘉巍,崔洋. 基于图像的道路语义分割检测方法[J]. 山东大学学报 (工学版), 2023, 53(5): 37-47.
[7]	那绪博,张莹,李沐阳,陈元畅,华云鹏. 基于ODCG的网约车需求预测模型[J]. 山东大学学报 (工学版), 2023, 53(5): 48-56.
[8]	范海雯,郝旭东,赵康,邢法财,蒋哲,李常刚. 基于卷积神经网络的含分布式光伏配电网静态等值[J]. 山东大学学报 (工学版), 2023, 53(4): 140-148.
[9]	李旭涛,杨寒玉,卢业飞,张玮. 基于深度学习的遥感图像道路分割[J]. 山东大学学报 (工学版), 2022, 52(6): 139-145.
[10]	王智伟,徐海超,郭相阳,马炯,褚云龙,陈前昌,卢治. 基于卷积神经网络和层次分析的新能源电源调频能力智能预测方法[J]. 山东大学学报 (工学版), 2022, 52(5): 70-76.
[11]	张学思,张婷,刘兆英,江天鹏. 基于轻量型卷积神经网络的海面红外显著性目标检测方法[J]. 山东大学学报 (工学版), 2022, 52(2): 41-49.
[12]	龚楷伦,翟婷婷,唐鸿成. 一种面向多标签分类的在线主动学习算法[J]. 山东大学学报 (工学版), 2022, 52(2): 80-88.
[13]	王心哲,邓棋文,王际潮,范剑超. 深度语义分割MRF模型的海洋筏式养殖信息提取[J]. 山东大学学报 (工学版), 2022, 52(2): 89-98.
[14]	朱恒东, 马盈仓, 代雪珍. 自适应半监督邻域聚类算法[J]. 山东大学学报 (工学版), 2021, 51(4): 24-34.
[15]	陶亮,刘宝宁,梁玮. 基于CNN-LSTM 混合模型的心律失常自动检测[J]. 山东大学学报 (工学版), 2021, 51(3): 30-36.

多维度评价

Viewed

Full text

Abstract

Cited

Shared

Discussed