基于深度卷积神经网络的图像自编码算法

doi:10.6040/j.issn.1672-3961.0.2017.432

山东大学学报 (工学版) ›› 2019, Vol. 49 ›› Issue (2): 61-66.doi: 10.6040/j.issn.1672-3961.0.2017.432

基于深度卷积神经网络的图像自编码算法

何奕江¹(),杜军平^1,*(),寇菲菲¹,梁美玉¹,王巍²,罗盎²

1. 北京邮电大学计算机学院, 北京 100876
2. 新浪技术(中国)有限公司, 北京 100876

收稿日期:2017-05-05 出版日期:2019-04-20 发布日期:2019-04-19
通讯作者: 杜军平 E-mail:he66024748@163.com;junpingdu@126.com
作者简介:何奕江(1994—),男,湖南道县人,硕士,主要研究方向为机器学习与数据挖掘. E-mail:he66024748@163.com
基金资助:
国家自然科学基金重点项目(61532006);国家自然科学基金国际合作项目(61320106006);国家自然科学基金青年科学基金(61502042)

Images auto-encoding algorithm based on deep convolution neural network

Yijiang HE¹(),Junping DU^1,*(),Feifei KOU¹,Meiyu LIANG¹,Wei WANG²,Ang LUO²

1. School of Computer Science, Beijing University of Posts and Telecommunications, Beijing 100876, China
2. Sina. Com Technology (China) Corporation, Beijing 100876, China

Received:2017-05-05 Online:2019-04-20 Published:2019-04-19
Contact: Junping DU E-mail:he66024748@163.com;junpingdu@126.com
Supported by:
国家自然科学基金重点项目(61532006);国家自然科学基金国际合作项目(61320106006);国家自然科学基金青年科学基金(61502042)

摘要/Abstract

摘要：

针对目前图像编码的研究工作更加重视信息无损性,而没有体现出社交网络图像区分度的问题,本研究提出一种新颖的基于深度卷积神经网络的社交网络图像自编码算法,将深度卷积神经网络提取特征的能力与社交网络中图像的特点相结合,得到性能良好的图像自编码。结合社交网络图片的特性与聚类算法,先将图片进行聚类得到距离信息,再利用深度卷积神经网络学习图片的距离信息,提取深度卷积神经网络中的全连接层作为编码,重复以上步骤,并得到最终的图像编码。试验结果表明,本研究提出的算法在图像搜索中的效果好于其他算法,更利于在社交网络图像搜索中使用。

关键词: 深度卷积, 神经网络, 社交网络图片, 图像自编码, 图像搜索

Abstract:

At present, image coding research was focused on information lossless, but it did not reflect the social network image differentiation. A novel social network images auto-encoding algorithm based on deep convolution neural network was proposed. The algorithm obtained good performance on image auto-encoding, which combined the feature extraction ability of deep convolutional neural network and characteristics of images in social networks. It combined the characteristics of the social network image with the clustering algorithm to cluster social network image and got the distance information, next the deep convolutional neural network was used to learn the distance information of these images, then it extracted the fully connected layer in the deep convolution neural network as the image coding, repeated the above steps and got the image coding finally. The experimental results showed that the proposed algorithm performed better than other algorithms of image search, and was more adaptive in the social network image search than that of the other algorithms mentioned.

Key words: deep convolution, neural network, social network pictures, image auto-encoding, image search

中图分类号:

TP37

何奕江,杜军平,寇菲菲,梁美玉,王巍,罗盎. 基于深度卷积神经网络的图像自编码算法[J]. 山东大学学报 (工学版), 2019, 49(2): 61-66.

Yijiang HE,Junping DU,Feifei KOU,Meiyu LIANG,Wei WANG,Ang LUO. Images auto-encoding algorithm based on deep convolution neural network[J]. Journal of Shandong University(Engineering Science), 2019, 49(2): 61-66.

图/表 8

图1

图2

图3

表1

表2

图4

表3

图5

参考文献 18

1	谢易道.大规模人脸图像编码及其在人脸验证中的应用研究[D].成都:电子科技大学, 2015.
	XIE Yidao. Large-scale face image coding and application in face verification[D]. Chengdu: University of Electronic Science and Technology of China. 2015.
2	张峰.基于Context建模的熵编码在无失真图像压缩中的应用[D].昆明:云南大学, 2015.
	ZHANG Feng. Context-modeling based application in the lossless image compression of entropy coding[D]. Kunming: Yunnan University, 2015.
3	BENGIO Y , COURVILLE A , VINCENT P . Representation learning: a review and new perspectives[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2013, 35 (8): 1798- 1828.
4	YANG J, PARIKH D, BATRA D. Joint unsupervised learning of deep representations and image clusters[C]// IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas, USA: IEEE, 2016: 5147-5156.
5	DOERSCH C, GUPTA A, EFROS A A. Unsupervised visual representation learning by context prediction[C]// IEEE International Conference on Computer Vision. Santiago, Chile: IEEE, 2015: 1422-1430.
6	SEIFFERT U . ANNIE——artificial neural network-based image encoder[J]. Neurocomputing, 2014, 125 (3): 229- 235.
7	FELZENSZWALB P F , GIRSHICK R B , MCALLESTER D , et al. Object detection with discriminatively trained part-based models[J]. IEEE Transactions on Software Engineering, 2014, 32 (9): 1627- 45.
8	LIU L, SHEN C, WANG L, et al. Encoding high dimensional local features by sparse coding based fisher vectors[EB/OL].(2014-11)[2017-01-27].https://arxiv.org/pdf/1411.406.pdf.
9	BADRINARAYANAN V , KENDALL A , CIPOLLA R . Segnet: a deep convolutional encoder-decoder architecture for image segmentation[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39 (12): 2481- 2495. doi: 10.1109/TPAMI.34
10	GIRSHICK R, DONAHUE J, DARRELL T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation[EB/OL].(2013-11)[2017-01-24]. https://arxiv.org/pdf/1311.2524v3.pdf.
11	MATTAR, MOHAMED Marwan. Unsupervised joint alignment, clustering and feature learning[EB/OL]. (2014-05)[2017-01-20]. https://scholarworks.umass.edu/cgi/viewcontent.cgi?article=1121&context=dissertations_2.
12	TIAN D P . A review on image feature extraction and representation techniques[J]. International Journal of Multimedia & Ubiquitous Engineering, 2013, 8 (4): 385- 395.
13	DENNIS J , TRAN H D , CHNG E S . Image feature representation of the subband power distribution for robust sound event classification[J]. IEEE Transactions on Audio Speech & Language Processing, 2011, 21 (2): 2437- 2440.
14	JIA Y, HUANG C. Receptive field learning for pooled image features. 8781218[P]. 2014-07-15.
15	JIN X , CAI Z X . A global image feature construction method based on local jet structure[J]. Acta Automatica Sinica, 2014, 40 (6): 1148- 1155. doi: 10.1016/S1874-1029(14)60012-4
16	SIMONYAN K, ZISSERMAN A. Very deep convolutional networks for large-scale image recognition[EB/OL]. (2014-09-04)[2017-02-16].https://arxiv.org/pdf/1409.1556.pdf.
17	Kingma D P, Welling M. Auto-Encoding Variational Bayes[EB/OL].(2013-12-20)[2017-01-12].https://arxiv.org/pdf/1312.6114.pdf.
18	HEYMANN J, HAEB-UMBACH R, GOLIK P, et al. Unsupervised adaptation of a denoising autoencoder by bayesian feature enhancement for reverberant asr under mismatch conditions[C]// IEEE International Conference on Acoustics, Speech and Signal Processing. South Brisbane, Queensland, Australia: IEEE, 2015: 5053-5057.

相关文章 15

[1]	黄芳,王欣,高国海,沈玲珍,付勋,方宇. 融合主客观评价的图数据Top-k频繁模式挖掘[J]. 山东大学学报 (工学版), 2025, 55(6): 1-12.
[2]	邵孟伟,袁世飞,周宏志,王乃华. 基于BP神经网络和遗传算法的翅片管结构优化[J]. 山东大学学报 (工学版), 2025, 55(6): 76-82.
[3]	邓彬, 张宗包, 赵文猛, 罗新航, 吴秋伟. 基于云边协同和图神经网络的电动汽车充电站负荷预测方法[J]. 山东大学学报 (工学版), 2025, 55(5): 62-69.
[4]	董明书,陈俐企,马川义,张珠皓,孙仁娟,管延华,庄培芝. 沥青路面内部裂缝雷达图像智能判识算法研究[J]. 山东大学学报 (工学版), 2025, 55(3): 72-79.
[5]	贾轩,许吉凯,任艺婧,刘德才,许强,张利. 基于样本扩容和数据驱动的台区理论线损计算方法[J]. 山东大学学报 (工学版), 2025, 55(3): 158-164.
[6]	祝明,石承龙,吕潘,刘现荣,孙驰,陈建城,范宏运. 基于优化长短时记忆网络的深基坑变形预测方法及其工程应用[J]. 山东大学学报 (工学版), 2025, 55(3): 141-148.
[7]	李伟豪,王苹苹,许万博,魏本征. 结构先验引导的多模态腰椎MRI图像分割算法[J]. 山东大学学报 (工学版), 2025, 55(1): 66-76.
[8]	孙尚渠,张恭禄,蒋志斌,李朝阳. 盾构滚刀磨损的影响因素敏感性分析及预测[J]. 山东大学学报 (工学版), 2025, 55(1): 86-96.
[9]	林振宇,邵蓥侠. 基于盖根堡多项式最佳平方近似的谱图网络[J]. 山东大学学报 (工学版), 2024, 54(5): 93-100.
[10]	常新功,苏敏惠,周志刚. 基于进化集成的图神经网络解释方法[J]. 山东大学学报 (工学版), 2024, 54(4): 1-12.
[11]	马翔悦,徐金东,倪梦莹. 基于多尺度特征模糊卷积神经网络的遥感图像分割[J]. 山东大学学报 (工学版), 2024, 54(3): 44-54.
[12]	赵涛,张宁,王小超,马川义,田源,张圣涛,杨梓梁. 基于图神经网络轨迹预测的合流区交通冲突预测方法[J]. 山东大学学报 (工学版), 2024, 54(2): 36-46.
[13]	李璐,张志军,范钰敏,王星,袁卫华. 面向冷启动用户的元学习与图转移学习序列推荐[J]. 山东大学学报 (工学版), 2024, 54(2): 69-79.
[14]	范黎林,刘士豪,李源,毛文涛,陈宗涛. 基于课程正则化的物理信息神经网络渐进式训练策略[J]. 山东大学学报 (工学版), 2024, 54(1): 11-24.
[15]	孙园,曾惠权,欧阳苏建,高佳倩,王绮楠,林智勇. 基于粒子群算法的模糊大脑情感学习非线性系统辨识[J]. 山东大学学报 (工学版), 2024, 54(1): 25-32.

多维度评价

Viewed

Full text

Abstract

Cited

Shared

Discussed

层数	1	2	3	4	5	6	7	8	9	10	11	12	13	14	15	16	17	18	19	20	21	22
功能	Input	Conv3-64	Conv3-64	Maxpool	Conv3-128	Conv3-128	Maxpool	Conv3-256	Conv3-256	Conv3-256	Maxpool	Conv3-512	Conv3-512	Conv3-512	Maxpool	Conv3-512	Conv3-512	Conv3-512	FC-4 096	FC-2 048	FC-512	Softmax

算法	16位	32位	64位
DA	0.538	0.532	0.584
AEVB	0.554	0.422	0.356
DCNNSE-1	0.330	0.566	0.590
DCNNSE-2	0.572	0.606	0.596

算法	16位	32位	64位
DA	0.180	0.202	0.218
AEVB	0.200	0.176	0.162
DCNNSE-1	0.138	0.204	0.209
DCNNSE-2	0.212	0.244	0.212

基于深度卷积神经网络的图像自编码算法

Images auto-encoding algorithm based on deep convolution neural network

RichHTML

PDF (PC)

摘要/Abstract

引用本文

使用本文

图/表 8

参考文献 18

相关文章 15

多维度评价

本文评价

推荐阅读 10