Journal of Shandong University(Engineering Science) ›› 2019, Vol. 49 ›› Issue (2): 61-66.doi: 10.6040/j.issn.1672-3961.0.2017.432

• Machine Learning & Data Mining • Previous Articles     Next Articles

Images auto-encoding algorithm based on deep convolution neural network

Yijiang HE1(),Junping DU1,*(),Feifei KOU1,Meiyu LIANG1,Wei WANG2,Ang LUO2   

  1. 1. School of Computer Science, Beijing University of Posts and Telecommunications, Beijing 100876, China
    2. Sina. Com Technology (China) Corporation, Beijing 100876, China
  • Received:2017-05-05 Online:2019-04-20 Published:2019-04-19
  • Contact: Junping DU E-mail:he66024748@163.com;junpingdu@126.com
  • Supported by:
    国家自然科学基金重点项目(61532006);国家自然科学基金国际合作项目(61320106006);国家自然科学基金青年科学基金(61502042)

Abstract:

At present, image coding research was focused on information lossless, but it did not reflect the social network image differentiation. A novel social network images auto-encoding algorithm based on deep convolution neural network was proposed. The algorithm obtained good performance on image auto-encoding, which combined the feature extraction ability of deep convolutional neural network and characteristics of images in social networks. It combined the characteristics of the social network image with the clustering algorithm to cluster social network image and got the distance information, next the deep convolutional neural network was used to learn the distance information of these images, then it extracted the fully connected layer in the deep convolution neural network as the image coding, repeated the above steps and got the image coding finally. The experimental results showed that the proposed algorithm performed better than other algorithms of image search, and was more adaptive in the social network image search than that of the other algorithms mentioned.

Key words: deep convolution, neural network, social network pictures, image auto-encoding, image search

CLC Number: 

  • TP37

Fig.1

The general framework of auto-coding algorithm for social network image based on deep convolution neural network"

Fig.2

Distance information extraction"

Fig.3

Deep convolution neural network coding learning and generation process"

Table 1

Deep convolution neural network architecture"

层数 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22
功能 Input Conv3-64 Conv3-64 Maxpool Conv3-128 Conv3-128 Maxpool Conv3-256 Conv3-256 Conv3-256 Maxpool Conv3-512 Conv3-512 Conv3-512 Maxpool Conv3-512 Conv3-512 Conv3-512 FC-4 096 FC-2 048 FC-512 Softmax

Table 2

Comparison of the precision @5of different algorithms on social network images"

算法 16位 32位 64位
DA 0.538 0.532 0.584
AEVB 0.554 0.422 0.356
DCNNSE-1 0.330 0.566 0.590
DCNNSE-2 0.572 0.606 0.596

Fig.4

The Precision@5 histogram of different algorithms on the social network image"

Table 3

Comparison of MAP@5 of different algorithms on social network images"

算法 16位 32位 64位
DA 0.180 0.202 0.218
AEVB 0.200 0.176 0.162
DCNNSE-1 0.138 0.204 0.209
DCNNSE-2 0.212 0.244 0.212

Fig.5

The MAP@5 histogram of different algorithms on the social network image"

1 谢易道.大规模人脸图像编码及其在人脸验证中的应用研究[D].成都:电子科技大学, 2015.
XIE Yidao. Large-scale face image coding and application in face verification[D]. Chengdu: University of Electronic Science and Technology of China. 2015.
2 张峰.基于Context建模的熵编码在无失真图像压缩中的应用[D].昆明:云南大学, 2015.
ZHANG Feng. Context-modeling based application in the lossless image compression of entropy coding[D]. Kunming: Yunnan University, 2015.
3 BENGIO Y , COURVILLE A , VINCENT P . Representation learning: a review and new perspectives[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2013, 35 (8): 1798- 1828.
4 YANG J, PARIKH D, BATRA D. Joint unsupervised learning of deep representations and image clusters[C]// IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas, USA: IEEE, 2016: 5147-5156.
5 DOERSCH C, GUPTA A, EFROS A A. Unsupervised visual representation learning by context prediction[C]// IEEE International Conference on Computer Vision. Santiago, Chile: IEEE, 2015: 1422-1430.
6 SEIFFERT U . ANNIE——artificial neural network-based image encoder[J]. Neurocomputing, 2014, 125 (3): 229- 235.
7 FELZENSZWALB P F , GIRSHICK R B , MCALLESTER D , et al. Object detection with discriminatively trained part-based models[J]. IEEE Transactions on Software Engineering, 2014, 32 (9): 1627- 45.
8 LIU L, SHEN C, WANG L, et al. Encoding high dimensional local features by sparse coding based fisher vectors[EB/OL].(2014-11)[2017-01-27].https://arxiv.org/pdf/1411.406.pdf.
9 BADRINARAYANAN V , KENDALL A , CIPOLLA R . Segnet: a deep convolutional encoder-decoder architecture for image segmentation[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39 (12): 2481- 2495.
doi: 10.1109/TPAMI.34
10 GIRSHICK R, DONAHUE J, DARRELL T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation[EB/OL].(2013-11)[2017-01-24]. https://arxiv.org/pdf/1311.2524v3.pdf.
11 MATTAR, MOHAMED Marwan. Unsupervised joint alignment, clustering and feature learning[EB/OL]. (2014-05)[2017-01-20]. https://scholarworks.umass.edu/cgi/viewcontent.cgi?article=1121&context=dissertations_2.
12 TIAN D P . A review on image feature extraction and representation techniques[J]. International Journal of Multimedia & Ubiquitous Engineering, 2013, 8 (4): 385- 395.
13 DENNIS J , TRAN H D , CHNG E S . Image feature representation of the subband power distribution for robust sound event classification[J]. IEEE Transactions on Audio Speech & Language Processing, 2011, 21 (2): 2437- 2440.
14 JIA Y, HUANG C. Receptive field learning for pooled image features. 8781218[P]. 2014-07-15.
15 JIN X , CAI Z X . A global image feature construction method based on local jet structure[J]. Acta Automatica Sinica, 2014, 40 (6): 1148- 1155.
doi: 10.1016/S1874-1029(14)60012-4
16 SIMONYAN K, ZISSERMAN A. Very deep convolutional networks for large-scale image recognition[EB/OL]. (2014-09-04)[2017-02-16].https://arxiv.org/pdf/1409.1556.pdf.
17 Kingma D P, Welling M. Auto-Encoding Variational Bayes[EB/OL].(2013-12-20)[2017-01-12].https://arxiv.org/pdf/1312.6114.pdf.
18 HEYMANN J, HAEB-UMBACH R, GOLIK P, et al. Unsupervised adaptation of a denoising autoencoder by bayesian feature enhancement for reverberant asr under mismatch conditions[C]// IEEE International Conference on Acoustics, Speech and Signal Processing. South Brisbane, Queensland, Australia: IEEE, 2015: 5053-5057.
[1] Xiaoxiong HOU,Xinzheng XU,Jiong ZHU,Yanyan GUO. Computer aided diagnosis method for breast cancer based on AlexNet and ensemble classifiers [J]. Journal of Shandong University(Engineering Science), 2019, 49(2): 74-79.
[2] Fang GUO,Lei CHEN,Ziwen YANG. Real-time traffic prediction based on MGU for large-scale IP backbone networks [J]. Journal of Shandong University(Engineering Science), 2019, 49(2): 88-95.
[3] Ya'nan YANG,Bin XIA,Nan XIE,Wenhao YUAN. Hybrid localization algorithm based on BP neural network and multivariable Taylor series [J]. Journal of Shandong University(Engineering Science), 2019, 49(1): 36-40.
[4] Wenwen QUAN,Mingxing LIN. Algorithm of underwater target recognition based on CNN features with BOF [J]. Journal of Shandong University(Engineering Science), 2019, 49(1): 107-113.
[5] Dongdong SHEN,Fengyu ZHOU,Mengyuan LI,Shuqian WANG,Renhe GUO. Indoor wireless positioning based on ensemble deep neural network [J]. Journal of Shandong University(Engineering Science), 2018, 48(5): 95-102.
[6] Pu ZHANG,Chang LIU,Yong WANG. Suggestion sentence classification model based on feature fusion and ensemble learning [J]. Journal of Shandong University(Engineering Science), 2018, 48(5): 47-54.
[7] Mengmeng LIANG,Tao ZHOU,Yong XIA,Feifei ZHANG,Jian YANG. Lung tumor images recognition based on PSO-ConvK convolutional neural network [J]. Journal of Shandong University(Engineering Science), 2018, 48(5): 77-84.
[8] ZHANG Xianhong, ZHANG Chunrui. Image enhancement algorithm based on six dimensional feedforward neural network model [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2018, 48(4): 10-19.
[9] LI Yuxin, PU Yuanyuan, XU Dan, QIAN Wenhua, LIU Hejuan. Image aesthetic quality evaluation based on embedded fine-tune deep CNN [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2018, 48(3): 60-66.
[10] ZHAO Yanxia, WANG Xizhao. Multipurpose zero watermarking algorithm for color image based on SVD and DCNN [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2018, 48(3): 25-33.
[11] CAO Ya, DENG Zhaohong, WANG Shitong. An radial basis function neural network model based on monotonic constraints [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2018, 48(3): 127-133.
[12] XIE Zhifeng, WU Jiaping, MA Lizhuang. Chinese financial news classification method based on convolutional neural network [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2018, 48(3): 34-39.
[13] HE Zhengyi, ZENG Xianhua, GUO Jiang. An ensemble method with convolutional neural network and deep belief network for gait recognition and simulation [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2018, 48(3): 88-95.
[14] TANG Leshuang, TIAN Guohui, HUANG Bin. An object fusion recognition algorithm based on DSmT [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2018, 48(1): 50-56.
[15] WANG Xiuqing, ZENG Hui, XIE Fei, LYU Feng. Fault diagnosis for manipulators based on Spiking neural networks [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2017, 47(5): 15-21.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
[1] WANG Su-yu,<\sup>,AI Xing<\sup>,ZHAO Jun<\sup>,LI Zuo-li<\sup>,LIU Zeng-wen<\sup> . Milling force prediction model for highspeed end milling 3Cr2Mo steel[J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2006, 36(1): 1 -5 .
[2] KONG Xiang-zhen,LIU Yan-jun,WANG Yong,ZHAO Xiu-hua . Compensation and simulation for the deadband of the pneumatic proportional valve[J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2006, 36(1): 99 -102 .
[3] LAI Xiang . The global domain of attraction for a kind of MKdV equations[J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2006, 36(1): 87 -92 .
[4] LI Liang, LUO Qiming, CHEN Enhong. Graph-based ranking model for object-level search
[J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2009, 39(1): 15 -21 .
[5] CHEN Rui, LI Hongwei, TIAN Jing. The relationship between the number of magnetic poles and the bearing capacity of radial magnetic bearing[J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2018, 48(2): 81 -85 .
[6] WANG Bo,WANG Ning-sheng . Automatic generation and combinatory optimization of disassembly sequence for mechanical-electric assembly[J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2006, 36(2): 52 -57 .
[7] LI Ke,LIU Chang-chun,LI Tong-lei . Medical registration approach using improved maximization of mutual information[J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2006, 36(2): 107 -110 .
[8] JI Tao,GAO Xu/sup>,SUN Tong-jing,XUE Yong-duan/sup>,XU Bing-yin/sup> . Characteristic analysis of fault generated traveling waves in 10 Kv automatic blocking and continuous power transmission lines[J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2006, 36(2): 111 -116 .
[9] QIN Tong, SUN Fengrong*, WANG Limei, WANG Qinghao, LI Xincai. 3D surface reconstruction using the shape based interpolation guided by maximal discs[J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2010, 40(3): 1 -5 .
[10] LIU Wen-liang, ZHU Wei-hong, CHEN Di, ZHANG Hong-quan. Detection and tracking of moving targets using the morphology match in radar images[J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2010, 40(3): 31 -36 .