Journal of Shandong University(Engineering Science) ›› 2019, Vol. 49 ›› Issue (1): 107-113.doi: 10.6040/j.issn.1672-3961.0.2017.385

• Electrical Engineering • Previous Articles     Next Articles

Algorithm of underwater target recognition based on CNN features with BOF

Wenwen QUAN(),Mingxing LIN*()   

  1. School of Mechanical Engineering, Shandong University, Jinan 250061, Shandong, China
  • Received:2017-08-03 Online:2019-02-01 Published:2019-03-01
  • Contact: Mingxing LIN E-mail:qw13541179302@126.com;mxlin2000@163.com

Abstract:

In order to prevent false matching problems of scale invariant feature transform (SIFT) matching as a low-level representation for lack of sufficient features, an improved bag of features (BOF) algorithm method combined with the convolution neural network (CNN) features was proposed, which had better semantic segmentation ability to enhance the recognition rates. The LifeCLEF fish video on ImageCLEF website was used to create our own target image databases. Convolution neural network was trained in the Alexnet architecture of caffe, and the features of image databases and query images were extracted. The trained CNN features were simulated in Matlab, and the hamming distance was calculated to verify the matching effect. In addition, the parameter values were changed to test the effect of different Hamming distance thresholds on target recognition results. The experiment of self-made image databases showed that the fusion of depth learning features could effectively improve the underwater target recognition rates of BOF algorithm, and the selection of Hamming distance thresholds required selecting the appropriate parameters according to the actual situation.

Key words: underwater target recognition, bag of features, scale invariant feature transform matching, convolution neural networks, Hamming distance

CLC Number: 

  • TP391

Fig.1

The framework of underwater target recognition method with CNN features"

Fig.2

Fish species in new image databases"

Table 1

The number of images for training, validation and testing"

类别 训练 验证 测试 训练验证总计
短身光鳃雀鲷
弓月蝴蝶鱼
黑带椒雀鲷
三带圆雀鲷
网纹圆雀鲷
小高鳍刺尾鱼
2 310
926
2 300
1 516
2 303
650
289
116
288
190
288
82
288
116
288
189
288
82
2 599
1 042
2 588
1 706
2 591
732
总计 10 005 1 253 1 251 11 258

Table 2

The recognition rates of six kinds of fish (T=0.600)"

%
方法 BOF CNN BOF+Conv5 BOF+Fc6 BOF+Fc7
短身光鳃雀鲷
弓月蝴蝶鱼
黑带椒雀鲷
三带圆雀鲷
网纹圆雀鲷
小高鳍刺尾鱼
74.0
70.7
71.6
76.5
69.7
69.5
74.3
73.8
72.6
75.8
74.2
72.4
86.8
85.5
83.1
87.3
82.8
82.1
86.7
85.3
86.2
87.0
84.5
84.8
76.0
72.8
74.6
76.8
71.3
74.2
平均识别率 72.2 73.9 84.7 85.9 74.3

Fig.3

Distribution of hamming distance"

Fig.4

The impact of parameter T on recognition rates"

1 JAFFE J S , MOORE K D , MCLEAN J , et al. Underwater optical imaging: status and prospects[J]. Oceanography, 2001, 14 (3): 66- 76.
2 王士龙, 徐玉如, 万磊, 等. 基于边界矩和改进FCM聚类的水下目标识别[J]. 系统工程理论与实践, 2012, 32 (12): 2809- 2815.
WANG Shilong , XU Yuru , WANG Lei , et al. Underwater targets recognition based on contour moment and modified FCM algorithm[J]. System Engineering Theory and Practice, 2012, 32 (12): 2809- 2815.
3 FATAN M , DALIRI M R , MOHAMMAD S A , et al. Underwater cable detection in the images using edge classification based on texture information[J]. Measurement, 2016, 91, 309- 317.
doi: 10.1016/j.measurement.2016.05.030
4 乔曦.基于水下机器视觉的海参实时识别研究[D].北京:中国农业大学, 2017.
QIAO Xi. Sea cucumber identification in real-time based on underwater machine vision technique[D]. Beijing: China Agricultural University, 2017.
5 LOWE D G . Distinctive image features from scale-invariant key points[J]. International Journal of Computer Vision, 2004, 60 (2): 91- 110.
doi: 10.1023/B:VISI.0000029664.99615.94
6 BAY H, TUYTELAARS T, VAN GOOL L. SURF: speeded up robust features[C]//Computer Vision-ECCV 2006. Graz, Austria: Springer Berlin Heidelberg, 2006: 404-417.
7 ZHENG Z Z , YUN Z , YAN L X . Global and local exploitation for saliency using bag-of-words[J]. IET Computer Vision, 2014, 8 (4): 299- 304.
doi: 10.1049/iet-cvi.2013.0132
8 KRIZHEVSKY A, SUTSKEVER I, HINTON G E. Imagenet classification with deep convolutional neural networks[C]//Proceedings of the Conference on Neural Information Processing Systems. Lake Tahoe, Spain: IEEE, 2012: 1097-1105.
9 REN S , HE K , GIRSHICK R , et al. Faster R-CNN: towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39 (6): 1137- 1149.
doi: 10.1109/TPAMI.2016.2577031
10 LECUN Y , BOTTOU L , BENGIO Y , et al. Gradient-based learning applied to document recognition[J]. Proceedings of the IEEE, 1998, 86 (11): 2278- 2324.
doi: 10.1109/5.726791
11 SZEGEDY C, LIU W, JIA Y, et al. Going deeper with convolutions[C]//Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition. Boston, USA: IEEE, 2015: 1-9.
12 HE K M, ZHANG X Y, REN S Q, et al. Deep residual learning for image recognition[C]//CVPR 16: Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. Los Vegas, USA: IEEE, 2016: 770-778.
13 LONG J, SHELHAMER E, DARRELL T. Fully convolutional networks for semantic segmentation[C]//Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition. Boston, USA: IEEE, 2015: 1337-1342.
14 EKANAYAKE J, PALLICKARE S. Map reduce for data intensive scientific analysis[C]//IEEE Science. Piscatway, USA: IEEE, 2008: 277-284.
15 LAZEBNIK S, SCHMID C, PONCE J. Video google: a text retrieval approach to object matching in videos[C]//2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. New York, USA: IEEE, 2006: 2169-2178.
16 SIVIC J, ZISSERMAN A. Beyond bags of features: spatial pyramid matching for recognizing natural scene categories[C]//Proceedings of Ninth IEEE International Conference on Computer Vision. Nice, France: IEEE, 2003: 1470-1477.
17 JEGOU H , DOYZE M , SCHMID C . Improving bag of features for large scale image search[J]. International Journal of Computer Vision, 2010, 87 (3): 316- 336.
doi: 10.1007/s11263-009-0285-2
18 ZHANG G X , ZENG Z , ZHANG S W , et al. SIFT matching with CNN evidences for particular object retrieval[J]. Neurocomputing, 2017, 238, 399- 409.
doi: 10.1016/j.neucom.2017.01.081
19 RUSSAKOVSKY O , DENG J , SU H , et al. Imagenet large scale visual recognition challenge[J]. International Journal of Computer Vision, 2014, 115 (3): 211- 252.
20 CAMPBELL A T , MEER H G D , KOUNAVIS M E , et al. A survey of programmable networks[J]. ACM Computer Communications Review, 1999, 29 (2): 7- 23.
doi: 10.1145/505733
21 LI X, SHANG M, QIN H W, et al. Fast Accurate Fish detection and recognition of Underwater Images with Fast R-CNN[C]//Oceans. Washington, USA: IEEE, 2015: 1-5.
22 JIA Y, SHELHAMER E, DONAHUE J, et al. Caffe: convolutional architecture for fast feature embedding[C]//Proceedings of the 22nd ACM international conference on Multimedia. Orlando, USA: arXiv, 2014: 675-678.
[1] DENG Bin, ZHANG Zongbao, ZHAO Wenmeng, LUO Xinhang, WU Qiuwei. Cloud-edge collaborative and graph neural network based load forecasting method for electric vehicle charging stations [J]. Journal of Shandong University(Engineering Science), 2025, 55(5): 62-69.
[2] LI Erchao, ZHANG Zhizhao. Online dynamic demand vehicle routing planning [J]. Journal of Shandong University(Engineering Science), 2024, 54(5): 62-73.
[3] YANG Jucheng, WEI Feng, LIN Liang, JIA Qingxiang, LIU Jianzheng. A research survey of driver drowsiness driving detection [J]. Journal of Shandong University(Engineering Science), 2024, 54(2): 1-12.
[4] XIAO Wei, ZHENG Gengsheng, CHEN Yujia. Named entity recognition method combined with self-training model [J]. Journal of Shandong University(Engineering Science), 2024, 54(2): 96-102.
[5] Gang HU, Lemeng WANG, Zhiyu LU, Qin WANG, Xiang XU. Importance identification method based on multi-order neighborhood hierarchical association contribution of nodes [J]. Journal of Shandong University(Engineering Science), 2024, 54(1): 1-10.
[6] Jiachun LI,Bowen LI,Jianbo CHANG. An efficient and lightweight RGB frame-level face anti-spoofing model [J]. Journal of Shandong University(Engineering Science), 2023, 53(6): 1-7.
[7] Yujiang FAN,Huanhuan HUANG,Jiaxiong DING,Kai LIAO,Binshan YU. Resilience evaluation system of the old community based on cloud model [J]. Journal of Shandong University(Engineering Science), 2023, 53(5): 1-9, 19.
[8] Ying LI,Jiankun WANG. The classification of mild cognitive impairment based on supervised graph regularization and information fusion [J]. Journal of Shandong University(Engineering Science), 2023, 53(4): 65-73.
[9] WU Yanli, LIU Shuwei, HE Dongxiao, WANG Xiaobao, JIN Di. Poisson-gamma topic model of describing multiple underlying relationships [J]. Journal of Shandong University(Engineering Science), 2023, 53(2): 51-60.
[10] YU Mingjun, DIAO Hongjun, LING Xinghong. Online multi-object tracking method based on trajectory mask [J]. Journal of Shandong University(Engineering Science), 2023, 53(2): 61-69.
[11] LIU Xing, YANG Lu, HAO Fanchang. Finger vein image retrieval based on multi-feature fusion [J]. Journal of Shandong University(Engineering Science), 2023, 53(2): 118-126.
[12] LIU Fangxu, WANG Jian, WEI Benzheng. Auxiliary diagnosis algorithm for pediatric pneumonia based on multi-spatial attention [J]. Journal of Shandong University(Engineering Science), 2023, 53(2): 135-142.
[13] YU Yixuan, YANG Geng, GENG Hua. Multimodal hierarchical keyframe extraction method for continuous combined motion [J]. Journal of Shandong University(Engineering Science), 2023, 53(2): 42-50.
[14] HUANG Huajuan, CHENG Qian, WEI Xiuxi, YU Chuchu. Adaptive crow search algorithm with Jaya algorithm and Gaussian mutation [J]. Journal of Shandong University(Engineering Science), 2023, 53(2): 11-22.
[15] ZHANG Hao, LI Ziling, LIU Tong, ZHANG Dawei, TAO Jianhua. A technology prediction model based on fuzzy Bayesian networks with sociological factors [J]. Journal of Shandong University(Engineering Science), 2023, 53(2): 23-33.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
[1] WANG Li-ju,HUANG Qi-cheng,WANG Zhao-xu . [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2006, 36(6): 51 -56 .
[2] Yue Khing Toh1, XIAO Wendong2, XIE Lihua1. Wireless sensor network for distributed target tracking: practices via real test bed development[J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2009, 39(1): 50 -56 .
[3] SUN Weiwei, WANG Yuzhen. Finite gain stabilization of singlemachine infinite bus system subject to saturation[J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2009, 39(1): 69 -76 .
[4] ZOU Feifei,GUAN Xiaojun,HAN Zhenqiang,SHEN Xiaomin,MA Xiaofei ,LIU Yunteng . hermal simulating experiment and FEM simulation of dynamic recrystallization of 09CuPTiRE steel[J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2006, 36(5): 17 -20 .
[5] ZHANG Ai-juan. Synthesis of bone-like hydroxyapatite in simulated body fluid[J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2010, 40(3): 86 -90 .
[6] SUN Cong-zheng,GUAN Cong-sheng,QIN Jing-yu,CHENG Chuan . The structure and performances of the electroless Ni-P alloy coating on aluminum alloy[J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2007, 37(5): 108 -112 .
[7] XIA Bin,ZHANG Lian-jun . Energy comparison-based TOA estimation algorithm for the DS-CDMA UWB system[J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2007, 37(1): 70 -73 .
[8] XUE Qiang,AI Xing,ZHAO Jun,ZHOU Yong-hui,YUAN Xun-liang . Effects of TiC nano-sized particle on the microstructure and properties of Si3N4 composite ceramics[J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2008, 38(3): 69 -72 .
[9] DIAO Yong, TIAN Si-Meng, CAO Zhe-Meng. Geological work method for the construction of the Yichang Wanzhou Railway tunnel in high risk karst areas[J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2009, 39(5): 91 -95 .
[10] ZHAO Cun-Ming, SHEN Fei-Min. Analysis of Rayleigh wave velocity criterion  for the surrounding  rock classification change of road tunnels[J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2009, 39(4): 34 -39 .