测绘学报 ›› 2021, Vol. 50 ›› Issue (2): 260-269.doi: 10.11947/j.AGCS.2021.20200187

• 海洋测量学 • 上一篇    下一篇

侧扫声呐识别沉船影像的迁移学习卷积神经网络法

汤寓麟, 金绍华, 边刚, 张永厚, 李凡   

  1. 海军大连舰艇学院军事海洋与测绘系, 辽宁 大连 116018
  • 收稿日期:2020-05-12 修回日期:2020-10-14 发布日期:2021-03-03
  • 通讯作者: 金绍华 E-mail:jsh_1978@163.com
  • 作者简介:汤寓麟(1996-),男,硕士生,研究方向为侧扫声纳图像处理与计算机视觉。E-mail:494592292@qq.com
  • 基金资助:
    国家自然科学基金(41876103;41576105)

The transfer learning with convolutional neural network method of side-scan sonar to identify wreck images

TANG Yulin, JIN Shaohua, BIAN Gang, ZHANG Yonghou, LI Fan   

  1. Department of Hydrography and Cartography, Dalian Naval Academy, Dalian 116018, China
  • Received:2020-05-12 Revised:2020-10-14 Published:2021-03-03
  • Supported by:
    The National Natural Science Foundation of China (Nos. 41876103;41576105)

摘要: 侧扫声呐海底沉船图像识别是水下障碍物核查和失事船只搜救中的一项重要工作。针对传统侧扫声呐图像人工判读存在效率低、耗时长、资源消耗大及主观不确定性强和过分依赖经验等问题,本文尝试引入卷积神经网络的方法,同时考虑到侧扫声呐沉船图像属于小样本数据集,提出一种基于迁移学习的卷积神经网络侧扫声呐沉船图像自动识别方法。通过归一化处理、图像增强等方式扩充样本数据,并以4∶1的比例划分训练集和测试集,同时参照经典VGG-16模型,根据侧扫声呐沉船数据集特点设计了改进的模型,然后将在ImageNet图像数据集上训练好的改进模型在小样本侧扫声呐沉船数据集上采用冻结和训练、微调两种迁移学习方式进行学习和试验,并与全新学习进行比较分析,结果表明,3种方法对侧扫声呐沉船图像识别的准确率分别为93.71%、84.49%和90.58%,其中第1种迁移学习方法准确率最高,模型收敛速度最快,且AP值最高为92.45%,分别比第2种迁移学习方法和全新学习高了8.06%和3.06%,在提高模型的识别能力和训练效率方面效果更佳,验证了该方法的有效性与可行性,具有一定实际指导意义。

关键词: 侧扫声呐海底沉船, 图像识别, 迁移学习, 卷积神经网络, VGG-16

Abstract: The Side-scan sonar image automatic recognition is an important part of verification for underwater obstacle and wreck search and rescue, in view of the traditional artificial interpretation of side-scan sonar image is inefficient, time consuming and resource consumption and strong subjective uncertainty and excessive reliance on experience. This paper attempts to introduce the method of convolutional neural network, considering that the side-scan sonar shipwreck image belongs to a small sample data set, and an automatic recognition method of side-scan sonar shipwreck image based on transfer learning is proposed.The sample data were expanded by means of normalization and image enhancement, the training set and testing set were divided into 4∶1 proportions, and an improved model was designed according to the characteristics of the side-scan sonar wreck data set by referring to the classical VGG-16 model, then, the improved model trained on the ImageNet image data set is used to learn and experiment on the small sample side-scan sonar shipwreck data set using two transfer learning methods: freeze and train and fine-tuning, and compared with new learning. The results show that the accuracy of the three methods for the recognition of side-scan sonar shipwreck images is 93.71%, 84.49% and 90.58%, respectively. The first transfer learning method has the highest accuracy rate, the fastest model convergence speed, and the highest AP value 92.45%, which is 8.06% and 3.06% higher than the second transfer learning and the new learning method, respectively,and has a better effect in improving the model’s recognition ability and training efficiency. which verifies the effectiveness and feasibility of this method and has certain practical guiding significance.

Key words: side-scan sonar wreck image, image recognition, transfer learning, convolutional neural network, VGG-16

中图分类号: