测绘学报 ›› 2026, Vol. 55 ›› Issue (3): 502-514.doi: 10.11947/j.AGCS.2026.20250362

• 海洋测绘 • 上一篇    下一篇

侧扫声呐图像目标识别模型类增量更新方法

于永灿1,2(), 赵建虎1,2(), 李冰墨1,2, 贺子扬1,2   

  1. 1.武汉大学测绘学院,湖北 武汉 430079
    2.武汉大学海洋研究院,湖北 武汉 430079
  • 收稿日期:2025-09-04 修回日期:2026-02-25 出版日期:2026-04-16 发布日期:2026-04-16
  • 通讯作者: 赵建虎 E-mail:2019202140056@whu.edu.cn;jhzhao@sgg.whu.edu.cn
  • 作者简介:于永灿(1998—),男,博士生,研究方向为水下目标检测、声呐图像处理。E-mail:2019202140056@whu.edu.cn
  • 基金资助:
    国家自然科学基金(42476179)

Class-incremental update method for target recognition models in sidescan sonar images

Yongcan YU1,2(), Jianhu ZHAO1,2(), Bingmo LI1,2, Ziyang HE1,2   

  1. 1.School of Geodesy and Geomatics, Wuhan University, Wuhan 430079, China
    2.Institute of Marine Science and Technology, Wuhan University, Wuhan 430079, China
  • Received:2025-09-04 Revised:2026-02-25 Online:2026-04-16 Published:2026-04-16
  • Contact: Jianhu ZHAO E-mail:2019202140056@whu.edu.cn;jhzhao@sgg.whu.edu.cn
  • About author:YU Yongcan (1998—), male, PhD candidate, majors in underwater target recognition and sonar image processing. E-mail: 2019202140056@whu.edu.cn
  • Supported by:
    The National Natural Science Foundation of China(42476179)

摘要:

侧扫声呐目标图像的类间相似性在识别模型类增量更新中易引发特征偏置,加剧灾难性遗忘。针对这一问题,本文以MEMO算法为基线,提出一种阶段渐进的双尺度动态注意力模块,由阶段内注意力与阶段间注意力组成:前者作用于阶段专用块特征,通过全局池化和通道重加权增强模型表示能力,缓解阶段内相似类别的混淆;后者作用于跨阶段串联特征,减少新类别主导的特征偏置。结合最近邻分类器,本文方法进一步强化了模型抗遗忘能力。在构建的SSS图像目标识别模型类增量更新框架中,本文方法取得了86.79%的平均识别准确率和80.94%的最后识别准确率,分别较基线提升10.88和11.43个百分点,性能优于主流类增量学习算法。在开源前视声呐数据上的拓展试验亦证明其泛化能力,平均识别准确率提升了2.65个百分点。本文方法仅引入1.41%的额外参数,并具备轻量级的更新开销与高效的推理速度。试验表明,本文方法能有效抑制类间相似性导致的特征干扰,提升模型在动态类别扩展中的稳定性,为侧扫声呐图像水下目标识别的持续学习提供了高效解决方案,对移动端部署和智能化、无人化水下测绘任务具有重要意义。

关键词: 水下目标识别, 模型类增量更新, 动态注意力, 类间相似性, 侧扫声呐图像

Abstract:

The inter-class similarity of side-scan sonar (SSS) target images can induce feature bias during class-incremental updates of recognition models, exacerbating catastrophic forgetting. To address this issue, this paper proposes a stage-progressive dual-scale dynamic attention module based on MEMO algorithm, which composes of intra-stage and inter-stage attention. The intra-stage attention employs global pooling and channel reweighting on stage-specific block features to enhance the model's representational capability and alleviate confusion among similar categories within the same stage. The inter-stage attention reweights the concatenated features based on the cross-stage information to mitigate feature bias dominated by new categories. Combined with a nearest-neighbor classifier, the overall approach further strengthens the model's anti-forgetting capability. Under the proposed class-incremental model update framework for SSS object recognition, our method achieves an average accuracy (Avg) of 86.79% and a last-stage accuracy of 80.94%, surpassing the baseline by 10.88 and 11.43 percentage, respectively, and outperforming mainstream class-incremental learning methods. Extended experiments on a large-scale open-source forward-looking sonar dataset further demonstrate the method's generalization ability, yielding an Avg improvement of 2.65 percentage. Our method introduces only 1.41% additional parameters while maintaining lightweight update overhead and efficient inference speed. Experimental results show that our method effectively suppresses feature interference caused by inter-class similarity, improving stability and robustness during dynamic category expansion. This provides an efficient solution for continual recognition of underwater sonar targets and holds significant value for mobile deployment and intelligent, unmanned underwater mapping tasks.

Key words: underwater target recognition, model class-incremental update, dynamic attention, inter-class similarity, side-scan sonar images

中图分类号: