Acta Geodaetica et Cartographica Sinica ›› 2026, Vol. 55 ›› Issue (1): 101-113.doi: 10.11947/j.AGCS.2026.20250303

• Photogrammetry and Remote Sensing • Previous Articles    

Water body segmentation network for SAR images combining dual-encoder and adaptive feature fuse

Bin HAN1(), Xin HUANG1, Fengyi LI1, Xiaozhen LU2()   

  1. 1.School of Communication and Information Engineering, Nanjing University of Posts and Telecommunications, Nanjing 210003, China
    2.School of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China
  • Received:2025-07-28 Revised:2026-01-04 Published:2026-02-13
  • Contact: Xiaozhen LU E-mail:njupt.bh@foxmail.com;luxiaozhen@nuaa.edu.cn
  • About author:HAN Bin (1990—), male, PhD, associate professor, majors in remote sensing image processing. E-mail: njupt.bh@foxmail.com
  • Supported by:
    The National Natural Science Foundation of China(62201281; 62571240; U22B2062);The Natural Science Foundation of Jiangsu Province(BK20220392);The Natural Science Research Initiation Fund for Talents Introduced to Nanjing University of Posts and Telecommunications(NY222004)

Abstract:

Synthetic aperture radar (SAR) image water body segmentation is widely applied in critical fields such as disaster response, water resource management, and environmental monitoring, holding significant practical importance. To address the issue of low segmentation accuracy for water bodies in complex-background SAR images, a dual-encoder adaptive feature fusion network (DEAFFNet) is proposed to achieve accurate water body segmentation. First, the model employs a lightweight residual network and Swin Transformer to construct a dual-encoder architecture, collaboratively extracting local detail information and global contextual information to mitigate the problem of insufficient information representation capability in complex backgrounds. Second, a feature fusion module based on cross-attention and adaptive weight learning is designed. This module utilizes cross-attention for interaction between local and global information and achieves hybrid feature fusion through adaptive weight learning, thereby enhancing the model's perception of water body structures. Then, a multi-scale convolutional pooling module is integrated into the decoder to reinforce multi-scale contextual information, combined with a lightweight content-aware upsampling method to alleviate feature distortion caused by upsampling. Finally, a composite loss function combining focal loss and active contour loss is adopted to strengthen the model's constraints on sample balance and water body boundary information. Water body segmentation experiments conducted on the ALOS PALSAR and Sen-1SAR datasets demonstrate that DEAFFNet outperforms existing methods across multiple evaluation metrics, achieving more accurate water body segmentation.

Key words: SAR image water body segmentation, dual-encoder, cross attention, adaptive weight learning, composite loss

CLC Number: