Acta Geodaetica et Cartographica Sinica ›› 2025, Vol. 54 ›› Issue (8): 1476-1488.doi: 10.11947/j.AGCS.2025.20240457

• Photogrammetry and Remote Sensing • Previous Articles     Next Articles

Spatio-temporal fusion algorithm based on adaptive reference feature incorporation and multi-scale feature aggregation

Shuai FANG1,2(), Jiaen LIU1, Jing ZHANG1   

  1. 1.School of Computer Science and Information Engineering, Hefei University of Technology, Hefei 230000, China
    2.Anhui Province Key Laboratory of Industry Safety and Emergency Technology, Hefei 230000, China
  • Received:2024-11-11 Revised:2025-06-19 Online:2025-09-16 Published:2025-09-16
  • About author:FANG Shuai (1978—), female, PhD, professor, majors in image restoration and visual inspection. E-mail: fangshuai@hfut.edu.cn
  • Supported by:
    The National Natural Science Foundation of China(61175033)

Abstract:

The purpose of spatio-temporal fusion algorithm is to generate dense time series images with high spatial resolution, which is very important for monitoring fine dynamic-changes of the surface. Most of the spatio-temporal fusion algorithms help to predict the target fine image by reference fine image in the adjacent time, which makes the purpose of spatio-temporal fusion algorithm is to generate dense time series images with high spatial resolution, which is very important for monitoring fine dynamic-changes of the surface. However, the existing spatio-temporal fusion algorithms are easily misled by the reference image in the area with land cover change, and the reconstruction of heterogeneous areas composed of small targets is more difficult. To this end, this paper proposes a spatio-temporal fusion algorithm based on adaptive reference feature incorporation and multi-scale feature aggregation. In the encoding stage, an adaptive reference feature incorporation module is designed. According to the change information provided by the time series coarse image pair and the gating structure, the adaptive introduction of the reference fine image is realized, which not only improves the prediction accuracy by using the reference information, but also suppresses the misleading of the reference information to the change area. In the decoder stage, a multi-scale feature aggregation strategy is designed to aggregate information of different scales for each layer of the decoder, and the channel attention mechanism is combined to filter information with important features to improve the reconstruction accuracy of heterogeneous areas. Finally, the focal frequency loss term is introduced into the loss function. From the perspective of frequency distribution, it enhances the authenticity of the generated image and focuses on the reconstruction of difficult frequency bands to make up for the deficiency of spatial spectrum loss. The experimental results on LGC, CIA and Wuhan datasets show that the proposed algorithm has better fusion results than the other six algorithms.

Key words: spatio-temporal fusion, deep learning, adaptive reference feature incorporation, multi-scale feature aggregation, focal frequency loss

CLC Number: