Acta Geodaetica et Cartographica Sinica ›› 2023, Vol. 52 ›› Issue (12): 2103-2114.doi: 10.11947/j.AGCS.2023.20220567

• Photogrammetry and Remote Sensing • Previous Articles     Next Articles

Deep learning method for large scene DSM generation of GF-7 imagery

HE Sheng1, ZHANG Jiankai2, CHEN Feng2, LI Shenhong1, JIANG Wanshou1   

  1. 1. State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan 430079, China;
    2. Beijing Geoway Spatial Information Co., Ltd., Beijing 100040, China
  • Received:2022-10-09 Revised:2023-06-06 Published:2024-01-03
  • Supported by:
    The High-Resolution Remote Sensing Application Demonstration System for Urban Fine Management (the second stage of the project) (No. 06-Y30F04-9001-20/22)

Abstract: Stereo matching is an important step to generate DSM from satellite imageries. Recently, studies have shown that deep learning-based methods have better performance. However, due to the fixed and limited disparity range predicted by models and the lack of training data, deep learning is rarely directly applied to the stereo matching of satellite images in large scenes. In this paper, a hierarchical dynamic matching strategy is proposed to dynamically determine the region of image blocks of the current level according to the matching results of the previous level, so that the disparity between the left and right epipolar image blocks is relatively small, which is conducive to the prediction of deep learning models. Besides, a scheme for the production of samples is introduced, and a GF-7 dataset is constructed by using manually edited DSM or LiDAR point clouds to obtain ground truth disparity values. In the experiment, this dataset, together with an existing dataset, is used to train Stereo-Net and DSM-Net, and based on the hierarchical matching strategy, the generation of high-quality DSM from Gaofen-7 imagery combined with deep learning technology is achieved for the first time. Experiments in imageries from three cities show that the average endpoint error is about 1 pixel, and the fraction of erroneous pixels is less than 3.8%. The quality of the generated DSMs is better than that of the traditional method.

Key words: satellite imagery, stereo matching, DSM, deep learning, GF-7

CLC Number: