Acta Geodaetica et Cartographica Sinica ›› 2021, Vol. 50 ›› Issue (5): 652-663.doi: 10.11947/j.AGCS.2021.20200190

• Photogrammetry and Remote Sensing • Previous Articles     Next Articles

Scene classification of high-resolution remote sensing imagery based on deep transfer deformable convolutional neural networks

SHI Huihui1,2, XU Yannan1,2, TENG Wenxiu3, WANG Ni4,5   

  1. 1. Co-Innovation Center for the Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing 210037, China;
    2. College of Forest, Nanjing Forestry University, Nanjing 210037, China;
    3. Department of Geosciences, University of Massachusetts, Massachusetts 01003, USA;
    4. Anhui Engineering Laboratory of Geographical Information Intelligent Sensor and Service, Chuzhou 239000, China;
    5. School of Geographic Information and Tourism, Chuzhou University, Chuzhou 239000, China
  • Received:2020-05-18 Revised:2021-02-02 Published:2021-06-03
  • Supported by:
    The National Key Research and Development Program of China (No. 2019YFD1100404)

Abstract: In recent years, scene classification of high-resolution remote sensing images based on deep convolutional neural networks has become the focus of attention. Because of the existing deep convolution neural network is not robust to the geometric deformation of remote sensing scene image, we proposed a novel scene classification method for high-resolution remote sensing image, based on the deep transfer deformable convolutional neural networks (DTDCNN). Specifically, the depth features of remote sensing image are extracted by using the trained depth model on the large-scale natural scene dataset (ImageNet), then, the deformable convolution layer is introduced to learn the depth features which are robust to the geometric deformation of remote sensing scene.The results show that:the accuracy of DTDCNN on AID, UC-Merced and NWPU-RESISC45 datasets is improved by 4.25%, 1.9% and 4.83% after adding the deformable convolution, respectively. By the adaptive adjustment of the receptive field for different objects in the scene, DTDCNN enhances the ability of spatial sampling position, and, effectively improves the accuracy of remote sensing scene classification.

Key words: remote sensing, scene classification, convolutional neural networks, deformable convolutional, transfer learning

CLC Number: