Acta Geodaetica et Cartographica Sinica ›› 2019, Vol. 48 ›› Issue (10): 1266-1274.doi: 10.11947/j.AGCS.2019.20180398

• Photogrammetry and Remote Sensing • Previous Articles     Next Articles

Aircraft detection in remote sensing imagery with multi-scale feature fusion convolutional neural networks

YAO Qunli1,2, HU Xian1,2, Lei Hong1   

  1. 1. Department of Space Microwave Remote Sensing Systems, Institute of Electronics, Chinese Academy of Sciences, Beijing 100190, China;
    2. School of Electronics, Electrical and Communication Engineering, University of Chinese Academy of Sciences, Beijing 100049, China
  • Received:2018-08-26 Revised:2019-05-05 Online:2019-10-20 Published:2019-10-24
  • Supported by:
    The National Natural Science Foundation of China (Nos. 61422113;61601437);The National Key Research and Development Program of China (No. 2017YFB0502700)

Abstract: Aircraft detection in remote sensing images (RSIs) is a meaningful task. There are many problems in current detection methods, such as low accuracy in complex background and dense aircraft area, especially for small-scale aircraft. To solve these problems, an end-to-end aircraft detection method named MultDet is proposed in this paper. Based on single shot multibox detector (SSD), a lightweight baseline Network is used to extract multi-scale features for its powerful ability in feature extraction. To obtain the feature maps with enriched representation power, then the multi-scale deconvolution feature fusion block is designed. We add the high-level features with rich semantic information to the low-level features via deconvolution fusion block. In order to locate aircraft of various scales more accurately, a series of aspect ratios of default boxes are set to better match aircraft shapes and combine predictions deduced from feature maps of different layers. The quantitative comparison analysis are carried out on the challenging UCAS-AOD data set. The experimental results demonstrate that the proposed method is accurate and robust for multi-scale aircraft detection, and achieves 94.8% AP(average precision) at the speed of 0.050 0 s/img with the input size 512×512 using a single Nvidia Titan Xp GPU.

Key words: remote sensing images, aircraft detection, feature fusion, multi-scale features

CLC Number: