Acta Geodaetica et Cartographica Sinica ›› 2024, Vol. 53 ›› Issue (7): 1384-1400.doi: 10.11947/j.AGCS.2024.20230455

• Photogrammetry and Remote Sensing • Previous Articles     Next Articles

Multi-scale entropy neural architecture search for object detection in remote sensing images

Jun YANG1,2(), Hengjing XIE1, Hongchao FAN3, Haowen YAN1   

  1. 1.Faculty of Geomatics, Lanzhou Jiaotong University, Lanzhou 730070, China
    2.School of Electronic and Information Engineering, Lanzhou Jiaotong University, Lanzhou 730070, China
    3.Department of Civil and Environmental Engineering, Norwegian University of Science and Technology, Trondheim 7491, Norway
  • Received:2023-10-07 Published:2024-08-12
  • About author:YANG Jun (1973—), male, PhD, professor, PhD supervisor, majors in 3D model spatial analysis, intelligent interpretation of remotely sensed big data, deep learning. E-mail: yangj@mail.lzjtu.cn
  • Supported by:
    The National Natural Science Foundation of China(42261067)

Abstract:

Aiming at the traditional neural architecture search requires an enormous amount of time for supernet training, search efficiency is suboptimal, and the searched model can not efficiently solve the problem of multi-scale object detection difficulty and high background complexity in remote sensing images. This paper proposes a multi-scale entropy neural architecture search method for object detection in remote sensing images. At first, the feature separation convolution is added to the base block of the search space instead of the regular convolution in the residual block, which reduces the information redundancy in remote sensing images due to the high background complexity, and improves the detection performance of the network modeluner the complex background. Next, the maximum entropy principle is introduced to calculate the multi-scale entropy of each candidate network in the search space, and the multi-scale entropy is combined with the feature pyramid network to balance the detection of large, medium and small objects in remote sensing images. Finally, the network model with maximum multi-scale entropy is obtained by searching without parameter training using progressive evolutionary algorithm for the object detection task. The model ensures detection accuracy while improving the search efficiency. The proposed algorithm achieves a mean average precision of 93.1%, 75.5% and 73.6% on the RSOD, DIOR and DOTA datasets, respectively, with a network search time of 8.1 hours. The experimental results demonstrate that the proposed algorithm can significantly improve the search efficiency of the network, combine the features at different scales better and solve the problem of high image background complexity in the object detection task compared with the current benchmark methods.

Key words: remote sensing, neural architecture search, object detection, feature separation convolution, maximum entropy, multi-scale entropy, progressive evolution

CLC Number: