Acta Geodaetica et Cartographica Sinica ›› 2024, Vol. 53 ›› Issue (4): 677-688.doi: 10.11947/j.AGCS.2024.20230125

• Photogrammetry and Remote Sensing • Previous Articles     Next Articles

Knowledge graph-guided deep network for high-resolution remote sensing image scene classification

Yansheng LI(), Minlang WU, Yongjun ZHANG   

  1. School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430079, China
  • Received:2023-04-23 Revised:2024-01-15 Published:2024-05-13
  • About author:LI Yansheng (1987—), male, PhD, professor, majors in remote sensing spatio-temporal knowledge graph, multi-modal remote sensing foundation model, and intelligent interpretation of remote sensing big data. E-mail: yansheng.li@whu.edu.cn
  • Supported by:
    The National Natural Science Foundation of China(42030102)

Abstract:

Thanks to the rapid development of deep network theory and methods, deep networks have gradually become the mainstream technology for remote sensing image scene classification tasks. However, existing deep network-based remote sensing image scene classification methods are highly dependent on a large number of manually labeled training samples and cannot effectively integrate and utilize the rich prior knowledge in the remote sensing field. In order to improve the utilization of domain knowledge while reducing the dependence on labeled samples, this paper proposes a knowledge graph-guided deep network learning method for high-resolution remote sensing image scene classification. First, this paper constructs a land cover concept knowledge graph that includes various sources of knowledge in the field to more flexibly and conveniently apply domain prior knowledge. Furthermore, through the knowledge graph representation learning method, the semantic categories of remote sensing scenes in the land cover concept knowledge graph are expressed as semantic vectors to form a semantic benchmark for remote sensing scene categories. In the knowledge-guided learning stage, the cross-modal alignment constraint between the scene category semantic vector and the shallow visual feature vector of the deep network is applied to guide the shallow part of the deep network to more effectively learn shared features of different categories of remote sensing image scenes, while in the deep part of the deep network, it is still guided by scene category labels to learn discriminative features of different remote sensing scenes. In the testing stage, the optimized deep network model can complete high-precision remote sensing image scene classification without relying on any prior knowledge. The experimental results on the currently largest publicly available remote sensing image scene classification dataset show that the proposed knowledge-guided learning method can obtain optimal classification performance at different training sample ratios such as 10%, 30%, and 50% compared with existing methods. Under the condition of 10% sample ratio, our proposed method can achieve an improvement of 5.11% in overall accuracy (OA) compared with baseline deep networks.

Key words: remote sensing image scene classification, land cover concept knowledge graph, knowledge graph representation learning, knowledge graph-guided deep network

CLC Number: