Acta Geodaetica et Cartographica Sinica ›› 2025, Vol. 54 ›› Issue (1): 123-135.doi: 10.11947/j.AGCS.2025.20230439

• Photogrammetry and Remote Sensing • Previous Articles    

Remote sensing scene retrieval method based on scene graph

Jiayi TANG(), Xiaochong TONG(), Chunping QIU, Yaxian LEI, Yi LEI, Haoshuai SONG   

  1. Institute of Geospatial Information, Information Engineering University, Zhengzhou 450001, China
  • Received:2023-10-07 Revised:2024-12-17 Published:2025-02-17
  • Contact: Xiaochong TONG E-mail:tangjiayi113769@163.com;txchr@aliyun.com
  • About author:TANG Jiayi (2000—), female, PhD candidate, majors in geospatial intelligence. E-mail: tangjiayi113769@163.com
  • Supported by:
    The National Key Research and Development Program of China(2024YFF1400804);Program of Songshan Laboratory(221100211000-03);The Excellent Youth Foundation of Henan Municipal Natural Science Foundation(212300410096);The National Natural Science Foundation of China(42201513)

Abstract:

Currently, most remote sensing scene retrieval is based on deep feature similarity matching of remote sensing images, which makes it difficult to directly represent the relationship information between scene entities and lacks a way to directly express spatial structure and semantics. Therefore, it cannot meet the complex retrieval requirements of users for remote sensing scene. This paper proposes a remote sensing scene retrieval method based on scene graph, which uses a graph neural network to map the scene graph data corresponding to the remote sensing scene to graph level feature vectors. The matching results of the graph feature vectors are used to reverse the remote sensing scene retrieval results. To train the graph neural network, this paper has created a dataset of 2380 pairs of remote sensing scene graphs, including 24 types of entities, 8 types of topological spatial relationships, and 9 types of directional relationships that have a structured representation of spatial relationships in remote sensing scenes. The spatial topological and orientational information is complete. The experiment shows that the remote sensing scene retrieval results based on scene graphs have high retrieval accuracy in entity categories, topological relationships, and orientation relationships. Especially compared with several representative international remote sensing scene retrieval methods, the scene retrieval accuracy indicators in topological and orientation relationships obtained by this method have a great improvement.

Key words: scene graph, remote sensing scene retrieval, knowledge graph, graph neural networks

CLC Number: