Acta Geodaetica et Cartographica Sinica ›› 2024, Vol. 53 ›› Issue (10): 1967-1980.doi: 10.11947/j.AGCS.2024.20240053.

• Remote Sensing Large Model • Previous Articles    

Research progress and trend of intelligent remote sensing large model

Qin YAN1,2,(), Haiyan GU1,2, Yi YANG1,2(), Haitao LI1,2, Hengtong SHEN1,2, Shiqi LIU1,2   

  1. 1.Chinese Academy of Surveying and Mapping, Beijing 100830, China
    2.Key Laboratory of Geospatial Technology for the Surveying and Mapping Sciences of the Ministry of Natural Resources, Beijing 100830, China
  • Received:2024-01-31 Published:2024-11-26
  • Contact: Yi YANG E-mail:yanqin@casm.ac.cn;yangyi@casm.ac.cn
  • About author:YAN Qin (1968—), female, PhD, researcher, majors in natural resource surveying and monitoring, territorial spatial planning and land use control, and aerospace remote sensing mapping. E-mail: yanqin@casm.ac.cn
  • Supported by:
    The National Key Research and Development Program of China(2023YFB3907600);Basic Research Funds for Central Public Walfare Research Institute(AR2420)

Abstract:

AI large models, with their advantages in generalization, universality, and high accuracy, have become the cornerstone of various AI applications such as computer vision, natural language processing. Based on the analysis of the development process, value, and challenges of AI large models, this article first discusses the research progress of remote sensing large models from three perspectives: data, model, and downstream tasks. At the data level, there is a transition from single modality to multi-modality; at the model level, there is a shift from small models to large models; and at the downstream task level, there is a development from single-task to multi-task. Next, the article explores three key development directions for remote sensing large models: multi-modal remote sensing large models, interpretable remote sensing large models, and reinforcement learning from human feedback(RLHF). Furthermore, it realizes a construction approach for remote sensing large models, namely “construction of unlabeled dataset-self-supervised model learning-downstream transfer application”. Technical experiments have been conducted to validate the significant advantages of remote sensing large models. Finally, the article concludes and provides prospects, emphasizing the need to focus on application tasks and combine theoretical methods, engineering technology, and iterative applications to achieve low-cost training, efficient and fast inference, lightweight deployment, and engineering-based applications for remote sensing large models.

Key words: remote sensing large models, artificial intelligence, multi-modal, interpretable, reinforcement learning from human feedback, self-supervised learning

CLC Number: