Acta Geodaetica et Cartographica Sinica ›› 2024, Vol. 53 ›› Issue (10): 1955-1966.doi: 10.11947/j.AGCS.2024.20240068.

• Remote Sensing Large Model • Previous Articles    

Large models enabling intelligent photogrammetry: status, challenges and prospects

Mi WANG1,(), Xu CHENG1(), Jun PAN1, Yingdong PI1, Jing XIAO2   

  1. 1.State Key Laboratory of Surveying, Mapping and Remote Sensing Information Engineering, Wuhan University, Wuhan 430079, China
    2.School of Computer Science, Wuhan University, Wuhan 430072, China
  • Received:2024-02-18 Published:2024-11-26
  • Contact: Xu CHENG E-mail:wangmi@whu.edu.cn;xucheng@whu.edu.cn
  • About author:WANG Mi (1974—), male, PhD, professor, PhD supervisor, majors in high precision satellite remote sensing technology. E-mail: wangmi@whu.edu.cn
  • Supported by:
    The National Key Research and Development Program of China(2022YFB3902804);The National Natural Science Foundation of China(62425102)

Abstract:

Developed from deep learning and transfer learning techniques, large models leverage vast training datasets and immense parameter capacities to create scale effects, thus inspiring the model's emergent capabilities and demonstrating strong generalization and adaptability in numerous downstream tasks. Large models, represented by ChatGPT and SAM, signify the arrival of the era of general artificial intelligence, providing new theories and techniques for the automation and intelligence of Earth's spatial information processing. To further explore the methods and pathways for large models to empower the field of photogrammetry, this paper reviews the basic problems and mission tasks in the field of photogrammetry, summarizes the research achievements of deep learning methods in intelligent photogrammetric processing, analyzes the advantages and limitations of supervised pre-training methods aimed at specific tasks; Besides, we elaborates on the characteristics and research progress of general artificial intelligence large models, focusing on the generalizability of large models in basic visual tasks and the potential in three-dimensional representation; Finally, this paper explores the current challenges and future trends of large model technologies in the field of photogrammetry, from the perspectives of training data, model fine-tuning strategies, and heterogeneous multi-modal data fusion strategies.

Key words: large models, intelligent photogrammetry, deep learning, multi-modal

CLC Number: