Acta Geodaetica et Cartographica Sinica ›› 2025, Vol. 54 ›› Issue (10): 1826-1840.doi: 10.11947/j.AGCS.2025.20250139

• Geodesy and Navigation • Previous Articles     Next Articles

A Stacking-SHAP ensemble method for landslide susceptibility prediction with high accuracy and interpretability

Xin HUANG1,2(), Jian YE1(), Chengbing LIU1, Qiuyu ZENG1, Wanxin GUO1,3, Zhikai GUO1   

  1. 1.Faculty of Geosciences and Engineering, Southwest Jiaotong University, Chengdu 611756, China
    2.Chongqing Institute of Surveying and Mapping, Ministry of Natural Resources, Chongqing 401120, China
    3.Powerchina Jiangxi Electric Power Engineering Co., Ltd., Nanchang 330096, China
  • Received:2025-03-31 Revised:2025-06-19 Online:2025-11-14 Published:2025-11-14
  • Contact: Jian YE E-mail:xinhuang@my.swjtu.edu.cn;yejian518@swjtu.edu.cn
  • About author:HUANG Xin (2001—), male, master, majors in landslide susceptibility modeling and analysis. E-mail: xinhuang@my.swjtu.edu.cn
  • Supported by:
    The National Natural Science Foundation of China(41771419)

Abstract:

Landslide susceptibility prediction and triggering factor analysis are crucial for developing scientifically effective landslide disaster prevention and control strategies. However, there is still a lack of landslide prediction models that can achieve both high prediction accuracy and interpretability. To address this, the present study proposes an interpretable enhanced ensemble learning method by constructing the Stacking-SHAP model, aimed at improving the accuracy of landslide susceptibility prediction and the reliability of triggering factor analysis. This model employs a Stacking ensemble framework, integrating various machine learning classifiers, such as XGBoost, CatBoost, LightGBM, logistic regression (LR), and random forest (RF), while ensuring prediction accuracy. Additionally, the shapley additive explanations (SHAP) algorithm is incorporated to enhance model interpretability. Experimental results demonstrate that the AUC value of the Stacking-SHAP model reaches 0.920, significantly outperforming individual classifier models, such as XGBoost (0.893), CatBoost (0.894), LightGBM (0.879), RF (0.859), and LR (0.794). More importantly, compared to SHAP-integrated single machine learning models, the Stacking-SHAP interpretable enhanced ensemble model shows superior overall performance in landslide triggering factor analysis, thereby improving the credibility of landslide causative factor analysis. Overall, this model combines high-accuracy prediction with highly reliable interpretation, offering an innovative approach to landslide susceptibility prediction and triggering factor analysis, with significant theoretical and practical value in landslide prevention and disaster reduction.

Key words: landslide susceptibility, geographical big data, Stacking algorithm, SHAP algorithm, landslide triggering factor analysis

CLC Number: