Interpretability Analysis of Data Augmented Convolutional Neural Network in Mineral Prospectivity Mapping Using Black-Box Visualization Tools

被引:0
|
作者
Liu, Yue [1 ,2 ]
Sun, Tao [1 ,2 ]
Wu, Kaixing [1 ,2 ]
Xiang, Wenyuan [3 ]
Zhang, Jingwei [2 ]
Zhang, Hongwei [2 ]
Feng, Mei [2 ]
机构
[1] Jiangxi Prov Key Lab Low Carbon Proc & Utilizat St, Ganzhou 341000, Peoples R China
[2] Jiangxi Univ Sci & Technol, Sch Resources & Environm Engn, Ganzhou 341000, Peoples R China
[3] China Chem Geol & Mine Bur Hunan Geol Explorat Ins, Changsha 410000, Peoples R China
基金
中国国家自然科学基金;
关键词
Interpretability; Convolutional neural network; Mineral prospectivity mapping; SMOTE; Black-box visualization; RANDOM FOREST; DISTRICT; PREDICTION; SELECTION; PROVINCE; MODELS; PLOT; AI;
D O I
10.1007/s11053-025-10462-5
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
Machine learning is becoming a popular and appealing tool in mineral prospectivity mapping (MPM); however, it has always been challenged by some essential limitations, such as scarcity of training samples, overfitting, and uncertainties. Data augmentation has been proven to be effective in addressing these issues and improving the performance of artificial intelligence models, but its mechanism regarding how augmented data influences predictive modeling processes, improves model performance, and alleviates overfitting has yet to be elucidated due to the black-box nature of machine learning modeling. In this study, the synthetic minority oversampling technique (SMOTE), proven to perform best among five commonly used data augmentation methods, was selected and utilized to enhance the training data and improve model performance. The results indicate that the convolutional neural network (CNN) model trained by rational-feature ordering and SMOTE-augmented data achieved better performance, with higher test accuracy (0.9306), recall (0.9167), F1-score (0.9296), and alleviated overfitting (0.0215), compared with the model trained on original data. A set of black-box visualization tools, including filter weight visualization, individual conditional expectation (ICE) plots, derivative ICE (d-ICE) plots, partial dependence plots (PDPs), and Shapley additive explanations (SHAP), were employed to explore the beneficial mechanism of SMOTE when applied to enhance the predictive capabilities of CNN in MPM. The visualization of the weight filters reveals that the optimal model activates favorable excitations of W anomalies, Mn anomalies and proximity to Yanshanian intrusions, which are associated with tungsten mineralization, thus optimizing feature extraction, refining convolutional operation, and improving model performance. The ICE and d-ICE analyses reveal that the SMOTE-augmented model exhibites a more consistent decision trend in key ore-associated features and reduces variability in derivative estimates, particularly beyond decision thresholds, leading to stabler predictions. The PDP results show that SMOTE-augmented data increase the decision boundary difference between positive and negative samples, suggesting a broader decision width that favored more accurate classification. The SHAP analyses indicate that the SMOTE-augmented data boost the recognition ability of the CNN model by clearly separating feature values of key ore-associated factors with contrasting SHAP values and help the model make more convergent decision paths, especially for samples with top probabilities. Our findings provide a straightforward view for explaining how a superior algorithm can benefit model predictions through black-box modeling processes, and contribute to understanding the decision-making mechanism of machine learning in MPM.
引用
收藏
页码:759 / 783
页数:25
相关论文
共 50 条
  • [21] Geological symbol recognition on geological map using convolutional recurrent neural network with augmented data
    Qiu, Qinjun
    Tan, Yongjian
    Ma, Kai
    Tian, Miao
    Xie, Zhong
    Tao, Liufeng
    ORE GEOLOGY REVIEWS, 2023, 153
  • [22] Using sensitivity analysis and visualization techniques to open black box data mining models
    Cortez, Paulo
    Embrechts, Mark J.
    INFORMATION SCIENCES, 2013, 225 : 1 - 17
  • [23] Detection of COVID-19 in X-ray Images Using Densely Connected Squeeze Convolutional Neural Network (DCSCNN): Focusing on Interpretability and Explainability of the Black Box Model
    Ali, Sikandar
    Hussain, Ali
    Bhattacharjee, Subrata
    Athar, Ali
    Abdullah, Abdullah
    Kim, Hee-Cheol
    SENSORS, 2022, 22 (24)
  • [24] A convolutional neural network approach to classifying urban spaces using generative tools for data augmentation
    Medel-Vera, Carlos
    Vidal-Estevez, Pelayo
    Madler, Thomas
    INTERNATIONAL JOURNAL OF ARCHITECTURAL COMPUTING, 2024, 22 (03) : 392 - 411
  • [25] SIS-CAM: An Interpretability Analysis Method for the Security of Convolutional Neural Network Models Based on Image Big Data
    Xu, Fang
    Zhang, Yuquan
    Ma, Yi
    Zhang, Yan
    Khan, Umer Sadiq
    Li, Zhimin
    Liu, Zhen
    Yang, Na
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2025, : 871 - 886
  • [26] RETRACTED: Adoption of Convolutional Neural Network Algorithm Combined with Augmented Reality in Building Data Visualization and Intelligent Detection (Retracted Article)
    Wei, Minghui
    Tang, Jingjing
    Tang, Haotian
    Zhao, Rui
    Gai, Xiaohui
    Lin, Renying
    COMPLEXITY, 2021, 2021
  • [27] Lithological Mapping Using a Convolutional Neural Network based on Stream Sediment Geochemical Survey Data
    Wang, Xueping
    Zuo, Renguang
    Wang, Ziye
    NATURAL RESOURCES RESEARCH, 2022, 31 (05) : 2397 - 2412
  • [28] Deep Convolutional Neural Network for Flood Extent Mapping Using Unmanned Aerial Vehicles Data
    Gebrehiwot, Asmamaw
    Hashemi-Beni, Leila
    Thompson, Gary
    Kordjamshidi, Parisa
    Langan, Thomas E.
    SENSORS, 2019, 19 (07)
  • [29] Geological Mapping Using Direct Sampling and a Convolutional Neural Network Based on Geochemical Survey Data
    Ziye Wang
    Renguang Zuo
    Fanfan Yang
    Mathematical Geosciences, 2023, 55 : 1035 - 1058
  • [30] Seq2Image: Sequence Analysis using Visualization and Deep Convolutional Neural Network
    Tavakoli, Neda
    2020 IEEE 44TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2020), 2020, : 1332 - 1337