Advancing interpretability of machine-learning prediction models

被引:1
|
作者
Trenary, Laurie [1 ,2 ]
DelSole, Timothy [1 ,2 ]
机构
[1] George Mason Univ, Dept Atmospher Ocean & Earth Sci, Fairfax, VA 22030 USA
[2] George Mason Univ, Ctr Ocean Land Atmosphere Studies, Fairfax, VA 22030 USA
来源
基金
美国国家科学基金会;
关键词
machine learning; model interpretation; subseasonal prediction;
D O I
10.1017/eds.2022.13
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
This paper proposes an approach to diagnosing the skill of a machine-learning prediction model based on finding combinations of variables that minimize the normalized mean square error of the predictions. This technique is attractive because it compresses the positive skill of a forecast model into the smallest number of components. The resulting components can then be analyzed much like principal components, including the construction of regression maps for investigating sources of skill. The technique is illustrated with a machine-learning model of week 3-4 predictions of western US wintertime surface temperatures. The technique reveals at least two patterns of large-scale temperature variations that are skillfully predicted. The predictability of these patterns is generally consistent between climate model simulations and observations. The predictability is determined largely by sea surface temperature variations in the Pacific, particularly the region associated with the El Nino-Southern Oscillation. This result is not surprising, but the fact that it emerges naturally from the technique demonstrates that the technique can be helpful in "explaining" the source of predictability in machine-learning models. Impact Statement Machine learning has emerged as a powerful tool for climate prediction, but the resulting models often are too complex to interpret. Methods for extracting meaningful knowledge from machine-learning models have been developed (e.g., explainable AI), but most of these methods apply only to low-dimensional outputs. In contrast, many climate applications require predicting spatial fields. This paper proposes an approach to reducing the dimension of the output by finding components with the most skill. This technique is illustrated by training separate machine-learning models at hundreds of spatial locations, and then using this technique to show that only a few patterns are predicted with significant skill. Individual patterns can then be analyzed using regression techniques to diagnose the source of the skill.
引用
下载
收藏
页数:10
相关论文
共 50 条
  • [1] The Importance of Interpretability and Validations of Machine-Learning Models
    Yamasawa, Daisuke
    Ozawa, Hideki
    Goto, Shinichi
    CIRCULATION JOURNAL, 2024, 88 (01) : 157 - 158
  • [2] Interpretability of machine learning-based prediction models in healthcare
    Stiglic, Gregor
    Kocbek, Primoz
    Fijacko, Nino
    Zitnik, Marinka
    Verbert, Katrien
    Cilar, Leona
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2020, 10 (05)
  • [3] An investigation on machine-learning models for the prediction of cyanobacteria growth
    Giere, Johannes
    Riley, Derek
    Nowling, R. J.
    McComack, Joshua
    Sander, Hedda
    FUNDAMENTAL AND APPLIED LIMNOLOGY, 2020, 194 (02) : 85 - 94
  • [4] Machine-learning models for prediction of sepsis patients mortality
    Bao, C.
    Deng, F.
    Zhao, S.
    MEDICINA INTENSIVA, 2023, 47 (06) : 315 - 325
  • [5] INTERPRETABILITY OF MACHINE LEARNING MODELS: APPLICATION FOR LAWSUITS PREDICTION IN THE ENERGY SECTOR
    Cavalcante, Andre Borges
    PROCEEDINGS OF THE 2020 INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING (IWSSIP), 27TH EDITION, 2020, : 17 - 17
  • [6] Evaluating molecular representations in machine learning models for drug response prediction and interpretability
    Baptista, Delora
    Correia, Joao
    Pereira, Bruno
    Rocha, Miguel
    JOURNAL OF INTEGRATIVE BIOINFORMATICS, 2022, 19 (03)
  • [7] Certified Machine-Learning Models
    Damiani, Ernesto
    Ardagna, Claudio A.
    SOFSEM 2020: THEORY AND PRACTICE OF COMPUTER SCIENCE, 2020, 12011 : 3 - 15
  • [8] Hit Dexter 2.0: Machine-Learning Models for the Prediction of Frequent Hitters
    Stork, Conrad
    Chen, Ya
    Sicho, Martin
    Kirchmair, Johannes
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2019, 59 (03) : 1030 - 1043
  • [9] Machine-learning models for prediction of sepsis patients mortality: A needed consideration
    Fernandez, Marcos Valiente
    Moya, Francisco de Paula Delgado
    de Aledo, Amanda Lesmes Gonzalez
    Badia, Isaias Martin
    MEDICINA INTENSIVA, 2023, 47 (07) : 416 - 417
  • [10] Ensemble Machine-Learning Models for Accurate Prediction of Solar Irradiation in Bangladesh
    Alam, Md Shafiul
    Al-Ismail, Fahad Saleh
    Hossain, Md Sarowar
    Rahman, Syed Masiur
    PROCESSES, 2023, 11 (03)