Interpretable and explainable machine learning: A methods-centric overview with concrete examples

被引:47
|
作者
Marcinkevics, Ricards [1 ]
Vogt, Julia E. [1 ]
机构
[1] Swiss Fed Inst Technol, Dept Comp Sci, Zurich, Switzerland
关键词
explainability; interpretability; machine learning; neural networks; FALSE DISCOVERY RATE; BLACK-BOX; CLASSIFICATION; EXPLANATIONS; REGRESSION;
D O I
10.1002/widm.1493
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Interpretability and explainability are crucial for machine learning (ML) and statistical applications in medicine, economics, law, and natural sciences and form an essential principle for ML model design and development. Although interpretability and explainability have escaped a precise and universal definition, many models and techniques motivated by these properties have been developed over the last 30 years, with the focus currently shifting toward deep learning. We will consider concrete examples of state-of-the-art, including specially tailored rule-based, sparse, and additive classification models, interpretable representation learning, and methods for explaining black-box models post hoc. The discussion will emphasize the need for and relevance of interpretability and explainability, the divide between them, and the inductive biases behind the presented "zoo" of interpretable models and explanation methods.This article is categorized under:Fundamental Concepts of Data and Knowledge > Explainable AITechnologies > Machine LearningCommercial, Legal, and Ethical Issues > Social Considerations
引用
收藏
页数:32
相关论文
共 50 条
  • [1] Explainable and interpretable machine learning and data mining
    Atzmueller, Martin
    Fuernkranz, Johannes
    Kliegr, Tomas
    Schmid, Ute
    DATA MINING AND KNOWLEDGE DISCOVERY, 2024, 38 (05) : 2571 - 2595
  • [2] An overview of AI and Machine Learning methods: motivations, concepts, and examples
    Assouline, D.
    Le Pogam, M-A
    Pittet, V.
    EUROPEAN JOURNAL OF PUBLIC HEALTH, 2021, 31
  • [3] Editorial: Interpretable and explainable machine learning models in oncology
    Hrinivich, William Thomas
    Wang, Tonghe
    Wang, Chunhao
    FRONTIERS IN ONCOLOGY, 2023, 13
  • [4] Interpretable and Explainable Machine Learning for Ultrasonic Defect Sizing
    Pyle, Richard J.
    Hughes, Robert R.
    Wilcox, Paul D.
    IEEE TRANSACTIONS ON ULTRASONICS FERROELECTRICS AND FREQUENCY CONTROL, 2023, 70 (04) : 277 - 290
  • [5] The coming of age of interpretable and explainable machine learning models
    Lisboa, P. J. G.
    Saralajew, S.
    Vellido, A.
    Fernandez-Domenech, R.
    Villmann, T.
    NEUROCOMPUTING, 2023, 535 : 25 - 39
  • [6] Interpretable and Explainable Machine Learning for Materials Science and Chemistry
    Oviedo, Felipe
    Ferres, Juan Lavista
    Buonassisi, Tonio
    Butler, Keith T.
    ACCOUNTS OF MATERIALS RESEARCH, 2022, 3 (06): : 597 - 607
  • [7] Explainable and Interpretable Machine Learning for Antimicrobial Stewardship: Opportunities and Challenges
    Giacobbe, Daniele Roberto
    Marelli, Cristina
    Guastavino, Sabrina
    Mora, Sara
    Rosso, Nicola
    Signori, Alessio
    Campi, Cristina
    Giacomini, Mauro
    Bassetti, Matteo
    CLINICAL THERAPEUTICS, 2024, 46 (06) : 474 - 480
  • [8] A spectrum of explainable and interpretable machine learning approaches for genomic studies
    Conard, Ashley Mae
    DenAdel, Alan
    Crawford, Lorin
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2023, 15 (05):
  • [9] A Survey of Interpretable Machine Learning Methods
    Wang, Yan
    Tuerhong, Gulanbaier
    2022 INTERNATIONAL CONFERENCE ON VIRTUAL REALITY, HUMAN-COMPUTER INTERACTION AND ARTIFICIAL INTELLIGENCE, VRHCIAI, 2022, : 232 - 237
  • [10] Explainable artificial intelligence and interpretable machine learning for agricultural data analysis
    Ryo, Masahiro
    ARTIFICIAL INTELLIGENCE IN AGRICULTURE, 2022, 6 : 257 - 265