Against Interpretability: a Critical Examination of the Interpretability Problem in Machine Learning

被引:0
|
作者
Krishnan M. [1 ]
机构
[1] All Souls College, High Street, Oxford, OX1 4AL, Oxfordshire
关键词
Algorithms; Black box; Explicability; Interpretability; Machine learning;
D O I
10.1007/s13347-019-00372-9
中图分类号
学科分类号
摘要
The usefulness of machine learning algorithms has led to their widespread adoption prior to the development of a conceptual framework for making sense of them. One common response to this situation is to say that machine learning suffers from a “black box problem.” That is, machine learning algorithms are “opaque” to human users, failing to be “interpretable” or “explicable” in terms that would render categorization procedures “understandable.” The purpose of this paper is to challenge the widespread agreement about the existence and importance of a black box problem. The first section argues that “interpretability” and cognates lack precise meanings when applied to algorithms. This makes the concepts difficult to use when trying to solve the problems that have motivated the call for interpretability (etc.). Furthermore, since there is no adequate account of the concepts themselves, it is not possible to assess whether particular technical features supply formal definitions of those concepts. The second section argues that there are ways of being a responsible user of these algorithms that do not require interpretability (etc.). In many cases in which a black box problem is cited, interpretability is a means to a further end such as justification or non-discrimination. Since addressing these problems need not involve something that looks like an “interpretation” (etc.) of an algorithm, the focus on interpretability artificially constrains the solution space by characterizing one possible solution as the problem itself. Where possible, discussion should be reformulated in terms of the ends of interpretability. © 2019, The Author(s).
引用
收藏
页码:487 / 502
页数:15
相关论文
共 50 条
  • [1] Interpretability in HealthCare: A Comparative Study of Local Machine Learning Interpretability Techniques
    El Shawi, Radwa
    Sherif, Youssef
    Al-Mallah, Mouaz
    Sakr, Sherif
    [J]. 2019 IEEE 32ND INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS (CBMS), 2019, : 275 - 280
  • [2] Interpretability in healthcare: A comparative study of local machine learning interpretability techniques
    ElShawi, Radwa
    Sherif, Youssef
    Al-Mallah, Mouaz
    Sakr, Sherif
    [J]. COMPUTATIONAL INTELLIGENCE, 2021, 37 (04) : 1633 - 1650
  • [3] A Study on Interpretability of Decision of Machine Learning
    Shirataki, Shohei
    Yamaguchi, Saneyasu
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 4830 - 4831
  • [4] A Review of Framework for Machine Learning Interpretability
    Araujo, Ivo de Abreu
    Torres, Renato Hidaka
    Sampaio Neto, Nelson Cruz
    [J]. AUGMENTED COGNITION, AC 2022, 2022, 13310 : 261 - 272
  • [5] Interpreting Interpretability: Understanding Data Scientists' Use of Interpretability Tools for Machine Learning
    Kaur, Harmanpreet
    Nori, Harsha
    Jenkins, Samuel
    Caruana, Rich
    Wallach, Hanna
    Vaughan, Jennifer Wortman
    [J]. PROCEEDINGS OF THE 2020 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI'20), 2020,
  • [6] Interpretability and Reproducability in Production Machine Learning Applications
    Ghanta, Sindhu
    Subramanian, Sriram
    Sundararaman, Swaminathan
    Khermosh, Lior
    Sridhar, Vinay
    Arteaga, Dulcardo
    Luo, Qianmei
    Das, Dhananjoy
    Talagala, Nisha
    [J]. 2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, : 658 - 664
  • [7] Machine learning interpretability meets TLS fingerprinting
    Mahdi Jafari Siavoshani
    Amirhossein Khajehpour
    Amirmohammad Ziaei Bideh
    Amirali Gatmiri
    Ali Taheri
    [J]. Soft Computing, 2023, 27 : 7191 - 7208
  • [8] Evaluating Attribution Methods in Machine Learning Interpretability
    Ratul, Qudrat E. Alahy
    Serra, Edoardo
    Cuzzocrea, Alfredo
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 5239 - 5245
  • [9] A Framework for Interpretability in Machine Learning for Medical Imaging
    Wang, Alan Q.
    Karaman, Batuhan K.
    Kim, Heejong
    Rosenthal, Jacob
    Saluja, Rachit
    Young, Sean I.
    Sabuncu, Mert R.
    [J]. IEEE ACCESS, 2024, 12 : 53277 - 53292
  • [10] Machine learning interpretability meets TLS fingerprinting
    Siavoshani, Mahdi Jafari
    Khajehpour, Amirhossein
    Bideh, Amirmohammad Ziaei
    Gatmiri, Amirali
    Taheri, Ali
    [J]. SOFT COMPUTING, 2023, 27 (11) : 7191 - 7208