Why did AI get this one wrong? - Tree-based explanations of machine learning model predictions

被引:24
|
作者
Parimbelli, Enea [1 ,3 ,5 ]
Buonocore, Tommaso Mario [1 ]
Nicora, Giovanna [1 ,2 ]
Michalowski, Wojtek [3 ]
Wilk, Szymon [4 ]
Bellazzi, Riccardo [1 ]
机构
[1] Univ Pavia, Dept Elect Comp & Biomed Engn, Pavia, Italy
[2] enGenome srl, Pavia, Italy
[3] Univ Ottawa, Telfer Sch Management, Ottawa, ON, Canada
[4] Poznan Univ Tech, Inst Comp Sci, Div Intelligent Decis Support Syst, Poznan, Poland
[5] Dept Elect Comp & Biomed Engn, Via Ferrata 5, I-27100 Pavia, Italy
基金
欧盟地平线“2020”;
关键词
XAI; Black-box; Explanation; Local explanation; Interpretable; Explainable; Fidelity; Reliability; Post-hoc; Model agnostic; Surrogate model; DATASET SHIFT;
D O I
10.1016/j.artmed.2022.102471
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Increasingly complex learning methods such as boosting, bagging and deep learning have made ML models more accurate, but harder to interpret and explain, culminating in black-box machine learning models. Model developers and users alike are often presented with a trade-off between performance and intelligibility, especially in high-stakes applications like medicine. In the present article we propose a novel methodological approach for generating explanations for the predictions of a generic machine learning model, given a specific instance for which the prediction has been made. The method, named AraucanaXAI, is based on surrogate, locally-fitted classification and regression trees that are used to provide post-hoc explanations of the prediction of a generic machine learning model. Advantages of the proposed XAI approach include superior fidelity to the original model, ability to deal with non-linear decision boundaries, and native support to both classification and regression problems. We provide a packaged, open-source implementation of the AraucanaXAI method and evaluate its behaviour in a number of different settings that are commonly encountered in medical applications of AI. These include potential disagreement between the model prediction and physician's expert opinion and low reliability of the prediction due to data scarcity.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Land subsidence modelling using tree-based machine learning algorithms
    Rahmati, Omid
    Falah, Fatemeh
    Naghibi, Seyed Amir
    Biggs, Trent
    Soltani, Milad
    Deo, Ravinesh C.
    Cerda, Artemi
    Mohammadi, Farnoush
    Dieu Tien Bui
    SCIENCE OF THE TOTAL ENVIRONMENT, 2019, 672 : 239 - 252
  • [32] Faster Convergence with Lexicase Selection in Tree-Based Automated Machine Learning
    Matsumoto, Nicholas
    Saini, Anil Kumar
    Ribeiro, Pedro
    Choi, Hyunjun
    Orlenko, Alena
    Lyytikainen, Leo-Pekka
    Laurikka, Jari O.
    Lehtimaki, Terho
    Batista, Sandra
    Moore, Jason H.
    GENETIC PROGRAMMING, EUROGP 2023, 2023, 13986 : 165 - 181
  • [33] A tree-based machine learning methodology to automatically classify software vulnerabilities
    Aivatoglou, Georgios
    Anastasiadis, Mike
    Spanos, Georgios
    Voulgaridis, Antonis
    Votis, Konstantinos
    Tzovaras, Dimitrios
    PROCEEDINGS OF THE 2021 IEEE INTERNATIONAL CONFERENCE ON CYBER SECURITY AND RESILIENCE (IEEE CSR), 2021, : 312 - 317
  • [34] The Predictability of Tree-based Machine Learning Algorithms in the Big Data Context
    Qolipour, F.
    Ghasemzadeh, M.
    Mohammad-Karimi, N.
    INTERNATIONAL JOURNAL OF ENGINEERING, 2021, 34 (01): : 82 - 89
  • [35] Malware Detection Method using Tree-based Machine Learning Algorithms
    Okada, Satoshi
    Matsuda, Wataru
    Fujimoto, Mariko
    Mitsunaga, Takuho
    2021 IEEE INTERNATIONAL CONFERENCE ON COMPUTING (ICOCO), 2021, : 103 - 108
  • [36] Tree-based machine learning model for visualizing complex relationships between biochar properties and anaerobic digestion
    Zhang, Yi
    Feng, Yijing
    Ren, Zhonghao
    Zuo, Runguo
    Zhang, Tianhui
    Li, Yeqing
    Wang, Yajing
    Liu, Zhiyang
    Sun, Ziyan
    Han, Yongming
    Feng, Lu
    Aghbashlo, Mortaza
    Tabatabaei, Meisam
    Pan, Junting
    BIORESOURCE TECHNOLOGY, 2023, 374
  • [37] Assessment of flood susceptibility prediction based on optimized tree-based machine learning models
    Eslaminezhad, Seyed Ahmad
    Eftekhari, Mobin
    Azma, Aliasghar
    Kiyanfar, Ramin
    Akbari, Mohammad
    JOURNAL OF WATER AND CLIMATE CHANGE, 2022, 13 (06) : 2353 - 2385
  • [38] MACHINE LEARNING TO JUDGE LABOR RELATIONS' HARMONIOUSNESS BASED ON DECISION TREE-BASED METHOD
    Chen, Tianxue
    Yang, Heqing
    3RD INTERNATIONAL SYMPOSIUM ON INFORMATION ENGINEERING AND ELECTRONIC COMMERCE (IEEC 2011), PROCEEDINGS, 2011, : 243 - 246
  • [39] Flood susceptibility prediction using tree-based machine learning models in the GBA
    Lyu, Hai -Min
    Yin, Zhen-Yu
    SUSTAINABLE CITIES AND SOCIETY, 2023, 97
  • [40] Tree-based machine learning performed in-memory with memristive analog CAM
    Giacomo Pedretti
    Catherine E. Graves
    Sergey Serebryakov
    Ruibin Mao
    Xia Sheng
    Martin Foltin
    Can Li
    John Paul Strachan
    Nature Communications, 12