Exploring accuracy and interpretability trade-off in tabular learning with novel attention-based models

被引:0
|
作者
Kodjo Mawuena Amekoe [1 ]
Hanane Azzag [3 ]
Zaineb Chelly Dagdia [1 ]
Mustapha Lebbah [2 ]
Gregoire Jaffre [2 ]
机构
[1] Université Sorbonne Paris Nord,
[2] LIPN CNRS UMR,undefined
[3] Université Paris-Saclay,undefined
[4] DAVID Lab,undefined
[5] UVSQ,undefined
[6] Groupe BPCE,undefined
关键词
Tabular data; Interpretability; Attention; Robust explanation;
D O I
10.1007/s00521-024-10163-9
中图分类号
学科分类号
摘要
Apart from high accuracy, what interests many researchers and practitioners in real-life tabular learning problems (e.g., fraud detection and credit scoring) is uncovering hidden patterns in the data and/or providing meaningful justification of decisions made by machine learning models. In this concern, an important question arises: should one use inherently interpretable models or explain full-complexity models such as XGBoost, Random Forest with post hoc tools? Opting for the second choice is typically supported by the accuracy metric, but it is not always evident that the performance gap is sufficiently significant, especially considering the current trend of accurate and inherently interpretable models, as well as accounting for other real-life evaluation metrics such as faithfulness, stability, and computational cost of explanations. In this work, we show through benchmarking on 45 datasets that the relative accuracy loss is less than 4% in average when using intelligible models such as explainable boosting machine. Furthermore, we propose a simple use of model ensembling to improve the expressiveness of TabSRALinear, a novel attention-based inherently interpretable solution, and demonstrate both theoretically and empirically that it is a viable option for (1) generating stable or robust explanations and (2) incorporating human knowledge during the training phase. Source code is available at https://github.com/anselmeamekoe/TabSRA.
引用
收藏
页码:18583 / 18611
页数:28
相关论文
共 50 条
  • [31] On the automaticity and flexibility of covert attention: A speed-accuracy trade-off analysis
    Giordano, Anna Marie
    McElree, Brian
    Carrasco, Marisa
    JOURNAL OF VISION, 2009, 9 (03):
  • [32] On the interaction of sustained and transient attention: A speed-accuracy trade-off analysis
    Carrasco, M.
    Giordano, A. M.
    PERCEPTION, 2007, 36 : 116 - 116
  • [33] Trade-off between approximation accuracy and complexity for TS fuzzy models
    Baranyi, P
    Korondi, P
    Patton, RJ
    Hashimoto, H
    ASIAN JOURNAL OF CONTROL, 2004, 6 (01) : 21 - 33
  • [34] Concept Embedding Models: Beyond the Accuracy-Explainability Trade-Off
    Zarlenga, Mateo Espinosa
    Barbiero, Pietro
    Ciravegna, Gabriele
    Marra, Giuseppe
    Giannini, Francesco
    Diligenti, Michelangelo
    Shams, Zohreh
    Precioso, Frederic
    Melacci, Stefano
    Weller, Adrian
    Lio, Pietro
    Jamnik, Mateja
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [35] A Synergistic Approach to Enhance the Accuracy-interpretability Trade-off of the NECLASS Classifier for Skewed Data Distribution
    Yousefi, Jamileh
    Hamilton-Wright, Andrew
    Obimbo, Charlie
    IJCCI: PROCEEDINGS OF THE 11TH INTERNATIONAL JOINT CONFERENCE ON COMPUTATIONAL INTELLIGENCE, 2019, : 325 - 334
  • [36] A Review on the Interpretability-Accuracy Trade-Off in Evolutionary Multi-Objective Fuzzy Systems (EMOFS)
    Shukla, Praveen Kumar
    Tripathi, Surya Prakash
    INFORMATION, 2012, 3 (03) : 256 - 277
  • [37] A modified NEFCLASS classifier with enhanced accuracy-interpretability trade-off for datasets with skewed feature values
    Yousefi, Jamileh
    FUZZY SETS AND SYSTEMS, 2021, 413 : 99 - 113
  • [38] Benchmarking Attention-Based Interpretability of Deep Learning in Multivariate Time Series Predictions
    Baric, Domjan
    Fumic, Petar
    Horvatic, Davor
    Lipic, Tomislav
    ENTROPY, 2021, 23 (02) : 1 - 23
  • [39] ECG Time Series Classification via Genetic-Fuzzy Approach Based on Accuracy-Interpretability Trade-Off Optimization
    Gorzalczany, Marian B.
    Rudzinski, Filip
    2018 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2018,
  • [40] Predicting supply chain risks using machine learning: The trade-off between performance and interpretability
    Baryannis, George
    Dani, Samir
    Antoniou, Grigoris
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 101 : 993 - 1004