Exploring accuracy and interpretability trade-off in tabular learning with novel attention-based models

被引:0
|
作者
Kodjo Mawuena Amekoe [1 ]
Hanane Azzag [3 ]
Zaineb Chelly Dagdia [1 ]
Mustapha Lebbah [2 ]
Gregoire Jaffre [2 ]
机构
[1] Université Sorbonne Paris Nord,
[2] LIPN CNRS UMR,undefined
[3] Université Paris-Saclay,undefined
[4] DAVID Lab,undefined
[5] UVSQ,undefined
[6] Groupe BPCE,undefined
关键词
Tabular data; Interpretability; Attention; Robust explanation;
D O I
10.1007/s00521-024-10163-9
中图分类号
学科分类号
摘要
Apart from high accuracy, what interests many researchers and practitioners in real-life tabular learning problems (e.g., fraud detection and credit scoring) is uncovering hidden patterns in the data and/or providing meaningful justification of decisions made by machine learning models. In this concern, an important question arises: should one use inherently interpretable models or explain full-complexity models such as XGBoost, Random Forest with post hoc tools? Opting for the second choice is typically supported by the accuracy metric, but it is not always evident that the performance gap is sufficiently significant, especially considering the current trend of accurate and inherently interpretable models, as well as accounting for other real-life evaluation metrics such as faithfulness, stability, and computational cost of explanations. In this work, we show through benchmarking on 45 datasets that the relative accuracy loss is less than 4% in average when using intelligible models such as explainable boosting machine. Furthermore, we propose a simple use of model ensembling to improve the expressiveness of TabSRALinear, a novel attention-based inherently interpretable solution, and demonstrate both theoretically and empirically that it is a viable option for (1) generating stable or robust explanations and (2) incorporating human knowledge during the training phase. Source code is available at https://github.com/anselmeamekoe/TabSRA.
引用
收藏
页码:18583 / 18611
页数:28
相关论文
共 50 条
  • [41] Optimizing Speed and Accuracy Trade-off in Machine Learning Models via Stochastic Gradient Descent Approximation
    Catapang, Jasper Kyle
    2022 9TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE, ISCMI, 2022, : 124 - 128
  • [42] Enhancing financial risk prediction with symbolic classifiers: addressing class imbalance and the accuracy-interpretability trade-off
    Mena, Luis J.
    Garcia, Vicente
    Felix, Vanessa G.
    Ostos, Rodolfo
    Martinez-Pelaez, Rafael
    Ochoa-Brust, Alberto
    Velarde-Alvarado, Pablo
    HUMANITIES & SOCIAL SCIENCES COMMUNICATIONS, 2024, 11 (01):
  • [43] On Exploring Attention-based Explanation for Transformer Models in Text Classification
    Liu, Shengzhong
    Le, Franck
    Chakraborty, Supriyo
    Abdelzaher, Tarek
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 1193 - 1203
  • [44] Multi-objective based Fuzzy Rule Based Systems (FRBSs) for trade-off improvement in accuracy and interpretability: A rule relevance point of view
    Rey, M. I.
    Galende, M.
    Fuente, M. J.
    Sainz-Palmeroc, G. I.
    KNOWLEDGE-BASED SYSTEMS, 2017, 127 : 67 - 84
  • [45] Accuracy vs. complexity: A trade-off in visual question answering models
    Farazi, Moshiur
    Khan, Salman
    Barnes, Nick
    PATTERN RECOGNITION, 2021, 120 (120)
  • [46] A genetic programming approach for real-time crash prediction to solve trade-off between interpretability and accuracy
    Ma, Xiaochi
    Lu, Jian
    Liu, Xian
    Qu, Weibin
    JOURNAL OF TRANSPORTATION SAFETY & SECURITY, 2023, 15 (04) : 421 - 443
  • [47] The Trade-Off Between Accuracy and Precision in Latent Variable Models of Mediation Processes
    Ledgerwood, Alison
    Shrout, Patrick E.
    JOURNAL OF PERSONALITY AND SOCIAL PSYCHOLOGY, 2011, 101 (06) : 1174 - 1188
  • [48] Representations of protein structure for exploring the conformational space: A speed-accuracy trade-off
    Postic, Guillaume
    Janel, Nathalie
    Moroy, Gautier
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2021, 19 : 2618 - 2625
  • [49] Enhancing Accuracy-Privacy Trade-Off in Differentially Private Split Learning
    Pham, Ngoc Duy
    Phan, Khoa T.
    Chilamkurti, Naveen
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2025, 9 (01): : 988 - 1000
  • [50] Using Machine Learning to Calibrate Automated Performance Assessment in a Virtual Laboratory: Exploring the Trade-Off between Accuracy and Explainability
    Zafeiropoulos, Vasilis
    Kalles, Dimitris
    APPLIED SCIENCES-BASEL, 2024, 14 (17):