Evaluation of post-hoc interpretability methods in time-series classification

被引:0
|
作者
Hugues Turbé
Mina Bjelogrlic
Christian Lovis
Gianmarco Mengaldo
机构
[1] University Hospitals of Geneva,Division of Medical Information Sciences
[2] University of Geneva,Department of Radiology and Medical Informatics
[3] National University of Singapore,Department of Mechanical Engineering, College of Design and Engineering
[4] Imperial College London,Honorary Research Fellow, Department of Aeronautics
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Post-hoc interpretability methods are critical tools to explain neural-network results. Several post-hoc methods have emerged in recent years but they produce different results when applied to a given task, raising the question of which method is the most suitable to provide accurate post-hoc interpretability. To understand the performance of each method, quantitative evaluation of interpretability methods is essential; however, currently available frameworks have several drawbacks that hinder the adoption of post-hoc interpretability methods, especially in high-risk sectors. In this work we propose a framework with quantitative metrics to assess the performance of existing post-hoc interpretability methods, particularly in time-series classification. We show that several drawbacks identified in the literature are addressed, namely, the dependence on human judgement, retraining and the shift in the data distribution when occluding samples. We also design a synthetic dataset with known discriminative features and tunable complexity. The proposed methodology and quantitative metrics can be used to understand the reliability of interpretability methods results obtained in practical applications. In turn, they can be embedded within operational workflows in critical fields that require accurate interpretability results for, example, regulatory policies.
引用
收藏
页码:250 / 260
页数:10
相关论文
共 50 条
  • [1] Evaluation of post-hoc interpretability methods in time-series classification
    Turbe, Hugues
    Bjelogrlic, Mina
    Lovis, Christian
    Mengaldo, Gianmarco
    [J]. NATURE MACHINE INTELLIGENCE, 2023, 5 (03) : 250 - +
  • [2] Evaluation of Post-hoc Interpretability Methods in Breast Cancer Histopathological Image Classification
    Wagas, Muhammad
    Maul, Tomas
    Ahmed, Amr
    Liao, Iman Yi
    [J]. ADVANCES IN BRAIN INSPIRED COGNITIVE SYSTEMS, BICS 2023, 2024, 14374 : 95 - 104
  • [3] Post-hoc Interpretability for Neural NLP: A Survey
    Madsen, Andreas
    Reddy, Siva
    Chandar, Sarath
    [J]. ACM COMPUTING SURVEYS, 2023, 55 (08)
  • [4] The Dangers of Post-hoc Interpretability: Unjustified Counterfactual Explanations
    Laugel, Thibault
    Lesot, Marie-Jeanne
    Marsala, Christophe
    Renard, Xavier
    Detyniecki, Marcin
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2801 - 2807
  • [5] Listen to Interpret: Post-hoc Interpretability for Audio Networks with NMF
    Parekh, Jayneel
    Parekh, Sanjeel
    Mozharovskyi, Pavlo
    d'Alche-Buc, Florence
    Richard, Gael
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [6] Evaluation of interpretability methods for multivariate time series forecasting
    Ozan Ozyegen
    Igor Ilic
    Mucahit Cevik
    [J]. Applied Intelligence, 2022, 52 : 4727 - 4743
  • [7] Evaluation of interpretability methods for multivariate time series forecasting
    Ozyegen, Ozan
    Ilic, Igor
    Cevik, Mucahit
    [J]. APPLIED INTELLIGENCE, 2022, 52 (05) : 4727 - 4743
  • [8] On the post-hoc explainability of deep echo state networks for time series forecasting, image and video classification
    Alejandro Barredo Arrieta
    Sergio Gil-Lopez
    Ibai Laña
    Miren Nekane Bilbao
    Javier Del Ser
    [J]. Neural Computing and Applications, 2022, 34 : 10257 - 10277
  • [9] On the post-hoc explainability of deep echo state networks for time series forecasting, image and video classification
    Barredo Arrieta, Alejandro
    Gil-Lopez, Sergio
    Lana, Ibai
    Bilbao, Miren Nekane
    Del Ser, Javier
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (13): : 10257 - 10277
  • [10] An Evaluation of Classification Methods for 3D Printing Time-Series Data
    Mahato, Vivek
    Obeidi, Muhannad Ahmed
    Brabazon, Dermot
    Cunningham, Padraig
    [J]. IFAC PAPERSONLINE, 2020, 53 (02): : 8211 - 8216