Improving deep neural network generalization and robustness to background bias via layer-wise relevance propagation optimization

被引:0
|
作者
Pedro R. A. S. Bassi
Sergio S. J. Dertkigil
Andrea Cavalli
机构
[1] Alma Mater Studiorum - University of Bologna,Center for Biomolecular Nanotechnologies
[2] Istituto Italiano di Tecnologia,School of Medical Sciences
[3] University of Campinas (UNICAMP),undefined
[4] Istituto Italiano di Tecnologia,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Features in images’ backgrounds can spuriously correlate with the images’ classes, representing background bias. They can influence the classifier’s decisions, causing shortcut learning (Clever Hans effect). The phenomenon generates deep neural networks (DNNs) that perform well on standard evaluation datasets but generalize poorly to real-world data. Layer-wise Relevance Propagation (LRP) explains DNNs’ decisions. Here, we show that the optimization of LRP heatmaps can minimize the background bias influence on deep classifiers, hindering shortcut learning. By not increasing run-time computational cost, the approach is light and fast. Furthermore, it applies to virtually any classification architecture. After injecting synthetic bias in images’ backgrounds, we compared our approach (dubbed ISNet) to eight state-of-the-art DNNs, quantitatively demonstrating its superior robustness to background bias. Mixed datasets are common for COVID-19 and tuberculosis classification with chest X-rays, fostering background bias. By focusing on the lungs, the ISNet reduced shortcut learning. Thus, its generalization performance on external (out-of-distribution) test databases significantly surpassed all implemented benchmark models.
引用
收藏
相关论文
共 48 条
  • [1] Improving deep neural network generalization and robustness to background bias via layer-wise relevance propagation optimization
    Bassi, Pedro R. A. S.
    Dertkigil, Sergio S. J.
    Cavalli, Andrea
    [J]. NATURE COMMUNICATIONS, 2024, 15 (01)
  • [2] Explaining Deep Neural Network using Layer-wise Relevance Propagation and Integrated Gradients
    Cik, Ivan
    Rasamoelina, Andrindrasana David
    Mach, Marian
    Sincak, Peter
    [J]. 2021 IEEE 19TH WORLD SYMPOSIUM ON APPLIED MACHINE INTELLIGENCE AND INFORMATICS (SAMI 2021), 2021, : 381 - 386
  • [3] LAYER-WISE DEEP NEURAL NETWORK PRUNING VIA ITERATIVELY REWEIGHTED OPTIMIZATION
    Jiang, Tao
    Yang, Xiangyu
    Shi, Yuanming
    Wang, Hao
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5606 - 5610
  • [4] Interpreting Convolutional Neural Networks via Layer-Wise Relevance Propagation
    Jia, Wohuan
    Zhang, Shaoshuai
    Jiang, Yue
    Xu, Li
    [J]. ARTIFICIAL INTELLIGENCE AND SECURITY, ICAIS 2022, PT I, 2022, 13338 : 457 - 467
  • [5] Hierarchical Neural Network with Layer-wise Relevance Propagation for Interpretable Multiclass Neural State Classification
    Ellis, Charles A.
    Sendi, Mohammad S. E.
    Willie, Jon T.
    Mahmoudi, Babak
    [J]. 2021 10TH INTERNATIONAL IEEE/EMBS CONFERENCE ON NEURAL ENGINEERING (NER), 2021, : 351 - 354
  • [6] Deep Neural Network Quantization via Layer-Wise Optimization Using Limited Training Data
    Chen, Shangyu
    Wang, Wenya
    Pan, Sinno Jialin
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 3329 - 3336
  • [7] Explaining Therapy Predictions with Layer-wise Relevance Propagation in Neural Networks
    Yang, Yinchong
    Tresp, Volker
    Wunderle, Marius
    Fasching, Peter A.
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI), 2018, : 152 - 162
  • [8] Layer-Wise Relevance Propagation for Neural Networks with Local Renormalization Layers
    Binder, Alexander
    Montavon, Gregoire
    Lapuschkin, Sebastian
    Mueller, Klaus-Robert
    Samek, Wojciech
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2016, PT II, 2016, 9887 : 63 - 71
  • [9] Interpretable Convolutional Neural Network Through Layer-wise Relevance Propagation for Machine Fault Diagnosis
    Grezmak, John
    Zhang, Jianjing
    Wang, Peng
    Loparo, Kenneth A.
    Gao, Robert X.
    [J]. IEEE SENSORS JOURNAL, 2020, 20 (06) : 3172 - 3181
  • [10] Layer-Wise Relevance Propagation for Explaining Deep Neural Network Decisions in MRI-Based Alzheimer's Disease Classification
    Boehle, Moritz
    Eitel, Fabian
    Weygandt, Martin
    Ritter, Kerstin
    [J]. FRONTIERS IN AGING NEUROSCIENCE, 2019, 11