Improving deep neural network generalization and robustness to background bias via layer-wise relevance propagation optimization

被引:0
|
作者
Pedro R. A. S. Bassi
Sergio S. J. Dertkigil
Andrea Cavalli
机构
[1] Alma Mater Studiorum - University of Bologna,Center for Biomolecular Nanotechnologies
[2] Istituto Italiano di Tecnologia,School of Medical Sciences
[3] University of Campinas (UNICAMP),undefined
[4] Istituto Italiano di Tecnologia,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Features in images’ backgrounds can spuriously correlate with the images’ classes, representing background bias. They can influence the classifier’s decisions, causing shortcut learning (Clever Hans effect). The phenomenon generates deep neural networks (DNNs) that perform well on standard evaluation datasets but generalize poorly to real-world data. Layer-wise Relevance Propagation (LRP) explains DNNs’ decisions. Here, we show that the optimization of LRP heatmaps can minimize the background bias influence on deep classifiers, hindering shortcut learning. By not increasing run-time computational cost, the approach is light and fast. Furthermore, it applies to virtually any classification architecture. After injecting synthetic bias in images’ backgrounds, we compared our approach (dubbed ISNet) to eight state-of-the-art DNNs, quantitatively demonstrating its superior robustness to background bias. Mixed datasets are common for COVID-19 and tuberculosis classification with chest X-rays, fostering background bias. By focusing on the lungs, the ISNet reduced shortcut learning. Thus, its generalization performance on external (out-of-distribution) test databases significantly surpassed all implemented benchmark models.
引用
收藏
相关论文
共 49 条
  • [21] Explaining Deep Learning Models for Tabular Data Using Layer-Wise Relevance Propagation
    Ullah, Ihsan
    Rios, Andre
    Gala, Vaibhav
    Mckeever, Susan
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (01):
  • [22] Layer-wise learning based stochastic gradient descent method for the optimization of deep convolutional neural network
    Zheng, Qinghe
    Tian, Xinyu
    Jiang, Nan
    Yang, Mingqiang
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 37 (04) : 5641 - 5654
  • [23] Uncovering convolutional neural network decisions for diagnosing multiple sclerosis on conventional MRI using layer-wise relevance propagation
    Eitel, Fabian
    Soehler, Emily
    Bellmann-Strobl, Judith
    Brandt, Alexander U.
    Ruprecht, Klemens
    Giess, Rene M.
    Kuchling, Joseph
    Asseyer, Susanna
    Weygandt, Martin
    Haynes, John-Dylan
    Scheel, Michael
    Paul, Friedemann
    Ritter, Kerstin
    [J]. NEUROIMAGE-CLINICAL, 2019, 24
  • [24] Explainability of deep reinforcement learning algorithms in robotic domains by using Layer-wise Relevance Propagation
    Taghian, Mehran
    Miwa, Shotaro
    Mitsuka, Yoshihiro
    Gunther, Johannes
    Golestan, Shadan
    Zaiane, Osmar
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 137
  • [25] Accelerate Cooperative Deep Inference via Layer-wise Processing Schedule Optimization
    Wang, Ning
    Duan, Yubin
    Wu, Jie
    [J]. 30TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS (ICCCN 2021), 2021,
  • [26] Stable feature selection utilizing Graph Convolutional Neural Network and Layer-wise Relevance Propagation for biomarker discovery in breast cancer
    Chereda, Hryhorii
    Leha, Andreas
    Beissbarth, Tim
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2024, 151
  • [27] A neural network-based control chart for monitoring and interpreting autocorrelated multivariate processes using layer-wise relevance propagation
    Sun, Jinwen
    Zhou, Shiyu
    Veeramani, Dharmaraj
    [J]. QUALITY ENGINEERING, 2023, 35 (01) : 33 - 47
  • [28] An explainable brain tumor detection and classification model using deep learning and layer-wise relevance propagation
    Saurabh Mandloi
    Mohd Zuber
    Rajeev Kumar Gupta
    [J]. Multimedia Tools and Applications, 2024, 83 : 33753 - 33783
  • [29] An explainable brain tumor detection and classification model using deep learning and layer-wise relevance propagation
    Mandloi, Saurabh
    Zuber, Mohd
    Gupta, Rajeev Kumar
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (11) : 33753 - 33783
  • [30] Beyond saliency: Understanding convolutional neural networks from saliency prediction on layer-wise relevance propagation
    Li, Heyi
    Tian, Yunke
    Mueller, Klaus
    Chen, Xin
    [J]. IMAGE AND VISION COMPUTING, 2019, 83-84 : 70 - 86