Improving deep neural network generalization and robustness to background bias via layer-wise relevance propagation optimization

被引:0
|
作者
Pedro R. A. S. Bassi
Sergio S. J. Dertkigil
Andrea Cavalli
机构
[1] Alma Mater Studiorum - University of Bologna,Center for Biomolecular Nanotechnologies
[2] Istituto Italiano di Tecnologia,School of Medical Sciences
[3] University of Campinas (UNICAMP),undefined
[4] Istituto Italiano di Tecnologia,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Features in images’ backgrounds can spuriously correlate with the images’ classes, representing background bias. They can influence the classifier’s decisions, causing shortcut learning (Clever Hans effect). The phenomenon generates deep neural networks (DNNs) that perform well on standard evaluation datasets but generalize poorly to real-world data. Layer-wise Relevance Propagation (LRP) explains DNNs’ decisions. Here, we show that the optimization of LRP heatmaps can minimize the background bias influence on deep classifiers, hindering shortcut learning. By not increasing run-time computational cost, the approach is light and fast. Furthermore, it applies to virtually any classification architecture. After injecting synthetic bias in images’ backgrounds, we compared our approach (dubbed ISNet) to eight state-of-the-art DNNs, quantitatively demonstrating its superior robustness to background bias. Mixed datasets are common for COVID-19 and tuberculosis classification with chest X-rays, fostering background bias. By focusing on the lungs, the ISNet reduced shortcut learning. Thus, its generalization performance on external (out-of-distribution) test databases significantly surpassed all implemented benchmark models.
引用
收藏
相关论文
共 49 条
  • [31] ALWANN: Automatic Layer-Wise Approximation of Deep Neural Network Accelerators without Retraining
    Mrazek, Vojtech
    Vasicek, Zdenek
    Sekanina, Lukas
    Hanif, Muhammad Abdullah
    Shafique, Muhammad
    [J]. 2019 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD), 2019,
  • [32] LRP2A: Layer-wise Relevance Propagation based Adversarial attacking for Graph Neural Networks
    Liu, Li
    Du, Yong
    Wang, Ye
    Cheung, William K.
    Zhang, Youmin
    Liu, Qun
    Wang, Guoyin
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 256
  • [33] Learning to Prune Deep Neural Networks via Layer-wise Optimal Brain Surgeon
    Dong, Xin
    Chen, Shangyu
    Pan, Sinno Jialin
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [34] ULAN: A Universal Local Adversarial Network for SAR Target Recognition Based on Layer-Wise Relevance Propagation
    Du, Meng
    Bi, Daping
    Du, Mingyang
    Xu, Xinsong
    Wu, Zilong
    [J]. REMOTE SENSING, 2023, 15 (01)
  • [35] Mixed-Precision Neural Network Quantization via Learned Layer-Wise Importance
    Tang, Chen
    Ouyang, Kai
    Wang, Zhi
    Zhu, Yifei
    Ji, Wen
    Wang, Yaowei
    Zhu, Wenwu
    [J]. COMPUTER VISION, ECCV 2022, PT XI, 2022, 13671 : 259 - 275
  • [36] Interpretation of convolutional neural network-based building HVAC fault diagnosis model using improved layer-wise relevance propagation
    Li, Guannan
    Wang, Luhan
    Shen, Limei
    Chen, Liang
    Cheng, Hengda
    Xu, Chengliang
    Li, Fan
    [J]. ENERGY AND BUILDINGS, 2023, 286
  • [37] A Dynamic Layer-Wise Gradient Sparsity and Gradient Merging Optimization Method for Deep Neural Networks
    Ju, Tao
    Kang, Heting
    Liu, Shuai
    Huo, Jiuyuan
    [J]. Hsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University, 2024, 58 (09): : 105 - 116
  • [38] Automated layer-wise solution for ensemble deep randomized feed-forward neural network
    Hu, Minghui
    Gao, Ruobin
    Suganthan, Ponnuthurai N.
    Tanveer, M.
    [J]. NEUROCOMPUTING, 2022, 514 : 137 - 147
  • [39] Modelling and identification of characteristic kinematic features preceding freezing of gait with convolutional neural networks and layer-wise relevance propagation
    Filtjens, Benjamin
    Ginis, Pieter
    Nieuwboer, Alice
    Afzal, Muhammad Raheel
    Spildooren, Joke
    Vanrumste, Bart
    Slaets, Peter
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2021, 21 (01)
  • [40] Modelling and identification of characteristic kinematic features preceding freezing of gait with convolutional neural networks and layer-wise relevance propagation
    Benjamin Filtjens
    Pieter Ginis
    Alice Nieuwboer
    Muhammad Raheel Afzal
    Joke Spildooren
    Bart Vanrumste
    Peter Slaets
    [J]. BMC Medical Informatics and Decision Making, 21