Improving deep neural network generalization and robustness to background bias via layer-wise relevance propagation optimization

被引：0

作者：

Pedro R. A. S. Bassi

Sergio S. J. Dertkigil

Andrea Cavalli

机构：

[1] Alma Mater Studiorum - University of Bologna,Center for Biomolecular Nanotechnologies

[2] Istituto Italiano di Tecnologia,School of Medical Sciences

[3] University of Campinas (UNICAMP),undefined

[4] Istituto Italiano di Tecnologia,undefined

来源：

Nature Communications | / 15卷

关键词：

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Features in images’ backgrounds can spuriously correlate with the images’ classes, representing background bias. They can influence the classifier’s decisions, causing shortcut learning (Clever Hans effect). The phenomenon generates deep neural networks (DNNs) that perform well on standard evaluation datasets but generalize poorly to real-world data. Layer-wise Relevance Propagation (LRP) explains DNNs’ decisions. Here, we show that the optimization of LRP heatmaps can minimize the background bias influence on deep classifiers, hindering shortcut learning. By not increasing run-time computational cost, the approach is light and fast. Furthermore, it applies to virtually any classification architecture. After injecting synthetic bias in images’ backgrounds, we compared our approach (dubbed ISNet) to eight state-of-the-art DNNs, quantitatively demonstrating its superior robustness to background bias. Mixed datasets are common for COVID-19 and tuberculosis classification with chest X-rays, fostering background bias. By focusing on the lungs, the ISNet reduced shortcut learning. Thus, its generalization performance on external (out-of-distribution) test databases significantly surpassed all implemented benchmark models.

引用

共 49 条

[31] ALWANN: Automatic Layer-Wise Approximation of Deep Neural Network Accelerators without Retraining
Mrazek, Vojtech
Vasicek, Zdenek
Sekanina, Lukas
Hanif, Muhammad Abdullah
Shafique, Muhammad
[J]. 2019 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD), 2019,
[32] LRP2A: Layer-wise Relevance Propagation based Adversarial attacking for Graph Neural Networks
Liu, Li
Du, Yong
Wang, Ye
Cheung, William K.
Zhang, Youmin
Liu, Qun
Wang, Guoyin
[J]. KNOWLEDGE-BASED SYSTEMS, 2022, 256
[33] Learning to Prune Deep Neural Networks via Layer-wise Optimal Brain Surgeon
Dong, Xin
Chen, Shangyu
Pan, Sinno Jialin
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
[34] ULAN: A Universal Local Adversarial Network for SAR Target Recognition Based on Layer-Wise Relevance Propagation
Du, Meng
Bi, Daping
Du, Mingyang
Xu, Xinsong
Wu, Zilong
[J]. REMOTE SENSING, 2023, 15 (01)
[35] Mixed-Precision Neural Network Quantization via Learned Layer-Wise Importance
Tang, Chen
Ouyang, Kai
Wang, Zhi
Zhu, Yifei
Ji, Wen
Wang, Yaowei
Zhu, Wenwu
[J]. COMPUTER VISION, ECCV 2022, PT XI, 2022, 13671 : 259 - 275
[36] Interpretation of convolutional neural network-based building HVAC fault diagnosis model using improved layer-wise relevance propagation
Li, Guannan
Wang, Luhan
Shen, Limei
Chen, Liang
Cheng, Hengda
Xu, Chengliang
Li, Fan
[J]. ENERGY AND BUILDINGS, 2023, 286
[37] A Dynamic Layer-Wise Gradient Sparsity and Gradient Merging Optimization Method for Deep Neural Networks
Ju, Tao
Kang, Heting
Liu, Shuai
Huo, Jiuyuan
[J]. Hsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University, 2024, 58 (09): : 105 - 116
[38] Automated layer-wise solution for ensemble deep randomized feed-forward neural network
Hu, Minghui
Gao, Ruobin
Suganthan, Ponnuthurai N.
Tanveer, M.
[J]. NEUROCOMPUTING, 2022, 514 : 137 - 147
[39] Modelling and identification of characteristic kinematic features preceding freezing of gait with convolutional neural networks and layer-wise relevance propagation
Filtjens, Benjamin
Ginis, Pieter
Nieuwboer, Alice
Afzal, Muhammad Raheel
Spildooren, Joke
Vanrumste, Bart
Slaets, Peter
[J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2021, 21 (01)
[40] Modelling and identification of characteristic kinematic features preceding freezing of gait with convolutional neural networks and layer-wise relevance propagation
Benjamin Filtjens
Pieter Ginis
Alice Nieuwboer
Muhammad Raheel Afzal
Joke Spildooren
Bart Vanrumste
Peter Slaets
[J]. BMC Medical Informatics and Decision Making, 21

← 1 2 3 4 5 →