Improving deep neural network generalization and robustness to background bias via layer-wise relevance propagation optimization

被引：0

作者：

Pedro R. A. S. Bassi

Sergio S. J. Dertkigil

Andrea Cavalli

机构：

[1] Alma Mater Studiorum - University of Bologna,Center for Biomolecular Nanotechnologies

[2] Istituto Italiano di Tecnologia,School of Medical Sciences

[3] University of Campinas (UNICAMP),undefined

[4] Istituto Italiano di Tecnologia,undefined

来源：

Nature Communications | / 15卷

关键词：

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Features in images’ backgrounds can spuriously correlate with the images’ classes, representing background bias. They can influence the classifier’s decisions, causing shortcut learning (Clever Hans effect). The phenomenon generates deep neural networks (DNNs) that perform well on standard evaluation datasets but generalize poorly to real-world data. Layer-wise Relevance Propagation (LRP) explains DNNs’ decisions. Here, we show that the optimization of LRP heatmaps can minimize the background bias influence on deep classifiers, hindering shortcut learning. By not increasing run-time computational cost, the approach is light and fast. Furthermore, it applies to virtually any classification architecture. After injecting synthetic bias in images’ backgrounds, we compared our approach (dubbed ISNet) to eight state-of-the-art DNNs, quantitatively demonstrating its superior robustness to background bias. Mixed datasets are common for COVID-19 and tuberculosis classification with chest X-rays, fostering background bias. By focusing on the lungs, the ISNet reduced shortcut learning. Thus, its generalization performance on external (out-of-distribution) test databases significantly surpassed all implemented benchmark models.

引用

共 49 条

[21] Explaining Deep Learning Models for Tabular Data Using Layer-Wise Relevance Propagation
Ullah, Ihsan
Rios, Andre
Gala, Vaibhav
Mckeever, Susan
[J]. APPLIED SCIENCES-BASEL, 2022, 12 (01):
[22] Layer-wise learning based stochastic gradient descent method for the optimization of deep convolutional neural network
Zheng, Qinghe
Tian, Xinyu
Jiang, Nan
Yang, Mingqiang
[J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 37 (04) : 5641 - 5654
[23] Uncovering convolutional neural network decisions for diagnosing multiple sclerosis on conventional MRI using layer-wise relevance propagation
Eitel, Fabian
Soehler, Emily
Bellmann-Strobl, Judith
Brandt, Alexander U.
Ruprecht, Klemens
Giess, Rene M.
Kuchling, Joseph
Asseyer, Susanna
Weygandt, Martin
Haynes, John-Dylan
Scheel, Michael
Paul, Friedemann
Ritter, Kerstin
[J]. NEUROIMAGE-CLINICAL, 2019, 24
[24] Explainability of deep reinforcement learning algorithms in robotic domains by using Layer-wise Relevance Propagation
Taghian, Mehran
Miwa, Shotaro
Mitsuka, Yoshihiro
Gunther, Johannes
Golestan, Shadan
Zaiane, Osmar
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 137
[25] Accelerate Cooperative Deep Inference via Layer-wise Processing Schedule Optimization
Wang, Ning
Duan, Yubin
Wu, Jie
[J]. 30TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS (ICCCN 2021), 2021,
[26] Stable feature selection utilizing Graph Convolutional Neural Network and Layer-wise Relevance Propagation for biomarker discovery in breast cancer
Chereda, Hryhorii
Leha, Andreas
Beissbarth, Tim
[J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2024, 151
[27] A neural network-based control chart for monitoring and interpreting autocorrelated multivariate processes using layer-wise relevance propagation
Sun, Jinwen
Zhou, Shiyu
Veeramani, Dharmaraj
[J]. QUALITY ENGINEERING, 2023, 35 (01) : 33 - 47
[28] An explainable brain tumor detection and classification model using deep learning and layer-wise relevance propagation
Saurabh Mandloi
Mohd Zuber
Rajeev Kumar Gupta
[J]. Multimedia Tools and Applications, 2024, 83 : 33753 - 33783
[29] An explainable brain tumor detection and classification model using deep learning and layer-wise relevance propagation
Mandloi, Saurabh
Zuber, Mohd
Gupta, Rajeev Kumar
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (11) : 33753 - 33783
[30] Beyond saliency: Understanding convolutional neural networks from saliency prediction on layer-wise relevance propagation
Li, Heyi
Tian, Yunke
Mueller, Klaus
Chen, Xin
[J]. IMAGE AND VISION COMPUTING, 2019, 83-84 : 70 - 86

← 1 2 3 4 5 →