Failure Detection in Deep Neural Networks for Medical Imaging

被引:6
|
作者
Ahmed, Sabeen [1 ]
Dera, Dimah [2 ]
Ul Hassan, Saud [3 ]
Bouaynaya, Nidhal [1 ]
Rasool, Ghulam [4 ]
机构
[1] Rowan Univ, Dept Elect & Comp Engn, Glassboro, NJ 08028 USA
[2] Univ Texas Rio Grande Valley, Brownsville, TX USA
[3] AMD Inc, Austin, TX USA
[4] H Lee Moffitt Canc Ctr & Res Inst, Machine Learning Dept, Tampa, FL USA
来源
基金
英国工程与自然科学研究理事会; 美国国家科学基金会;
关键词
failure detection; robustness; trustworthiness; adversarial attacks; Bayesian deep neural networks; self-assessment; reliability; natural noise; UNCERTAINTY;
D O I
10.3389/fmedt.2022.919046
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Deep neural networks (DNNs) have started to find their role in the modern healthcare system. DNNs are being developed for diagnosis, prognosis, treatment planning, and outcome prediction for various diseases. With the increasing number of applications of DNNs in modern healthcare, their trustworthiness and reliability are becoming increasingly important. An essential aspect of trustworthiness is detecting the performance degradation and failure of deployed DNNs in medical settings. The softmax output values produced by DNNs are not a calibrated measure of model confidence. Softmax probability numbers are generally higher than the actual model confidence. The model confidence-accuracy gap further increases for wrong predictions and noisy inputs. We employ recently proposed Bayesian deep neural networks (BDNNs) to learn uncertainty in the model parameters. These models simultaneously output the predictions and a measure of confidence in the predictions. By testing these models under various noisy conditions, we show that the (learned) predictive confidence is well calibrated. We use these reliable confidence values for monitoring performance degradation and failure detection in DNNs. We propose two different failure detection methods. In the first method, we define a fixed threshold value based on the behavior of the predictive confidence with changing signal-to-noise ratio (SNR) of the test dataset. The second method learns the threshold value with a neural network. The proposed failure detection mechanisms seamlessly abstain from making decisions when the confidence of the BDNN is below the defined threshold and hold the decision for manual review. Resultantly, the accuracy of the models improves on the unseen test samples. We tested our proposed approach on three medical imaging datasets: PathMNIST, DermaMNIST, and OrganAMNIST, under different levels and types of noise. An increase in the noise of the test images increases the number of abstained samples. BDNNs are inherently robust and show more than 10% accuracy improvement with the proposed failure detection methods. The increased number of abstained samples or an abrupt increase in the predictive variance indicates model performance degradation or possible failure. Our work has the potential to improve the trustworthiness of DNNs and enhance user confidence in the model predictions.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] Face detection in untrained deep neural networks
    Baek, Seungdae
    Song, Min
    Jang, Jaeson
    Kim, Gwangsu
    Paik, Se-Bum
    [J]. NATURE COMMUNICATIONS, 2021, 12 (01)
  • [42] Cough Detection Using Deep Neural Networks
    Liu, Jia-Ming
    You, Mingyu
    Wang, Zheng
    Li, Guo-Zheng
    Xu, Xianghuai
    Qiu, Zhongmin
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2014,
  • [43] Entanglement detection with classical deep neural networks
    Urena, Julio
    Sojo, Antonio
    Bermejo-Vega, Juani
    Manzano, Daniel
    [J]. SCIENTIFIC REPORTS, 2024, 14 (01):
  • [44] Adversarial image detection in deep neural networks
    Carrara, Fabio
    Falchi, Fabrizio
    Caldelli, Roberto
    Amato, Giuseppe
    Becarelli, Rudy
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (03) : 2815 - 2835
  • [45] Deep Neural Networks for Voice Activity Detection
    Mihalache, Serban
    Ivanov, Ioan-Alexandru
    Burileanu, Dragos
    [J]. 2021 44TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2021, : 191 - 194
  • [46] THE COMBINATION OF CONVOLUTION NEURAL NETWORKS AND DEEP NEURAL NETWORKS FOR FAKE NEWS DETECTION
    Jawad, Zainab A.
    Obaid, Ahmed J.
    [J]. JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2023, 18 (01): : 814 - 826
  • [47] Deep Convolutional Neural Networks for pedestrian detection
    Tome, D.
    Monti, F.
    Baroffio, L.
    Bondi, L.
    Tagliasacchi, M.
    Tubaro, S.
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2016, 47 : 482 - 489
  • [48] Stenosis Detection with Deep Convolutional Neural Networks
    Antczak, Karol
    Liberadzki, Lukasz
    [J]. 22ND INTERNATIONAL CONFERENCE ON CIRCUITS, SYSTEMS, COMMUNICATIONS AND COMPUTERS (CSCC 2018), 2018, 210
  • [49] Monkeypox detection using deep neural networks
    Amir Sorayaie Azar
    Amin Naemi
    Samin Babaei Rikan
    Jamshid Bagherzadeh Mohasefi
    Habibollah Pirnejad
    Uffe Kock Wiil
    [J]. BMC Infectious Diseases, 23
  • [50] Monkeypox detection using deep neural networks
    Sorayaie Azar, Amir
    Naemi, Amin
    Babaei Rikan, Samin
    Mohasefi, Jamshid Bagherzadeh
    Pirnejad, Habibollah
    Wiil, Uffe Kock
    [J]. BMC INFECTIOUS DISEASES, 2023, 23 (01)