Improving the repeatability of deep learning models with Monte Carlo dropout

被引:0
|
作者
Andreanne Lemay
Katharina Hoebel
Christopher P. Bridge
Brian Befano
Silvia De Sanjosé
Didem Egemen
Ana Cecilia Rodriguez
Mark Schiffman
John Peter Campbell
Jayashree Kalpathy-Cramer
机构
[1] Martinos Center for Biomedical Imaging,Department of Epidemiology
[2] NeuroPoly,Division of Cancer Epidemiology & Genetics
[3] Polytechnique Montreal,undefined
[4] Massachusetts Institute of Technology,undefined
[5] MGH & BWH Center for Clinical Data Science,undefined
[6] University of Washington School of Public Health,undefined
[7] National Cancer Institute,undefined
[8] Oregon Health and Science University,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
The integration of artificial intelligence into clinical workflows requires reliable and robust models. Repeatability is a key attribute of model robustness. Ideal repeatable models output predictions without variation during independent tests carried out under similar conditions. However, slight variations, though not ideal, may be unavoidable and acceptable in practice. During model development and evaluation, much attention is given to classification performance while model repeatability is rarely assessed, leading to the development of models that are unusable in clinical practice. In this work, we evaluate the repeatability of four model types (binary classification, multi-class classification, ordinal classification, and regression) on images that were acquired from the same patient during the same visit. We study the each model’s performance on four medical image classification tasks from public and private datasets: knee osteoarthritis, cervical cancer screening, breast density estimation, and retinopathy of prematurity. Repeatability is measured and compared on ResNet and DenseNet architectures. Moreover, we assess the impact of sampling Monte Carlo dropout predictions at test time on classification performance and repeatability. Leveraging Monte Carlo predictions significantly increases repeatability, in particular at the class boundaries, for all tasks on the binary, multi-class, and ordinal models leading to an average reduction of the 95% limits of agreement by 16% points and of the class disagreement rate by 7% points. The classification accuracy improves in most settings along with the repeatability. Our results suggest that beyond about 20 Monte Carlo iterations, there is no further gain in repeatability. In addition to the higher test-retest agreement, Monte Carlo predictions are better calibrated which leads to output probabilities reflecting more accurately the true likelihood of being correctly classified.
引用
收藏
相关论文
共 50 条
  • [31] Self-learning Monte Carlo with deep neural networks
    Shen, Huitao
    Liu, Junwei
    Fu, Liang
    [J]. PHYSICAL REVIEW B, 2018, 97 (20)
  • [32] On the Robustness of Monte Carlo Dropout Trained with Noisy Labels
    Goel, Purvi
    Li Chen
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 2219 - 2228
  • [33] Ensemble of Deep Convolutional Neural Networks with Monte Carlo Dropout Sampling for Automated Image Segmentation Quality Control and Robust Deep Learning Using Small Datasets
    Hann, Evan
    Gonzales, Ricardo A.
    Popescu, Iulia A.
    Zhang, Qiang
    Ferreira, Vanessa M.
    Piechnik, Stefan K.
    [J]. MEDICAL IMAGE UNDERSTANDING AND ANALYSIS (MIUA 2021), 2021, 12722 : 280 - 293
  • [34] Bayesian deep learning-based 1H-MRS of the brain: Metabolite quantification with uncertainty estimation using Monte Carlo dropout
    Lee, Hyeong Hun
    Kim, Hyeonjin
    [J]. MAGNETIC RESONANCE IN MEDICINE, 2022, 88 (01) : 38 - 52
  • [35] Feasibility of Monte Carlo dropout-based uncertainty maps to evaluate deep learning-based synthetic CTs for adaptive proton therapy
    Galapon Jr, Arthur Villanueva
    Thummerer, Adrian
    Langendijk, Johannes Albertus
    Wagenaar, Dirk
    Both, Stefan
    [J]. MEDICAL PHYSICS, 2024, 51 (04) : 2499 - 2509
  • [36] Monte Carlo Dropout for Uncertainty Estimation and Motor Imagery Classification
    Milanes-Hermosilla, Daily
    Codorniu, Rafael Trujillo
    Lopez-Baracaldo, Rene
    Sagaro-Zamora, Roberto
    Delisle-Rodriguez, Denis
    Villarejo-Mayor, John Jairo
    Nunez-Alvarez, Jose Ricardo
    [J]. SENSORS, 2021, 21 (21)
  • [37] Fast Monte-Carlo dose simulation with recurrent deep learning
    Martinot, S.
    Bus, N.
    Vakalopoulou, M.
    Robert, C.
    Deutsch, E.
    Paragios, N.
    [J]. RADIOTHERAPY AND ONCOLOGY, 2021, 161 : S216 - S217
  • [38] Quantum monte carlo for economics: Stress testing and macroeconomic deep learning
    Skavysh, Vladimir
    Priazhkina, Sofia
    Guala, Diego
    Bromley, Thomas R.
    [J]. JOURNAL OF ECONOMIC DYNAMICS & CONTROL, 2023, 153
  • [39] Explainable Fingerprint ROI Segmentation Using Monte Carlo Dropout
    Joshi, Indu
    Kothari, Riya
    Utkarsh, Ayush
    Kurmi, Vinod K.
    Dantcheva, Antitza
    Roy, Sumantra Dutta
    Kalra, Prem Kumar
    [J]. 2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW 2021), 2021, : 60 - 69
  • [40] An Anomaly Detection Method for Satellites Using Monte Carlo Dropout
    Sadr, Mohammad Amin Maleki
    Zhu, Yeying
    Hu, Peng
    [J]. IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2023, 59 (02) : 2044 - 2052