The Effect of Training Data Quantity on Monte Carlo Dropout Uncertainty Quantification in Deep Learning

被引:1
|
作者
Cusack, Harrison [1 ]
Bialkowski, Alina [1 ]
机构
[1] Univ Queensland, Sch Informat Technol & Elect Engn, Brisbane, Qld, Australia
关键词
D O I
10.1109/IJCNN54540.2023.10191327
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When deploying deep neural networks, quantification of a model's uncertainty is necessary to provide confidence in its predictions by distinguishing between accurate predictions and coincidentally correct guesses. While it is known that the accuracy of predictions is dependent on the data on which the model was trained, to date, limited work has examined the relationship between training data quantity and uncertainty quantification. In this paper, we propose two metrics to assess the 'quality' of uncertainty quantification, and investigate the relationship between training data quantity and Monte Carlo Dropout uncertainty quantification in supervised and semisupervised learning across various text-based datasets. We found that in supervised learning, uncertainty quantification quality (across both metrics) initially increased for larger quantities of training data, but interestingly, after a certain threshold, began to gradually decline. In semi-supervised learning, uncertainty quantification was enhanced by both a greater number of training samples and greater proportion of pre-labelled data. These results suggest that for supervised learning, data scientists generally ought not to invest resources into acquiring more training data solely for superior uncertainty quantification. However, if semi-supervised learning is necessary, then there is a marked benefit in obtaining more data.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Combining unsupervised deep learning and Monte Carlo dropout for seismic data reconstruction and its uncertainty quantification
    Chen, Gui
    Liu, Yang
    [J]. Geophysics, 2023, 89 (01)
  • [2] Combining unsupervised deep learning and Monte Carlo dropout for seismic data reconstruction and its uncertainty quantification
    Chen, Gui
    Liu, Yang
    [J]. GEOPHYSICS, 2024, 89 (01) : WA53 - WA65
  • [3] Assessing the uncertainty of deep learning soil spectral models using Monte Carlo dropout
    Padarian, J.
    Minasny, B.
    McBratney, A. B.
    [J]. GEODERMA, 2022, 425
  • [4] Assessing the uncertainty of deep learning soil spectral models using Monte Carlo dropout
    Padarian, J.
    Minasny, B.
    McBratney, A.B.
    [J]. Geoderma, 2022, 425
  • [5] Improving the repeatability of deep learning models with Monte Carlo dropout
    Lemay, Andreanne
    Hoebel, Katharina
    Bridge, Christopher P.
    Befano, Brian
    De Sanjose, Silvia
    Egemen, Didem
    Rodriguez, Ana Cecilia
    Schiffman, Mark
    Campbell, John Peter
    Kalpathy-Cramer, Jayashree
    [J]. NPJ DIGITAL MEDICINE, 2022, 5 (01)
  • [6] Improving the repeatability of deep learning models with Monte Carlo dropout
    Andreanne Lemay
    Katharina Hoebel
    Christopher P. Bridge
    Brian Befano
    Silvia De Sanjosé
    Didem Egemen
    Ana Cecilia Rodriguez
    Mark Schiffman
    John Peter Campbell
    Jayashree Kalpathy-Cramer
    [J]. npj Digital Medicine, 5
  • [7] Epistemic Uncertainty and Model Transparency in Rock Facies Classification Using Monte Carlo Dropout Deep Learning
    Hossain, Touhid Mohammad
    Hermana, Maman
    Abdulkadir, Said Jadid
    [J]. IEEE ACCESS, 2023, 11 : 89349 - 89358
  • [8] Uncertainty estimation for deep learning-based pectoral muscle segmentation via Monte Carlo dropout
    Klanecek, Zan
    Wagner, Tobias
    Wang, Yao-Kuan
    Cockmartin, Lesley
    Marshall, Nicholas
    Schott, Brayden
    Deatsch, Ali
    Studen, Andrej
    Hertl, Kristijana
    Jarm, Katja
    Krajc, Mateja
    Vrhovec, Milos
    Bosmans, Hilde
    Jeraj, Robert
    [J]. PHYSICS IN MEDICINE AND BIOLOGY, 2023, 68 (11):
  • [9] Bayesian deep learning-based 1H-MRS of the brain: Metabolite quantification with uncertainty estimation using Monte Carlo dropout
    Lee, Hyeong Hun
    Kim, Hyeonjin
    [J]. MAGNETIC RESONANCE IN MEDICINE, 2022, 88 (01) : 38 - 52
  • [10] Bitcoin Price Prediction Using Deep Bayesian LSTM With Uncertainty Quantification: A Monte Carlo Dropout-Based Approach
    Hassan, Masoud Muhammed
    [J]. STAT, 2024, 13 (03):