Quantification of Deep Neural Network Prediction Uncertainties for VVUQ of Machine Learning Models

被引:6
|
作者
Yaseen, Mahmoud [1 ]
Wu, Xu [1 ]
机构
[1] North Carolina State Univ, Burlington Engn Labs, Dept Nucl Engn, 2500 Stinson Dr, Raleigh, NC 27695 USA
关键词
Uncertainty quantification; Deep Neural Network; Monte Carlo Dropout; Deep Ensemble; Bayesian Neural Network; FUEL PERFORMANCE CODE; VALIDATION; INFERENCE; RELEASE;
D O I
10.1080/00295639.2022.2123203
中图分类号
TL [原子能技术]; O571 [原子核物理学];
学科分类号
0827 ; 082701 ;
摘要
Recent performance breakthroughs in artificial intelligence (AI) and machine learning (ML), especially advances in deep learning, the availability of powerful and easy-to-use ML libraries (e.g., scikit-learn, TensorFlow, PyTorch), and increasing computational power, have led to unprecedented interest in AI/ML among nuclear engineers. For physics-based computational models, verification, validation, and uncertainty quantification (VVUQ) processes have been very widely investigated, and many methodologies have been developed. However, VVUQ of ML models has been relatively less studied, especially in nuclear engineering. This work focuses on uncertainty quantification (UQ) of ML models as a preliminary step of ML VVUQ, more specifically Deep Neural Networks (DNNs) because they are the most widely used supervised ML algorithm for both regression and classification tasks. This work ai3ms at quantifying the prediction or approximation uncertainties of DNNs when they are used as surrogate models for expensive physical models. Three techniques for UQ of DNNs are compared, namely, Monte Carlo Dropout (MCD), Deep Ensembles (DE), and Bayesian Neural Networks (BNNs). Two nuclear engineering examples are used to benchmark these methods: (1) time-dependent fission gas release data using the Bison code and (2) void fraction simulation based on the Boiling Water Reactor Full-size Fine-Mesh Bundle Tests (BFBT) benchmark using the TRACE code. It is found that the three methods typically require different DNN architectures and hyperparameters to optimize their performance. The UQ results also depend on the amount of training data available and the nature of the data. Overall, all three methods can provide reasonable estimations of the approximation uncertainties. The uncertainties are generally smaller when the mean predictions are close to the test data while the BNN methods usually produce larger uncertainties than MCD and DE.
引用
下载
收藏
页码:947 / 966
页数:20
相关论文
共 50 条
  • [1] Frost prediction using machine learning and deep neural network models
    Talsma, Carl J.
    Solander, Kurt C.
    Mudunuru, Maruti K.
    Crawford, Brandon
    Powell, Michelle R.
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2023, 5
  • [2] Private Deep Neural Network Models Publishing for Machine Learning as a Service
    Mao, Yunlong
    Zhu, Boyu
    Hong, Wenbo
    Zhu, Zhifei
    Zhang, Yuan
    Zhong, Sheng
    2020 IEEE/ACM 28TH INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE (IWQOS), 2020,
  • [3] Comparing the prediction performance of a Deep Learning Neural Network model with conventional machine learning models in landslide susceptibility assessment
    Dieu Tien Bui
    Tsangaratos, Paraskevas
    Viet-Tien Nguyen
    Ngo Van Liem
    Phan Trong Trinh
    CATENA, 2020, 188
  • [4] Runtime Performance Prediction for Deep Learning Models with Graph Neural Network
    Gao, Yanjie
    Gu, Xianyu
    Zhang, Hongyu
    Lin, Haoxiang
    Yang, Mao
    2023 IEEE/ACM 45TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: SOFTWARE ENGINEERING IN PRACTICE, ICSE-SEIP, 2023, : 368 - 380
  • [5] Deep UQ: Learning deep neural network surrogate models for high dimensional uncertainty quantification
    Tripathy, Rohit K.
    Bilionis, Ilias
    JOURNAL OF COMPUTATIONAL PHYSICS, 2018, 375 : 565 - 588
  • [6] Neural network models and deep learning
    Kriegeskorte, Nikolaus
    Golan, Tal
    CURRENT BIOLOGY, 2019, 29 (07) : R231 - R236
  • [7] Uncertainty quantification of machine learning models: on conformal prediction
    Akpabio, Inimfon I.
    Savari, Serap A.
    JOURNAL OF MICRO-NANOPATTERNING MATERIALS AND METROLOGY-JM3, 2021, 20 (04):
  • [8] GPU Occupancy Prediction of Deep Learning Models Using Graph Neural Network
    Mei, Hengquan
    Qu, Huaizhi
    Sun, Jingwei
    Gao, Yanjie
    Lin, Haoxiang
    Sun, Guangzhong
    2023 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING, CLUSTER, 2023, : 318 - 329
  • [9] Review of machine learning and deep learning models for toxicity prediction
    Guo, Wenjing
    Liu, Jie
    Dong, Fan
    Song, Meng
    Li, Zoe
    Khan, Md Kamrul Hasan
    Patterson, Tucker A.
    Hong, Huixiao
    EXPERIMENTAL BIOLOGY AND MEDICINE, 2023, 248 (21) : 1952 - 1973
  • [10] Machine Learning and Neural Network Models for Customer Churn Prediction in Banking and Telecom Sectors
    Patil, Ketaki
    Patil, Shivraj
    Danve, Riya
    Patil, Ruchira
    PROCEEDINGS OF SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER ENGINEERING AND COMMUNICATION SYSTEMS, ICACECS 2021, 2022, : 241 - 253