Quantification of Deep Neural Network Prediction Uncertainties for VVUQ of Machine Learning Models

被引：6

作者：

Yaseen, Mahmoud ^{[1
]}

Wu, Xu ^{[1
]}

机构：

[1] North Carolina State Univ, Burlington Engn Labs, Dept Nucl Engn, 2500 Stinson Dr, Raleigh, NC 27695 USA

来源：

NUCLEAR SCIENCE AND ENGINEERING | 2023年 / 197卷 / 05期

关键词：

Uncertainty quantification; Deep Neural Network; Monte Carlo Dropout; Deep Ensemble; Bayesian Neural Network; FUEL PERFORMANCE CODE; VALIDATION; INFERENCE; RELEASE;

D O I：

10.1080/00295639.2022.2123203

中图分类号：

TL [原子能技术]; O571 [原子核物理学];

学科分类号：

0827 ; 082701 ;

摘要：

Recent performance breakthroughs in artificial intelligence (AI) and machine learning (ML), especially advances in deep learning, the availability of powerful and easy-to-use ML libraries (e.g., scikit-learn, TensorFlow, PyTorch), and increasing computational power, have led to unprecedented interest in AI/ML among nuclear engineers. For physics-based computational models, verification, validation, and uncertainty quantification (VVUQ) processes have been very widely investigated, and many methodologies have been developed. However, VVUQ of ML models has been relatively less studied, especially in nuclear engineering. This work focuses on uncertainty quantification (UQ) of ML models as a preliminary step of ML VVUQ, more specifically Deep Neural Networks (DNNs) because they are the most widely used supervised ML algorithm for both regression and classification tasks. This work ai3ms at quantifying the prediction or approximation uncertainties of DNNs when they are used as surrogate models for expensive physical models. Three techniques for UQ of DNNs are compared, namely, Monte Carlo Dropout (MCD), Deep Ensembles (DE), and Bayesian Neural Networks (BNNs). Two nuclear engineering examples are used to benchmark these methods: (1) time-dependent fission gas release data using the Bison code and (2) void fraction simulation based on the Boiling Water Reactor Full-size Fine-Mesh Bundle Tests (BFBT) benchmark using the TRACE code. It is found that the three methods typically require different DNN architectures and hyperparameters to optimize their performance. The UQ results also depend on the amount of training data available and the nature of the data. Overall, all three methods can provide reasonable estimations of the approximation uncertainties. The uncertainties are generally smaller when the mean predictions are close to the test data while the BNN methods usually produce larger uncertainties than MCD and DE.

引用

下载

页码：947 / 966

页数：20

共 50 条

[1] Frost prediction using machine learning and deep neural network models
Talsma, Carl J.
Solander, Kurt C.
Mudunuru, Maruti K.
Crawford, Brandon
Powell, Michelle R.
FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2023, 5
[2] Private Deep Neural Network Models Publishing for Machine Learning as a Service
Mao, Yunlong
Zhu, Boyu
Hong, Wenbo
Zhu, Zhifei
Zhang, Yuan
Zhong, Sheng
2020 IEEE/ACM 28TH INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE (IWQOS), 2020,
[3] Comparing the prediction performance of a Deep Learning Neural Network model with conventional machine learning models in landslide susceptibility assessment
Dieu Tien Bui
Tsangaratos, Paraskevas
Viet-Tien Nguyen
Ngo Van Liem
Phan Trong Trinh
CATENA, 2020, 188
[4] Runtime Performance Prediction for Deep Learning Models with Graph Neural Network
Gao, Yanjie
Gu, Xianyu
Zhang, Hongyu
Lin, Haoxiang
Yang, Mao
2023 IEEE/ACM 45TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: SOFTWARE ENGINEERING IN PRACTICE, ICSE-SEIP, 2023, : 368 - 380
[5] Deep UQ: Learning deep neural network surrogate models for high dimensional uncertainty quantification
Tripathy, Rohit K.
Bilionis, Ilias
JOURNAL OF COMPUTATIONAL PHYSICS, 2018, 375 : 565 - 588
[6] Neural network models and deep learning
Kriegeskorte, Nikolaus
Golan, Tal
CURRENT BIOLOGY, 2019, 29 (07) : R231 - R236
[7] Uncertainty quantification of machine learning models: on conformal prediction
Akpabio, Inimfon I.
Savari, Serap A.
JOURNAL OF MICRO-NANOPATTERNING MATERIALS AND METROLOGY-JM3, 2021, 20 (04):
[8] GPU Occupancy Prediction of Deep Learning Models Using Graph Neural Network
Mei, Hengquan
Qu, Huaizhi
Sun, Jingwei
Gao, Yanjie
Lin, Haoxiang
Sun, Guangzhong
2023 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING, CLUSTER, 2023, : 318 - 329
[9] Review of machine learning and deep learning models for toxicity prediction
Guo, Wenjing
Liu, Jie
Dong, Fan
Song, Meng
Li, Zoe
Khan, Md Kamrul Hasan
Patterson, Tucker A.
Hong, Huixiao
EXPERIMENTAL BIOLOGY AND MEDICINE, 2023, 248 (21) : 1952 - 1973
[10] Machine Learning and Neural Network Models for Customer Churn Prediction in Banking and Telecom Sectors
Patil, Ketaki
Patil, Shivraj
Danve, Riya
Patil, Ruchira
PROCEEDINGS OF SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER ENGINEERING AND COMMUNICATION SYSTEMS, ICACECS 2021, 2022, : 241 - 253

← 1 2 3 4 5 →