Evaluation and Optimization Methods for Applicability Domain Methods and Their Hyperparameters, Considering the Prediction Performance of Machine Learning Models

被引:3
|
作者
Kaneko, Hiromasa [1 ]
机构
[1] Meiji Univ, Sch Sci & Technol, Dept Appl Chem, Kawasaki, Kanagawa 2148571, Japan
来源
ACS OMEGA | 2024年 / 9卷 / 10期
关键词
CRITICAL-TEMPERATURE; DATA SET; QSAR; POINT;
D O I
10.1021/acsomega.3c08036
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In molecular, material, and process design and control, the applicability domain (AD) of a mathematical model y = f(x) between properties, activities, and features x is constructed. As there are multiple AD methods, each with its own set of hyperparameters, it is necessary to select an appropriate AD method and hyperparameters for each data set and mathematical model. However, there is no method for optimizing the AD model. This study proposes a method for evaluating and optimizing the AD model for each data set and a mathematical model. Using the predictions of double cross-validation with all samples, the relationship between coverage and root-mean-squared error (RMSE) was calculated for all combinations of AD methods and their hyperparameters, and the area under the coverage and RMSE curve (AUCR) was calculated. The AD model with the lowest AUCR value was selected as the optimal fit for the mathematical model. The proposed method was validated using eight data sets, including molecules, materials, and spectra, demonstrating that the proposed method could generate optimal AD models for all data sets. The Python code for the proposed method is available at https://github.com/hkaneko1985/dcekit.
引用
收藏
页码:11453 / 11458
页数:6
相关论文
共 50 条
  • [11] Performance Evaluation of Machine Learning Methods in Cultural Modeling
    Xiao-Chen Li
    Wen-Ji Mao
    Daniel Zeng
    Peng Su
    Fei-Yue Wang
    Journal of Computer Science and Technology, 2009, 24 : 1010 - 1017
  • [12] Application of machine learning methods in performance prediction and multi-objective optimization of fuel cell
    School of Energy and Power Engineering, Northeast Electric Power University, China
    Proc. Int. Conf. Power Eng., ICOPE,
  • [13] PREDICTION OF GAS TURBINE PERFORMANCE USING MACHINE LEARNING METHODS
    Goyal, Vipul
    Xu, Mengyu
    Kapat, Jayanta
    Vesely, Ladislav
    PROCEEDINGS OF THE ASME TURBO EXPO: TURBOMACHINERY TECHNICAL CONFERENCE AND EXPOSITION, VOL 6, 2020,
  • [14] Performance attribution of machine learning methods for stock returns prediction
    Daul, Stephane
    Jaisson, Thibault
    Nagy, Alexandra
    JOURNAL OF FINANCE AND DATA SCIENCE, 2022, 8 : 86 - 104
  • [15] Performance Comparison of Machine Learning Models for Annual Precipitation Prediction Using Different Decomposition Methods
    Song, Chao
    Chen, Xiaohong
    REMOTE SENSING, 2021, 13 (05) : 1 - 27
  • [16] Assessment of XCMS Optimization Methods with Machine-Learning Performance
    Lassen, Johan
    Nielsen, Kirstine Lykke
    Johannsen, Mogens
    Villesen, Palle
    ANALYTICAL CHEMISTRY, 2021, 93 (40) : 13459 - 13466
  • [17] Fuel Consumption Prediction Models Based on Machine Learning and Mathematical Methods
    Xie, Xianwei
    Sun, Baozhi
    Li, Xiaohe
    Olsson, Tobias
    Maleki, Neda
    Ahlgren, Fredrik
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (04)
  • [18] Comprehensive input models and machine learning methods to improve permeability prediction
    Davari, Mohammad Ali
    Kadkhodaie, Ali
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [19] Performance evaluation of machine learning methods for path loss prediction in rural environment at 3.7 GHz
    Moraitis, Nektarios
    Tsipi, Lefteris
    Vouyioukas, Demosthenes
    Gkioni, Angelina
    Louvros, Spyridon
    WIRELESS NETWORKS, 2021, 27 (06) : 4169 - 4188
  • [20] Methods to Evaluate Temporal Cognitive Biases in Machine Learning Prediction Models
    Harris, Christopher G.
    WWW'20: COMPANION PROCEEDINGS OF THE WEB CONFERENCE 2020, 2020, : 572 - 575