Evaluation and Optimization Methods for Applicability Domain Methods and Their Hyperparameters, Considering the Prediction Performance of Machine Learning Models

被引:3
|
作者
Kaneko, Hiromasa [1 ]
机构
[1] Meiji Univ, Sch Sci & Technol, Dept Appl Chem, Kawasaki, Kanagawa 2148571, Japan
来源
ACS OMEGA | 2024年 / 9卷 / 10期
关键词
CRITICAL-TEMPERATURE; DATA SET; QSAR; POINT;
D O I
10.1021/acsomega.3c08036
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In molecular, material, and process design and control, the applicability domain (AD) of a mathematical model y = f(x) between properties, activities, and features x is constructed. As there are multiple AD methods, each with its own set of hyperparameters, it is necessary to select an appropriate AD method and hyperparameters for each data set and mathematical model. However, there is no method for optimizing the AD model. This study proposes a method for evaluating and optimizing the AD model for each data set and a mathematical model. Using the predictions of double cross-validation with all samples, the relationship between coverage and root-mean-squared error (RMSE) was calculated for all combinations of AD methods and their hyperparameters, and the area under the coverage and RMSE curve (AUCR) was calculated. The AD model with the lowest AUCR value was selected as the optimal fit for the mathematical model. The proposed method was validated using eight data sets, including molecules, materials, and spectra, demonstrating that the proposed method could generate optimal AD models for all data sets. The Python code for the proposed method is available at https://github.com/hkaneko1985/dcekit.
引用
收藏
页码:11453 / 11458
页数:6
相关论文
共 50 条
  • [31] Assessment of Methods To Define the Applicability Domain of Structural Alert Models
    Ellison, C. M.
    Sherhod, R.
    Cronin, M. T. D.
    Enoch, S. J.
    Madden, J. C.
    Judson, P. N.
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2011, 51 (05) : 975 - 985
  • [33] Optimization of model parameters and hyperparameters in deep learning models for spatial interaction prediction
    Liu, Lin
    Cao, Xiaojing
    Wang, Hengsheng
    Xiang, Junying
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 266
  • [34] Machine learning and domain decomposition methods - a survey
    Axel Klawonn
    Martin Lanser
    Janine Weber
    Computational Science and Engineering, 1 (1):
  • [35] Metaheuristic optimization of data preparation and machine learning hyperparameters for prediction of dynamic methane production
    Meola, Alberto
    Winkler, Manuel
    Weinrich, Soeren
    BIORESOURCE TECHNOLOGY, 2023, 372
  • [36] Machine learning methods applied to drilling rate of penetration prediction and optimization - A review
    Barbosa, Luis Felipe F. M.
    Nascimento, Andreas
    Mathias, Mauro Hugo
    de Carvalho, Joao Andrade, Jr.
    JOURNAL OF PETROLEUM SCIENCE AND ENGINEERING, 2019, 183
  • [37] Comparison and evaluation of advanced machine learning methods for performance and emissions prediction of a gasoline Wankel rotary engine
    Wang, Huaiyu
    Ji, Changwei
    Shi, Cheng
    Ge, Yunshan
    Meng, Hao
    Yang, Jinxin
    Chang, Ke
    Wang, Shuofeng
    Energy, 2022, 248
  • [38] Comparison and evaluation of advanced machine learning methods for performance and emissions prediction of a gasoline Wankel rotary engine
    Wang, Huaiyu
    Ji, Changwei
    Shi, Cheng
    Ge, Yunshan
    Meng, Hao
    Yang, Jinxin
    Chang, Ke
    Wang, Shuofeng
    ENERGY, 2022, 248
  • [39] Methods for Prediction Optimization of the Constrained State-Preserved Extreme Learning Machine
    Goodman, Garrett
    Hirt, Quinn
    Shimizu, Cogan
    Ktistakis, Iosif Papadakis
    Alamaniotis, Miltiadis
    Bourbakis, Nikolaos
    2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 639 - 646
  • [40] The prediction and optimization of Hydraulic fracturing by integrating the numerical simulation and the machine learning methods
    Lizhe, Li
    Fujian, Zhou
    You, Zhou
    Zhuolin, Cai
    Bo, Wang
    Yingying, Zhao
    Yutian, Lu
    Energy Reports, 2022, 8 : 15338 - 15349