Machine learning-based surrogate modeling for data-driven optimization: a comparison of subset selection for regression techniques

被引:0
|
作者
Sun Hye Kim
Fani Boukouvala
机构
[1] Georgia Institute of Technology,School of Chemical & Biomolecular Engineering
来源
Optimization Letters | 2020年 / 14卷
关键词
Machine Learning; Surrogate modeling; Black-box optimization; Data-driven optimization; Subset selection for regression;
D O I
暂无
中图分类号
学科分类号
摘要
Optimization of simulation-based or data-driven systems is a challenging task, which has attracted significant attention in the recent literature. A very efficient approach for optimizing systems without analytical expressions is through fitting surrogate models. Due to their increased flexibility, nonlinear interpolating functions, such as radial basis functions and Kriging, have been predominantly used as surrogates for data-driven optimization; however, these methods lead to complex nonconvex formulations. Alternatively, commonly used regression-based surrogates lead to simpler formulations, but they are less flexible and inaccurate if the form is not known a priori. In this work, we investigate the efficiency of subset selection regression techniques for developing surrogate functions that balance both accuracy and complexity. Subset selection creates sparse regression models by selecting only a subset of original features, which are linearly combined to generate a diverse set of surrogate models. Five different subset selection techniques are compared with commonly used nonlinear interpolating surrogate functions with respect to optimization solution accuracy, computation time, sampling requirements, and model sparsity. Our results indicate that subset selection-based regression functions exhibit promising performance when the dimensionality is low, while interpolation performs better for higher dimensional problems.
引用
收藏
页码:989 / 1010
页数:21
相关论文
共 50 条
  • [1] Machine learning-based surrogate modeling for data-driven optimization: a comparison of subset selection for regression techniques
    Kim, Sun Hye
    Boukouvala, Fani
    [J]. OPTIMIZATION LETTERS, 2020, 14 (04) : 989 - 1010
  • [2] Data-driven surrogate modeling of multiphase flows using machine learning techniques
    Ganti, Himakar
    Khare, Prashant
    [J]. COMPUTERS & FLUIDS, 2020, 211
  • [3] Machine learning-based data-driven robust optimization approach under uncertainty
    Zhang, Chenhan
    Wang, Zhenlei
    Wang, Xin
    [J]. JOURNAL OF PROCESS CONTROL, 2022, 115 : 1 - 11
  • [4] Adapting Data-Driven Techniques to Improve Surrogate Machine Learning Model Performance
    Jones, Huw Rhys
    Popescu, Andrei C.
    Sulehman, Yusuf
    Mu, Tingting
    [J]. IEEE ACCESS, 2023, 11 : 23909 - 23925
  • [5] A Data-Driven Methodology for Guiding the Selection of Preprocessing Techniques in a Machine Learning Pipeline
    Garcia-Carraseo, Jorge
    Mate, Alejandro
    Trujillo, Juan
    [J]. INTELLIGENT INFORMATION SYSTEMS, CAISE FORUM 2023, 2023, 477 : 34 - 42
  • [6] Data-Driven Learning-Based Optimization for Distribution System State Estimation
    Zamzam, Ahmed S.
    Fu, Xiao
    Sidiropoulos, Nicholas D.
    [J]. IEEE TRANSACTIONS ON POWER SYSTEMS, 2019, 34 (06) : 4796 - 4805
  • [7] A Framework for Modeling and Optimization of Data-Driven Energy Systems Using Machine Learning
    Danish, Mir Sayed Shah
    [J]. IEEE Transactions on Artificial Intelligence, 2024, 5 (05): : 2434 - 2443
  • [8] Data-driven Autism Biomarkers Selection by using Signal Processing and Machine Learning Techniques
    Antovski, Antonio
    Kostadinovska, Stefani
    Simjanoska, Monika
    Eftimov, Tome
    Ackovska, Nevena
    Bogdanova, Ana Madevska
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES, VOL 3 (BIOINFORMATICS), 2019, : 201 - 208
  • [9] A data-driven Machine Learning approach to creativity and innovation techniques selection in solution development
    de Carvalho Botega, Luiz Fernando
    da Silva, Jonny Carlos
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 257
  • [10] A machine learning-based data-driven method for risk analysis of marine accidents
    Feng, Yinwei
    Wang, Huanxin
    Xia, Guoqing
    Cao, Wenjie
    Li, Tianyi
    Wang, Xinjian
    Liu, Zhengjiang
    [J]. JOURNAL OF MARINE ENGINEERING AND TECHNOLOGY, 2024,