Selecting Machine Learning Algorithms using Regression Models

被引:25
|
作者
Doan, Tri [1 ]
Kalita, Jugal [1 ]
机构
[1] Univ Colorado, Dept Comp Sci, Colorado Springs, CO 80907 USA
关键词
Meta-learning; regression; dimensionality reduction; combined metric;
D O I
10.1109/ICDMW.2015.43
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In performing data mining, a common task is to search for the most appropriate algorithm(s) to retrieve important information from data. With an increasing number of available data mining techniques, it may be impractical to experiment with many techniques on a specific dataset of interest to find the best algorithm(s). In this paper, we demonstrate the suitability of tree-based multi-variable linear regression in predicting algorithm performance. We take into account prior machine learning experience to construct meta-knowledge for supervised learning. The idea is to use summary knowledge about datasets along with past performance of algorithms on these datasets to build this meta-knowledge. We augment pure statistical summaries with descriptive features and a misclassification cost, and discover that transformed datasets obtained by reducing a high dimensional feature space to a smaller dimension still retain significant characteristic knowledge necessary to predict algorithm performance. Our approach works well for both numerical and nominal data obtained from real world environments.
引用
收藏
页码:1498 / 1505
页数:8
相关论文
共 50 条
  • [1] Machine learning embedded EM algorithms for semiparametric mixture regression models
    Xue, Jiacheng
    Yao, Weixin
    Xiang, Sijia
    [J]. COMPUTATIONAL STATISTICS, 2024,
  • [2] Capacity Loss Analysis Using Machine Learning Regression Algorithms
    Atay, Sergen
    Ayranci, Ahmet Aytug
    Erkmen, Burcu
    [J]. 2022 9TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONICS ENGINEERING (ICEEE 2022), 2022, : 10 - 13
  • [3] The Feasibility of Quantum Algorithms Using Machine Learning Models
    Citko, Wieslaw
    Sienko, Wieslaw
    [J]. PRZEGLAD ELEKTROTECHNICZNY, 2019, 95 (09): : 146 - 149
  • [4] Application and comparison of several machine learning algorithms and their integration models in regression problems
    Jui-Chan Huang
    Kuo-Min Ko
    Ming-Hung Shu
    Bi-Min Hsu
    [J]. Neural Computing and Applications, 2020, 32 : 5461 - 5469
  • [5] Application and comparison of several machine learning algorithms and their integration models in regression problems
    Huang, Jui-Chan
    Ko, Kuo-Min
    Shu, Ming-Hung
    Hsu, Bi-Min
    [J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (10): : 5461 - 5469
  • [6] The feasibility and flexibility of selecting quasars by variability using ensemble machine learning algorithms
    羊达明
    谢彰亮
    王俊贤
    [J]. Research in Astronomy and Astrophysics, 2021, 21 (04) : 195 - 205
  • [7] The feasibility and flexibility of selecting quasars by variability using ensemble machine learning algorithms
    Yang, Da-Ming
    Xie, Zhang-Liang
    Wang, Jun-Xian
    [J]. RESEARCH IN ASTRONOMY AND ASTROPHYSICS, 2021, 21 (04)
  • [8] Analysis of Diabetic Prediction & Regression System using Machine Learning Algorithms
    Ghodeswar, Ujwala
    Keote, Minal
    [J]. JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (03) : 750 - 762
  • [9] An Approach for Potato Yield Prediction Using Machine Learning Regression Algorithms
    Patnaik, Prabhu Prasad
    Padhy, Neelamadhab
    [J]. NEXT GENERATION OF INTERNET OF THINGS, 2023, 445 : 327 - 336
  • [10] AHMoSe: A knowledge-based visual support system for selecting regression machine learning models
    Rojo, Diego
    Htun, Nyi Nyi
    Parra, Denis
    De Croon, Robin
    Verbert, Katrien
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2021, 187