Exploring Hyperparameter Usage and Tuning in Machine Learning Research

被引:2
|
作者
Simon, Sebastian [1 ]
Kolyada, Nikolay [2 ]
Akiki, Christopher [3 ]
Potthast, Martin [3 ]
Stein, Benno [2 ]
Siegmund, Norbert [3 ]
机构
[1] Univ Leipzig, Leipzig, Germany
[2] Bauhaus Univ Weimar, Weimar, Germany
[3] Univ Leipzig, ScaDS AI Dresden, Leipzig, Germany
关键词
Hyperparameter; Hyperparameter Tuning; Configuration Settings;
D O I
10.1109/CAIN58948.2023.00016
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The success of machine learning (ML) models depends on careful experimentation and optimization of their hyperparameters. Tuning can affect the reliability and accuracy of a trained model and is the subject of ongoing research. However, little is known on whether and how hyperparameters are used and optimized in research practice. This lack of knowledge not only limits the adoption of best practices for tuning in research, but also affects the reproducibility of published results. Our research systematically analyzes the use and tuning of hyperparameters in ML publications. For this, we analyze 2000 code repositories and their associated research papers from Papers with Code. We compare the use and tuning of hyperparameters of three widely used ML libraries: scikit-learn, TensorFlow, and PyTorch. Our results show that the most of the available hyperparameters remain untouched, and those that have been changed use constant values. In particular, there is a significant difference between tuning hyperparameters and the reporting about it in the corresponding research papers. Our results suggest that there is a need for improved research and reporting practices when using ML methods to improve the reproducibility of published results.
引用
收藏
页码:68 / 79
页数:12
相关论文
共 50 条
  • [1] Enabling Hyperparameter Tuning of Machine Learning Classifiers in Production
    Sandha, Sandeep Singh
    Aggarwal, Mohit
    Saha, Swapnil Sayan
    Srivastava, Mani
    [J]. 2021 IEEE THIRD INTERNATIONAL CONFERENCE ON COGNITIVE MACHINE INTELLIGENCE (COGMI 2021), 2021, : 262 - 271
  • [2] OptABC: an Optimal Hyperparameter Tuning Approach for Machine Learning Algorithms
    Zahedi, Leila
    Mohammadi, Farid Ghareh
    Amini, M. Hadi
    [J]. 20TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2021), 2021, : 1138 - 1145
  • [3] Hyperparameter Tuning for Causal Inference with Double Machine Learning: A Simulation Study
    Bach, Philipp
    Schacht, Oliver
    Chernozhukov, Victor
    Klaassen, Sven
    Spindler, Martin
    [J]. CAUSAL LEARNING AND REASONING, VOL 236, 2024, 236 : 1065 - 1117
  • [4] Effect of hyperparameter tuning on classical machine learning models in detecting potholes
    Govender, Shaolin Lee
    Joseph, Seena
    Singh, Alveen
    [J]. 2023 CONFERENCE ON INFORMATION COMMUNICATIONS TECHNOLOGY AND SOCIETY, ICTAS, 2023, : 120 - 126
  • [5] Impact of Hyperparameter Tuning on Machine Learning Models in Stock Price Forecasting
    Hoque, Kazi Ekramul
    Aljamaan, Hamoud
    [J]. IEEE ACCESS, 2021, 9 : 163815 - 163830
  • [6] A Statistical Approach to Hyperparameter Tuning of Deep Learning for Construction Machine Classification
    Ottoni, Andre Luiz C.
    Novo, Marcela S.
    Oliveira, Marcos S.
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2024, 49 (04) : 5117 - 5128
  • [7] Automatic Hyperparameter Tuning of Machine Learning Models under Time Constraints
    Wang, Zhen
    Agung, Mulya
    Egawa, Ryusuke
    Suda, Reiji
    Takizawa, Hiroyuki
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 4967 - 4973
  • [8] Hyperparameter Tuning for Machine Learning Algorithms Used for Arabic Sentiment Analysis
    Elgeldawi, Enas
    Sayed, Awny
    Galal, Ahmed R.
    Zaki, Alaa M.
    [J]. INFORMATICS-BASEL, 2021, 8 (04):
  • [9] Diabetes Prediction Using Machine Learning with Feature Engineering and Hyperparameter Tuning
    El Massari, Hakim
    Gherabi, Noreddine
    Qanouni, Fatima
    Mhammedi, Sajida
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (08) : 171 - 179
  • [10] A Statistical Approach to Hyperparameter Tuning of Deep Learning for Construction Machine Classification
    André Luiz C. Ottoni
    Marcela S. Novo
    Marcos S. Oliveira
    [J]. Arabian Journal for Science and Engineering, 2024, 49 : 5117 - 5128