Comparative Analysis of Machine Learning Models for Performance Prediction of the SPEC Benchmarks

被引：2

作者：

Tousi, Ashkan ^{[1
]}

Lujan, Mikel ^{[1
]}

机构：

[1] Univ Manchester, Dept Comp Sci, Manchester M13 9PL, Lancs, England

来源：

IEEE ACCESS | 2022年 / 10卷

基金：

英国工程与自然科学研究理事会;

关键词：

Benchmark testing; Predictive models; Data models; Feature extraction; Software; Hardware; Analytical models; Machine learning; performance analysis; predictive models; SPEC CPU2017; supervised learning; REGRESSION; SELECTION;

D O I：

10.1109/ACCESS.2022.3142240

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Simulation-based performance prediction is cumbersome and time-consuming. An alternative approach is to consider supervised learning as a means of predicting the performance scores of Standard Performance Evaluation Corporation (SPEC) benchmarks. SPEC CPU2017 contains a public dataset of results obtained by executing 43 standardised performance benchmarks organised into 4 suites on various system configurations. This paper analyses the dataset and aims to answer the following questions: I) can we accurately predict the SPEC results based on the configurations provided in the dataset, without having to actually run the benchmarks? II) what are the most important hardware and software features? III) what are the best predictive models and hyperparameters, in terms of prediction error and time? and IV) can we predict the performance of future systems using the past data? We present how to prepare data, select features, tune hyperparameters and evaluate regression models based on Multi-Task Elastic-Net, Decision Tree, Random Forest, and Multi-Layer Perceptron neural networks estimators. Feature selection is performed in three steps: removing zero variance features, removing highly correlated features, and Recursive Feature Elimination based on different feature importance metrics: elastic-net coefficients, tree-based importance measures and Permutation Importance. We select the best models using grid search on the hyperparameter space, and finally, compare and evaluate the performance of the models. We show that tree-based models with the original 29 features provide accurate predictions with an average error of less than 4%. The average error of faster Decision Tree and Random Forest models with 10 features is still below 6% and 5% respectively.

引用

页码：11994 / 12011

页数：18

共 50 条

[31] Comparative Analysis of Machine Learning Models for Prediction of Acute Liver Injury in Sepsis Patients
Lu, Xiaochi
Chen, Yi
Zhang, Gongping
Zeng, Xu
Lai, Linjie
Qu, Chaojun
JOURNAL OF EMERGENCIES TRAUMA AND SHOCK, 2024, 17 (02) : 91 - 101
[32] Analyzing the Performance of Univariate and Multivariate Machine Learning Models in Soil Movement Prediction: A Comparative Study
Kumar, Praveen
Priyanka, P.
Dhanya, J.
Uday, Kala Venkata
Dutt, Varun
IEEE ACCESS, 2023, 11 : 62368 - 62381
[33] Absenteeism Prediction: A Comparative Study Using Machine Learning Models
Dogruyol, Kagan
Sekeroglu, Boran
10TH INTERNATIONAL CONFERENCE ON THEORY AND APPLICATION OF SOFT COMPUTING, COMPUTING WITH WORDS AND PERCEPTIONS - ICSCCW-2019, 2020, 1095 : 728 - 734
[34] Comparative Performance Study of SPEC INT 2006 benchmarks on Nehalem, Sandybridge and Haswell Microarchitectures
Sadasivam, Satish Kumar
Selvi, S. Thamarai
2015 INTERNATIONAL CONFERENCE ON COMPUTER, INFORMATION AND TELECOMMUNICATION SYSTEMS (CITS), 2015,
[35] Evaluating Performance Portability of Accelerator Programming Models using SPEC ACCEL 1.2 Benchmarks
Boehm, Swen
Pophale, Swaroop
Larrea, Veronica G. Vergara
Hernandez, Oscar
HIGH PERFORMANCE COMPUTING, ISC HIGH PERFORMANCE 2018, 2018, 11203 : 711 - 723
[36] Unsupervised machine learning for disease prediction: a comparative performance analysis using multiple datasets
Lu, Haohui
Uddin, Shahadat
HEALTH AND TECHNOLOGY, 2024, 14 (01) : 141 - 154
[37] Unsupervised machine learning for disease prediction: a comparative performance analysis using multiple datasets
Haohui Lu
Shahadat Uddin
Health and Technology, 2024, 14 : 141 - 154
[38] Comparative Analysis of Machine Learning Algorithms for Rainfall Prediction
Patil, Rudragoud
Bedekar, Gayatri
INNOVATIVE DATA COMMUNICATION TECHNOLOGIES AND APPLICATION, ICIDCA 2021, 2022, 96 : 833 - 842
[39] Comparative Analysis of Diabetes Prediction Using Machine Learning
David, S. Alex
Varsha, V.
Ravali, Y.
Saranya, N. Naga Amrutha
SOFT COMPUTING FOR SECURITY APPLICATIONS, ICSCS 2022, 2023, 1428 : 155 - 163
[40] A comparative study on student performance prediction using machine learning
Chen, Yawen
Zhai, Linbo
EDUCATION AND INFORMATION TECHNOLOGIES, 2023, 28 (09) : 12039 - 12057

← 1 2 3 4 5 →