Comparative Analysis of Machine Learning Models for Performance Prediction of the SPEC Benchmarks

被引：2

作者：

Tousi, Ashkan ^{[1
]}

Lujan, Mikel ^{[1
]}

机构：

[1] Univ Manchester, Dept Comp Sci, Manchester M13 9PL, Lancs, England

来源：

IEEE ACCESS | 2022年 / 10卷

基金：

英国工程与自然科学研究理事会;

关键词：

Benchmark testing; Predictive models; Data models; Feature extraction; Software; Hardware; Analytical models; Machine learning; performance analysis; predictive models; SPEC CPU2017; supervised learning; REGRESSION; SELECTION;

D O I：

10.1109/ACCESS.2022.3142240

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Simulation-based performance prediction is cumbersome and time-consuming. An alternative approach is to consider supervised learning as a means of predicting the performance scores of Standard Performance Evaluation Corporation (SPEC) benchmarks. SPEC CPU2017 contains a public dataset of results obtained by executing 43 standardised performance benchmarks organised into 4 suites on various system configurations. This paper analyses the dataset and aims to answer the following questions: I) can we accurately predict the SPEC results based on the configurations provided in the dataset, without having to actually run the benchmarks? II) what are the most important hardware and software features? III) what are the best predictive models and hyperparameters, in terms of prediction error and time? and IV) can we predict the performance of future systems using the past data? We present how to prepare data, select features, tune hyperparameters and evaluate regression models based on Multi-Task Elastic-Net, Decision Tree, Random Forest, and Multi-Layer Perceptron neural networks estimators. Feature selection is performed in three steps: removing zero variance features, removing highly correlated features, and Recursive Feature Elimination based on different feature importance metrics: elastic-net coefficients, tree-based importance measures and Permutation Importance. We select the best models using grid search on the hyperparameter space, and finally, compare and evaluate the performance of the models. We show that tree-based models with the original 29 features provide accurate predictions with an average error of less than 4%. The average error of faster Decision Tree and Random Forest models with 10 features is still below 6% and 5% respectively.

引用

页码：11994 / 12011

页数：18

共 50 条

[41] A comparative study on student performance prediction using machine learning
Yawen Chen
Linbo Zhai
Education and Information Technologies, 2023, 28 : 12039 - 12057
[42] Performance analysis of machine learning models for AQI prediction in Gorakhpur City: a critical study
Patel, Prabhat Kumar
Singh, Hrishikesh Kumar
ENVIRONMENTAL MONITORING AND ASSESSMENT, 2024, 196 (10)
[43] Comparative analysis of machine learning models for daily suspended sediment concentration prediction in environmental monitoring
Goldi Jarbais
Pon Harshavardhanan
Sādhanā, 50 (2)
[44] River water quality index prediction and uncertainty analysis: A comparative study of machine learning models
Asadollah, Seyed Babak Haji Seyed
Sharafati, Ahmad
Motta, Davide
Yaseen, Zaher Mundher
JOURNAL OF ENVIRONMENTAL CHEMICAL ENGINEERING, 2021, 9 (01):
[45] Multisource Data Integration and Comparative Analysis of Machine Learning Models for On-Street Parking Prediction
Inam, Saba
Mahmood, Azhar
Khatoon, Shaheen
Alshamari, Majed
Nawaz, Nazia
SUSTAINABILITY, 2022, 14 (12)
[46] Comparative Analysis of Conventional Machine Learning and Graph Neural Network Models for Perovskite Property Prediction
Jin, Jirui
Faraji, Somayeh
Liu, Bin
Liu, Mingjie
JOURNAL OF PHYSICAL CHEMISTRY C, 2024,
[47] Comparative Analysis of Machine Learning Models for Crop Yield Prediction Across Multiple Crop Types
Yashraj Patil
Harikrishnan Ramachandran
Sridhevi Sundararajan
P. Srideviponmalar
SN Computer Science, 6 (1)
[48] Performance Assessment of Machine Learning Based Models for Diabetes Prediction
Deo, Ridhi
Panigrahi, Suranjan
2019 IEEE HEALTHCARE INNOVATIONS AND POINT OF CARE TECHNOLOGIES (HI-POCT), 2019, : 147 - 150
[49] Comparative Performance of Recurrent Heart Failure Prediction Models: Incorporating Social Determinants and Applying Machine Learning
Deshpande, Aniruddha
Morris, Alanna A.
Ho, Joyce C.
Patel, Shivani A.
CIRCULATION, 2022, 146
[50] Comparative performance of machine learning models for the classification of human gait
Thakur, Divya
Lalwani, Praveen
BIOMEDICAL PHYSICS & ENGINEERING EXPRESS, 2024, 10 (02):

← 1 2 3 4 5 →