Comparative Analysis of Machine Learning Models for Performance Prediction of the SPEC Benchmarks

被引:2
|
作者
Tousi, Ashkan [1 ]
Lujan, Mikel [1 ]
机构
[1] Univ Manchester, Dept Comp Sci, Manchester M13 9PL, Lancs, England
基金
英国工程与自然科学研究理事会;
关键词
Benchmark testing; Predictive models; Data models; Feature extraction; Software; Hardware; Analytical models; Machine learning; performance analysis; predictive models; SPEC CPU2017; supervised learning; REGRESSION; SELECTION;
D O I
10.1109/ACCESS.2022.3142240
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Simulation-based performance prediction is cumbersome and time-consuming. An alternative approach is to consider supervised learning as a means of predicting the performance scores of Standard Performance Evaluation Corporation (SPEC) benchmarks. SPEC CPU2017 contains a public dataset of results obtained by executing 43 standardised performance benchmarks organised into 4 suites on various system configurations. This paper analyses the dataset and aims to answer the following questions: I) can we accurately predict the SPEC results based on the configurations provided in the dataset, without having to actually run the benchmarks? II) what are the most important hardware and software features? III) what are the best predictive models and hyperparameters, in terms of prediction error and time? and IV) can we predict the performance of future systems using the past data? We present how to prepare data, select features, tune hyperparameters and evaluate regression models based on Multi-Task Elastic-Net, Decision Tree, Random Forest, and Multi-Layer Perceptron neural networks estimators. Feature selection is performed in three steps: removing zero variance features, removing highly correlated features, and Recursive Feature Elimination based on different feature importance metrics: elastic-net coefficients, tree-based importance measures and Permutation Importance. We select the best models using grid search on the hyperparameter space, and finally, compare and evaluate the performance of the models. We show that tree-based models with the original 29 features provide accurate predictions with an average error of less than 4%. The average error of faster Decision Tree and Random Forest models with 10 features is still below 6% and 5% respectively.
引用
收藏
页码:11994 / 12011
页数:18
相关论文
共 50 条
  • [41] A comparative study on student performance prediction using machine learning
    Yawen Chen
    Linbo Zhai
    Education and Information Technologies, 2023, 28 : 12039 - 12057
  • [42] Performance analysis of machine learning models for AQI prediction in Gorakhpur City: a critical study
    Patel, Prabhat Kumar
    Singh, Hrishikesh Kumar
    ENVIRONMENTAL MONITORING AND ASSESSMENT, 2024, 196 (10)
  • [43] Comparative analysis of machine learning models for daily suspended sediment concentration prediction in environmental monitoring
    Goldi Jarbais
    Pon Harshavardhanan
    Sādhanā, 50 (2)
  • [44] River water quality index prediction and uncertainty analysis: A comparative study of machine learning models
    Asadollah, Seyed Babak Haji Seyed
    Sharafati, Ahmad
    Motta, Davide
    Yaseen, Zaher Mundher
    JOURNAL OF ENVIRONMENTAL CHEMICAL ENGINEERING, 2021, 9 (01):
  • [45] Multisource Data Integration and Comparative Analysis of Machine Learning Models for On-Street Parking Prediction
    Inam, Saba
    Mahmood, Azhar
    Khatoon, Shaheen
    Alshamari, Majed
    Nawaz, Nazia
    SUSTAINABILITY, 2022, 14 (12)
  • [46] Comparative Analysis of Conventional Machine Learning and Graph Neural Network Models for Perovskite Property Prediction
    Jin, Jirui
    Faraji, Somayeh
    Liu, Bin
    Liu, Mingjie
    JOURNAL OF PHYSICAL CHEMISTRY C, 2024,
  • [47] Comparative Analysis of Machine Learning Models for Crop Yield Prediction Across Multiple Crop Types
    Yashraj Patil
    Harikrishnan Ramachandran
    Sridhevi Sundararajan
    P. Srideviponmalar
    SN Computer Science, 6 (1)
  • [48] Performance Assessment of Machine Learning Based Models for Diabetes Prediction
    Deo, Ridhi
    Panigrahi, Suranjan
    2019 IEEE HEALTHCARE INNOVATIONS AND POINT OF CARE TECHNOLOGIES (HI-POCT), 2019, : 147 - 150
  • [49] Comparative Performance of Recurrent Heart Failure Prediction Models: Incorporating Social Determinants and Applying Machine Learning
    Deshpande, Aniruddha
    Morris, Alanna A.
    Ho, Joyce C.
    Patel, Shivani A.
    CIRCULATION, 2022, 146
  • [50] Comparative performance of machine learning models for the classification of human gait
    Thakur, Divya
    Lalwani, Praveen
    BIOMEDICAL PHYSICS & ENGINEERING EXPRESS, 2024, 10 (02):