Data-efficient performance learning for configurable systems

被引:57
|
作者
Guo, Jianmei [1 ]
Yang, Dingyu [2 ]
Siegmund, Norbert [3 ]
Apel, Sven [4 ]
Sarkar, Atrisha [5 ,6 ]
Valov, Pavel [7 ]
Czarnecki, Krzysztof [8 ]
Wasowski, Andrzej [9 ]
Yu, Huiqun [10 ]
机构
[1] Alibaba Grp, Hangzhou, Zhejiang, Peoples R China
[2] Shanghai Dianji Univ, Shanghai, Peoples R China
[3] Bauhaus Univ Weimar, Weimar, Germany
[4] Univ Passau, Software Engn, Passau, Germany
[5] Univ Waterloo, David R Cheriton Sch Comp, Waterloo, ON, Canada
[6] Univ Waterloo, Autonomoose Self Driving Car Project, Waterloo, ON, Canada
[7] Univ Waterloo, Waterloo, ON, Canada
[8] Univ Waterloo, Elect & Comp Engn, Waterloo, ON, Canada
[9] IT Univ Copenhagen, Copenhagen, Denmark
[10] East China Univ Sci & Technol, Shanghai, Peoples R China
基金
加拿大自然科学与工程研究理事会; 中国国家自然科学基金;
关键词
Performance prediction; Configurable systems; Regression; Model selection; Parameter tuning; PREDICTION;
D O I
10.1007/s10664-017-9573-6
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Many software systems today are configurable, offering customization of functionality by feature selection. Understanding how performance varies in terms of feature selection is key for selecting appropriate configurations that meet a set of given requirements. Due to a huge configuration space and the possibly high cost of performance measurement, it is usually not feasible to explore the entire configuration space of a configurable system exhaustively. It is thus a major challenge to accurately predict performance based on a small sample of measured system variants. To address this challenge, we propose a data-efficient learning approach, called DECART, that combines several techniques of machine learning and statistics for performance prediction of configurable systems. DECART builds, validates, and determines a prediction model based on an available sample of measured system variants. Empirical results on 10 real-world configurable systems demonstrate the effectiveness and practicality of DECART. In particular, DECART achieves a prediction accuracy of 90% or higher based on a small sample, whose size is linear in the number of features. In addition, we propose a sample quality metric and introduce a quantitative analysis of the quality of a sample for performance prediction.
引用
收藏
页码:1826 / 1867
页数:42
相关论文
共 50 条
  • [1] Data-efficient performance learning for configurable systems
    Jianmei Guo
    Dingyu Yang
    Norbert Siegmund
    Sven Apel
    Atrisha Sarkar
    Pavel Valov
    Krzysztof Czarnecki
    Andrzej Wasowski
    Huiqun Yu
    [J]. Empirical Software Engineering, 2018, 23 : 1826 - 1867
  • [2] Data-Efficient Reinforcement Learning for Complex Nonlinear Systems
    Donge, Vrushabh S.
    Lian, Bosen
    Lewis, Frank L.
    Davoudi, Ali
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (03) : 1391 - 1402
  • [3] Data-Efficient Graph Learning
    Ding, Kaize
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 20, 2024, : 22663 - 22663
  • [4] Data-Efficient Performance Modeling for Configurable Big Data Frameworks by Reducing Information Overlap Between Training Examples
    Liu, Zhiqiang
    Shi, Xuanhua
    Jin, Hai
    [J]. BIG DATA RESEARCH, 2022, 30
  • [5] Data-Efficient Hierarchical Reinforcement Learning
    Nachum, Ofir
    Gu, Shixiang
    Lee, Honglak
    Levine, Sergey
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [6] Uniform Priors for Data-Efficient Learning
    Sinha, Samarth
    Roth, Karsten
    Goyal, Anirudh
    Ghassemi, Marzyeh
    Akata, Zeynep
    Larochelle, Hugo
    Garg, Animesh
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 4026 - 4037
  • [7] Data-efficient Learning of Morphology and Controller for a Microrobot
    Liao, Thomas
    Wang, Grant
    Yang, Brian
    Lee, Rene
    Pister, Kristofer
    Levine, Sergey
    Calandra, Roberto
    [J]. 2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 2488 - 2494
  • [8] Data-Efficient Reinforcement Learning for Malaria Control
    Zou, Lixin
    [J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 507 - 513
  • [9] Pretraining Representations for Data-Efficient Reinforcement Learning
    Schwarzer, Max
    Rajkumar, Nitarshan
    Noukhovitch, Michael
    Anand, Ankesh
    Charlin, Laurent
    Hjelm, Devon
    Bachman, Philip
    Courville, Aaron
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [10] Elliptic PDE learning is provably data-efficient
    Boulle, Nicolas
    Halikias, Diana
    Townsend, Alex
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2023, 120 (39)