Data-efficient performance learning for configurable systems

被引：57

作者：

Guo, Jianmei ^{[1
]}

Yang, Dingyu ^{[2
]}

Siegmund, Norbert ^{[3
]}

Apel, Sven ^{[4
]}

Sarkar, Atrisha ^{[5
,6
]}

Valov, Pavel ^{[7
]}

Czarnecki, Krzysztof ^{[8
]}

Wasowski, Andrzej ^{[9
]}

Yu, Huiqun ^{[10
]}

机构：

[1] Alibaba Grp, Hangzhou, Zhejiang, Peoples R China

[2] Shanghai Dianji Univ, Shanghai, Peoples R China

[3] Bauhaus Univ Weimar, Weimar, Germany

[4] Univ Passau, Software Engn, Passau, Germany

[5] Univ Waterloo, David R Cheriton Sch Comp, Waterloo, ON, Canada

[6] Univ Waterloo, Autonomoose Self Driving Car Project, Waterloo, ON, Canada

[7] Univ Waterloo, Waterloo, ON, Canada

[8] Univ Waterloo, Elect & Comp Engn, Waterloo, ON, Canada

[9] IT Univ Copenhagen, Copenhagen, Denmark

[10] East China Univ Sci & Technol, Shanghai, Peoples R China

来源：

EMPIRICAL SOFTWARE ENGINEERING | 2018年 / 23卷 / 03期

基金：

加拿大自然科学与工程研究理事会; 中国国家自然科学基金;

关键词：

Performance prediction; Configurable systems; Regression; Model selection; Parameter tuning; PREDICTION;

D O I：

10.1007/s10664-017-9573-6

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Many software systems today are configurable, offering customization of functionality by feature selection. Understanding how performance varies in terms of feature selection is key for selecting appropriate configurations that meet a set of given requirements. Due to a huge configuration space and the possibly high cost of performance measurement, it is usually not feasible to explore the entire configuration space of a configurable system exhaustively. It is thus a major challenge to accurately predict performance based on a small sample of measured system variants. To address this challenge, we propose a data-efficient learning approach, called DECART, that combines several techniques of machine learning and statistics for performance prediction of configurable systems. DECART builds, validates, and determines a prediction model based on an available sample of measured system variants. Empirical results on 10 real-world configurable systems demonstrate the effectiveness and practicality of DECART. In particular, DECART achieves a prediction accuracy of 90% or higher based on a small sample, whose size is linear in the number of features. In addition, we propose a sample quality metric and introduce a quantitative analysis of the quality of a sample for performance prediction.

引用

页码：1826 / 1867

页数：42

共 50 条

[1] Data-efficient performance learning for configurable systems
Jianmei Guo
Dingyu Yang
Norbert Siegmund
Sven Apel
Atrisha Sarkar
Pavel Valov
Krzysztof Czarnecki
Andrzej Wasowski
Huiqun Yu
[J]. Empirical Software Engineering, 2018, 23 : 1826 - 1867
[2] Data-Efficient Reinforcement Learning for Complex Nonlinear Systems
Donge, Vrushabh S.
Lian, Bosen
Lewis, Frank L.
Davoudi, Ali
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (03) : 1391 - 1402
[3] Data-Efficient Graph Learning
Ding, Kaize
[J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 20, 2024, : 22663 - 22663
[4] Data-Efficient Performance Modeling for Configurable Big Data Frameworks by Reducing Information Overlap Between Training Examples
Liu, Zhiqiang
Shi, Xuanhua
Jin, Hai
[J]. BIG DATA RESEARCH, 2022, 30
[5] Data-Efficient Hierarchical Reinforcement Learning
Nachum, Ofir
Gu, Shixiang
Lee, Honglak
Levine, Sergey
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[6] Uniform Priors for Data-Efficient Learning
Sinha, Samarth
Roth, Karsten
Goyal, Anirudh
Ghassemi, Marzyeh
Akata, Zeynep
Larochelle, Hugo
Garg, Animesh
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 4026 - 4037
[7] Data-efficient Learning of Morphology and Controller for a Microrobot
Liao, Thomas
Wang, Grant
Yang, Brian
Lee, Rene
Pister, Kristofer
Levine, Sergey
Calandra, Roberto
[J]. 2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 2488 - 2494
[8] Data-Efficient Reinforcement Learning for Malaria Control
Zou, Lixin
[J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 507 - 513
[9] Pretraining Representations for Data-Efficient Reinforcement Learning
Schwarzer, Max
Rajkumar, Nitarshan
Noukhovitch, Michael
Anand, Ankesh
Charlin, Laurent
Hjelm, Devon
Bachman, Philip
Courville, Aaron
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[10] Elliptic PDE learning is provably data-efficient
Boulle, Nicolas
Halikias, Diana
Townsend, Alex
[J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2023, 120 (39)

← 1 2 3 4 5 →