DeepCAT+: A Low-Cost and Transferrable Online Configuration Auto-Tuning Approach for Big Data Frameworks

被引:0
|
作者
Dou, Hui [1 ]
Wang, Yilun [1 ]
Zhang, Yiwen [1 ]
Chen, Pengfei [2 ]
Zheng, Zibin [2 ]
机构
[1] Anhui Univ, Sch Comp Sci & Technol, Hefei 230093, Peoples R China
[2] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangzhou 510275, Peoples R China
基金
中国国家自然科学基金;
关键词
Performance optimization; online configuration tuning; big data framework; deep reinforcement learning; TUNING SYSTEM;
D O I
10.1109/TPDS.2024.3459889
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Big data frameworks usually provide a large numberof performance-related parameters. Online auto-tuning these pa-rameters based on deep reinforcement learning (DRL) to achieve abetter performance has shown their advantages over search-basedand machine learning-based approaches. Unfortunately, the timecost during the online tuning phase of conventional DRL-basedmethods is still heavy, especially for Big Data applications. There-fore, in this paper, we propose DeepCAT+, a low-cost and trans-ferrable deep reinforcement learning-based approach to achieveonline configuration auto-tuning for Big Data frameworks. Toreduce the total online tuning cost and increase the adaptability: 1)DeepCAT+utilizes the TD3 algorithm instead of DDPG to alleviatevalue overestimation; 2) DeepCAT+modifies the conventional ex-perience replay to fully utilize the rare but valuable transitions viaa novel reward-driven prioritized experience replay mechanism;3) DeepCAT+designs a Twin-Q Optimizer to estimate the exe-cution time of each action without the costly configuration eval-uation and optimize the sub-optimal ones to achieve a low-costexploration-exploitation tradeoff; 4) Furthermore, DeepCAT+also implements an Online Continual Learner module based onProgressive Neural Networks to transfer knowledge from historicaltuning experiences. Experimental results based on a lab Spark clus-ter with HiBench benchmark applications show that DeepCAT+is able to speed up the best execution time by a factor of 1.49x,1.63xand 1.65xon average respectively over the baselines, whileconsuming up to 50.08%, 53.39% and 70.79% less total tuningtime. In addition, DeepCAT+also has a strong adaptability to thetime-varying environment of Big Data frameworks
引用
收藏
页码:2114 / 2131
页数:18
相关论文
共 10 条
  • [1] DeepCAT: A Cost-Efficient Online Configuration Auto-Tuning Approach for Big Data Frameworks
    Dou, Hui
    Wang, Yilun
    Zhang, Yiwen
    Chen, Pengfei
    51ST INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2022, 2022,
  • [2] TurBO: A cost-efficient configuration-based auto-tuning approach for cluster-based big data frameworks
    Dou, Hui
    Zhang, Lei
    Zhang, Yiwen
    Chen, Pengfei
    Zheng, Zibin
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2023, 177 : 89 - 105
  • [3] ATConf: auto-tuning high dimensional configuration parameters for big data processing frameworks
    Dou, Hui
    Wang, Kang
    Zhang, Yiwen
    Chen, Pengfei
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2023, 26 (05): : 2737 - 2755
  • [4] ATConf: auto-tuning high dimensional configuration parameters for big data processing frameworks
    Hui Dou
    Kang Wang
    Yiwen Zhang
    Pengfei Chen
    Cluster Computing, 2023, 26 : 2737 - 2755
  • [5] TuneChain: An Online Configuration Auto-Tuning Approach for Permissioned Blockchain Systems
    Lin, Junxiong
    Deng, Ruijun
    Lu, Zhihui
    Zhang, Yiguang
    Duan, Qiang
    2024 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES, ICWS 2024, 2024, : 512 - 523
  • [6] Teaching PID Auto-Tuning Using a Low-Cost Control Kit
    Ibrahim, Dogan
    Abu Hasna, Jamal F.
    INTERNATIONAL JOURNAL OF ENGINEERING EDUCATION, 2013, 29 (01) : 239 - 247
  • [8] Low-cost embedded solution for auto-tuning DC motor controllers
    Dobra, P.
    Petreus, D.
    Moga, D.
    Trusca, Mirela
    Tomesc, L.
    Duma, R.
    2008 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION, QUALITY AND TESTING, ROBOTICS (AQTR 2008), THETA 16TH EDITION, VOL II, PROCEEDINGS, 2008, : 280 - 284
  • [9] ATCS: Auto-Tuning Configurations of Big Data Frameworks Based on Generative Adversarial Nets
    Li, Mingyu
    Liu, Zhiqiang
    Shi, Xuanhua
    Jin, Hai
    IEEE ACCESS, 2020, 8 : 50485 - 50496
  • [10] LOCAT: Low-Overhead Online Configuration Auto-Tuning of Spark SQL Applications
    Xin, Jinhan
    Hwang, Kai
    Yu, Zhibin
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA (SIGMOD '22), 2022, : 674 - 684