ATCS: Auto-Tuning Configurations of Big Data Frameworks Based on Generative Adversarial Nets

被引:18
|
作者
Li, Mingyu [1 ,2 ]
Liu, Zhiqiang [1 ]
Shi, Xuanhua [1 ]
Jin, Hai [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Natl Engn Res Ctr Big Data Technol & Syst, Serv Comp Technol & Syst Lab,Cluster & Grid Comp, Wuhan 430074, Peoples R China
[2] Liupanshui Normal Univ, Sch Math & Comp Sci, Liupanshui 553004, Peoples R China
关键词
Big data; generative adversarial nets; spark; genetic algorithm; automatic tune parameters; OPTIMIZATION; ALGORITHM;
D O I
10.1109/ACCESS.2020.2979812
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Big data processing frameworks (e.g., Spark, Storm) have been extensively used for massive data processing in the industry. To improve the performance and robustness of these frameworks, developers provide users with highly-configurable parameters. Due to the high-dimensional parameter space and complicated interactions of parameters, manual tuning of parameters is time-consuming and ineffective. Building performance-predicting models for big data frameworks is challenging for several reasons: (1) the significant time required to collect training data and (2) the poor accuracy of the prediction model when training data are limited. To meet this challenge, we proposes an auto-tuning configuration parameters system (ATCS), a new auto-tuning approach based on Generative Adversarial Nets (GAN). ATCS can build a performance prediction model with less training data and without sacrificing model accuracy. Moreover, an optimized Genetic Algorithm (GA) is used in ATCS to explore the parameter space for optimum solutions. To prove the effectiveness of ATCS, we select five frequently-used workloads in Spark, each of which runs on five different sized data sets. The results demonstrate that ATCS improves the performance of five frequently-used Spark workloads compared to the default configurations. We achieved a performance increase of 3.5x on average, with a maximum of 6.9x. To obtain similar model accuracy, experiment results also demonstrate that the quantity of ATCS training data is only 6% of Deep Neural Network (DNN) data, 13% of Support Vector Machine (SVM) data, 18% of Decision Tree (DT) data. Moreover, compared to other machine learning models, the average performance increase of ATCS is 1.7x that of DNN, 1.6x that of SVM, 1.7x that of DT on the five typical Spark programs.
引用
收藏
页码:50485 / 50496
页数:12
相关论文
共 50 条
  • [41] A Generative Adversarial Nets Method for Monitoring Data Generation on Aircraft Engines
    FU Qiang
    WANG Huawei
    Transactions of Nanjing University of Aeronautics and Astronautics, 2019, 36 (04) : 609 - 616
  • [42] Image Data Augmentation for SAR Sensor via Generative Adversarial Nets
    Cui, Zongyong
    Zhang, Mingrui
    Cao, Zongjie
    Cao, Changjie
    IEEE ACCESS, 2019, 7 : 42255 - 42268
  • [43] Attribute Augmented Network Embedding Based on Generative Adversarial Nets
    Zheng, Conghui
    Pan, Li
    Wu, Peng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (07) : 3473 - 3487
  • [44] Imbalanced sentiment classification based on sequence generative adversarial nets
    Wang, Chuantao
    Yang, Xuexin
    Ding, Linkai
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (05) : 7909 - 7919
  • [45] Compressed Sensing MRI Reconstruction Based on Generative Adversarial Nets
    Jiang, Tao
    Tao, Jinxu
    Ye, Zhongfu
    Qiu, Bensheng
    Xu, Jinzhang
    2018 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (CSSE 2018), 2018, : 148 - 156
  • [46] A rub fault recognition method based on generative adversarial nets
    Wei Wang
    Weidong Liu
    Jing Li
    Wei Peng
    Journal of Mechanical Science and Technology, 2020, 34 : 1389 - 1397
  • [47] Data Augmentation for Communication Emitter Identification Using Generative Adversarial Nets
    Tian, Jianwei
    Zhang, Xu
    Zhu, Hongyu
    Yue, Gang
    Sun, Zhuo
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, VOL. 1, 2022, 878 : 227 - 235
  • [48] A Road Extraction Method Based on Conditional Generative Adversarial Nets
    Lu, Chuanwei
    Sun, Qun
    Zhao, Yunpeng
    Sun, Shijie
    Ma, Jingzhen
    Cheng, Mianmian
    Li, Yuanfu
    Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University, 2021, 46 (06): : 807 - 815
  • [49] GANCCRobot: Generative adversarial nets based chinese calligraphy robot
    Wu, Ruiqi
    Zhou, Changle
    Chao, Fei
    Yang, Longzhi
    Lin, Chih-Min
    Shang, Changjing
    INFORMATION SCIENCES, 2020, 516 : 474 - 490
  • [50] A rub fault recognition method based on generative adversarial nets
    Wang, Wei
    Liu, Weidong
    Li, Jing
    Peng, Wei
    JOURNAL OF MECHANICAL SCIENCE AND TECHNOLOGY, 2020, 34 (04) : 1389 - 1397