Fast Modeling of Analytics Workloads for Big Data Services

被引:0
|
作者
Yang, Lin [1 ]
Li, Changsheng [1 ]
Fan, Liya [1 ]
Xu, Jingmin [1 ]
机构
[1] IBM Res China, Beijing, Peoples R China
关键词
big data; analytics; cloud computing; service; modeling; machine learning; MapReduce;
D O I
10.1109/ICSS.2014.37
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Building models to predict analytics workloads' execution is a foundational capability that enables key scenarios for big data services, like SLA-driven service provisioning and elastic auto scaling. Given the various infrastructure and workload characteristics, it's more preferable to build the models in a "black-box" fashion, for example, by leveraging machine learning techniques. However, this approach has assumptions on the volume and quality of workloads' existing records to learn from, which require sophisticate benchmark or long time monitoring. In this paper, we present a method to accelerate the modeling process of an analytics workload for its quick time-to-value in the context of big data services. Specifically, clustering and transfer learning techniques are leveraged for this acceleration by shifting the data collection from the online service phase to the offline preparation phase. This paper focuses on the conceived service model and fast modeling techniques. Their feasibility is demonstrated by experiments.
引用
收藏
页码:101 / 105
页数:5
相关论文
共 50 条
  • [1] Understanding Big Data Analytics Workloads on Modern Processors
    Jia, Zhen
    Zhan, Jianfeng
    Wang, Lei
    Luo, Chunjie
    Gao, Wanling
    Jin, Yi
    Han, Rui
    Zhang, Lixin
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2017, 28 (06) : 1797 - 1810
  • [2] Modeling and Optimization for Big Data Analytics
    Slavakis, Konstantinos
    Giannakis, Georgios B.
    Mateos, Gonzalo
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2014, 31 (05) : 18 - 31
  • [3] Monitoring Data Integrity in Big Data Analytics Services
    Mantzoukas, Konstantinos
    Kloukinas, Christos
    Spanoudakis, George
    [J]. PROCEEDINGS 2018 IEEE 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2018, : 904 - 907
  • [4] Fast Big Data Analytics for Smart Meter Data
    Mohajeri, Morteza
    Ghassemi, Abolfazl
    Gulliver, T. Aaron
    [J]. IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2020, 1 : 1864 - 1871
  • [5] Characterizing big data analytics workloads on POWER8 SMT processors
    贾禛
    Zhan Jianfeng
    Wang Lei
    Zhang Lixin
    [J]. High Technology Letters, 2017, 23 (03) : 245 - 251
  • [6] Comparative Evaluation of Big-Data Systems on Scientific Image Analytics Workloads
    Mehta, Parmita
    Dorkenwald, Sven
    Zhao, Dongfang
    Kaftan, Tomer
    Cheung, Alvin
    Balazinska, Magdalena
    Rokem, Ariel
    Connolly, Andrew
    Vanderplas, Jacob
    AlSayyad, Yusra
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 10 (11): : 1226 - 1237
  • [7] Big Data Analytics Services for Enhancing Business Intelligence
    Sun, Zhaohao
    Sun, Lizhe
    Strang, Kenneth
    [J]. JOURNAL OF COMPUTER INFORMATION SYSTEMS, 2018, 58 (02) : 162 - 169
  • [8] Enhancing Digital Health Services with Big Data Analytics
    Berros, Nisrine
    El Mendili, Fatna
    Filaly, Youness
    El Idrissi, Younes El Bouzekri
    [J]. BIG DATA AND COGNITIVE COMPUTING, 2023, 7 (02)
  • [9] Evolutionary Scheduling of Dynamic Multitasking Workloads for Big-Data Analytics in Elastic Cloud
    Zhang, Fan
    Cao, Junwei
    Tan, Wei
    Khan, Samee U.
    Li, Keqin
    Zomaya, Albert Y.
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2014, 2 (03) : 338 - 351
  • [10] Dynamic and Transparent Memory Sharing for Accelerating Big Data Analytics Workloads in Virtualized Cloud
    Cao, Wenqi
    Liu, Ling
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 191 - 200