DRS: Auto-Scaling for Real-Time Stream Analytics

被引:42
|
作者
Fu, Tom Z. J. [1 ,2 ]
Ding, Jianbing [3 ]
Ma, Richard T. B. [4 ]
Winslett, Marianne [5 ]
Yang, Yin [6 ]
Zhang, Zhenjie [1 ]
机构
[1] Illinois Singapore Pte Ltd, Adv Digital Sci Ctr, Singapore 138602, Singapore
[2] Guangdong Univ Technol, Guangzhou, Guangdong, Peoples R China
[3] Sun Yat Sen Univ, Sch Informat Sci & Technol, Guangzhou 510275, Guangdong, Peoples R China
[4] Natl Univ Singapore, Sch Comp, Singapore 117418, Singapore
[5] Univ Illinois, Dept Comp Sci, 1304 W Springfield Ave, Urbana, IL 61801 USA
[6] Hamad Bin Khalifa Univ, Coll Sci & Engn, Doha, Qatar
基金
中国博士后科学基金;
关键词
Cloud computing; queueing network model; resource auto-scaling; stream data analytics; MAPREDUCE; SYSTEM;
D O I
10.1109/TNET.2017.2741969
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In a stream data analytics system, input data arrive continuously and trigger the processing and updating of analytics results. We focus on applications with real-time constraints, in which, any data unit must be completely processed within a given time duration. To handle fast data, it is common to place the stream data analytics system on top of a cloud infrastructure. Because stream properties, such as arrival rates can fluctuate unpredictably, cloud resources must be dynamically provisioned and scheduled accordingly to ensure real-time responses. It is essential, for existing systems or future developments, to possess the ability of scaling resources dynamically according to the instantaneous workload, in order to avoid wasting resources or failing in delivering the correct analytics results on time. Motivated by this, we propose DRS, a dynamic resource scaling framework for cloud-based stream data analytics systems. DRS overcomes three fundamental challenges: 1) how to model the relationship between the provisioned resources and the application performance, 2) where to best place resources, and 3) how to measure the system load with minimal overhead. In particular, DRS includes an accurate performance model based on the theory of Jackson open queueing networks and is capable of handling arbitrary operator topologies, possibly with loops, splits, and joins. Extensive experiments with real data show that DRS is capable of detecting sub-optimal resource allocation and making quick and effective resource adjustment.
引用
收藏
页码:3338 / 3352
页数:15
相关论文
共 50 条
  • [1] Auto-scaling for real-time stream analytics on HPC cloud
    Cheng, Yingchao
    Hao, Zhifeng
    Cai, Ruichu
    SERVICE ORIENTED COMPUTING AND APPLICATIONS, 2019, 13 (02) : 169 - 183
  • [2] Auto-scaling for real-time stream analytics on HPC cloud
    Yingchao Cheng
    Zhifeng Hao
    Ruichu Cai
    Service Oriented Computing and Applications, 2019, 13 : 169 - 183
  • [3] Real-time Intrusion Detection in Network Traffic Using Adaptive and Auto-scaling Stream Processor
    Loganathan, Gobinath
    Samarabandu, Jagath
    Wang, Xianbin
    2018 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2018,
  • [4] Auto-scaling Techniques for Elastic Data Stream Processing
    Heinze, Thomas
    Pappalardo, Valerio
    Jerzak, Zbigniew
    Fetzer, Christof
    2014 IEEE 30TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW), 2014, : 296 - 302
  • [5] A Data Analytics Based Approach to Cloud Resource Auto-Scaling
    Hao, Fang
    Kodialam, Murali
    Mukherjee, Sarit
    Lakshman, T., V
    2022 IEEE 23RD INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE SWITCHING AND ROUTING (IEEE HPSR), 2022, : 224 - 231
  • [6] The Non-Expert Tax: Quantifying the cost of auto-scaling in Cloud-based data stream analytics
    Wang, Yuanli
    Lyu, Baiqing
    Kalavri, Vasiliki
    PROCEEDINGS OF THE INTERNATIONAL WORKSHOP ON BIGIG DATA IN EMERGENT DISTRIBUTED ENVIRONMENTS (BIDEDE 2022), 2022,
  • [7] An auto-scaling wide dynamic range current to frequency converter for real-time monitoring of signals in neuromorphic systems
    Qiao, Ning
    Indiveri, Giacomo
    PROCEEDINGS OF 2016 IEEE BIOMEDICAL CIRCUITS AND SYSTEMS CONFERENCE (BIOCAS), 2016, : 160 - 163
  • [8] An Auto-Scaling Mechanism for Virtual Resources to Support Mobile, Pervasive, Real-Time Healthcare Applications in Cloud Computing
    Ahn, Yong Woon
    Cheng, Albert M. K.
    Baek, Jinsuk
    Jo, Minho
    Chen, Hsiao-Hwa
    IEEE NETWORK, 2013, 27 (05): : 62 - 68
  • [9] Social Auto-Scaling
    Smith, Peter
    Gonzalez-Velez, Horacio
    Caton, Simon
    2018 26TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP 2018), 2018, : 186 - 195
  • [10] Auto-scaling Walkability Analytics through Kubernetes and Docker SWARM on the Cloud
    Chen, Lu
    Pan, Yiru
    Sinnott, Richard O.
    PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND SERVICES SCIENCE (CLOSER), 2020, : 261 - 272