DRS: Auto-Scaling for Real-Time Stream Analytics

被引:45
|
作者
Fu, Tom Z. J. [1 ,2 ]
Ding, Jianbing [3 ]
Ma, Richard T. B. [4 ]
Winslett, Marianne [5 ]
Yang, Yin [6 ]
Zhang, Zhenjie [1 ]
机构
[1] Illinois Singapore Pte Ltd, Adv Digital Sci Ctr, Singapore 138602, Singapore
[2] Guangdong Univ Technol, Guangzhou, Guangdong, Peoples R China
[3] Sun Yat Sen Univ, Sch Informat Sci & Technol, Guangzhou 510275, Guangdong, Peoples R China
[4] Natl Univ Singapore, Sch Comp, Singapore 117418, Singapore
[5] Univ Illinois, Dept Comp Sci, 1304 W Springfield Ave, Urbana, IL 61801 USA
[6] Hamad Bin Khalifa Univ, Coll Sci & Engn, Doha, Qatar
基金
中国博士后科学基金;
关键词
Cloud computing; queueing network model; resource auto-scaling; stream data analytics; MAPREDUCE; SYSTEM;
D O I
10.1109/TNET.2017.2741969
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In a stream data analytics system, input data arrive continuously and trigger the processing and updating of analytics results. We focus on applications with real-time constraints, in which, any data unit must be completely processed within a given time duration. To handle fast data, it is common to place the stream data analytics system on top of a cloud infrastructure. Because stream properties, such as arrival rates can fluctuate unpredictably, cloud resources must be dynamically provisioned and scheduled accordingly to ensure real-time responses. It is essential, for existing systems or future developments, to possess the ability of scaling resources dynamically according to the instantaneous workload, in order to avoid wasting resources or failing in delivering the correct analytics results on time. Motivated by this, we propose DRS, a dynamic resource scaling framework for cloud-based stream data analytics systems. DRS overcomes three fundamental challenges: 1) how to model the relationship between the provisioned resources and the application performance, 2) where to best place resources, and 3) how to measure the system load with minimal overhead. In particular, DRS includes an accurate performance model based on the theory of Jackson open queueing networks and is capable of handling arbitrary operator topologies, possibly with loops, splits, and joins. Extensive experiments with real data show that DRS is capable of detecting sub-optimal resource allocation and making quick and effective resource adjustment.
引用
收藏
页码:3338 / 3352
页数:15
相关论文
共 50 条
  • [21] AMAS: Adaptive Auto-Scaling on the Edge
    Mukherjee, Saptarshi
    Sidhanta, Subhajit
    21ST IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2021), 2021, : 618 - 621
  • [22] Model-based Stream Processing Auto-scaling in Geo-Distributed Environments
    Arkian, HamidReza
    Pierre, Guillaume
    Tordsson, Johan
    Elmroth, Erik
    30TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS (ICCCN 2021), 2021,
  • [23] RASP: Real-time Network Analytics with Distributed NoSQL Stream Processing
    Touloupas, Georgios
    Konstantinou, Ioannis
    Koziris, Nectarios
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 2414 - 2419
  • [24] Auto-scaling of Scientific Workflows in Kubernetes
    Balis, Bartosz
    Bronski, Andrzej
    Szarek, Mateusz
    COMPUTATIONAL SCIENCE, ICCS 2022, PT II, 2022, : 33 - 40
  • [25] A Hybrid Mechanism of Horizontal Auto-scaling Based on Thresholds and Time Series
    Pereira, Paulo
    Araujo, Jean
    Maciel, Paulo
    2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 2065 - 2070
  • [26] DEPAS: a decentralized probabilistic algorithm for auto-scaling
    Calcavecchia, Nicolo M.
    Caprarescu, Bogdan A.
    Di Nitto, Elisabetta
    Dubois, Daniel J.
    Petcu, Dana
    COMPUTING, 2012, 94 (8-10) : 701 - 730
  • [27] Parameter Optimization for Hybrid Auto-scaling Mechanism
    Hirashima, Yoko
    Komoda, Norihisa
    2016 17TH IEEE INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND INFORMATICS (CINTI 2016), 2016, : 111 - 116
  • [28] On the Value of Service Demand Estimation for Auto-scaling
    Bauer, Andre
    Grohmann, Johannes
    Herbst, Nikolas
    Kounev, Samuel
    MEASUREMENT, MODELLING AND EVALUATION OF COMPUTING SYSTEMS, MMB 2018, 2018, 10740 : 142 - 156
  • [29] RES: Real-Time Video Stream Analytics Using Edge Enhanced Clouds
    Ali, Muhammad
    Anjum, Ashiq
    Rana, Omer
    Zamani, Ali Reza
    Balouek-Thomert, Daniel
    Parashar, Manish
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2022, 10 (02) : 792 - 804
  • [30] A Hybrid approach for containerized Microservices auto-scaling
    Merkouche, Souheir
    Bouanaka, Chafia
    2022 IEEE/ACS 19TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2022,