Resource Management and Scheduling in Distributed Stream Processing Systems: A Taxonomy, Review, and Future Directions

被引:24
|
作者
Liu, Xunyun [1 ]
Buyya, Rajkumar [2 ]
机构
[1] Natl Innovat Inst Def Technol, Artificial Intelligence Res Ctr, Beijing 100071, Peoples R China
[2] Univ Melbourne, Cloud Comp & Distributed Syst CLOUDS Lab, Sch Comp & Informat Syst, Parkville, Vic 3010, Australia
关键词
Resource management; stream processing; distributed stream processing systems; task scheduling; REAL-TIME; PLATFORM; GRAPH;
D O I
10.1145/3355399
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Stream processing is an emerging paradigm to handle data streams upon arrival, powering latency-critical application such as fraud detection, algorithmic trading, and health surveillance. Though there are a variety of Distributed Stream Processing Systems (DSPSs) that facilitate the development of streaming applications, resource management and task scheduling is not automatically handled by the DSPS middleware and requires a laborious process to tune toward specific deployment targets. As the advent of cloud computing has supported renting resources on-demand, it is of great interest to review the research progress of hosting streaming systems in clouds under certain Service Level Agreements (SLA) and cost constraints. In this article, we introduce the hierarchical structure of streaming systems, define the scope of the resource management problem, and present a comprehensive taxonomy in this context covering critical research topics such as resource provisioning, operator parallelisation, and task scheduling. The literature is then reviewed following the taxonomy structure, facilitating a deeper understanding of the research landscape through classification and comparison of existing works. Finally, we discuss the open issues and future research directions toward realising an automatic, SLA-aware resource management framework.
引用
收藏
页数:41
相关论文
共 50 条
  • [1] Distributed data stream processing and edge computing: A survey on resource elasticity and future directions
    de Assuncao, Marcos Dias
    Veith, Alexandre da Silva
    Buyya, Rajkumar
    [J]. JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2018, 103 : 1 - 17
  • [2] Critical review on resource scheduling in IaaS clouds: Taxonomy, issues, challenges, and future directions
    Madni, Syed Hamid Hussain
    Faheem, Muhammad
    Younas, Muhammad
    Masum, Maidul Hasan
    Shah, Sajid
    [J]. JOURNAL OF ENGINEERING-JOE, 2024, 2024 (08):
  • [3] Multiple Workflows Scheduling in Multi-tenant Distributed Systems: A Taxonomy and Future Directions
    Hilman, Muhammad H.
    Rodriguez, Maria A.
    Buyya, Rajkumar
    [J]. ACM COMPUTING SURVEYS, 2020, 53 (01)
  • [4] Distributed resource allocation in stream processing systems
    Xia, Cathy H.
    Broberg, James A.
    Liu, Zhen
    Zhang, Li
    [J]. Distributed Computing, Proceedings, 2006, 4167 : 489 - 504
  • [5] Resource Allocation and Task Scheduling in Fog Computing and Internet of Everything Environments: A Taxonomy, Review, and Future Directions
    Jamil, Bushra
    Ijaz, Humaira
    Shojafar, Mohammad
    Munir, Kashif
    Buyya, Rajkumar
    [J]. ACM COMPUTING SURVEYS, 2022, 54 (11S)
  • [6] Poster: Iterative Scheduling for Distributed Stream Processing Systems
    Eskandari, Leila
    Mair, Jason
    Huang, Zhiyi
    Eyers, David
    [J]. DEBS'18: PROCEEDINGS OF THE 12TH ACM INTERNATIONAL CONFERENCE ON DISTRIBUTED AND EVENT-BASED SYSTEMS, 2018, : 234 - 237
  • [7] Priority-based Resource Scheduling in Distributed Stream Processing Systems for Big Data Applications
    Bellavista, Paolo
    Corradi, Antonio
    Reale, Andrea
    Ticca, Nicola
    [J]. 2014 IEEE/ACM 7TH INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING (UCC), 2014, : 363 - 370
  • [8] Resource Estimation in Distributed Data Stream Processing Systems
    Fan, Minglu
    Liang, Yi
    Liu, Fei
    Yang, Mangmang
    Wang, Haihua
    [J]. PROCEEDINGS OF THE 2016 2ND WORKSHOP ON ADVANCED RESEARCH AND TECHNOLOGY IN INDUSTRY APPLICATIONS, 2016, 81 : 1824 - 1827
  • [9] Big Data Resource Management & Networks: Taxonomy, Survey, and Future Directions
    Awaysheh, Feras M.
    Alazab, Mamoun
    Garg, Sahil
    Niyato, Dusit
    Verikoukis, Christos
    [J]. IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2021, 23 (04): : 2098 - 2130
  • [10] Model-driven scheduling for distributed stream processing systems
    Shukla, Anshu
    Simmhan, Yogesh
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2018, 117 : 98 - 114