Poster: Iterative Scheduling for Distributed Stream Processing Systems

被引:9
|
作者
Eskandari, Leila [1 ]
Mair, Jason [1 ]
Huang, Zhiyi [1 ]
Eyers, David [1 ]
机构
[1] Univ Otago, Dept Comp Sci, Dunedin, New Zealand
关键词
Stream processing; Scheduling; Graph Partitioning;
D O I
10.1145/3210284.3219768
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Nowadays data stream processing systems need to efficiently handle large volumes of data in near real-time. To achieve this, the schedulers within such systems minimise the data movement between highly communicating tasks, improving system throughput. However, finding an optimal schedule for these systems is NP-hard. In this research, we propose a heuristic scheduling algorithm which reliably and efficiently finds the highly communicating tasks by exploiting graph partitioning algorithms and a mathematical optimisation software package. We evaluate our scheduler with two popular existing schedulers R-Storm and Aniello et al.'s 'Online scheduler' using two real-world applications and show that our proposed scheduler outperforms R-Storm, increasing throughput by between 3% and 30% and Online scheduler by 20 86% as a result of finding a more efficient schedule.
引用
收藏
页码:234 / 237
页数:4
相关论文
共 50 条
  • [1] I-Scheduler: Iterative scheduling for distributed stream processing systems
    Eskandari, Leila
    Mair, Jason
    Huang, Zhiyi
    Eyers, David
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 117 (117): : 219 - 233
  • [2] Model-driven scheduling for distributed stream processing systems
    Shukla, Anshu
    Simmhan, Yogesh
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2018, 117 : 98 - 114
  • [3] Online Scheduling for Shuffle Grouping in Distributed Stream Processing Systems
    Rivetti, Nicolo
    Anceaume, Emmanuelle
    Busnel, Yann
    Querzoni, Leonardo
    Sericola, Bruno
    [J]. MIDDLEWARE '16: PROCEEDINGS OF THE 17TH INTERNATIONAL MIDDLEWARE CONFERENCE, 2016,
  • [4] Poster: Raptor: Rapid prototyping of distributed stream processing applications at scale
    Ifath, Md. Monzurul Amin
    Neves, Miguel
    Haque, Israat
    [J]. PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON EMERGING NETWORKING EXPERIMENTS AND TECHNOLOGIES, CONEXT 2021, 2021, : 485 - 486
  • [5] Reliable stream data processing for elastic distributed stream processing systems
    Xiaohui Wei
    Yuan Zhuang
    Hongliang Li
    Zhiliang Liu
    [J]. Cluster Computing, 2020, 23 : 555 - 574
  • [6] An empirical analysis of stateful operator migration for online scheduling in distributed stream processing systems
    Sornalakshmi, K.
    Vadivu, G.
    [J]. MICROPROCESSORS AND MICROSYSTEMS, 2023, 98
  • [7] Resource Management and Scheduling in Distributed Stream Processing Systems: A Taxonomy, Review, and Future Directions
    Liu, Xunyun
    Buyya, Rajkumar
    [J]. ACM COMPUTING SURVEYS, 2020, 53 (03)
  • [8] Reliable stream data processing for elastic distributed stream processing systems
    Wei, Xiaohui
    Zhuang, Yuan
    Li, Hongliang
    Liu, Zhiliang
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2020, 23 (02): : 555 - 574
  • [9] Signal processing challenges in distributed stream processing systems
    Frossard, Pascal
    Verscheure, Olivier
    Venkatramani, Chitra
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 5903 - 5906
  • [10] Priority-based Resource Scheduling in Distributed Stream Processing Systems for Big Data Applications
    Bellavista, Paolo
    Corradi, Antonio
    Reale, Andrea
    Ticca, Nicola
    [J]. 2014 IEEE/ACM 7TH INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING (UCC), 2014, : 363 - 370