Control-based scheduling in a distributed stream processing system

被引:0
|
作者
Khorlin, Andrey [1 ]
Chandy, K. Mani [1 ]
机构
[1] CALTECH, Comp Sci 256-80, Pasadena, CA 91125 USA
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Stream processing systems receive continuous streams of messages with raw information and produce streams of messages with processed information. The utility of a stream-processing system depends, in part, on the accuracy and timeliness of the output. Streams in complex event processing systems are processed on distributed systems; several steps are taken on different processors to process each incoming message, and messages may be enqueued between steps. This paper deals with the problems of distributed dynamic control of streams to optimize the total utility provided by the system. A challenge of distributed control is that timeliness of output depends only on the total end-to-end time and is otherwise independent of the delays at each separate processor whereas the controller for each processor takes action to control only the steps on that processor and cannot directly control the entire network. This paper identifies key problems in distributed control and analyzes two scheduling algorithms that help in an initial analysis of a difficult problem.
引用
收藏
页码:55 / +
页数:3
相关论文
共 50 条
  • [1] Scheduling parallel and distributed processing for automotive data stream management system
    Rho, Jaeyong
    Azumi, Takuya
    Nakagawa, Mayo
    Sato, Kenya
    Nishio, Nobuhiko
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2017, 109 : 286 - 300
  • [2] Poster: Iterative Scheduling for Distributed Stream Processing Systems
    Eskandari, Leila
    Mair, Jason
    Huang, Zhiyi
    Eyers, David
    DEBS'18: PROCEEDINGS OF THE 12TH ACM INTERNATIONAL CONFERENCE ON DISTRIBUTED AND EVENT-BASED SYSTEMS, 2018, : 234 - 237
  • [3] Online Scheduling for Shuffle Grouping in Distributed Stream Processing Systems
    Rivetti, Nicolo
    Anceaume, Emmanuelle
    Busnel, Yann
    Querzoni, Leonardo
    Sericola, Bruno
    MIDDLEWARE '16: PROCEEDINGS OF THE 17TH INTERNATIONAL MIDDLEWARE CONFERENCE, 2016,
  • [4] A Predictive Scheduling Framework for Fast and Distributed Stream Data Processing
    Li, Teng
    Tang, Jian
    Xu, Jielong
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 333 - 338
  • [5] Model-driven scheduling for distributed stream processing systems
    Shukla, Anshu
    Simmhan, Yogesh
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2018, 117 : 98 - 114
  • [6] An On-the-fly Scheduling Strategy for Distributed Stream Processing Platform
    Wang, Wen'an
    Zhang, Chuang
    Chen, Xiaojun
    Li, Zhao
    Ding, Hong
    Wen, Xin
    2018 IEEE INT CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, UBIQUITOUS COMPUTING & COMMUNICATIONS, BIG DATA & CLOUD COMPUTING, SOCIAL COMPUTING & NETWORKING, SUSTAINABLE COMPUTING & COMMUNICATIONS, 2018, : 773 - 780
  • [7] Hone: Mitigating Stragglers in Distributed Stream Processing With Tuple Scheduling
    Li, Wenxin
    Liu, Duowen
    Chen, Kai
    Li, Keqiu
    Qi, Heng
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 32 (08) : 2021 - 2034
  • [8] Modeling Data Stream Intensity in Distributed Stream Processing System
    Gorawski, Marcin
    Marks, Pawel
    Gorawski, Michal
    COMPUTER NETWORKS, CN 2013, 2013, 370 : 372 - 383
  • [9] Priority-based Resource Scheduling in Distributed Stream Processing Systems for Big Data Applications
    Bellavista, Paolo
    Corradi, Antonio
    Reale, Andrea
    Ticca, Nicola
    2014 IEEE/ACM 7TH INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING (UCC), 2014, : 363 - 370
  • [10] Control-based quality adaptation in data stream management systems
    Tu, YC
    Hefeeda, M
    Xia, YN
    Prabhakar, S
    Liu, S
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2005, 3588 : 746 - 755