Model-driven scheduling for distributed stream processing systems

被引:35
|
作者
Shukla, Anshu [1 ]
Simmhan, Yogesh [1 ]
机构
[1] Indian Inst Sci, Dept Computat & Data Sci, Bangalore, Karnataka, India
关键词
Stream processing; Scheduling algorithms; Performance models; Big data; Cloud computing; Distributed systems; INTERNET; FUTURE;
D O I
10.1016/j.jpdc.2018.02.003
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Distributed Stream Processing Systems (DSPS) are "Fast Data" platforms that allow streaming applications to be composed and executed with low latency on commodity clusters and Clouds. Such applications are composed as a Directed Acyclic Graph (DAG) of tasks, with data parallel execution using concurrent task threads on distributed resource slots. Scheduling such DAGs for DSPS has two parts-allocation of threads and resources for a DAG, and mapping threads to resources. Existing schedulers often address just one of these, make the assumption that performance linearly scales, or use ad hoc empirical tuning at runtime. Instead, we propose model-driven techniques for both mapping and allocation that rely on low-overhead a priori performance modeling of tasks. Our scheduling algorithms are able to offer predictable and low resource needs that is suitable for elastic pay-as-you-go Cloud resources, support a high input rate through high VM utilization, and can be combined with other mapping approaches as well. These are validated for micro and application benchmarks, and compared with contemporary schedulers, for the Apache Storm DSPS. (C) 2018 Elsevier Inc. All rights reserved.
引用
收藏
页码:98 / 114
页数:17
相关论文
共 50 条
  • [31] UML-Based Modeling and Model-Driven Development of Distributed Control Systems
    Basile, Francesco
    Chiacchio, Pasquale
    Del Grosso, Domenico
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION, PROCEEDINGS, 2008, : 1120 - 1127
  • [32] A model-driven framework for the generation of gateways in distributed real-time systems
    Obermaisser, R.
    [J]. RTSS 2007: 28TH IEEE INTERNATIONAL REAL-TIME SYSTEMS SYMPOSIUM, PROCEEDINGS, 2007, : 93 - 104
  • [33] An empirical analysis of stateful operator migration for online scheduling in distributed stream processing systems
    Sornalakshmi, K.
    Vadivu, G.
    [J]. MICROPROCESSORS AND MICROSYSTEMS, 2023, 98
  • [34] Resource Management and Scheduling in Distributed Stream Processing Systems: A Taxonomy, Review, and Future Directions
    Liu, Xunyun
    Buyya, Rajkumar
    [J]. ACM COMPUTING SURVEYS, 2020, 53 (03)
  • [35] Reliable stream data processing for elastic distributed stream processing systems
    Xiaohui Wei
    Yuan Zhuang
    Hongliang Li
    Zhiliang Liu
    [J]. Cluster Computing, 2020, 23 : 555 - 574
  • [36] Reliable stream data processing for elastic distributed stream processing systems
    Wei, Xiaohui
    Zhuang, Yuan
    Li, Hongliang
    Liu, Zhiliang
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2020, 23 (02): : 555 - 574
  • [37] MDDRSPF: A Model Driven Distributed Real-time Stream Processing Framework
    Wen, Yijun
    Zhang, Li
    Wang, Cheng
    [J]. 2019 IEEE INTL CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, BIG DATA & CLOUD COMPUTING, SUSTAINABLE COMPUTING & COMMUNICATIONS, SOCIAL COMPUTING & NETWORKING (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2019), 2019, : 1352 - 1358
  • [38] Model-driven Performance Prediction of Systems of Systems
    Falkner, Katrina
    Szabo, Claudia
    Chiprianov, Vanea
    [J]. 19TH ACM/IEEE INTERNATIONAL CONFERENCE ON MODEL DRIVEN ENGINEERING LANGUAGES AND SYSTEMS (MODELS'16), 2016, : 44 - 44
  • [39] Model-driven performance prediction of systems of systems
    Falkner, Katrina
    Szabo, Claudia
    Chiprianov, Vanea
    Puddy, Gavin
    Rieckmann, Marianne
    Fraser, Dan
    Aston, Cathlyn
    [J]. SOFTWARE AND SYSTEMS MODELING, 2018, 17 (02): : 415 - 441
  • [40] Model-driven performance prediction of systems of systems
    Katrina Falkner
    Claudia Szabo
    Vanea Chiprianov
    Gavin Puddy
    Marianne Rieckmann
    Dan Fraser
    Cathlyn Aston
    [J]. Software & Systems Modeling, 2018, 17 : 415 - 441