Priority-based Resource Scheduling in Distributed Stream Processing Systems for Big Data Applications

被引:0
|
作者
Bellavista, Paolo [1 ]
Corradi, Antonio [1 ]
Reale, Andrea [1 ]
Ticca, Nicola [1 ]
机构
[1] Univ Bologna, Dept Comp Sci & Engn, Bologna, Italy
关键词
Distributed Stream Processing; Big Data; Priority-based Resource Scheduling; Application-level and Application-specific Scheduling; Cloud Computing Optimization; Vehicular Traffic Analysis;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Distributed Stream Processing Systems (DSPSs) are attracting increasing industrial and academic interest as flexible tools to implement scalable and cost-effective on-line analytics applications over Big Data streams. Often hosted in private/public cloud deployment environments, DSPSs offer datastream processing services that transparently exploit the distributed computing resources made available to them at runtime. Given the volume of data of interest, possible (hard/soft) real-time processing requirements, and the time-variable characteristics of input datastreams, it is very important for DSPSs to use smart and innovative scheduling techniques that allocate computing resources properly and avoid static over-provisioning. In this paper, we originally investigate the suitability of exploiting application-level indications about differentiated priorities of different stream processing tasks to enable application-specific DSPS resource scheduling, e.g., capable of re-shaping processing resources in order to dynamically follow input data peaks of prioritized tasks, with no static over-provisioning. We originally propose a general and simple technique to design and implement priority-based resource scheduling in flow-graph-based DSPSs, by allowing application developers to augment DSPS graphs with priority metadata and by introducing an extensible set of priority schemas to be automatically handled by the extended DSPS. In addition, we show the effectiveness of our approach via its implementation and integration in our Quasit DSPS and through experimental evaluation of this prototype on a real-world stream processing application of Big Data vehicular traffic analysis.
引用
收藏
页码:363 / 370
页数:8
相关论文
共 50 条
  • [1] Priority-Based Job Scheduling in Distributed Systems
    Bansal, Sunita
    Hota, Chittaranjan
    [J]. INFORMATION SYSTEMS, TECHNOLOGY AND MANAGEMENT-THIRD INTERNATIONAL CONFERENCE, ICISTM 2009, 2009, 31 : 110 - +
  • [2] Priority-based operator scheduling strategy in data stream system
    Li Maozeng
    Wang Dan
    Du Dongming
    [J]. Advanced Computer Technology, New Education, Proceedings, 2007, : 332 - 337
  • [3] A path priority-based task scheduling algorithm for heterogeneous distributed systems
    Eswari, R.
    Nickolas, S.
    Arock, Michael
    [J]. INTERNATIONAL JOURNAL OF COMMUNICATION NETWORKS AND DISTRIBUTED SYSTEMS, 2014, 12 (02) : 183 - 201
  • [4] Priority Based Resource Scheduling Techniques for a Resource Constrained Stream Processing System
    Chakraborty, Rudraneel
    Majumdar, Shikharesh
    [J]. BDCAT'17: PROCEEDINGS OF THE FOURTH IEEE/ACM INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING, APPLICATIONS AND TECHNOLOGIES, 2017, : 21 - 31
  • [5] Priority-based Management Algorithm in Distributed Energy Resource Management Systems
    Bilakanti, Nishant
    Gurung, Niroj
    Chen, Heng
    Kothandaraman, Sri Raghavan
    [J]. 2021 13TH ANNUAL IEEE GREEN TECHNOLOGIES CONFERENCE GREENTECH 2021, 2021, : 351 - 356
  • [6] Resource Estimation in Distributed Data Stream Processing Systems
    Fan, Minglu
    Liang, Yi
    Liu, Fei
    Yang, Mangmang
    Wang, Haihua
    [J]. PROCEEDINGS OF THE 2016 2ND WORKSHOP ON ADVANCED RESEARCH AND TECHNOLOGY IN INDUSTRY APPLICATIONS, 2016, 81 : 1824 - 1827
  • [7] Data Driven Priority Scheduling on Spark Based Stream Processing
    Ajila, Tobi
    Majumdar, Shikaresh
    [J]. 2018 IEEE/ACM 5TH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING APPLICATIONS AND TECHNOLOGIES (BDCAT), 2018, : 208 - 210
  • [8] Adaptive priority-based data placement and multi-task scheduling in geo-distributed cloud systems
    Li, Chunlin
    Liu, Jun
    Li, Weigang
    Luo, Youlong
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 224
  • [9] Priority-based event message scheduling in distributed virtual environment
    Yu, Seokjong
    [J]. HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, PROCEEDINGS, 2006, 4208 : 884 - 893
  • [10] A priority-based resource allocation strategy in distributed computing networks
    Huang, YF
    Chao, BW
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2001, 58 (03) : 221 - 233