Priority-based Resource Scheduling in Distributed Stream Processing Systems for Big Data Applications

被引:0
|
作者
Bellavista, Paolo [1 ]
Corradi, Antonio [1 ]
Reale, Andrea [1 ]
Ticca, Nicola [1 ]
机构
[1] Univ Bologna, Dept Comp Sci & Engn, Bologna, Italy
关键词
Distributed Stream Processing; Big Data; Priority-based Resource Scheduling; Application-level and Application-specific Scheduling; Cloud Computing Optimization; Vehicular Traffic Analysis;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Distributed Stream Processing Systems (DSPSs) are attracting increasing industrial and academic interest as flexible tools to implement scalable and cost-effective on-line analytics applications over Big Data streams. Often hosted in private/public cloud deployment environments, DSPSs offer datastream processing services that transparently exploit the distributed computing resources made available to them at runtime. Given the volume of data of interest, possible (hard/soft) real-time processing requirements, and the time-variable characteristics of input datastreams, it is very important for DSPSs to use smart and innovative scheduling techniques that allocate computing resources properly and avoid static over-provisioning. In this paper, we originally investigate the suitability of exploiting application-level indications about differentiated priorities of different stream processing tasks to enable application-specific DSPS resource scheduling, e.g., capable of re-shaping processing resources in order to dynamically follow input data peaks of prioritized tasks, with no static over-provisioning. We originally propose a general and simple technique to design and implement priority-based resource scheduling in flow-graph-based DSPSs, by allowing application developers to augment DSPS graphs with priority metadata and by introducing an extensible set of priority schemas to be automatically handled by the extended DSPS. In addition, we show the effectiveness of our approach via its implementation and integration in our Quasit DSPS and through experimental evaluation of this prototype on a real-world stream processing application of Big Data vehicular traffic analysis.
引用
下载
收藏
页码:363 / 370
页数:8
相关论文
共 50 条
  • [31] Reliable stream data processing for elastic distributed stream processing systems
    Wei, Xiaohui
    Zhuang, Yuan
    Li, Hongliang
    Liu, Zhiliang
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2020, 23 (02): : 555 - 574
  • [32] Priority-Based Task Scheduling in the Cloud Systems Using a Memetic Algorithm
    Keshanchi, Bahman
    Navimipour, Nima Jafari
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2016, 25 (10)
  • [33] Priority-based Balance Scheduling in Real-Time Data Warehouse
    Shi, JinGang
    Bao, YuBin
    Leng, FangLing
    Yu, Ge
    HIS 2009: 2009 NINTH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS, VOL 3, PROCEEDINGS, 2009, : 301 - 306
  • [34] Flexible Priority-based Stream Schedulers in QUIC
    Fernandez, Fatima
    Zverev, Mihail
    Diez, Luis
    Juarez, Jose R.
    Brunstrom, Anna
    Aguero, Ramon
    PROCEEDINGS OF THE INT'L ACM SYMPOSIUM ON PERFORMANCE EVALUATION OF WIRELESS AD HOC, SENSOR, & UBIQUITOUS NETWORKS, PE-WASUN 2023, 2023, : 91 - 98
  • [35] An enhanced priority-based scheduling heuristic for DAG applications with temporal unpredictability in task execution and data transmission
    Zhang, Xinbo
    Zhang, Dongzhan
    Zheng, Wei
    Chen, Jinjun
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 100 : 428 - 439
  • [36] Implementation of a Distributed Processing Engine for Spatial Big-Data Processing based on Batch and Stream
    Kim, Sang-Su
    Song, Kwaun-Sik
    2017 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC), 2017, : 1196 - 1198
  • [37] Priority-based joint EDF–RM scheduling algorithm for individual real-time task on distributed systems
    Rashmi Sharma
    Nitin Nitin
    Mohammed Abdul Rahman AlShehri
    Deepak Dahiya
    The Journal of Supercomputing, 2021, 77 : 890 - 908
  • [38] Priority-based Resource Allocation for RT and NRT Traffics in OFDMA Systems
    Wang, Hua
    2007 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-15, 2007, : 791 - 794
  • [39] Benchmarking Distributed Stream Data Processing Systems
    Karimov, Jeyhun
    Rabl, Tilmann
    Katsifodimos, Asterios
    Samarev, Roman
    Heiskanen, Henri
    Markl, Volker
    2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2018, : 1507 - 1518
  • [40] Tracing Distributed Data Stream Processing Systems
    Zvara, Zoltan
    Szabo, Peter G. N.
    Hermann, Gabor
    Benczur, Andras
    2017 IEEE 2ND INTERNATIONAL WORKSHOPS ON FOUNDATIONS AND APPLICATIONS OF SELF* SYSTEMS (FAS*W), 2017, : 235 - 242