Data Driven Priority Scheduling on Spark Based Stream Processing

被引:7
|
作者
Ajila, Tobi [1 ]
Majumdar, Shikaresh [1 ]
机构
[1] Carleton Univ, Dept Syst & Comp Engn, Ottawa, ON, Canada
关键词
Spark; Spark Streaming; priority scheduling;
D O I
10.1109/BDCAT.2018.00034
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper focuses on priority based processing of streaming data. One of the greatest challenges in big data analytics is responding to a bursty input load. The common solutions are to use dynamic resource provisioning techniques, however, these techniques may not respond quickly enough to the change in the load. Another option is to overprovision, but this results in wasted computing resources. This paper describes a technique that can be used in cases where resources are statically provisioned. This technique enables users to prioritize certain input data items so that in cases where the load suddenly increases, the high priority items are given precedence over low priority items. This technique is implemented on the Spark Streaming engine.
引用
收藏
页码:208 / 210
页数:3
相关论文
共 50 条
  • [1] Data Driven Priority Scheduling on a Spark Streaming System
    Ajila, Tobi
    Majumdar, Shikharesh
    [J]. 2019 19TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2019, : 561 - 568
  • [2] Priority-based Resource Scheduling in Distributed Stream Processing Systems for Big Data Applications
    Bellavista, Paolo
    Corradi, Antonio
    Reale, Andrea
    Ticca, Nicola
    [J]. 2014 IEEE/ACM 7TH INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING (UCC), 2014, : 363 - 370
  • [3] A Spark™ Based Client for Synchrophasor Data Stream Processing
    Menon, Vijay Krishna
    Variyar, Sajith V.
    Soman, K. P.
    Gopalakrishnan, E. A.
    Kottayil, Sasi K.
    Almas, Muhammad Shoaib
    Nordstrom, Lars
    [J]. PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE AND UTILITY EXHIBITION ON GREEN ENERGY FOR SUSTAINABLE DEVELOPMENT (ICUE 2018), 2018,
  • [4] Data-priority Aware Fair Task Scheduling for Stream Processing at the Edge
    Akram, Faiza
    Kang, Peng
    Lama, Palden
    Khan, Samee U.
    [J]. 2024 IEEE CLOUD SUMMIT, CLOUD SUMMIT 2024, 2024, : 117 - 122
  • [5] Priority-based operator scheduling strategy in data stream system
    Li Maozeng
    Wang Dan
    Du Dongming
    [J]. Advanced Computer Technology, New Education, Proceedings, 2007, : 332 - 337
  • [6] Priority Based Resource Scheduling Techniques for a Resource Constrained Stream Processing System
    Chakraborty, Rudraneel
    Majumdar, Shikharesh
    [J]. BDCAT'17: PROCEEDINGS OF THE FOURTH IEEE/ACM INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING, APPLICATIONS AND TECHNOLOGIES, 2017, : 21 - 31
  • [7] Task Scheduling in Data Stream Processing
    Falt, Zbynek
    Yaghob, Jakub
    [J]. DATESO 2011: DATABASES, TEXTS, SPECIFICATIONS, OBJECTS, 2011, 706 : 85 - 96
  • [8] A Real-Time Scheduling Strategy Based on Priority in Data Stream System
    Wang, Yan
    Xuan, Weihong
    Li, Wei
    Song, Baoyan
    Li, Xiaoguang
    [J]. HIS 2009: 2009 NINTH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS, VOL 3, PROCEEDINGS, 2009, : 268 - 272
  • [9] Priority Based Data Scheduling in VANETs
    Kumar, Vishal
    Vaisla, Kunwar Singh
    Sudarsan, S. D.
    [J]. 2016 THIRD INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND ENGINEERING (ICACCE 2016), 2016, : 19 - 22
  • [10] Model-driven scheduling for distributed stream processing systems
    Shukla, Anshu
    Simmhan, Yogesh
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2018, 117 : 98 - 114