Use case-based evaluation of workflow optimization strategy in real-time computation system

被引:3
|
作者
Ahmad, Saima Gulzar [1 ]
Khan, Hikmat Ullah [1 ]
Ijaz, Samia [1 ]
Munir, Ehsan Ullah [1 ]
机构
[1] COMSATS Univ Islamabad, Wah Campus, Islamabad, Pakistan
来源
JOURNAL OF SUPERCOMPUTING | 2020年 / 76卷 / 01期
关键词
Workflow optimization; STORM topology; Partitions; Data intensive; Stream data processing; STREAMING APPLICATIONS; THROUGHPUT;
D O I
10.1007/s11227-019-03060-9
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
With the start of big data era, data stream computing has emerged as a well-known approach to optimize data-intensive workflows. Apache STORM is an open-source real-time distributed computation system for processing data streams and has been opted by famous organizations such as Twitter, Yahoo, Alibaba, Baidu, Groupon. The workflows are implemented as topologies in STORM. The main aspect that controls the execution performance of a workflow in STORM is the strategy of scheduling the topology components (spout and bolts). In this paper, we evaluate and analyze the performance of our algorithm Partition-based Data-intensive Workflow optimization Algorithm (PDWA) in Apache STORM using a use case workflow, EURExpressII. It is a real-world application-based workflow that builds a transcriptome-wide atlas of gene expression for the developing mouse embryo established by ribonucleic acid (RNA) in situ hybridization. Our proposed algorithm, PDWA, partitions the application task graph so that the data movement between partitions is minimum. Each partition is then mapped on one machine for the execution of tasks of that partition. It provides minimum execution time for that particular partition. Partial task duplication is also part of this algorithm that enhances the performance. A STORM-based computing cluster is developed in OpenStack cloud which is used as a computing environment. The performance of PDWA-based optimizer is evaluated with the data sets of different sizes. The achieved results show that PDWA performs with 21% improved average execution time for different sizes of data sets and varying execution nodes. In addition, the comparative results show that on average the efficiency of PDWA is 20.4% higher as compared to STORM default scheduler (SDS).
引用
收藏
页码:708 / 725
页数:18
相关论文
共 50 条
  • [1] Use case-based evaluation of workflow optimization strategy in real-time computation system
    Saima Gulzar Ahmad
    Hikmat Ullah Khan
    Samia Ijaz
    Ehsan Ullah Munir
    [J]. The Journal of Supercomputing, 2020, 76 : 708 - 725
  • [2] Real-Time Strategy Generation System using Case-based Reasoning
    Kim, Wonil
    Baik, Sung Wook
    Kwon, Soonil
    Han, Changhee
    Hong, Chuleui
    Kim, Junghyun
    [J]. 2014 INTERNATIONAL SYMPOSIUM ON COMPUTER, CONSUMER AND CONTROL (IS3C 2014), 2014, : 1159 - 1162
  • [3] Real-time plan adaptation for case-based planning in real-time strategy games
    Sugandh, Neha
    Ontanon, Santiago
    Ram, Ashwin
    [J]. ADVANCES IN CASE-BASED REASONING, PROCEEDINGS, 2008, 5239 : 533 - 547
  • [4] Case-based planning and execution for real-time strategy games
    Ontanon, Santiago
    Mishra, Kinshuk
    Sugandh, Neha
    Ram, Ashwin
    [J]. CASE-BASED REASONING RESEARCH AND DEVELOPMENT, PROCEEDINGS, 2007, 4626 : 164 - +
  • [5] Learning to win: Case-based plan selection in a real-time strategy game
    Aha, DW
    Molineaux, M
    Ponsen, M
    [J]. CASE-BASED REASONING RESEARCH AND DEVELOPMENT, PROCEEDINGS, 2005, 3620 : 5 - 20
  • [6] EPISODIC CASES AND REAL-TIME PERFORMANCE IN A CASE-BASED PLANNING SYSTEM
    MCCARTNEY, R
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 1993, 6 (01) : 9 - 22
  • [7] Integrating Case-Based Reasoning with Reinforcement Learning for Real-Time Strategy Game Micromanagement
    Wender, Stefan
    Watson, Ian
    [J]. PRICAI 2014: TRENDS IN ARTIFICIAL INTELLIGENCE, 2014, 8862 : 64 - 76
  • [8] Home energy system optimization based on time-of-use price and real-time control strategy of battery
    Wu, Huicheng
    Wang, Chun
    Zuo, Yuanlong
    Chen, Yujie
    Liu, Kuan
    [J]. Dianli Xitong Baohu yu Kongzhi/Power System Protection and Control, 2019, 47 (19): : 23 - 30
  • [9] A prototype case-based reasoning system for real-time freeway traffic routing
    Sadek, AW
    Smith, BL
    Demetsky, MJ
    [J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2001, 9 (05) : 353 - 380
  • [10] GRIDCC: Real-time workflow system
    McGough, Andrew Stephen
    Akram, Asif
    Guo, Li
    Krznaric, Marko
    Dickens, Luke
    Colling, David
    Martyniak, Janusz
    Powell, Roger
    Kyberd, Paul
    Kotsokalis, Constantinos
    [J]. Proceedings of the 2nd Workshop on Workflows in Support of Large-scale Science, WORKS'07, 2007, : 3 - 12