Use case-based evaluation of workflow optimization strategy in real-time computation system

被引:3
|
作者
Ahmad, Saima Gulzar [1 ]
Khan, Hikmat Ullah [1 ]
Ijaz, Samia [1 ]
Munir, Ehsan Ullah [1 ]
机构
[1] COMSATS Univ Islamabad, Wah Campus, Islamabad, Pakistan
来源
JOURNAL OF SUPERCOMPUTING | 2020年 / 76卷 / 01期
关键词
Workflow optimization; STORM topology; Partitions; Data intensive; Stream data processing; STREAMING APPLICATIONS; THROUGHPUT;
D O I
10.1007/s11227-019-03060-9
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
With the start of big data era, data stream computing has emerged as a well-known approach to optimize data-intensive workflows. Apache STORM is an open-source real-time distributed computation system for processing data streams and has been opted by famous organizations such as Twitter, Yahoo, Alibaba, Baidu, Groupon. The workflows are implemented as topologies in STORM. The main aspect that controls the execution performance of a workflow in STORM is the strategy of scheduling the topology components (spout and bolts). In this paper, we evaluate and analyze the performance of our algorithm Partition-based Data-intensive Workflow optimization Algorithm (PDWA) in Apache STORM using a use case workflow, EURExpressII. It is a real-world application-based workflow that builds a transcriptome-wide atlas of gene expression for the developing mouse embryo established by ribonucleic acid (RNA) in situ hybridization. Our proposed algorithm, PDWA, partitions the application task graph so that the data movement between partitions is minimum. Each partition is then mapped on one machine for the execution of tasks of that partition. It provides minimum execution time for that particular partition. Partial task duplication is also part of this algorithm that enhances the performance. A STORM-based computing cluster is developed in OpenStack cloud which is used as a computing environment. The performance of PDWA-based optimizer is evaluated with the data sets of different sizes. The achieved results show that PDWA performs with 21% improved average execution time for different sizes of data sets and varying execution nodes. In addition, the comparative results show that on average the efficiency of PDWA is 20.4% higher as compared to STORM default scheduler (SDS).
引用
收藏
页码:708 / 725
页数:18
相关论文
共 50 条
  • [31] Real-time heliostat field aiming strategy optimization based on reinforcement learning
    Zeng, Zhichen
    Ni, Dong
    Xiao, Gang
    [J]. APPLIED ENERGY, 2022, 307
  • [32] A Real-Time Scheduling Strategy Based on Priority in Data Stream System
    Wang, Yan
    Xuan, Weihong
    Li, Wei
    Song, Baoyan
    Li, Xiaoguang
    [J]. HIS 2009: 2009 NINTH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS, VOL 3, PROCEEDINGS, 2009, : 268 - 272
  • [33] Use a real-time operating system
    Beal, J
    [J]. CONTROL ENGINEERING, 1996, 43 (01) : 81 - 82
  • [34] Evaluation of Tensor-Based Algorithms for Real-Time Bidding Optimization
    Szwabe, Andrzej
    Misiorek, Pawel
    Ciesielczyk, Michal
    [J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2017, PT I, 2017, 10191 : 160 - 169
  • [35] Research on Optimization of Case-Based Reasoning System
    Tong, Lin
    Wu, Di
    [J]. PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS ENGINEERING (CASE-13), 2013, 45 : 34 - 37
  • [36] Real-Time Visual Hull Computation Based on GPU
    Gong Dafeng
    Wu Xiaojun
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2015, : 1792 - 1797
  • [37] Control strategy of dual-drive powertrain system of pure electric vehicle based on real-time optimization
    Wang Y.
    Deng J.
    [J]. International Journal of Performability Engineering, 2019, 15 (12) : 3108 - 3116
  • [38] Model-Based Real-Time Evaluation of Security Patterns: A SCADA System Case Study
    Motii, Anas
    Lanusse, Agnes
    Hamid, Brahim
    Bruel, Jean-Michel
    [J]. COMPUTER SAFETY, RELIABILITY, AND SECURITY, SAFECOMP 2016, 2016, 9923 : 375 - 389
  • [39] Learning-Based Modeling and Optimization for Real-Time System Availability
    Li, Liying
    Zhou, Junlong
    Wei, Tongquan
    Chen, Mingsong
    Hu, Xiaobo Sharon
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2021, 70 (04) : 581 - 594
  • [40] Research on Software Optimization of Real-Time Processing System Based on DSP
    Gong, Mingju
    Liu, Yuanlai
    [J]. MECHANICAL AND ELECTRONICS ENGINEERING III, PTS 1-5, 2012, 130-134 : 2944 - +