Multi-objective scheduling of MapReduce jobs in big data processing

被引:0
|
作者
Ibrahim Abaker Targio Hashem
Nor Badrul Anuar
Mohsen Marjani
Abdullah Gani
Arun Kumar Sangaiah
Adewole Kayode Sakariyah
机构
[1] University of Malaya,Faculty of Computer Science and Information Technology
[2] VIT University,School of Computing Science and Engineering
来源
关键词
Hadoop; MapReduce; Cloud computing; Big data; Scheduling algorithms;
D O I
暂无
中图分类号
学科分类号
摘要
Data generation has increased drastically over the past few years due to the rapid development of Internet-based technologies. This period has been called the big data era. Big data offer an emerging paradigm shift in data exploration and utilization. The MapReduce computational paradigm is a well-known framework and is considered the main enabler for the distributed and scalable processing of a large amount of data. However, despite recent efforts toward improving the performance of MapReduce, scheduling MapReduce jobs across multiple nodes has been considered a multi-objective optimization problem. This problem can become increasingly complex when virtualized clusters in cloud computing are used to execute a large number of tasks. This study aims to optimize MapReduce job scheduling based on the completion time and cost of cloud service models. First, the problem is formulated as a multi-objective model. The model consists of two objective functions, namely, (i) completion time and (ii) cost minimization. Second, a scheduling algorithm using earliest finish time scheduling that considers resource allocation and job scheduling in the cloud is proposed. Lastly, experimental results show that the proposed scheduler exhibits better performance than other well-known schedulers, such as FIFO and Fair.
引用
收藏
页码:9979 / 9994
页数:15
相关论文
共 50 条
  • [1] Multi-objective scheduling of MapReduce jobs in big data processing
    Hashem, Ibrahim Abaker Targio
    Anuar, Nor Badrul
    Marjani, Mohsen
    Gani, Abdullah
    Sangaiah, Arun Kumar
    Sakariyah, Adewole Kayode
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (08) : 9979 - 9994
  • [2] A hierarchical multi-objective task scheduling approach for fast big data processing
    Zahra Jalalian
    Mohsen Sharifi
    The Journal of Supercomputing, 2022, 78 : 2307 - 2336
  • [3] A hierarchical multi-objective task scheduling approach for fast big data processing
    Jalalian, Zahra
    Sharifi, Mohsen
    JOURNAL OF SUPERCOMPUTING, 2022, 78 (02): : 2307 - 2336
  • [4] Towards decomposition based multi-objective workflow scheduling for big data processing in clouds
    Bugingo, Emmanuel
    Zhang, Defu
    Chen, Zhaobin
    Zheng, Wei
    Cluster Computing, 2021, 24 (01) : 115 - 139
  • [5] Towards decomposition based multi-objective workflow scheduling for big data processing in clouds
    Emmanuel Bugingo
    Defu Zhang
    Zhaobin Chen
    Wei Zheng
    Cluster Computing, 2021, 24 : 115 - 139
  • [6] Towards decomposition based multi-objective workflow scheduling for big data processing in clouds
    Bugingo, Emmanuel
    Zhang, Defu
    Chen, Zhaobin
    Zheng, Wei
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2021, 24 (01): : 115 - 139
  • [7] Multi-objective Scheduling for Parallel Jobs on Grid
    Liu, Pengfei
    Dong, Shoubin
    ADVANCED MEASUREMENT AND TEST, PARTS 1 AND 2, 2010, 439-440 : 1281 - +
  • [8] Energy-Aware Scheduling of MapReduce Jobs for Big Data Applications
    Mashayekhy, Lena
    Nejad, Mahyar Movahed
    Grosu, Daniel
    Zhang, Quan
    Shi, Weisong
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2015, 26 (10) : 2720 - 2733
  • [9] Multi-Objective Task Scheduling in Cloud IoT Environments Using Differential Evaluation for Big Data Processing
    Pal, Souvik
    Kumar, Raghvendra
    Alkhayyat, Ahmed Hussein
    INTERNET TECHNOLOGY LETTERS, 2025,
  • [10] Cooperative grid jobs scheduling with multi-objective genetic algorithm
    Zeng, Bin
    Wei, Jun
    Wang, Wei
    Wang, Pu
    PARALLEL AND DISTRIBUTED PROCESSING AND APPLICATIONS, PROCEEDINGS, 2007, 4742 : 545 - 555