Energy-Aware Heuristic Scheduling Using Bin Packing MapReduce Scheduler for Heterogeneous Workloads Performance in Big Data

被引:0
|
作者
S. Aarthee
R. Prabakaran
机构
[1] Anna University,Department of Electrical and Electronics Engineering, University College of Engineering, BIT Campus
关键词
Job scheduler; Map/reduce task; Task scheduling; Energy consumption; Virtual machine instances; Resource utilization;
D O I
暂无
中图分类号
学科分类号
摘要
Big data refers to diverse large data types from heterogeneous sources such as mobile devices, the web, social media, and the internet of things. The cloud offers a wide variety of tools to handle the big data on-demand for pay-per-service basis through a cluster of virtual machines that are hosted across cloud datacenter in heterogeneous physical machines. The primary goal tends to analyze the big data at the point of creation and scaling by data-intensive computing. Hadoop MapReduce helps to solve scalability and complexity by adding more jobs in a virtual cluster across the racks in a distributed cloud datacenter. By default, MapReduce schedulers do not perform the computational jobs in heterogeneity and the virtual machines will execute the blocks in equal numbers despite their capacity decreasing the performance dynamically. The virtual machines on a virtual cluster are not aware of energy efficiency, which is highly important in a heterogeneous environment. Hence, we propose the dynamic performance heuristic-based bin packing (DP-HBP) MapReduce scheduler, which increases the utilization of resources in heterogeneous virtual machines. The proposed DP-HBP scheduler improves the makespan and latency by 49% and 39% over the roulette wheel scheme and heuristic-based MapReduce job schedulers from our experimentation. DP-HBP derived the average number of data nonlocal execution as 27%, which is lesser, compared to the existing schedulers. The resource utilization for the average number of unused vCPU and memory is improved by 34% and 41%, which enhances the performance workloads in handling big data in a heterogeneous environment.
引用
收藏
页码:1891 / 1905
页数:14
相关论文
共 33 条
  • [1] Energy-Aware Heuristic Scheduling Using Bin Packing MapReduce Scheduler for Heterogeneous Workloads Performance in Big Data
    Aarthee, S.
    Prabakaran, R.
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2023, 48 (02) : 1891 - 1905
  • [2] Energy-Aware Scheduling of MapReduce Jobs for Big Data Applications
    Mashayekhy, Lena
    Nejad, Mahyar Movahed
    Grosu, Daniel
    Zhang, Quan
    Shi, Weisong
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2015, 26 (10) : 2720 - 2733
  • [3] Linear Weighted Regression and Energy-Aware Greedy Scheduling for Heterogeneous Big Data
    Kallam, Suresh
    Patan, Rizwan
    Ramana, Tathapudi V.
    Gandomi, Amir H.
    [J]. ELECTRONICS, 2021, 10 (05) : 1 - 16
  • [4] An energy-aware scheduling algorithm for big data applications in Spark
    Hongjian Li
    Huochen Wang
    Shuyong Fang
    Yang Zou
    Wenhong Tian
    [J]. Cluster Computing, 2020, 23 : 593 - 609
  • [5] An energy-aware scheduling algorithm for big data applications in Spark
    Li, Hongjian
    Wang, Huochen
    Fang, Shuyong
    Zou, Yang
    Tian, Wenhong
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2020, 23 (02): : 593 - 609
  • [6] An Energy-aware Task Scheduling Algorithm for a Heterogeneous Data Center
    Zhang, Shuo
    Wang, Baosheng
    Zhao, Baokang
    Tao, Jing
    [J]. 2013 12TH IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2013), 2013, : 1471 - 1477
  • [7] Energy-Aware Scheduling of Workflow Using a Heuristic Method on Green Cloud
    Peng, Zhihao
    Barzegar, Behnam
    Yarahmadi, Maryam
    Motameni, Homayun
    Pirouzmand, Poria
    [J]. SCIENTIFIC PROGRAMMING, 2020, 2020
  • [8] An energy-aware gradient-based scheduling heuristic for heterogeneous multiprocessor embedded systems
    Goh, Lee Kee
    Veeravalli, Bharadwaj
    Viswanathan, Sivakumar
    [J]. HIGH PERFORMANCE COMPUTING - HIPC 2007, PROCEEDINGS, 2007, 4873 : 331 - +
  • [9] An Energy-Aware Heuristic Scheduling for Data-Intensive Workflows in Virtualized Datacenters
    Peng Xiao
    Zhi-Gang Hu
    Yan-Ping Zhang
    [J]. Journal of Computer Science and Technology, 2013, 28 : 948 - 961
  • [10] Energy-Aware Virtual Machine Scheduling on Data Centers with Heterogeneous Bandwidths
    Lago, Daniel Guimaraes
    Madeira, Edmundo R. M.
    Medhi, Deep
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2018, 29 (01) : 83 - 98