Hybridhadoop: CPU-GPU hybrid scheduling in hadoop

被引:0
|
作者
Oh, Chanyoung [1 ]
Yi, Saehanseul [2 ]
Seok, Jongkyu [3 ]
Jung, Hyeonjin [3 ]
Yoon, Illo [3 ]
Yi, Youngmin [3 ]
机构
[1] Kongju Natl Univ, Dept Software, Cheonan 31080, Chungcheongnam, South Korea
[2] Univ Calif Irvine, Sch Informat & Comp Sci, Irvine, CA 92697 USA
[3] Univ Seoul, Sch Elect & Comp Engn, Seoul 02504, South Korea
关键词
CPU-GPU heterogeneous computing; Distributed systems; Performance estimation; Hadoop;
D O I
10.1007/s10586-023-04178-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As a GPU has become an essential component in high performance computing, it has been attempted by many works to leverage GPU computing in Hadoop. However, few works considered to fully utilize the GPU in Hadoop and only a few works studied utilizing both CPU and GPU at the same time. In this paper, we propose a CPU-GPU hybrid scheduling in Hadoop, where both CPUs and GPUs in a node are exploited as much as possible in an adaptive manner. The technical barrier stands in that the optimal number of GPU tasks is not known in advance, and the total number of Containers in a node cannot be changed once a Hadoop job starts. In the proposed approach, we first determine the initial number of Containers as well as the hybrid execution mode, then the proposed dynamic scheduler adjusts the number of Containers for a GPU and a CPU with the help of a GPU monitor during the job execution. It also employs a load-balancing algorithm for the tail. The experiments with various benchmarks show that the proposed CPU-GPU hybrid scheduling achieves 3.87x of speedup on average against the 12-core CPU-only Hadoop.
引用
收藏
页码:3875 / 3892
页数:18
相关论文
共 50 条
  • [1] HybridHadoop: CPU-GPU Hybrid Scheduling in Hadoop
    Oh, Chanyoung
    Jung, Hyeonjin
    Yi, Saehanseul
    Yoon, Illo
    Yi, Youngmin
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING IN ASIA-PACIFIC REGION (HPC ASIA 2021), 2020, : 40 - 49
  • [2] Hybrid CPU-GPU scheduling and execution of tree traversals
    Liu, Jianqiao
    Hegde, Nikhil
    Kulkarni, Milind
    ACM SIGPLAN NOTICES, 2016, 51 (08) : 385 - 386
  • [3] A user mode CPU-GPU scheduling framework for hybrid workloads
    Wang, Bin
    Ma, Ruhui
    Qi, Zhengwei
    Yao, Jianguo
    Guan, Haibing
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2016, 63 : 25 - 36
  • [4] ONLINE SCHEDULING OF MIXED CPU-GPU JOBS
    Chen, Lin
    Ye, Deshi
    Zhang, Guochuang
    INTERNATIONAL JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE, 2014, 25 (06) : 745 - 761
  • [5] Scheduling concurrent applications on a cluster of CPU-GPU nodes
    Ravi, Vignesh T.
    Becchi, Michela
    Jiang, Wei
    Agrawal, Gagan
    Chakradhar, Srimat
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2013, 29 (08): : 2262 - 2271
  • [6] A Survey on Task Scheduling of CPU-GPU Heterogeneous Cluster
    ZHOU Yiheng
    ZENG Wei
    ZHENG Qingfang
    LIU Zhilong
    CHEN Jianping
    ZTE Communications, 2024, 22 (03) : 83 - 90
  • [7] A Flexible Scheduling Framework for Heterogeneous CPU-GPU Clusters
    Sajjapongse, Kittisak
    Agarwal, Tejaswi
    Becchi, Michela
    2014 21ST INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2014,
  • [8] Memory-Constrained Vectorization and Scheduling of Dataflow Graphs for Hybrid CPU-GPU Platforms
    Lin, Shuoxin
    Wu, Jiahao
    Bhattacharyya, Shuvra S.
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2018, 17 (02)
  • [9] Energy Efficient Real-time Task Scheduling on CPU-GPU Hybrid Clusters
    Mei, Xinxin
    Chu, Xiaowen
    Liu, Hai
    Leung, Yiu-Wing
    Li, Zongpeng
    IEEE INFOCOM 2017 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, 2017,
  • [10] Prediction Model for Scheduling an Irregular Graph Algorithms on CPU-GPU Hybrid Cluster Framework
    Chandrashekhar, B. N.
    Sanjay, H. A.
    Lakshmi, H.
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT-2020), 2020, : 584 - 589