Lit: A High Performance Massive Data Computing Framework Based on CPU/GPU Cluster

被引:0
|
作者
Zhai, Yanlong [1 ]
Mbarushimana, Emmanuel [1 ]
Li, Wei [2 ]
Zhang, Jing [2 ]
Guo, Ying [1 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci, Beijing Engn Res Ctr Mass Language Informat Proc, Beijing 100081, Peoples R China
[2] Sci & Technol Complex Syst Simulat, Beijing, Peoples R China
关键词
MAPREDUCE;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Big data processing is receiving significant amount of interest as an important technology to reveal the information behind the data, such as trends, characteristics, etc. MapReduce is considered as the most efficient distributed parallel data processing framework. However, some high-end applications, especially some scientific analyses have both data-intensive and computation-intensive features. Current big data processing techniques like Hadoop are not designed for computation-intensive applications, thus have insufficient computation power. In this paper, we presented Lit, a high performance massive data computing framework based on CPU/GPU cluster. Lit integrated GPU with Hadoop to improve the computational power of each node in the cluster. Since the architecture and programming model of GPU is different from CPU, Lit provided an annotation based approach to automatically generate CUDA codes from Hadoop codes. Lit hided the complexity of programming on CPU/GPU cluster by providing extended compiler and optimizer. To utilize the simplified programming, scalability and fault tolerance benefits of Hadoop and combine them with the high performance computation power of GPU, Lit extended the Hadoop by applying a GPUClassloader to detect the GPU, generate and compile CUDA codes, and invoke the shared library. Our experimental results show that Lit can achieve an average speedup of 1x to 3x on three typical applications over Hadoop.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Mille Cheval: a GPU-based in-memory high-performance computing framework for accelerated processing of big-data streams
    Kumar, Vivek
    Sharma, Dilip Kumar
    Mishra, Vinay Kumar
    JOURNAL OF SUPERCOMPUTING, 2021, 77 (07): : 6936 - 6960
  • [42] Mille Cheval: a GPU-based in-memory high-performance computing framework for accelerated processing of big-data streams
    Vivek Kumar
    Dilip Kumar Sharma
    Vinay Kumar Mishra
    The Journal of Supercomputing, 2021, 77 : 6936 - 6960
  • [43] Embedded GPU Cluster Computing Framework for Inference of Convolutional Neural Networks
    Kain, Evan
    Wildenstein, Diego
    Pineda, Andrew C.
    2019 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2019,
  • [44] A multi-GPU based high-performance computing framework in elastodynamics simulation using octree meshes
    Mohammadian, Shayan
    Kumar, Ankit S.
    Song, Chongmin
    COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2025, 436
  • [45] High Performance FFT Based Poisson Solver on a CPU-GPU Heterogeneous Platform
    Wu, Jing
    JaJa, Joseph
    IEEE 27TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2013), 2013, : 115 - 125
  • [46] Efficient parallel implementation of crowd simulation using a hybrid CPU plus GPU high performance computing system
    Skrzypczak, Jakub
    Czarnul, Pawel
    SIMULATION MODELLING PRACTICE AND THEORY, 2023, 123
  • [47] Performance Optimization Strategies of High Performance Computing on GPU
    Ma, Anguo
    Cai, Jing
    Cheng, Yu
    Ni, Xiaoqiang
    Tang, Yuxing
    Xing, Zuocheng
    ADVANCED PARALLEL PROCESSING TECHNOLOGIES, PROCEEDINGS, 2009, 5737 : 150 - 164
  • [48] GPU-based high-performance computing for radiation therapy
    Jia, Xun
    Ziegenhein, Peter
    Jiang, Steve B.
    PHYSICS IN MEDICINE AND BIOLOGY, 2014, 59 (04): : R151 - R182
  • [49] Analysis and Optimization of Massive Data Processing on High Performance Computing Architecture
    Huang, He
    Li, Shanshan
    Yi, Xiaodong
    Zhang, Feng
    Liao, Xiangke
    Dong, Pan
    THIRD INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, GRIDS, AND VIRTUALIZATION (CLOUD COMPUTING 2012), 2012, : 186 - 191
  • [50] GPU Clusters for High-Performance Computing
    Kindratenko, Volodymyr V.
    Enos, Jeremy J.
    Shi, Guochun
    Showerman, Michael T.
    Arnold, Galen W.
    Stone, John E.
    Phillips, James C.
    Hwu, Wen-mei
    2009 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING AND WORKSHOPS, 2009, : 638 - +