Lit: A High Performance Massive Data Computing Framework Based on CPU/GPU Cluster

被引:0
|
作者
Zhai, Yanlong [1 ]
Mbarushimana, Emmanuel [1 ]
Li, Wei [2 ]
Zhang, Jing [2 ]
Guo, Ying [1 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci, Beijing Engn Res Ctr Mass Language Informat Proc, Beijing 100081, Peoples R China
[2] Sci & Technol Complex Syst Simulat, Beijing, Peoples R China
关键词
MAPREDUCE;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Big data processing is receiving significant amount of interest as an important technology to reveal the information behind the data, such as trends, characteristics, etc. MapReduce is considered as the most efficient distributed parallel data processing framework. However, some high-end applications, especially some scientific analyses have both data-intensive and computation-intensive features. Current big data processing techniques like Hadoop are not designed for computation-intensive applications, thus have insufficient computation power. In this paper, we presented Lit, a high performance massive data computing framework based on CPU/GPU cluster. Lit integrated GPU with Hadoop to improve the computational power of each node in the cluster. Since the architecture and programming model of GPU is different from CPU, Lit provided an annotation based approach to automatically generate CUDA codes from Hadoop codes. Lit hided the complexity of programming on CPU/GPU cluster by providing extended compiler and optimizer. To utilize the simplified programming, scalability and fault tolerance benefits of Hadoop and combine them with the high performance computation power of GPU, Lit extended the Hadoop by applying a GPUClassloader to detect the GPU, generate and compile CUDA codes, and invoke the shared library. Our experimental results show that Lit can achieve an average speedup of 1x to 3x on three typical applications over Hadoop.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Design and Optimization of a Big Data Computing Framework based on CPU/GPU Cluster
    Zhai, Yanlong
    Guo, Ying
    Chen, Qiurui
    Yang, Kai
    Mbarushimana, Emmanuel
    2013 IEEE 15TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2013 IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (HPCC_EUC), 2013, : 1039 - 1046
  • [2] HETEROGENEOUS GPU&CPU CLUSTER FOR HIGH PERFORMANCE COMPUTING IN CRYPTOGRAPHY
    Marks, Michal
    Jantura, Jaroslaw
    Niewiadomska-Szynkiewicz, Ewa
    Strzelczyk, Przemyslaw
    Gozdz, Krzysztof
    COMPUTER SCIENCE-AGH, 2012, 13 (02): : 63 - 79
  • [3] A MapReduce Computing Framework Based on GPU Cluster
    Gao, Heng
    Tang, Jie
    Wu, Gangshan
    2013 IEEE 15TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2013 IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (HPCC_EUC), 2013, : 1902 - 1907
  • [4] Research on LogGP Based Parallel Computing Model for CPU/GPU Cluster
    Wu, Yongwen
    Song, Junqiang
    Ren, Kaijun
    Li, Xiaoyong
    INFORMATION TECHNOLOGY AND INTELLIGENT TRANSPORTATION SYSTEMS, VOL 2, 2017, 455 : 409 - 420
  • [5] High performance computing and quantum trajectory method in CPU and GPU systems
    Wisniewska, Joanna
    Sawerwain, Marek
    Leonski, Wieslaw
    3RD INTERNATIONAL CONFERENCE ON MATHEMATICAL MODELING IN PHYSICAL SCIENCES (IC-MSQUARE 2014), 2015, 574
  • [6] Benchmarking of High Performance Computing Clusters with Heterogeneous CPU/GPU Architecture
    Sukharev, Pavel V.
    Vasilyev, Nikolay P.
    Rovnyagin, Mikhail M.
    Durnov, Maxim A.
    PROCEEDINGS OF THE 2017 IEEE RUSSIA SECTION YOUNG RESEARCHERS IN ELECTRICAL AND ELECTRONIC ENGINEERING CONFERENCE (2017 ELCONRUS), 2017, : 574 - 577
  • [7] Parallel Optimization of Relion: Performance Comparison based on Cluster for CPU/GPU and KNL
    Zhou, Heng
    Ni, FuChuan
    Zhao, Liang
    Zheng, Fang
    PROCEEDINGS OF THE 2018 2ND INTERNATIONAL CONFERENCE ON ALGORITHMS, COMPUTING AND SYSTEMS (ICACS 2018), 2018, : 48 - 52
  • [8] Learning Based Performance and Power Efficient Cluster Resource Manager for CPU-GPU Cluster
    Das, Soumen Kumar
    Sudhakaran, G.
    Ashok, V.
    2014 FOURTH INTERNATIONAL CONFERENCE OF EMERGING APPLICATIONS OF INFORMATION TECHNOLOGY (EAIT), 2014, : 161 - 166
  • [9] From CPU to GPU: GPU-based electromagnetic computing (GPUECO)
    Tao, Y. B.
    Lin, H.
    Bao, H. J.
    PROGRESS IN ELECTROMAGNETICS RESEARCH-PIER, 2008, 81 : 1 - 19
  • [10] CPU and GPU parallel efficiency of ARM based single board computing cluster for CFD applications
    Di Pierro, Bastien
    Hank, Sarah
    COMPUTERS & FLUIDS, 2024, 272