Mille Cheval: a GPU-based in-memory high-performance computing framework for accelerated processing of big-data streams

被引:0
|
作者
Vivek Kumar
Dilip Kumar Sharma
Vinay Kumar Mishra
机构
[1] Dr. A. P. J. Abdul Kalam Technical University,
[2] GLA University,undefined
[3] SRMGPC,undefined
来源
关键词
GPGPU; Stream; High-performance computing; HPC; In-memory; Single scan; Parallel; CUDA; Nvidia; Kernel;
D O I
暂无
中图分类号
学科分类号
摘要
Streams are temporally ordered, rapid changing, ample in volume, and infinite in nature. It is nearly impossible to store the entire data stream due to its large volume and high velocity. In this work, the principle of parallelism is employed to accelerate stream data computing. GPU-based high-performance computing (HPC) framework is proposed for accelerated processing of big-data streams using the in-memory data structure. We have implemented three parallel algorithms to prove the viability of the framework. The contributions of Mille Cheval are: (1) the viability of streaming on accelerators to increase throughput, (2) carefully chosen hash algorithms to achieve low collision rate and high randomness, and (3) memory sketches for approximation. The objective is to leverage the power of a single node using in-memory computing and hybrid computing. HPC does not always require high-end hardware but well-designed algorithms. Achievements of Mille Cheval are: (1) relative error is 1.32 when error rate and overestimate rate are chosen as 0.001 and (2) the host memory space requirement is just 63 MB for 1 terabyte of data. The proposed algorithms are pragmatic. It is evident from experimental results that the framework demonstrates 10X speed-up as compared with CPU implementations and 3X speed-up as compared with GPU implementations.
引用
收藏
页码:6936 / 6960
页数:24
相关论文
共 35 条
  • [1] Mille Cheval: a GPU-based in-memory high-performance computing framework for accelerated processing of big-data streams
    Kumar, Vivek
    Sharma, Dilip Kumar
    Mishra, Vinay Kumar
    [J]. JOURNAL OF SUPERCOMPUTING, 2021, 77 (07): : 6936 - 6960
  • [2] GPU-based high-performance computing for radiation therapy
    Jia, Xun
    Ziegenhein, Peter
    Jiang, Steve B.
    [J]. PHYSICS IN MEDICINE AND BIOLOGY, 2014, 59 (04): : R151 - R182
  • [3] High-Performance Genomic Analysis Framework with In-Memory Computing
    Li, Xueqi
    Tan, Guangming
    Wang, Bingchen
    Sun, Ninghui
    [J]. ACM SIGPLAN NOTICES, 2018, 53 (01) : 317 - +
  • [4] High-Performance Computing for Big Data Processing
    Wu, Yulei
    Xiang, Yang
    Ge, Jingguo
    Muller, Peter
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 88 : 693 - 695
  • [5] FlowWalker: A Memory-efficient and High-performance GPU-based Dynamic Graph Random Walk Framework
    Mei, Junyi
    Sun, Shixuan
    Li, Chao
    Xu, Cheng
    Chen, Cheng
    Liu, Yibo
    Wang, Jing
    Zhao, Cheng
    Hou, Xiaofeng
    Guo, Minyi
    He, Bingsheng
    Cong, Xiaoliang
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2024, 17 (08): : 1788 - 1801
  • [6] GPU-based high-performance computing of multichannel EEG phase wavelet synchronization
    Efitorov, Alexander
    Knyazeva, Irina
    Yulia, Boytsova
    Danko, Sergey
    [J]. 8TH ANNUAL INTERNATIONAL CONFERENCE ON BIOLOGICALLY INSPIRED COGNITIVE ARCHITECTURES, BICA 2017 (EIGHTH ANNUAL MEETING OF THE BICA SOCIETY), 2018, 123 : 128 - 133
  • [7] EVOLVE: Towards Converging Big-Data, High-Performance and Cloud-Computing Worlds
    Tzenetopoulos, Achilleas
    Masouros, Dimosthenis
    Koliogeorgi, Konstantina
    Xydis, Sotirios
    Soudris, Dimitrios
    Chazapis, Antony
    Kozanitis, Christos
    Bilas, Angelos
    Pinto, Christian
    Huy-Nam Nguyen
    Louloudakis, Stelios
    Gardikis, Georgios
    Vamvakas, George
    Aubrun, Michelle
    Symeonidou, Christy
    Spitadakis, Vassilis
    Xylogiannopoulos, Konstantinos
    Peischl, Bernhard
    Kalayci, Tahir Emre
    Stocker, Alexander
    Acquaviva, Jean-Thomas
    [J]. PROCEEDINGS OF THE 2022 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2022), 2022, : 975 - 980
  • [8] FlinkCL: An OpenCL-Based In-Memory Computing Architecture on Heterogeneous CPU-GPU Clusters for Big Data
    Chen, Cen
    Li, Kenli
    Ouyang, Aijia
    Li, Keqin
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2018, 67 (12) : 1765 - 1779
  • [9] GPU-based high-performance computing for integrated surface-sub-surface flow modeling
    Le, Phong V. V.
    Kumar, Praveen
    Valocchi, Albert J.
    Dang, Hoang-Vu
    [J]. ENVIRONMENTAL MODELLING & SOFTWARE, 2015, 73 : 1 - 13
  • [10] Anonymizing Big Data Streams Using In-memory Processing: A Novel Model Based on One-time Clustering
    Shamsinejad, Elham
    Banirostam, Touraj
    Pedram, Mir Mohsen
    Rahmani, Amir Masoud
    [J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2024, 96 (6-7): : 333 - 356