Massively Parallel Computation of Linear Recurrence Equations with Graphics Processing Units

被引:3
|
作者
Sung, Wonyong [1 ]
Lee, Dong-hwan [1 ]
Hwang, Kyuyeon [1 ]
机构
[1] Seoul Natl Univ, Dept Elect Engn & Comp Sci, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
Graphics processing unit (GPU); massively parallel processing; linear recurrence equation; prefix-sum; scan; IMPLEMENTATION; ALGORITHM;
D O I
10.1145/3229631.3229649
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Graphics processing units (GPUs) show very high performance when executing many parallel programs; however their use in solving linear recurrence equations is considered difficult because of the sequential nature of the problem. Previously developed parallel algorithms, such as recursive doubling and multi-block processing, do not show high efficiency in GPUs because of poor scalability with the number of threads. In this work, we have developed a highly efficient GPU-based algorithm for recurrences using a thread-level parallel (TLP) approach, instead of conventional thread-block level parallel (TBLP) methods. The proposed TLP method executes all of the threads as independently as possible to improve the computational efficiency and employs a hierarchical structure for inter-thread communication. Not only constant but also time-varying coefficient recurrence equations are implemented on NVIDIA GTX285, GTX580 and GTX TITAN X GPUs, and the performances are compared with the results on single-core and multi-core SIMD CPU-based PCs.
引用
收藏
页码:10 / 17
页数:8
相关论文
共 50 条
  • [41] On the Hardness of Massively Parallel Computation
    Chung, Kai-Min
    Ho, Kuan-Yi
    Sun, Xiaorui
    PROCEEDINGS OF THE 32ND ACM SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURES (SPAA '20), 2020, : 153 - 162
  • [42] Graphics Processing Units and Open Computing Language for parallel computing
    Perelygin, Kyrylo
    Lam, Shui
    Wu, Xiaolong
    COMPUTERS & ELECTRICAL ENGINEERING, 2014, 40 (01) : 241 - 251
  • [43] Parallel UPGMA Algorithm on Graphics Processing Units Using CUDA
    Chen, Yu-Rong
    Hung, Che Lun
    Lin, Yu-Shiang
    Lin, Chun-Yuan
    Lee, Tien-Lin
    Lee, Kual-Zheng
    2012 IEEE 14TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2012 IEEE 9TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (HPCC-ICESS), 2012, : 849 - 854
  • [44] PARALLEL EFFICIENT METHOD OF MOMENTS EXPLOITING GRAPHICS PROCESSING UNITS
    De Donno, D.
    Esposito, A.
    Monti, G.
    Tarricone, L.
    MICROWAVE AND OPTICAL TECHNOLOGY LETTERS, 2010, 52 (11) : 2568 - 2572
  • [45] Parallel Implementation of the Discrete Wavelet Transform on Graphics Processing Units
    Khemiri, Randa
    Sayadi, Fatma
    Saidani, Taoufik
    Chouchene, Marwa
    Bahri, Haythem
    Tourki, Rached
    2014 1ST INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP 2014), 2014, : 111 - 114
  • [46] Parallel execution of Java']Java loops on Graphics Processing Units
    Leung, Alan
    Lhotak, Ondrej
    Lashari, Ghulam
    SCIENCE OF COMPUTER PROGRAMMING, 2013, 78 (05) : 458 - 480
  • [47] Parallel Mining of Neuronal Spike Streams on Graphics Processing Units
    Yong Cao
    Debprakash Patnaik
    Sean Ponce
    Jeremy Archuleta
    Patrick Butler
    Wu-chun Feng
    Naren Ramakrishnan
    International Journal of Parallel Programming, 2012, 40 : 605 - 632
  • [48] Special issue on evolutionary computation on general purpose graphics processing units
    José L. Risco-Martín
    Juan Lanchares
    Carlos A. Coello-Coello
    Soft Computing, 2012, 16 : 185 - 186
  • [49] Parallel data cube computation on graphic processing units
    Zhou G.-L.
    Chen H.
    Li C.-P.
    Wang S.
    Zheng T.
    Jisuanji Xuebao/Chinese Journal of Computers, 2010, 33 (10): : 1788 - 1798
  • [50] Stochastic Speculative Computation Method on general purpose graphics processing units
    Suzuki, Yosuke
    Hamada, Akira
    Iizuka, Yasuki
    2017 6TH IIAI INTERNATIONAL CONGRESS ON ADVANCED APPLIED INFORMATICS (IIAI-AAI), 2017, : 1049 - 1050