Parallel Triangular Matrix System Solving on CPU-GPU System

被引:0
|
作者
Mahfoudhi, Ryma [1 ]
Achour, Sarni [1 ]
Mahjoub, Zaher [1 ]
机构
[1] Univ Tunis El Manar, Fac Sci Turns, Univ Campus,Manar 2, Tunis 2092, Tunisia
关键词
Divide & Conquer; GPU; Heterogeneous Linear Algebra; Recursive Algorithms; Triangular Matrix System Solving;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
GPU-accelerated computing consists in using a graphics processing unit (GPU) together with a CPU in order to enhance the performance of scientific and engineering applications. With the increasing spread of GPUs as hardware accelerators for scientific applications, several optimized linear algebra libraries have emerged to make use of this additional computing power. In this paper we present an implementation of a recursive algorithm for triangular matrix system solving targeting a hybrid multicore + GPU system.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Performance Optimization for CPU-GPU Heterogeneous Parallel System
    Wang, Yanhua
    Qiao, Jianzhong
    Lin, Shukuan
    Zhao, Tinglei
    [J]. 2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 1259 - 1266
  • [2] Optimization of Parallel Algorithm for Kalman Filter on CPU-GPU Heterogeneous System
    Xu, Dandan
    Xiao, Zheng
    Li, Dapu
    Wu, Fan
    [J]. 2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 2165 - 2172
  • [3] Profiling based optimization method for CPU-GPU heterogeneous parallel processing system
    Zhang, Bao
    Dong, Xiaoshe
    Bai, Xiuxiu
    Cao, Haijun
    Liu, Chao
    Mei, Yiduo
    [J]. Dong, X., 1600, Xi'an Jiaotong University (46): : 17 - 23
  • [4] Parallel computation of Entropic Lattice Boltzmann method on hybrid CPU-GPU accelerated system
    Ye, Yu
    Li, Kenli
    Wang, Yan
    Deng, Tan
    [J]. COMPUTERS & FLUIDS, 2015, 110 : 114 - 121
  • [5] Parallel Graph Partitioning on a CPU-GPU Architecture
    Goodarzi, Bahareh
    Burtscher, Martin
    Goswami, Dhrubajyoti
    [J]. 2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2016, : 58 - 66
  • [6] Fast Snippet Generation Based On CPU-GPU Hybrid System
    Liu, Ding
    Li, Ruixuan
    Gu, Xiwu
    Wen, Kunmei
    He, Heng
    Gao, Guoqiang
    [J]. 2011 IEEE 17TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2011, : 252 - 259
  • [7] Binomial American Option Pricing on CPU-GPU Hetergenous System
    Zhang, Nan
    Lei, Chi-Un
    Man, Ka Lok
    [J]. ENGINEERING LETTERS, 2012, 20 (03) : 279 - 285
  • [8] CPU-GPU hybrid parallel strategy for cosmological simulations
    Wang, Yueqing
    Dou, Yong
    Guo, Song
    Lei, Yuanwu
    Zou, Dan
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2014, 26 (03): : 748 - 765
  • [9] Heterogeneous parallel_for Template for CPU-GPU Chips
    Navarro, Angeles
    Corbera, Francisco
    Rodriguez, Andres
    Vilches, Antonio
    Asenjo, Rafael
    [J]. INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2019, 47 (02) : 213 - 233
  • [10] EFFICIENT PARALLEL PROCESSING BY IMPROVED CPU-GPU INTERACTION
    Khatter, Harsh
    Aggarwal, Vaishali
    [J]. PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON ISSUES AND CHALLENGES IN INTELLIGENT COMPUTING TECHNIQUES (ICICT), 2014, : 159 - 161