tpSpMV: A two-phase large-scale sparse matrix-vector multiplication kernel for manycore architectures

被引:10
|
作者
Chen, Yuedan [1 ,2 ]
Xiao, Guoqing [1 ,2 ]
Wu, Fan [1 ,2 ]
Tang, Zhuo [1 ,2 ]
Li, Keqin [1 ,2 ,3 ]
机构
[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Hunan, Peoples R China
[2] Natl Supercomp Ctr Changsha, Changsha 410082, Hunan, Peoples R China
[3] SUNY Coll New Paltz, Dept Comp Sci, New Paltz, NY 12561 USA
基金
中国国家自然科学基金;
关键词
CSR; Manycore; Parallelization; Sparse matrix-vector multiplication (SpMV); SW26010; SPMV; OPTIMIZATION; SYSTEMS;
D O I
10.1016/j.ins.2020.03.020
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sparse matrix-vector multiplication (SpMV) is one of the important subroutines in numerical linear algebras widely used in lots of large-scale applications. Accelerating SpMV on multicore and manycore architectures based on Compressed Sparse Row (CSR) format via row-wise parallelization is one of the most popular directions. However, there are three main challenges in optimizing parallel CSR-based SpMV: (a) limited local memory of each computing unit can be overwhelmed by assignments to long rows of large-scale sparse matrices; (b) irregular accesses to the input vector result in expensive memory access latency; (c) sparse data structure leads to low bandwidth usage. This paper proposes a two-phase large-scale SpMV, called tpSpMV, based on the memory structure and computing architecture of multicore and manycore architectures to alleviate the three main difficulties. First, we propose the two-phase parallel execution technique for tpSpMV that performs parallel CSR-based SpMV into two separate phases to overcome the computational scale limitation. Second, we respectively propose the adaptive partitioning methods and parallelization designs using the local memory caching technique for the two phases to exploit the architectural advantages of the high-performance computing platforms and alleviate the problem of high memory access latency. Third, we design several optimizations, such as data reduction, aligned memory accessing, and pipeline technique, to improve bandwidth usage and optimize tpSpMV's performance. Experimental results on SW26010 CPUs of the Sunway TaihuLight supercomputer prove that tpSpMV achieves up to 28.61 speedups and yields the performance improvement of 13.16% over the state-of-the-art work on average. (C) 2020 Elsevier Inc. All rights reserved.
引用
收藏
页码:279 / 295
页数:17
相关论文
共 50 条
  • [41] A two-phase method based on OBDD for searching for splitting strategies of Large-Scale power systems
    Kai, S
    Zhao, QC
    Zheng, DZ
    Jin, M
    Qiang, L
    POWERCON 2002: INTERNATIONAL CONFERENCE ON POWER SYSTEM TECHNOLOGY, VOLS 1-4, PROCEEDINGS, 2002, : 834 - 838
  • [42] Simulating large-scale bubble plumes using various closure and two-phase turbulence models
    Zboray, R
    de Cachard, F
    NUCLEAR ENGINEERING AND DESIGN, 2005, 235 (08) : 867 - 884
  • [43] Two-phase sampling of woody and herbaceous plant communities using large-scale aerial photographs
    Pitt, DG
    Glover, GR
    Jones, RH
    CANADIAN JOURNAL OF FOREST RESEARCH-REVUE CANADIENNE DE RECHERCHE FORESTIERE, 1996, 26 (04): : 509 - 524
  • [44] Machine learning assisted two-phase upscaling for large-scale oil-water system
    Wang, Yanji
    Li, Hangyu
    Xu, Jianchun
    Liu, Shuyang
    Tan, Qizhi
    Wang, Xiaopu
    APPLIED ENERGY, 2023, 337
  • [45] Lagrangian two-phase flow model for the wave generation process due to large-scale landslides
    Gotoh, H
    Sakai, T
    Hayashi, M
    PROCEEDINGS OF THE FIRST ASIAN AND PACIFIC COASTAL ENGINEERING CONFERENCE, VOLS 1 AND 2 (APACE 2001), 2001, : 176 - 185
  • [46] PTRE: A probabilistic two-phase replication elimination policy in large-scale distributed storage platforms
    Han N.
    Liu D.
    International Journal of Networking and Virtual Organisations, 2019, 20 (04) : 340 - 355
  • [47] A Vehicle Route Planning Method of Two-Phase Large-Scale Crowd Evacuation in Typhoon Relief Activities
    Zhan, Sha-lei
    Chen, Liang
    Chen, Ping-Kuo
    Ye, Yong
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2019, 2019
  • [49] The effect of bubbles on the rheology of basaltic lava flows: Insights from large-scale two-phase experiments
    Soldati, A.
    Farrell, J. A.
    Sant, C.
    Wysocki, R.
    Karson, J. A.
    EARTH AND PLANETARY SCIENCE LETTERS, 2020, 548
  • [50] A two-phase genetic algorithm for large-scale bidline-generation problems at Delta Air Lines
    Dowdall, T
    INTERFACES, 1999, 29 (05) : 65 - 65