Exploiting Locality in Sparse Matrix-Matrix Multiplication on Many-Core Architectures

被引:27
|
作者
Akbudak, Kadir [1 ]
Aykanat, Cevdet [1 ]
机构
[1] Bilkent Univ, Comp Engn Dept, TR-06800 Ankara, Turkey
关键词
Data locality; sparse matrix; sparse matrix-matrix multiplication; SpGEMM; computational hypergraph model; hypergraph partitioning; hypergraph clustering; graph model; bipartite graph model; graph partitioning; graph clustering; many-core architecture; Intel Xeon Phi; IMPLEMENTATION; MODELS;
D O I
10.1109/TPDS.2017.2656893
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Exploiting spatial and temporal localities is investigated for efficient row-by-row parallelization of general sparse matrix-matrix multiplication (SpGEMM) operation of the form C = AB on many-core architectures. Hypergraph and bipartite graph models are proposed for 1D rowwise partitioning of matrix A to evenly partition the work across threads with the objective of reducing the number of B-matrix words to be transferred from the memory and between different caches. A hypergraph model is proposed for B-matrix column reordering to exploit spatial locality in accessing entries of thread-private temporary arrays, which are used to accumulate results for C-matrix rows. A similarity graph model is proposed for B-matrix row reordering to increase temporal reuse of these accumulation array entries. The proposed models and methods are tested on a wide range of sparse matrices from real applications and the experiments were carried on a 60-core Intel Xeon Phi processor, as well as a two-socket Xeon processor. Results show the validity of the models and methods proposed for enhancing the locality in parallel SpGEMM operations.
引用
收藏
页码:2258 / 2271
页数:14
相关论文
共 50 条
  • [1] Multithreaded sparse matrix-matrix multiplication for many-core and GPU architectures
    Deveci, Mehmet
    Trott, Christian
    Rajamanickam, Sivasankaran
    [J]. PARALLEL COMPUTING, 2018, 78 : 33 - 46
  • [2] Performance-Portable Sparse Matrix-Matrix Multiplication for Many-Core Architectures
    Deveci, Mehmet
    Trott, Christian
    Rajamanickam, Sivasankaran
    [J]. 2017 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2017, : 693 - 702
  • [3] MEMORY-EFFICIENT SPARSE MATRIX-MATRIX MULTIPLICATION BY ROW MERGING ON MANY-CORE ARCHITECTURES
    Gremse, Felix
    Kuepper, Kerstin
    Naumann, Uwe
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2018, 40 (04): : C429 - C449
  • [4] Sparse Matrix-Matrix Multiplication on Modern Architectures
    Matam, Kiran
    Indarapu, Siva Rama Krishna Bharadwaj
    Kothapalli, Kishore
    [J]. 2012 19TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2012,
  • [5] Sparse Matrix Multiplication on a Reconfigurable Many-Core Architecture
    Pinhao, Joao
    Jose, Wilson
    Neto, Horacio
    Vestias, Mario
    [J]. 2015 EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN (DSD), 2015, : 330 - 336
  • [6] Adaptive Optimization of Sparse Matrix-Vector Multiplication on Emerging Many-Core Architectures
    Chen, Shizhao
    Fang, Jianbin
    Chen, Donglin
    Xu, Chuanfu
    Wang, Zheng
    [J]. IEEE 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS / IEEE 16TH INTERNATIONAL CONFERENCE ON SMART CITY / IEEE 4TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2018, : 649 - 658
  • [7] Scale-Free Sparse Matrix-Vector Multiplication on Many-Core Architectures
    Liang, Yun
    Tang, Wai Teng
    Zhao, Ruizhe
    Lu, Mian
    Huynh Phung Huynh
    Goh, Rick Siow Mong
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2017, 36 (12) : 2106 - 2119
  • [8] EXPLOITING MULTIPLE LEVELS OF PARALLELISM IN SPARSE MATRIX-MATRIX MULTIPLICATION
    Azad, Ariful
    Ballard, Grey
    Buluc, Aydin
    Demmel, James
    Grigori, Laura
    Schwartz, Oded
    Toledo, Sivan
    Williams, Samuel
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2016, 38 (06): : C624 - C651
  • [9] Locality-Aware Parallel Sparse Matrix-Vector and Matrix-Transpose-Vector Multiplication on Many-Core Processors
    Karsavuran, M. Ozan
    Akbudak, Kadir
    Aykanat, Cevdet
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2016, 27 (06) : 1713 - 1726
  • [10] Performance optimization, modeling and analysis of sparse matrix-matrix products on multi-core and many-core processors
    Nagasaka, Yusuke
    Matsuoka, Satoshi
    Azad, Ariful
    Buluc, Aydin
    [J]. PARALLEL COMPUTING, 2019, 90