Exploiting Locality in Sparse Matrix-Matrix Multiplication on Many-Core Architectures

被引:27
|
作者
Akbudak, Kadir [1 ]
Aykanat, Cevdet [1 ]
机构
[1] Bilkent Univ, Comp Engn Dept, TR-06800 Ankara, Turkey
关键词
Data locality; sparse matrix; sparse matrix-matrix multiplication; SpGEMM; computational hypergraph model; hypergraph partitioning; hypergraph clustering; graph model; bipartite graph model; graph partitioning; graph clustering; many-core architecture; Intel Xeon Phi; IMPLEMENTATION; MODELS;
D O I
10.1109/TPDS.2017.2656893
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Exploiting spatial and temporal localities is investigated for efficient row-by-row parallelization of general sparse matrix-matrix multiplication (SpGEMM) operation of the form C = AB on many-core architectures. Hypergraph and bipartite graph models are proposed for 1D rowwise partitioning of matrix A to evenly partition the work across threads with the objective of reducing the number of B-matrix words to be transferred from the memory and between different caches. A hypergraph model is proposed for B-matrix column reordering to exploit spatial locality in accessing entries of thread-private temporary arrays, which are used to accumulate results for C-matrix rows. A similarity graph model is proposed for B-matrix row reordering to increase temporal reuse of these accumulation array entries. The proposed models and methods are tested on a wide range of sparse matrices from real applications and the experiments were carried on a 60-core Intel Xeon Phi processor, as well as a two-socket Xeon processor. Results show the validity of the models and methods proposed for enhancing the locality in parallel SpGEMM operations.
引用
收藏
页码:2258 / 2271
页数:14
相关论文
共 50 条
  • [41] Column-Segmented Sparse Matrix-Matrix Multiplication on Multicore CPUs
    An, Xiaojing
    Catalyurek, Umit, V
    [J]. 2021 IEEE 28TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS (HIPC 2021), 2021, : 202 - 211
  • [42] Sparse matrix-vector multiplication on the Single-Chip Cloud Computer many-core processor
    Pichel, Juan C.
    Rivera, Francisco F.
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2013, 73 (12) : 1539 - 1550
  • [43] Predicting optimal sparse general matrix-matrix multiplication algorithm on GPUs
    Wei, Bingxin
    Wang, Yizhuo
    Chang, Fangli
    Gao, Jianhua
    Ji, Weixing
    [J]. INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2024, 38 (03): : 245 - 259
  • [44] Generalized Sparse Matrix-Matrix Multiplication for Vector Engines and Graph Applications
    Li, Jiayu
    Wang, Fugang
    Araki, Takuya
    Qiu, Judy
    [J]. PROCEEDINGS OF MCHPC'19: 2019 IEEE/ACM WORKSHOP ON MEMORY CENTRIC HIGH PERFORMANCE COMPUTING (MCHPC), 2019, : 33 - 42
  • [45] A framework for general sparse matrix-matrix multiplication on GPUs and heterogeneous processors
    Liu, Weifeng
    Vinter, Brian
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2015, 85 : 47 - 61
  • [46] Locality-aware parallel block-sparse matrix-matrix multiplication using the Chunks and Tasks programming model
    Rubensson, Emanuel H.
    Rudberg, Elias
    [J]. PARALLEL COMPUTING, 2016, 57 : 87 - 106
  • [47] A high-performance matrix-matrix multiplication methodology for CPU and GPU architectures
    Kelefouras, Vasilios
    Kritikakou, A.
    Mporas, Iosif
    Kolonias, Vasilios
    [J]. JOURNAL OF SUPERCOMPUTING, 2016, 72 (03): : 804 - 844
  • [48] Performance Analysis and Optimization of Sparse Matrix-Vector Multiplication on Modern Multi- and Many-Core Processors
    Elafrou, Athena
    Goumas, Georgios
    Koziris, Nectarios
    [J]. 2017 46TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2017, : 292 - 301
  • [49] Exploiting memory allocations in clusterised many-core architectures
    Garibotti, Rafael
    Ost, Luciano
    Butko, Anastasiia
    Reis, Ricardo
    Gamatie, Abdoulaye
    Sassatelli, Gilles
    [J]. IET COMPUTERS AND DIGITAL TECHNIQUES, 2019, 13 (04): : 302 - 311
  • [50] Register-based Implementation of the Sparse General Matrix-Matrix Multiplication on GPUs
    Liu, Junhong
    He, Xin
    Liu, Weifeng
    Tan, Guangming
    [J]. ACM SIGPLAN NOTICES, 2018, 53 (01) : 407 - 408