Exploiting Matrix Dependency for Efficient Distributed Matrix Computation

被引:19
|
作者
Yu, Lele [1 ]
Shao, Yingxia [1 ]
Cui, Bin [1 ]
机构
[1] Peking Univ, Sch EECS, Key Lab High Confidence Software Technol MOE, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
matrix computing; dependency analysis; distributed system;
D O I
10.1145/2723372.2723712
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Distributed matrix computation is a popular approach for many large-scale data analysis and machine learning tasks. However existing distributed matrix computation systems generally incur heavy communication cost during the runtime, which degrades the overall performance. In this paper, we propose a novel matrix computation system, named DMac, which exploits the matrix dependencies in matrix programs for efficient matrix computation in the distributed environment. We decompose each matrix program into a sequence of operations, and reveal the matrix dependencies between operations in the program. We next design a dependency-oriented cost model to select an optimal execution strategy for each operation, and generate a communication efficient execution plan for the matrix computation program. To facilitate the matrix computation in distributed systems, we further divide the execution plan into multiple un-interleaved stages which can run in a distributed cluster with efficient local execution strategy on each worker. The DMac system has been implemented on a popular general-purpose data processing framework, Spark. The experimental results demonstrate that our techniques can significantly improve the performance of a wide range of matrix programs.
引用
收藏
页码:93 / 105
页数:13
相关论文
共 50 条
  • [31] Efficient computation of the spectral projections of regular matrix pairs
    Sadkane, Miloud
    Sidje, Roger B.
    [J]. JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2016, 298 : 72 - 81
  • [32] EFFICIENT GLOBAL MATRIX APPROACH TO THE COMPUTATION OF SYNTHETIC SEISMOGRAMS
    SCHMIDT, H
    TANGO, G
    [J]. GEOPHYSICAL JOURNAL OF THE ROYAL ASTRONOMICAL SOCIETY, 1986, 84 (02): : 331 - 359
  • [33] EFFICIENT LARGE SCALED PROFILE MATRIX COMPUTATION PROGRAMS
    PARK, MS
    [J]. COMPUTERS & STRUCTURES, 1995, 54 (04) : 731 - 742
  • [34] Efficient computation of the matrix exponential by generalized polar decompositions
    Iserles, A
    Zanna, A
    [J]. SIAM JOURNAL ON NUMERICAL ANALYSIS, 2005, 42 (05) : 2218 - 2256
  • [35] Efficient computation of matrix elements of generic Slater determinants
    Rodriguez-Laguna, Javier
    Miguel Robledo, Luis
    Dukelsky, Jorge
    [J]. PHYSICAL REVIEW A, 2020, 101 (01)
  • [36] Efficient Computation of the Fisher Information Matrix in the EM Algorithm
    Meng, Lingyao
    Spall, James C.
    [J]. 2017 51ST ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2017,
  • [37] In-network computation of the Transition Matrix for Distributed Subspace Projection
    Insausti, Xabier
    Crespo, Pedro M.
    Beferull-Lozano, Baltasar
    [J]. 2012 IEEE 8TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING IN SENSOR SYSTEMS (DCOSS), 2012, : 124 - 131
  • [38] Distributed and Private Coded Matrix Computation with Flexible Communication Load
    Aliasgari, Malihe
    Simeone, Osvaldo
    Kliewer, Jorg
    [J]. 2019 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2019, : 1092 - 1096
  • [39] CodedSketch: A Coding Scheme for Distributed Computation of Approximated Matrix Multiplication
    Jahani-Nezhad, Tayyebeh
    Maddah-Ali, Mohammad Ali
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2021, 67 (06) : 4185 - 4196
  • [40] Efficient distributed skyline computation using dependency-based data partitioning
    Yin, Bo
    Zhou, Siwang
    Lin, Yaping
    Liu, Yonghe
    Hu, Yupeng
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2014, 93 : 69 - 83