Exploiting Matrix Dependency for Efficient Distributed Matrix Computation

被引:19
|
作者
Yu, Lele [1 ]
Shao, Yingxia [1 ]
Cui, Bin [1 ]
机构
[1] Peking Univ, Sch EECS, Key Lab High Confidence Software Technol MOE, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
matrix computing; dependency analysis; distributed system;
D O I
10.1145/2723372.2723712
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Distributed matrix computation is a popular approach for many large-scale data analysis and machine learning tasks. However existing distributed matrix computation systems generally incur heavy communication cost during the runtime, which degrades the overall performance. In this paper, we propose a novel matrix computation system, named DMac, which exploits the matrix dependencies in matrix programs for efficient matrix computation in the distributed environment. We decompose each matrix program into a sequence of operations, and reveal the matrix dependencies between operations in the program. We next design a dependency-oriented cost model to select an optimal execution strategy for each operation, and generate a communication efficient execution plan for the matrix computation program. To facilitate the matrix computation in distributed systems, we further divide the execution plan into multiple un-interleaved stages which can run in a distributed cluster with efficient local execution strategy on each worker. The DMac system has been implemented on a popular general-purpose data processing framework, Spark. The experimental results demonstrate that our techniques can significantly improve the performance of a wide range of matrix programs.
引用
收藏
页码:93 / 105
页数:13
相关论文
共 50 条
  • [41] An efficient implementation of matrix clocks in the distributed computing environment
    Singh, Avaneesh
    Badal, Neelendra
    [J]. 2015 INTERNATIONAL CONFERENCE ON GREEN COMPUTING AND INTERNET OF THINGS (ICGCIOT), 2015, : 275 - 279
  • [42] An efficient implementation of multi matrix clocks in the distributed system
    Singh, Avaneesh
    Badal, Neelendra
    [J]. 2015 INTERNATIONAL CONFERENCE ON GREEN COMPUTING AND INTERNET OF THINGS (ICGCIOT), 2015, : 209 - 213
  • [43] Efficient computation of the discrete autocorrelation wavelet inner product matrix
    Eckley, IA
    Nason, GP
    [J]. STATISTICS AND COMPUTING, 2005, 15 (02) : 83 - 92
  • [44] Efficient computation of the inverse of gametic relationship matrix for a marked QTL
    Mehdi Sargolzaei
    Hiroaki Iwaisaki
    Jean-Jacques Colleau
    [J]. Genetics Selection Evolution, 38 (3)
  • [45] Efficient computation of the discrete autocorrelation wavelet inner product matrix
    Idris A. Eckley
    Guy P. Nason
    [J]. Statistics and Computing, 2005, 15 : 83 - 92
  • [46] EFFICIENT COMPUTATION OF ENCLOSURES FOR THE EXACT SOLVENTS OF A QUADRATIC MATRIX EQUATION
    Hashemi, Behnam
    Dehghan, Mehdi
    [J]. ELECTRONIC JOURNAL OF LINEAR ALGEBRA, 2010, 20 : 519 - 536
  • [47] Efficient Parallel Computation of the Characteristic Polynomial of a Sparse, Separable Matrix
    J. H. Reif
    [J]. Algorithmica, 2001, 29 : 487 - 510
  • [48] Enabling Efficient Publicly Verifiable Outsourcing Computation for Matrix Multiplication
    Li, Hongwei
    Zhang, Shenmin
    Luan, Tom H.
    Ren, Hao
    Dai, Yuanshun
    Zhou, Liang
    [J]. 25TH INTERNATIONAL TELECOMMUNICATION NETWORKS AND APPLICATIONS CONFERENCE (ITNAC 2015), 2015, : 44 - 50
  • [49] Efficient Secure Outsourcing Computation of Matrix Multiplication in Cloud Computing
    Zhang, Shenmin
    Li, Hongwei
    Jia, Kun
    Dai, Yunshun
    Zhao, Lian
    [J]. 2016 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2016,
  • [50] EFFICIENT COMPUTATION OF THE MATRIX EXPONENTIAL USING PADE-APPROXIMATION
    LIN, YY
    HWANG, LP
    [J]. COMPUTERS & CHEMISTRY, 1992, 16 (04): : 285 - 293