Parallel Tensor Compression for Large-Scale Scientific Data

被引:84
|
作者
Austin, Woody [1 ]
Ballard, Grey [2 ]
Kolda, Tamara G. [2 ]
机构
[1] Univ Texas Austin, Austin, TX 78712 USA
[2] Sandia Natl Labs, Livermore, CA USA
关键词
Tucker tensor decomposition; compression; SINGULAR-VALUE DECOMPOSITION; COLLECTIVE COMMUNICATION; ALGORITHMS;
D O I
10.1109/IPDPS.2016.67
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As parallel computing trends towards the exascale, scientific data produced by high-fidelity simulations are growing increasingly massive. For instance, a simulation on a three-dimensional spatial grid with 512 points per dimension that tracks 64 variables per grid point for 128 time steps yields 8 TB of data, assuming double precision. By viewing the data as a dense five-way tensor, we can compute a Tucker decomposition to find inherent low-dimensional multilinear structure, achieving compression ratios of up to 5000 on real-world data sets with negligible loss in accuracy. So that we can operate on such massive data, we present the first-ever distributed-memory parallel implementation for the Tucker decomposition, whose key computations correspond to parallel linear algebra operations, albeit with nonstandard data layouts. Our approach specifies a data distribution for tensors that avoids any tensor data redistribution, either locally or in parallel. We provide accompanying analysis of the computation and communication costs of the algorithms. To demonstrate the compression and accuracy of the method, we apply our approach to real-world data sets from combustion science simulations. We also provide detailed performance results, including parallel performance in both weak and strong scaling experiments.
引用
收藏
页码:912 / 922
页数:11
相关论文
共 50 条
  • [31] Fundamental tensor operations for large-scale data analysis using tensor network formats
    Lee, Namgil
    Cichocki, Andrzej
    [J]. MULTIDIMENSIONAL SYSTEMS AND SIGNAL PROCESSING, 2018, 29 (03) : 921 - 960
  • [32] A survey of the techniques of volume rendering for large-scale scientific data
    Wang H.
    He L.
    Cao Y.
    Xiao L.
    [J]. Guofang Keji Daxue Xuebao/Journal of National University of Defense Technology, 2020, 42 (02): : 1 - 12
  • [33] DISTRIBUTED LARGE-SCALE TENSOR DECOMPOSITION
    de Almeida, Andre L. F.
    Kibangou, Alain Y.
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [34] High-Order Tensor Decomposition for Large-Scale Data Analysis
    Li, Longzhuang
    Boulware, Douglas
    [J]. 2015 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2015, 2015, : 665 - 668
  • [35] A data parallel approach for large-scale Gaussian process modeling
    Choudhury, A
    Nair, PB
    Keane, AJ
    [J]. PROCEEDINGS OF THE SECOND SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2002, : 95 - 111
  • [36] A Workflow for Parallel and Distributed Computing of Large-Scale Genomic Data
    Choi, Hyun-Hwa
    Kim, Byoung-Seob
    Ahn, Shin-Young
    Bae, Seung-Jo
    [J]. 2013 8TH INTERNATIONAL CONFERENCE FOR INTERNET TECHNOLOGY AND SECURED TRANSACTIONS (ICITST), 2013, : 215 - 218
  • [37] Dynamic group communication for large-scale parallel data mining
    Katti, Amogh
    Di Fatta, Giuseppe
    [J]. CONCURRENT ENGINEERING-RESEARCH AND APPLICATIONS, 2013, 21 (03): : 227 - 234
  • [38] Parallel Partial Reduction for Large-Scale Data Analysis and Visualization
    He, Wenbin
    Guo, Hanqi
    Peterka, Tom
    Di, Sheng
    Cappello, Franck
    Shen, Han-Wei
    [J]. 2018 IEEE 8TH SYMPOSIUM ON LARGE DATA ANALYSIS AND VISUALIZATION (LDAV), 2018, : 45 - 55
  • [39] Parallel Approach and Platform for Large-scale Web Data Extraction
    Shen, Yi
    Shi, Shengsheng
    Wang, Haitao
    Wei, Wu
    Yuan, Chunfeng
    Huang, Yihua
    [J]. 2013 INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD), 2013, : 192 - 196
  • [40] Parallel stochastic configuration networks for large-scale data regression
    Zhang, Chenglong
    Ding, Shifei
    Zhang, Jian
    Jia, Weikuan
    [J]. APPLIED SOFT COMPUTING, 2021, 103