Low-overhead Load-balanced Scheduling for Sparse Tensor Computations

被引:0
|
作者
Baskaran, Muthu [1 ]
Meister, Benoit [1 ]
Lethin, Richard [1 ]
机构
[1] Reservoir Labs Inc, New York, NY 10012 USA
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Irregular computations over large-scale sparse data are prevalent in critical data applications and they have significant room for improvement on modern computer systems from the aspects of parallelism and data locality. We introduce new techniques to efficiently map large irregular computations onto modern multi-core systems with non-uniform memory access (NUMA) behavior. Our techniques are broadly applicable for irregular computations with multi-dimensional sparse arrays (or sparse tensors). We implement a static-cum-dynamic task scheduling scheme with low overhead for effective parallelization of sparse computations. We introduce locality-aware optimizations to the task scheduling mechanism that are driven by the sparse input data pattern. We evaluate our techniques using two popular sparse tensor decomposition methods that have wide applications in data mining, graph analysis, signal processing, and elsewhere. Our techniques not only improve parallel performance but also result in improved performance scalability with increasing number of cores. We achieve around 45x improvement in performance over existing parallel approaches and observe "scalable" parallel performance on modern multi-core systems with up to 32 processor cores. We take real sparse data sets as input to the sparse tensor computations and demonstrate the achieved improvements.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Load-Balanced Sparse MTTKRP on GPUs
    Nisa, Israt
    Li, Jiajia
    Sukumaran-Rajam, Aravind
    Vuduc, Richard
    Sadayappan, P.
    [J]. 2019 IEEE 33RD INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2019), 2019, : 123 - 133
  • [2] Low-overhead scheduling of nested parallelism
    Hummel, S.F.
    Schonberg, E.
    [J]. 1600, (35): : 5 - 6
  • [3] A Path-Length Efficient, Low-Overhead, Load-Balanced Routing Protocol for Maximum Network Lifetime in Wireless Sensor Networks with Holes
    Phi Le Nguyen
    Thanh Hung Nguyen
    Kien Nguyen
    [J]. SENSORS, 2020, 20 (09)
  • [4] LOW-OVERHEAD SCHEDULING OF NESTED PARALLELISM
    HUMMEL, SF
    SCHONBERG, E
    [J]. IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 1991, 35 (5-6) : 743 - 765
  • [5] Hybrid Load-Balanced Scheduling in Scalable Cloud Environment
    Jayswal, Anant Kumar
    [J]. INTERNATIONAL JOURNAL OF INFORMATION SYSTEM MODELING AND DESIGN, 2020, 11 (03) : 62 - 78
  • [6] Adaptive TDM Scheduling Scheme for Load-Balanced Switches
    Xia, Yu
    Zeng, Huaxin
    Shen, Zhijun
    Gao, Zhijiang
    [J]. IEEE COMMUNICATIONS LETTERS, 2011, 15 (07) : 758 - 760
  • [7] Low-Delay Scheduling for Internet of Vehicles: Load-Balanced Multipath Communication With FEC
    Pokhrel, Shiva Raj
    Choi, Jinho
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2019, 67 (12) : 8489 - 8501
  • [8] Low-overhead uplink scheduling through load prediction for WiMAX real-time services
    Nie, W.
    Wang, H.
    Xiong, N.
    [J]. IET COMMUNICATIONS, 2011, 5 (08) : 1060 - 1067
  • [9] Load-balanced sparse matrix-vector multiplication on parallel computers
    Nastea, SG
    Frieder, O
    El-Ghazawi, T
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1997, 46 (02) : 180 - 193
  • [10] LBcast: Load-Balanced Broadcast Scheduling for Low-Duty-Cycle Wireless Sensor Networks
    Xu, Lijie
    Chen, Guihai
    [J]. 2013 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2013, : 7 - 12