Hierarchical algorithms on hierarchical architectures

被引:15
|
作者
Keyes, D. E. [1 ]
Ltaief, H. [1 ]
Turkiyyah, G. [2 ]
机构
[1] King Abdullah Univ Sci & Technol, Extreme Comp Res Ctr, Thuwal 239556900, Saudi Arabia
[2] Amer Univ Beirut, Dept Comp Sci, Beirut 11072020, Lebanon
关键词
computational linear algebra; hierarchical matrices; exascale architectures; LOW-RANK APPROXIMATION; MATRICES; QR;
D O I
10.1098/rsta.2019.0055
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
A traditional goal of algorithmic optimality, squeezing out flops, has been superseded by evolution in architecture. Flops no longer serve as a reasonable proxy for all aspects of complexity. Instead, algorithms must now squeeze memory, data transfers, and synchronizations, while extra flops on locally cached data represent only small costs in time and energy. Hierarchically low-rank matrices realize a rarely achieved combination of optimal storage complexity and high-computational intensity for a wide class of formally dense linear operators that arise in applications for which exascale computers are being constructed. They may be regarded as algebraic generalizations of the fast multipole method. Methods based on these hierarchical data structures and their simpler cousins, tile low-rank matrices, are well proportioned for early exascale computer architectures, which are provisioned for high processing power relative to memory capacity and memory bandwidth. They are ushering in a renaissance of computational linear algebra. A challenge is that emerging hardware architecture possesses hierarchies of its own that do not generally align with those of the algorithm. We describe modules of a software toolkit, hierarchical computations on manycore architectures, that illustrate these features and are intended as building blocks of applications, such as matrix-free higher-order methods in optimization and large-scale spatial statistics. Some modules of this open-source project have been adopted in the software libraries of major vendors. This article is part of a discussion meeting issue 'Numerical algorithms for high-performance computational science'.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Architectures for hierarchical and other block matching algorithms
    Gupta, G
    Chakrabarti, C
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1995, 5 (06) : 477 - 489
  • [2] Trends in algorithms for nonuniform applications on hierarchical distributed architectures
    Keyes, DE
    [J]. COMPUTATIONAL AEROSCIENCES IN THE 21ST CENTURY, 2000, 8 : 103 - 137
  • [3] QoS provision in hierarchical and non-hierarchical switch architectures
    Cano-Cano, Javier
    Andujar, Francisco J.
    Alfaro-Cortes, Francisco J.
    Sanchez, Jose L.
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2021, 148 : 138 - 150
  • [4] ρ:: Hierarchical modeling of parallel architectures
    Weper, R
    Zehendner, E
    Erhard, W
    [J]. PROCEEDINGS OF THE SEVENTH EUROMICRO WORKSHOP ON PARALLEL AND DISTRIBUTED PROCESSING, PDP'99, 1999, : 233 - 240
  • [5] Towards an Understanding of Hierarchical Architectures
    Goerick, Christian
    [J]. IEEE TRANSACTIONS ON AUTONOMOUS MENTAL DEVELOPMENT, 2011, 3 (01) : 54 - 63
  • [6] Hierarchical Skills and Cognitive Architectures
    Langley, Pat
    Cummings, Kirstin
    Shapiro, Daniel
    [J]. PROCEEDINGS OF THE TWENTY-SIXTH ANNUAL CONFERENCE OF THE COGNITIVE SCIENCE SOCIETY, 2004, : 779 - 784
  • [7] Hierarchical parallel algorithms for module placement of large chips on distributed memory architectures
    Yang, LTR
    [J]. DCABES 2002, PROCEEDING, 2002, : 47 - 51
  • [8] Swapped networks: Unifying the architectures and algorithms of a wide class of hierarchical parallel processors
    Yeh, CH
    Parhami, B
    [J]. 1996 INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, PROCEEDINGS, 1996, : 230 - 237
  • [9] ALGORITHMS FOR HIERARCHICAL POWER
    DE, M
    HIPEL, KW
    KILGOUR, DM
    [J]. APPLIED MATHEMATICS AND COMPUTATION, 1990, 39 (01) : 21 - 36
  • [10] COORDINATION IN HIERARCHICAL ALGORITHMS
    XINOGALAS, TC
    DASIGI, S
    SINGH, MG
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1983, 13 (03): : 397 - 406