New data structures for matrices and specialized inner kernels: Low overhead for high performance

被引:0
|
作者
Herrero, Jose R. [1 ]
机构
[1] Univ Politecn Cataluna, Comp Architecture Dept, Barcelona, Spain
关键词
specialized inner kernels; new data structures; dense linear algebra; low overhead; high performance;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Dense linear algebra codes are often expressed and coded in terms of BLAS calls. This approach, however, achieves suboptimal performance clue to the overheads associated to such calls. Taking as an example the dense Cholesky factorization of a symmetric positive definite matrix we show that the potential of non-canonical data structures for dense linear algebra can be better exploited with the use of specialized inner kernels. The use of non-canonical data structures together with specialized inner kernels has low overhead and can produce excellent performance.
引用
收藏
页码:659 / 667
页数:9
相关论文
共 50 条
  • [21] Low-overhead fault tolerance for high-throughput data processing systems
    Martin, Andre
    Knauth, Thomas
    Creutz, Stephan
    Becker, Diogo
    Weigert, Stefan
    Fetzer, Christof
    Brito, Andrey
    31ST INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2011), 2011, : 689 - 699
  • [22] Low Overhead Soft Error Mitigation Techniques for High-Performance and Aggressive Systems
    Avirneni, Naga Durga Prasad
    Subramanian, Viswanathan
    Somani, Arun K.
    2009 IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS & NETWORKS (DSN 2009), 2009, : 185 - 194
  • [23] A low-overhead networking mechanism for virtualized high-performance computing systems
    Jae-Wan Jang
    Euiseong Seo
    Heeseung Jo
    Jin-Soo Kim
    The Journal of Supercomputing, 2012, 59 : 443 - 468
  • [24] Low Overhead Soft Error Mitigation Techniques for High-Performance and Aggressive Designs
    Avirneni, Naga Durga Prasad
    Somani, Arun K.
    IEEE TRANSACTIONS ON COMPUTERS, 2012, 61 (04) : 488 - 501
  • [25] A low-overhead networking mechanism for virtualized high-performance computing systems
    Jang, Jae-Wan
    Seo, Euiseong
    Jo, Heeseung
    Kim, Jin-Soo
    JOURNAL OF SUPERCOMPUTING, 2012, 59 (01): : 443 - 468
  • [26] Alleviating Memory Refresh Overhead via Data Compression for High Performance and Energy Efficiency
    Zhou, Ke
    Liu, Wenjie
    Tang, Kun
    Huang, Ping
    He, Xubin
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2018, 29 (07) : 1469 - 1483
  • [27] DSM: A Low-Overhead, High-Performance, Dynamic Stream Mapping Approach for MongoDB
    Nguyen, Trong-Dat
    Lee, Sang-Won
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2019, 35 (02) : 447 - 469
  • [28] Message Oriented Framework with Low Overhead for Efficient High-Performance Monte Carlo Simulations
    Atanassov, E.
    Gurov, T.
    Karaivanova, A.
    2013 36TH INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2013, : 169 - 171
  • [29] Sealer: In-SRAM AES for High-Performance and Low-Overhead Memory Encryption
    Zhang, Jingyao
    Naghibijouybari, Hoda
    Sadredini, Elaheh
    2022 ACM/IEEE INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN, ISLPED 2022, 2022,
  • [30] New architecture created from high performance structures
    Sarkisian, M. P.
    LIFE-CYCLE ANALYSIS AND ASSESSMENT IN CIVIL ENGINEERING: TOWARDS AN INTEGRATED VISION, 2019, : 49 - 63