New data structures for matrices and specialized inner kernels: Low overhead for high performance

被引:0
|
作者
Herrero, Jose R. [1 ]
机构
[1] Univ Politecn Cataluna, Comp Architecture Dept, Barcelona, Spain
关键词
specialized inner kernels; new data structures; dense linear algebra; low overhead; high performance;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Dense linear algebra codes are often expressed and coded in terms of BLAS calls. This approach, however, achieves suboptimal performance clue to the overheads associated to such calls. Taking as an example the dense Cholesky factorization of a symmetric positive definite matrix we show that the potential of non-canonical data structures for dense linear algebra can be better exploited with the use of specialized inner kernels. The use of non-canonical data structures together with specialized inner kernels has low overhead and can produce excellent performance.
引用
收藏
页码:659 / 667
页数:9
相关论文
共 50 条
  • [31] New high-performance overhead contact line type re 330 of Deutsche BahnNew high-performance overhead contact line type re 330 of Deutsche Bahn
    Kiessling, F.
    Semrau, M.
    Tessun, H.
    Zweig, B.-W.
    Elektrische Bahnen, 1994, 92 (08):
  • [32] Stochastic matrix-function estimators: Scalable Big-Data kernels with high performance
    Staar, Peter W. J.
    Barkoutsos, Panagiotis Kl
    Istrate, Roxana
    Malossi, A. Cristiano I.
    Tavernelli, Ivano
    Moll, Nikolaj
    Giefers, Heiner
    Hagleitner, Christoph
    Bekas, Costas
    Curioni, Alessandro
    2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2016), 2016, : 812 - 821
  • [33] Caching Data Stores: High Performance at Low Cost
    Lomet, David
    2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2018, : 1661 - 1661
  • [34] A New Data Transmission Method with Low System Overhead Based on Multiple TSV Sub-Arrays
    Feng X.
    Cui X.
    Wei C.
    Cui X.
    Jin Y.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2021, 33 (04): : 649 - 654
  • [35] Ensuring high reliability and performance with low space overhead for deduplicated and delta-compressed storage systems
    Zuo, Chunxue
    Wang, Fang
    Zheng, Mai
    Hu, Yuchong
    Feng, Dan
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (05):
  • [36] An effective single-hop distributed hash table with high lookup performance and low traffic overhead
    Monnerat, Luiz
    Amorim, Claudio L.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2015, 27 (07): : 1767 - 1788
  • [37] The design and verification of a high-performance low-control-overhead asynchronous differential equation solver
    Yun, KY
    Beerel, PA
    Vakilotojar, V
    Dooply, AE
    Arceo, J
    THIRD INTERNATIONAL SYMPOSIUM ON ADVANCED RESEARCH IN ASYNCHRONOUS CIRCUITS AND SYSTEMS, PROCEEDINGS, 1997, : 140 - 153
  • [38] The design and verification of a high-performance low-control-overhead asynchronous differential equation solver
    Yun, KY
    Beerel, PA
    Vakilotojar, V
    Dooply, AE
    Arceo, J
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 1998, 6 (04) : 643 - 655
  • [39] A Low Area Overhead Design for High-Performance General-Synchronous Circuits with Speculative Execution
    Sato, Shimpei
    Sassa, Eijiro
    Ukon, Yuta
    Takahashi, Atsushi
    2019 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2019,
  • [40] NONNEGATIVE DIAGONALS AND HIGH PERFORMANCE ON LOW-PROFILE MATRICES FROM HOUSEHOLDER QR
    Demmel, James W.
    Hoemmen, Mark
    Hida, Yozo
    Riedy, E. Jason
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2009, 31 (04): : 2832 - 2841