A Basic Linear Algebra Compiler for Embedded Processors

被引:0
|
作者
Kyrtatas, Nikolaos [1 ]
Spampinato, Daniele G. [1 ]
Pueschel, Markus [1 ]
机构
[1] Swiss Fed Inst Technol, Dept Comp Sci, Zurich, Switzerland
来源
2015 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE) | 2015年
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Many applications in signal processing, control, and graphics on embedded devices require efficient linear algebra computations. On general-purpose computers, program generators have proven useful to produce such code, or important building blocks, automatically. An example is LGen, a compiler for basic linear algebra computations of fixed size. In this work, we extend LGen towards the embedded domain using as example targets Intel Atom, ARM Cortex-A8, ARM Cortex-A9, and ARM1176 (Raspberry Pi). To efficiently support these processors we introduce support for the NEON vector ISA and a methodology for domain-specific load/store optimizations. Our experimental evaluation shows that the new version of LGen produces code that performs in many cases considerably better than well-established, commercial and non-commercial libraries (Intel MKL and IPP), software generators (Eigen and ATLAS), and compilers (icc, gcc, and clang).
引用
收藏
页码:1054 / 1059
页数:6
相关论文
共 50 条
  • [41] Basic Linear Algebra Operations on TensorCore GPU
    Zhang, Shaoshuai
    Karihaloo, Vivek
    Wu, Panruo
    PROCEEDINGS OF SCALA 2020: 11TH WORKSHOP ON LATEST ADVANCES IN SCALABLE ALGORITHMS FOR LARGE-SCALE SYSTEMS, 2020, : 44 - 52
  • [42] Compiler-assisted leakage-aware loop scheduling for embedded VLIW DSP processors
    Wang, Meng
    Wang, Yi
    Liu, Duo
    Qin, Zhiwei
    Shao, Zili
    JOURNAL OF SYSTEMS AND SOFTWARE, 2010, 83 (05) : 772 - 785
  • [43] OPTICAL LINEAR ALGEBRA PROCESSORS - NOISE AND ERROR-SOURCE MODELING
    CASASENT, D
    GHOSH, A
    OPTICS LETTERS, 1985, 10 (06) : 252 - 254
  • [44] BIPOLAR BLASING IN HIGH-ACCURACY OPTICAL LINEAR ALGEBRA PROCESSORS
    CASASENT, D
    PERLEE, C
    APPLIED OPTICS, 1986, 25 (07): : 1033 - 1035
  • [45] REALIZATION OF CONJUGATE-GRADIENT ALGORITHM ON OPTICAL LINEAR ALGEBRA PROCESSORS
    GHOSH, AK
    APPLIED OPTICS, 1987, 26 (04): : 611 - 613
  • [46] Compiler Optimization for Superscalar and Pipelined Processors
    Bharadwaj, Vishnu P.
    Rao, Mahesh
    PROCEEDINGS OF 2016 IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING, VLSI, ELECTRICAL CIRCUITS AND ROBOTICS (DISCOVER), 2016, : 232 - 236
  • [47] Compiler optimizations for processors with SIMD instructions
    Pryanishnikov, Ivan
    Krall, Andreas
    Horspool, Nigel
    SOFTWARE-PRACTICE & EXPERIENCE, 2007, 37 (01): : 93 - 113
  • [48] COMP: Compiler Optimizations for Manycore Processors
    Song, Linhai
    Feng, Min
    Ravi, Nishkam
    Yang, Yi
    Chakradhar, Srimat
    2014 47TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO), 2014, : 659 - 671
  • [49] SyCAP - Synthesis compiler for applicationspecific processors
    Schoppa, I.
    Gremzow, C.
    Menge, M.
    ITG-Fachbericht, 2001, (164):
  • [50] Compiler scheduling for STA-processors
    Cichon, G
    Robelly, P
    Seidel, H
    Bronzel, M
    Fettweis, G
    INTERNATIONAL CONFERENCE ON PARALLEL COMPUTING IN ELECTRICAL ENGINEERING, 2004, : 45 - 50