Minimal Unroll Factor for Code Generation of Software Pipelining

被引:0
|
作者
Mounira Bachir
Sid-Ahmed-Ali Touati
Frederic Brault
David Gregg
Albert Cohen
机构
[1] University of Nice Sophia-Antipolis,
关键词
Periodic register allocation; Software pipelining; Code generation; Instruction level parallelism; Embedded systems; Compilation;
D O I
暂无
中图分类号
学科分类号
摘要
We address the problem of generating compact code from software pipelined loops. Although software pipelining is a powerful technique to extract fine-grain parallelism, it generates lifetime intervals spanning multiple loop iterations. These intervals require periodic register allocation (also called variable expansion), which in turn yields a code generation challenge. We are looking for the minimal unrolling factor enabling the periodic register allocation of software pipelined kernels. This challenge is generally addressed through one of: (1) hardware support in the form of rotating register files, which solve the unrolling problem but are expensive in hardware; (2) register renaming by inserting register moves, which increase the number of operations in the loop, and may damage the schedule of the software pipeline and reduce throughput; (3) post-pass loop unrolling that does not compromise throughput but often leads to impractical code growth. The latter approach relies on the proof that MAXLIVE registers (maximal number of values simultaneously alive) are sufficient for periodic register allocation (Eisenbeis et al. in PACT ’95: Proceedings of the IFIP WG10.3 working conference on Parallel Architectures and Compilation Techniques, pages 264–267, Manchester, UK, 1995; Hendren et al. in CC ’92: Proceedings of the 4th International Conference on Compiler Construction, pages 176–191, London, UK, 1992). However, the best existing heuristic for controlling this code growth—modulo variable expansion (Lam in SIGPLAN Not 23(7):318–328, 1988)—may not apply the correct amount of loop unrolling to guarantee that MAXLIVE registers are enough, which may result in register spills Eisenbeis et al. in PACT ’95: Proceedings of the IFIP WG10.3 working conference on Parallel Architectures and Compilation Techniques, pages 264–267, Manchester, UK, 1995. This paper presents our research results on the open problem of minimal loop unrolling, allowing a software-only code generation that does not trade the optimality of the initiation interval (II) for the compactness of the generated code. Our novel idea is to use the remaining free registers after periodic register allocation to relax the constraints on register reuse. The problem of minimal loop unrolling arises either before or after software pipelining, either with a single or with multiple register types (classes). We provide a formal problem definition for each scenario, and we propose and study a dedicated algorithm for each problem. Our solutions are implemented within an industrial-strength compiler for a VLIW embedded processor from STMicroelectronics, and validated on multiple benchmarks suites.
引用
收藏
页码:1 / 58
页数:57
相关论文
共 50 条
  • [21] Software de-pipelining technique
    Su, BG
    Wang, J
    Hu, EW
    Manzano, J
    [J]. FOURTH IEEE INTERNATIONAL WORKSHOP ON SOURCE CODE ANALYSIS AND MANIPULATION, PROCEEDINGS, 2004, : 7 - 16
  • [22] Register-sensitive software pipelining
    Dani, AK
    Ramanan, VJ
    Govindarajan, R
    [J]. FIRST MERGED INTERNATIONAL PARALLEL PROCESSING SYMPOSIUM & SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING, 1998, : 194 - 198
  • [23] Flexible data dependence and software pipelining
    Rong, H.
    Tang, Z.
    [J]. Ruan Jian Xue Bao/Journal of Software, 2001, 12 (06): : 894 - 906
  • [24] Software Pipelining for Quantum Loop Programs
    Guo, Jingzhe
    Ying, Mingsheng
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2023, 49 (04) : 2815 - 2828
  • [25] Software pipelining with cache profiling information
    Key Laboratory of Computer System and Architecture, Chinese Academy of Sciences, Beijing 100190, China
    [J]. Jisuanji Yanjiu yu Fazhan, 2008, 5 (834-840):
  • [26] Optimizing an Architecture with Software Pipelining Strategies
    Nunes, Denis F. L.
    Fernandes, Silvio R.
    Kreutz, Marcio E.
    [J]. 2019 IFIP/IEEE 27TH INTERNATIONAL CONFERENCE ON VERY LARGE SCALE INTEGRATION (VLSI-SOC), 2019, : 299 - 304
  • [27] AN EXTENDED SCHEDULING TECHNIQUE FOR SOFTWARE PIPELINING
    LU, DG
    BISWAS, P
    [J]. MICROPROCESSING AND MICROPROGRAMMING, 1993, 37 (1-5): : 99 - 104
  • [28] Reducing register pressure in software pipelining
    Universitat Politecnica de Catalunya, Barcelona, Spain
    [J]. J Inf Sci Eng, 1 (265-279):
  • [29] Reducing register pressure in software pipelining
    Sanchez, F
    Cortadella, J
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 1998, 14 (01) : 265 - 279
  • [30] Software pipelining loops with conditional branches
    Stoodley, MG
    Lee, CG
    [J]. PROCEEDINGS OF THE 29TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE - MICRO-29, 1996, : 262 - 273