Multiloop Parallelisation Using Unrolling and Fission

被引:3
|
作者
Lam, Yuet Ming [1 ]
Coutinho, Jose Gabriel F. [2 ]
Ho, Chun Hok [2 ]
Leong, Philip Heng Wai [3 ]
Luk, Wayne [2 ]
机构
[1] Macau Univ Sci & Technol, Fac Informat Technol, Taipa, Peoples R China
[2] Imperial Coll London, Dept Comp, London, England
[3] Univ Sydney, Sch Elect & Informat Engn, Sydney, NSW, Australia
基金
英国工程与自然科学研究理事会;
关键词
All Open Access; Gold; Green;
D O I
10.1155/2010/475620
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A technique for parallelising multiple loops in a heterogeneous computing system is presented. Loops are first unrolled and then broken up into multiple tasks which are mapped to reconfigurable hardware. A performance-driven optimisation is applied to find the best unrolling factor for each loop under hardware size constraints. The approach is demonstrated using three applications: speech recognition, image processing, and the N-Body problem. Experimental results show that a maximum speedup of 34 is achieved on a 274 MHz FPGA for the N-Body over a 2.6GHz microprocessor, which is 4.1 times higher than that of an approach without unrolling.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] The shared memory parallelisation of an ocean modelling code using an interactive parallelisation toolkit
    Ierotheou, CS
    Johnson, S
    Leggett, P
    Cross, M
    COMPUTATIONAL SCIENCE-ICCS 2002, PT II, PROCEEDINGS, 2002, 2330 : 953 - 962
  • [2] Functional Program Transformation for Parallelisation Using Skeletons
    Kannan, Venkatesh
    Hamilton, G. W.
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2018, 46 (01) : 152 - 172
  • [3] A wavefront parallelisation of CTMC solution using MTBDDs
    Zhang, Y
    Parker, D
    Kwiatkowska, M
    2005 INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS, PROCEEDINGS, 2005, : 732 - 741
  • [4] Parallelisation of seismic algorithms using PVM and FORGE
    Wedemeijer, H
    Cox, HLH
    Verschuur, DJ
    Ritsema, IL
    EUROSIM '96 - HPCN CHALLENGES IN TELECOMP AND TELECOM: PARALLEL SIMULATION OF COMPLEX SYSTEMS AND LARGE-SCALE APPLICATIONS, 1996, : 535 - 540
  • [5] Functional Program Transformation for Parallelisation Using Skeletons
    Venkatesh Kannan
    G. W. Hamilton
    International Journal of Parallel Programming, 2018, 46 : 152 - 172
  • [6] Case for Dynamic Parallelisation using Learning Techniques
    Gurunathan, Karthik
    Kartikey, Kaustubh
    Sudarshan, T. S. B.
    Divyaprabha, K. N.
    2020 IEEE 9TH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORK TECHNOLOGIES (CSNT 2020), 2020, : 32 - 39
  • [7] Efficient parallelisation of recursive problems using constructive recursion
    Haveraaen, M
    EURO-PAR 2000 PARALLEL PROCESSING, PROCEEDINGS, 2000, 1900 : 758 - 761
  • [8] GPGPGPU: Evaluation of Parallelisation of Genetic Programming Using GPGPU
    Kim, Jinhan
    Kim, Junhwi
    Yoo, Shin
    SEARCH BASED SOFTWARE ENGINEERING, SSBSE 2017, 2017, 10452 : 137 - 142
  • [9] Parallelisation of storage cell flood models using OpenMP
    Neal, Jeffrey
    Fewtrell, Timothy
    Trigg, Mark
    ENVIRONMENTAL MODELLING & SOFTWARE, 2009, 24 (07) : 872 - 877
  • [10] Blind inverse light transport using unrolling network
    Yin, Wenting
    Li, Yuqi
    Li, Jiabao
    Ye, Xulun
    Guo, Lijun
    APPLIED INTELLIGENCE, 2023, 53 (24) : 30482 - 30496