Multiloop Parallelisation Using Unrolling and Fission

被引:3
|
作者
Lam, Yuet Ming [1 ]
Coutinho, Jose Gabriel F. [2 ]
Ho, Chun Hok [2 ]
Leong, Philip Heng Wai [3 ]
Luk, Wayne [2 ]
机构
[1] Macau Univ Sci & Technol, Fac Informat Technol, Taipa, Peoples R China
[2] Imperial Coll London, Dept Comp, London, England
[3] Univ Sydney, Sch Elect & Informat Engn, Sydney, NSW, Australia
基金
英国工程与自然科学研究理事会;
关键词
All Open Access; Gold; Green;
D O I
10.1155/2010/475620
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A technique for parallelising multiple loops in a heterogeneous computing system is presented. Loops are first unrolled and then broken up into multiple tasks which are mapped to reconfigurable hardware. A performance-driven optimisation is applied to find the best unrolling factor for each loop under hardware size constraints. The approach is demonstrated using three applications: speech recognition, image processing, and the N-Body problem. Experimental results show that a maximum speedup of 34 is achieved on a 274 MHz FPGA for the N-Body over a 2.6GHz microprocessor, which is 4.1 times higher than that of an approach without unrolling.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Temperature Control for a Pasteurization Miniplant using Multiloop and Cascade Controllers
    Firmansyah, Fajrin
    Sulistyawan, Ivan Adela
    Hadisupadmo, Sutanto
    Widyotriatmo, Augie
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON INSTRUMENTATION, CONTROL, AND AUTOMATION (ICA), 2016, : 32 - 37
  • [42] Parallelisation of Nonlinear Structural Analysis using Dual Partition Super-Elements
    Jokhio, G. A.
    Izzuddin, B. A.
    PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, GRID AND CLOUD COMPUTING FOR ENGINEERING, 2011, 95
  • [43] Efficient Mutation Analysis of Relational Database Structure Using Mutant Schemata and Parallelisation
    Wright, Chris J.
    Kapfhammer, Gregory M.
    McMinn, Phil
    IEEE SIXTH INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION WORKSHOPS (ICSTW 2013), 2013, : 63 - 72
  • [44] Accelerated catadioptric omnidirectional view image unwrapping processing using GPU parallelisation
    Chong, Nguan Soon
    Wong, M. L. Dennis
    Kho, Yau Hee
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2016, 12 (01) : 55 - 69
  • [45] Dimension Synthesis of a MultiLoop Linkage With Single Input Using Parameterized Curves
    Moosavian, Amin
    Sun, Cong Zhu
    Xi, Fengfeng
    Inman, Daniel J.
    JOURNAL OF MECHANISMS AND ROBOTICS-TRANSACTIONS OF THE ASME, 2017, 9 (02):
  • [46] Three-Dimensional Analysis of Spiny Dendrites Using Straightening and Unrolling Transforms
    Morales, Juan
    Benavides-Piccione, Ruth
    Rodriguez, Angel
    Pastor, Luis
    Yuste, Rafael
    DeFelipe, Javier
    NEUROINFORMATICS, 2012, 10 (04) : 391 - 407
  • [47] A novel technique using a multiloop traction device for colonic diverticular bleeding
    Okagawa, Yutaka
    Fujii, Ryoji
    Sumiyoshi, Tetsuya
    Fukuda, Mai
    Tomita, Yusuke
    Minagawa, Takeyoshi
    Kondo, Hitoshi
    ENDOSCOPY, 2023, 55 : E47 - E48
  • [48] Cocktail Glass Network: Fast Depth Estimation Using Channel to Space Unrolling
    Yu, Jung-Jae
    Ko, Jong-Gook
    Kim, Junmo
    IEEE ACCESS, 2021, 9 : 114680 - 114689
  • [49] Quantification of interaction in multiloop control systems using directed spectral decomposition
    Gigi, S.
    Tangirala, Arun K.
    AUTOMATICA, 2013, 49 (05) : 1174 - 1183
  • [50] Solving multiloop Feynman diagrams using light-front coordinates
    Ligterink, NE
    PHYSICAL REVIEW D, 2000, 61 (10):