A GPU-based tabu search for very large hardware/software partitioning with limited resource usage

被引:6
|
作者
Hou, Neng [1 ]
He, Fazhi [1 ,2 ]
Zhou, Yi [3 ]
Ai, Haojun [1 ]
机构
[1] Wuhan Univ, Sch Comp Sci, State Key Lab Software Engn, Wuhan 430072, Hubei, Peoples R China
[2] State Key Lab Digital Mfg Equipment & Technol, Wuhan 430074, Hubei, Peoples R China
[3] Wuhan Univ Sci & Technol, Sch Informat Sci & Engn, Wuhan 430081, Hubei, Peoples R China
基金
美国国家科学基金会;
关键词
Hardware/software co-design; Hardware/software partitioning; GPU-based tabu search; GPU resource-limitation; Time-space tradeoff; ALGORITHMIC ASPECTS; OPTIMIZATION; TRACKING; DESIGN;
D O I
10.1299/jamdsm.2017jamdsm0060
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In hardware/software (HW/SW) co-design, HW/SW partitioning is the most important step since it determines which components are implemented in hardware and which are implemented in software. Since most of HW/SW partitioning problems are NP hard, heuristic methods have to be utilized to solve them, especially for the large size problems. GPU-based heuristic methods to accelerate HW/SW co-design are a promising way to reduce run time. However, the existing methods cannot deal with very large embedded applications because of GPU resource limitations. This paper presents a method to overcome the GPU resource limitations for very large partitioning while keeping a reasonable runtime. First, at the stage of computing the costs of the candidates, we propose a fast method of 2-flipping computing for very large HW/SW co-design. Our method is also general and can deal with both odd and even numbers of nodes. More importantly, our method avoids utilizing double-precision arithmetic units, which are scarce resources in GPU architecture. Second, since the GPU is constrained by memory limitations and the costs of candidates cannot be directly stored in the GPU's global memory, we present a time-space tradeoff strategy to break memory limitations for very large HW/SW partitioning. In this way, the following steps can be run under the constraint of GPU's memory limitations. Third, an in-place removal of infeasible solutions is proposed to reduce the overhead of global memory by half when the neighborhood is compacted. Fourth, when evaluating the tabu status of feasible candidates, we present a bitwise representation of tabu status to minimize the transfer overhead. Finally, we conduct a number of experiments. The results show that the proposed 2-flipping method of single precision data types works well. The results also demonstrate that the proposed approach expands the number of nodes of the task graph from 10,000 to 30,000 under the limitation of the GPU's global memory of 6 GB. The correlations between compression intensity and solution quality are analyzed to ensure the fairness and soundness of our method. Our work is general and can provide guidance for other applications.
引用
收藏
页数:18
相关论文
共 17 条
  • [1] An efficient GPU-based parallel tabu search algorithm for hardware/software co-design
    Hou, Neng
    He, Fazhi
    Zhou, Yi
    Chen, Yilin
    [J]. FRONTIERS OF COMPUTER SCIENCE, 2020, 14 (05)
  • [2] An efficient GPU-based parallel tabu search algorithm for hardware/software co-design
    Neng Hou
    Fazhi He
    Yi Zhou
    Yilin Chen
    [J]. Frontiers of Computer Science, 2020, 14
  • [3] A metaheuristic based on the tabu search for hardware-software partitioning
    Jemai, Mehdi
    Dimassi, Sonia
    Ouni, Bouraoui
    Mtibaa, Abdellatif
    [J]. TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2017, 25 (02) : 901 - 912
  • [4] A Tabu Search-Based Memetic Algorithm for Hardware/Software Partitioning
    Lin, Geng
    Zhu, Wenxing
    Ali, M. Montaz
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2014, 2014
  • [5] Efficient heuristic and tabu search for hardware/software partitioning
    Jigang Wu
    Pu Wang
    Siew-Kei Lam
    Thambipillai Srikanthan
    [J]. The Journal of Supercomputing, 2013, 66 : 118 - 134
  • [6] Efficient heuristic and tabu search for hardware/software partitioning
    Wu, Jigang
    Wang, Pu
    Lam, Siew-Kei
    Srikanthan, Thambipillai
    [J]. JOURNAL OF SUPERCOMPUTING, 2013, 66 (01): : 118 - 134
  • [7] System level hardware/software partitioning based on simulated annealing and tabu search
    Eles, P
    Peng, Z
    Kuchcinski, K
    Doboli, A
    [J]. DESIGN AUTOMATION FOR EMBEDDED SYSTEMS, 1997, 2 (01) : 5 - 32
  • [8] HARDWARE/SOFTWARE PARTITIONING ALGORITHM BASED ON THE COMBINATION OF GENETIC ALGORITHM AND TABU SEARCH
    Li, G.
    Feng, J.
    Wang, C.
    Wang, J.
    [J]. ENGINEERING REVIEW, 2014, 34 (02) : 151 - 160
  • [9] System Level Hardware/Software Partitioning Based on Simulated Annealing and Tabu Search
    Petru Eles
    Zebo Peng
    Krzysztof Kuchcinski
    Alexa Doboli
    [J]. Design Automation for Embedded Systems, 1997, 2 : 5 - 32
  • [10] Tabu search with intensification strategy for functional partitioning in hardware-software codesign
    Wiangtong, T
    Cheung, PYK
    Luk, W
    [J]. 10TH ANNUAL IEEE SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES, PROCEEDINGS, 2002, : 297 - 298