A GPU-based tabu search for very large hardware/software partitioning with limited resource usage

被引:6
|
作者
Hou, Neng [1 ]
He, Fazhi [1 ,2 ]
Zhou, Yi [3 ]
Ai, Haojun [1 ]
机构
[1] Wuhan Univ, Sch Comp Sci, State Key Lab Software Engn, Wuhan 430072, Hubei, Peoples R China
[2] State Key Lab Digital Mfg Equipment & Technol, Wuhan 430074, Hubei, Peoples R China
[3] Wuhan Univ Sci & Technol, Sch Informat Sci & Engn, Wuhan 430081, Hubei, Peoples R China
基金
美国国家科学基金会;
关键词
Hardware/software co-design; Hardware/software partitioning; GPU-based tabu search; GPU resource-limitation; Time-space tradeoff; ALGORITHMIC ASPECTS; OPTIMIZATION; TRACKING; DESIGN;
D O I
10.1299/jamdsm.2017jamdsm0060
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In hardware/software (HW/SW) co-design, HW/SW partitioning is the most important step since it determines which components are implemented in hardware and which are implemented in software. Since most of HW/SW partitioning problems are NP hard, heuristic methods have to be utilized to solve them, especially for the large size problems. GPU-based heuristic methods to accelerate HW/SW co-design are a promising way to reduce run time. However, the existing methods cannot deal with very large embedded applications because of GPU resource limitations. This paper presents a method to overcome the GPU resource limitations for very large partitioning while keeping a reasonable runtime. First, at the stage of computing the costs of the candidates, we propose a fast method of 2-flipping computing for very large HW/SW co-design. Our method is also general and can deal with both odd and even numbers of nodes. More importantly, our method avoids utilizing double-precision arithmetic units, which are scarce resources in GPU architecture. Second, since the GPU is constrained by memory limitations and the costs of candidates cannot be directly stored in the GPU's global memory, we present a time-space tradeoff strategy to break memory limitations for very large HW/SW partitioning. In this way, the following steps can be run under the constraint of GPU's memory limitations. Third, an in-place removal of infeasible solutions is proposed to reduce the overhead of global memory by half when the neighborhood is compacted. Fourth, when evaluating the tabu status of feasible candidates, we present a bitwise representation of tabu status to minimize the transfer overhead. Finally, we conduct a number of experiments. The results show that the proposed 2-flipping method of single precision data types works well. The results also demonstrate that the proposed approach expands the number of nodes of the task graph from 10,000 to 30,000 under the limitation of the GPU's global memory of 6 GB. The correlations between compression intensity and solution quality are analyzed to ensure the fairness and soundness of our method. Our work is general and can provide guidance for other applications.
引用
收藏
页数:18
相关论文
共 18 条
  • [11] Tabu search with intensification strategy for functional partitioning in hardware-software codesign
    Wiangtong, T
    Cheung, PYK
    Luk, W
    [J]. 10TH ANNUAL IEEE SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES, PROCEEDINGS, 2002, : 297 - 298
  • [12] GA-based Algorithm for Hardware/Software Partitioning with Resource Contentions
    Dou, Shuang
    Ding, Shan
    Zhang, Shi
    Zhu, Liucun
    [J]. 2ND IEEE INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER CONTROL (ICACC 2010), VOL. 1, 2010, : 68 - 72
  • [13] GPUTucker: Large-Scale GPU-Based Tucker Decomposition Using Tensor Partitioning
    Lee, Jihye
    Han, Donghyoung
    Kwon, Oh-Kyoung
    Chon, Kang-Wook
    Kim, Min-Soo
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237
  • [14] A hovering swarm particle swarm optimization algorithm based on node resource attributes for hardware/software partitioning
    Shao Deng
    Shanzhu Xiao
    Qiuqun Deng
    Huanzhang Lu
    [J]. The Journal of Supercomputing, 2024, 80 : 4625 - 4647
  • [15] A hovering swarm particle swarm optimization algorithm based on node resource attributes for hardware/software partitioning
    Deng, Shao
    Xiao, Shanzhu
    Deng, Qiuqun
    Lu, Huanzhang
    [J]. JOURNAL OF SUPERCOMPUTING, 2024, 80 (04): : 4625 - 4647
  • [16] A new tabu search-based hyper-heuristic algorithm for solving construction leveling problems with limited resource availabilities
    Koulinas, G. K.
    Anagnostopoulos, K. P.
    [J]. AUTOMATION IN CONSTRUCTION, 2013, 31 : 169 - 175
  • [17] A Discrete Multi-Objective Optimization Method for Hardware/Software Partitioning Problem Based on Cuckoo Search and Elite Strategy
    Xiong, Wei
    Guo, Bing
    Shen, Yan
    Zhang, Wenli
    [J]. NEUROQUANTOLOGY, 2018, 16 (05) : 749 - 756
  • [18] Solving Clique Covering in Very Large Sparse Random Graphs by a Technique Based on k-Fixed Coloring Tabu Search
    Chalupa, David
    [J]. EVOLUTIONARY COMPUTATION IN COMBINATORIAL OPTIMIZATION (EVOCOP 2013), 2013, 7832 : 238 - 249