High-Performance Placement Engine for Modern Large-Scale FPGAs With Heterogeneity and Clock Constraints

被引:2
|
作者
Zhu, Ziran [1 ]
Mei, Yangjie [1 ]
Deng, Kangkang [2 ]
He, Huan [3 ]
Chen, Jianli [4 ]
Yang, Jun [1 ]
Chang, Yao-Wen [5 ,6 ]
机构
[1] Southeast Univ, Natl ASIC Syst Engn Res Ctr, Sch Integrated Circuits, Nanjing 210096, Peoples R China
[2] Peking Univ, Beijing Int Ctr Math Res, Beijing 100871, Peoples R China
[3] Hangzhou Huawei Enterprises Telecommun Technol Co, Cent Hardware Planning & Architecture Design Dept, Hangzhou 310000, Peoples R China
[4] Fudan Univ, State Key Lab ASIC & Syst, Shanghai 200433, Peoples R China
[5] Natl Taiwan Univ, Grad Inst Elect Engn, Taipei 10617, Taiwan
[6] Natl Taiwan Univ, Dept Elect Engn, Taipei 10617, Taiwan
关键词
Clock constraint; field-programmable gate array (FPGA); packing; physical design; placement;
D O I
10.1109/TCAD.2023.3329774
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As field-programmable gate array (FPGA) architectures continue to evolve and become more complex, the heterogeneity and clock constraints imposed by modern FPGAs have posed significant challenges to FPGA placement. This article proposes a high-performance placement engine for modern large-scale FPGAs with heterogeneity and clock constraints. To improve efficiency and scalability, we develop a clustering method considering both internal/external connectivity and the balance of block types to build the hierarchy. In each hierarchy level, we propose a hybrid penalty and augmented Lagrangian method (HPALM) to convert the FPGA global placement with heterogeneity and clock constraints into a series of unconstrained optimization subproblems, then use the Adam method to solve each subproblem. In particular, we prove that the HPALM is globally convergent for global placement. Besides, a matching-based IP block legalization is developed to legalize the DSPs and RAMs, and a multistage packing is presented to cluster LUTs and FFs into HCLBs. Finally, we propose a history-based legalization to legalize CLBs in an FPGA, and a simulated-annealing-based detailed placement is presented to reduce the wirelength while maintaining legality. Compared with the state-of-the-art works, experimental results based on the ISPD 2017 contest benchmarks show that the proposed algorithm can achieve the shortest routed wirelength in a reasonable runtime.
引用
收藏
页码:956 / 969
页数:14
相关论文
共 50 条
  • [1] High-performance Placement for Large-scale Heterogeneous FPGAs with Clock Constraints
    Zhu, Ziran
    Mei, Yangjie
    Li, Zijun
    Lin, Jingwen
    Chen, Jianli
    Yang, Jun
    Chang, Yao-Wen
    PROCEEDINGS OF THE 59TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC 2022, 2022, : 643 - 648
  • [2] A High-Performance Routing Engine for Large-Scale FPGAs
    Martin, Timothy
    Maarouf, Dani
    Grewal, Gary
    Areibi, Shawki
    2024 34TH INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, FPL 2024, 2024, : 53 - 59
  • [3] Clock-Aware Placement for Large-Scale Heterogeneous FPGAs
    Chen, Jianli
    Lin, Zhifeng
    Kuo, Yun-Chih
    Huang, Chau-Chin
    Chang, Yao-Wen
    Chen, Shih-Chun
    Chiang, Chun-Han
    Kuo, Sy-Yen
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2020, 39 (12) : 5042 - 5055
  • [4] Clock-Aware Placement for Large-Scale Heterogeneous FPGAs
    Kuo, Yun-Chih
    Huang, Chau-Chin
    Chen, Shih-Chun
    Chiang, Chun-Han
    Chang, Yao-Wen
    Kuo, Sy-Yen
    2017 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD), 2017, : 519 - 526
  • [5] Fast Placement for Large-scale Hierarchical FPGAs
    Dai, Hui
    Zhou, Qiang
    Cai, Yici
    Bian, Jinian
    Hong, Xianlong
    2009 11TH IEEE INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN AND COMPUTER GRAPHICS, PROCEEDINGS, 2009, : 190 - 194
  • [6] LSC: A Large-Scale Consensus-Based Clustering Algorithm for High-Performance FPGAs
    Singhal, Love
    Iyer, Mahesh A.
    Adya, Saurabh
    PROCEEDINGS OF THE 2017 54TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2017,
  • [7] UTPlaceF 2.0: A High-Performance Clock-Aware FPGA Placement Engine
    Li, Wuxi
    Lin, Yibo
    Li, Meng
    Dhar, Shounak
    Pan, David Z.
    ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2018, 23 (04)
  • [8] Efficient and Effective Packing and Analytical Placement for Large-Scale Heterogeneous FPGAs
    Chen, Yu-Chen
    Chen, Sheng-Yen
    Chang, Yao-Wen
    2014 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD), 2014, : 647 - 654
  • [9] elfPlace: Electrostatics-Based Placement for Large-Scale Heterogeneous FPGAs
    Meng, Yibai
    Li, Wuxi
    Lin, Yibo
    Pan, David Z.
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 41 (01) : 155 - 168
  • [10] RippleFPGA: A Routability-Driven Placement for Large-Scale Heterogeneous FPGAs
    Pui, Chak-Wa
    Chen, Gengjie
    Chow, Wing-Kai
    Lam, Ka-Chun
    Kuang, Jian
    Tu, Peishan
    Zhang, Hang
    Young, Evangeline F. Y.
    Yu, Bei
    2016 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD), 2016,