DARIC: A Data Reuse-Friendly CGRA for Parallel Data Access via Elastic FIFOs

被引:1
|
作者
Liu, Dajiang [1 ]
Mou, Di [1 ]
Zhu, Rong [1 ]
Zhuang, Yan [1 ]
Shang, Jiaxing [1 ]
Zhong, Jiang [1 ]
Yin, Shouyi [2 ]
机构
[1] Chongqing Univ, Coll Comp Sci, Chongqing 400044, Peoples R China
[2] Tsinghua Univ, Sch Integrated Circuits, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
CGRA; Data Reuse; Parallel Data Access; FIFO;
D O I
10.1109/DAC56929.2023.10247862
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Coarse-Grained Reconfigurable Arrays (CGRAs) are a promising architecture for data-intensive applications. For parallel data accesses, uniform memory partitioning is usually introduced to CGRA for better pipelining performance. However, uniform memory partitioning not only suffers from a local minimum, but also introduces non-negligible overhead for banking function, which may greatly degrade the performance of CGRA. To this end, this paper introduces non-uniform memory partitioning and proposes a data-reuse-friendly CGRA (DARIC). With well elaborated configurable bank groups cooperated with register chains, elastic FIFOs can be achieved for non-uniform memory partitioning. Based on the resource graph of DARIC, a mapping algorithm supporting path sharing is proposed. Finally, the experimental results show that DARIC can achieve 2.35 x throughput and 2.59 x energy efficiency while having even less area and power overhead, as compared to the state-of-the-art.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Efficient Memory Partitioning for Parallel Data Access via Data Reuse
    Su, Jincheng
    Yang, Fan
    Zeng, Xuan
    Zhou, Dian
    [J]. PROCEEDINGS OF THE 2016 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS (FPGA'16), 2016, : 138 - 147
  • [2] Efficient Memory Partitioning for Parallel Data Access in FPGA via Data Reuse
    Su, Jincheng
    Yang, Fan
    Zeng, Xuan
    Zhou, Dian
    Chen, Jie
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2017, 36 (10) : 1674 - 1687
  • [3] CASCADE: High Throughput Data Streaming via Decoupled Access-Execute CGRA
    Wijerathne, Dhananjaya
    Li, Zhaoying
    Karunarathne, Manupa
    Pathania, Anuj
    Mitra, Tulika
    [J]. ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2019, 18 (05)
  • [4] An Efficient Memory Partitioning Approach for Multi-Pattern Data Access via Data Reuse
    Li, Wensong
    Yang, Fan
    Zhu, Hengliang
    Zeng, Xuan
    Zhou, Dian
    [J]. ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2019, 12 (01)
  • [5] Access to and reuse of public data: open data and open government
    Ferrer-Sapena, Antonia
    Peset, Fernanda
    Aleixandre-Benavent, Rafael
    [J]. PROFESIONAL DE LA INFORMACION, 2011, 20 (03): : 260 - 269
  • [6] Optimizing Data Reuse for CGRA Mapping Using Polyhedral-based Loop Transformations
    Huang, Liao
    Liu, Dajiang
    [J]. 2023 60TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC, 2023,
  • [7] Parallel Data Reuse Theory for OpenMP Applications
    Wu, Junjie
    Pan, Xiaohui
    Liu, Guanghui
    Zhan, Baida
    Yang, Xuejun
    [J]. SNPD 2009: 10TH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCES, NETWORKING AND PARALLEL DISTRIBUTED COMPUTING, PROCEEDINGS, 2009, : 516 - 523
  • [8] An Efficient Data Reuse Strategy for Multi-Pattern Data Access
    Li, Wensong
    Yang, Fan
    Zhu, Hengliang
    Zeng, Xuan
    Zhou, Dian
    [J]. 2018 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD) DIGEST OF TECHNICAL PAPERS, 2018,
  • [9] Optimize Parallel Data Access in Big Data Processing
    Yin, Jiangling
    Wang, Jun
    [J]. 2015 15TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING, 2015, : 721 - 724
  • [10] ON ACCESS AND ALIGNMENT OF DATA IN A PARALLEL PROCESSOR
    LEE, DL
    [J]. INFORMATION PROCESSING LETTERS, 1989, 33 (01) : 11 - 14