Exploring Memory Access Techniques for Efficient FPGA based 3D CNN Accelerator Design

被引:0
|
作者
Khan, Fatima Hameed [1 ]
Pasha, Muhammad Adeel [1 ]
Masud, Shahid [1 ]
机构
[1] Lahore Univ Management Sci LUMS, Dept Elect Engn, Lahore, Pakistan
关键词
3D CNNs; Parameterized Hardware Accelerator; FPGA; Memory Access Optimization; Systolic Architecture;
D O I
10.1109/AICAS59952.2024.10595963
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advancements in 3D Convolutional Neural Network (CNN) architectures have demonstrated superior performance across diverse computer vision tasks, albeit with a trade-off of intense computational and memory demands. Thus, the tiling of incoming data becomes mandatory for 3D CNN acceleration in memory-constrained platforms such as Field Programmable Gate Arrays (FPGA). In this paper, different memory access techniques are explored to reduce the data traffic between on- chip and off-chip memories during the inference stage of a 3D CNN. The most suitable data traffic mode is identified by considering multiple parameters like latency, on-chip memory utilization and off-chip memory access. A parameterized and modular design approach for 3D CNNs has been implemented on an FPGA, where the input and weight data mapping modules are designed to minimize the onchip memory requirements. These modules are parameterized for variable tiling sizes and different memory access modes while the main computation is performed on a systolic-array-based pipelined architecture. The experiments conducted on three widely adopted 3D networks, I3D, C3D, and R(2+1)D, have shown 16%, 28%, and 10% improvement in latency respectively. The proposed methodology also results in a lower energy dissipation profile.
引用
收藏
页码:218 / 222
页数:5
相关论文
共 50 条
  • [21] On-chip Memory Efficient Data Layout for 2D FFT on 3D Memory Integrated FPGA
    Singapura, Shreyas G.
    Kannan, Rajgopal
    Prasanna, Viktor K.
    2016 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2016,
  • [22] Toward Efficient Co-Design of CNN Quantization and HW Architecture on FPGA Hybrid-Accelerator
    Zhang, Yiran
    Li, Guiying
    Yuan, Bo
    2024 INTERNATIONAL SYMPOSIUM OF ELECTRONICS DESIGN AUTOMATION, ISEDA 2024, 2024, : 678 - 683
  • [23] Memory-Centric Fusion-based CNN Accelerator with 3D-NoC and 3D-DRAM
    Lu, Wei
    Ge, Pei-Yu
    Huang, Po-Tsang
    Chen, Hung-Ming
    Hwang, Wei
    2022 19TH INTERNATIONAL SOC DESIGN CONFERENCE (ISOCC), 2022, : 169 - 170
  • [24] GP3D: 3D NAND Based In-Memory Graph Processing Accelerator
    Shim, Wonbo
    Yu, Shimeng
    IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2022, 12 (02) : 500 - 507
  • [25] Exploring alternative 3D FPGA architectures: Design methodology and cad tool support
    Siozios, K.
    Sotiriadis, K.
    Pavlidis, V. F.
    Soudris, D.
    2007 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS, PROCEEDINGS, VOLS 1 AND 2, 2007, : 652 - 655
  • [26] Exploring potential benefits of 3D FPGA integration
    Ababei, C
    Maidee, P
    Bazargan, K
    FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, PROCEEDINGS, 2004, 3203 : 874 - 880
  • [27] An Efficient Accelerator Based on Lightweight Deformable 3D-CNN for Video Super-Resolution
    Zhang, Siyu
    Mao, Wendong
    Wang, Zhongfeng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2023, 70 (06) : 2384 - 2397
  • [28] VEA: An FPGA-Based Voxel Encoding Accelerator for 3D Object Detection with LiDAR
    Li, Xin
    Ren, Ao
    Tan, Yujuan
    Li, Xusheng
    Huang, Zhetong
    Wang, Chengliang
    Chen, Xianzhang
    Liu, Duo
    2022 IEEE 40TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD 2022), 2022, : 509 - 516
  • [29] Design of 3D FFTs with FPGA Clusters
    Sheng, Jiayi
    Humphries, Ben
    Zhang, Hansen
    Herbordt, Martin C.
    2014 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2014,
  • [30] Optimized FPGA-based Deep Learning Accelerator for Sparse CNN using High Bandwidth Memory
    Jiang, Chao
    Ojika, David
    Patel, Bhavesh
    Lam, Herman
    2021 IEEE 29TH ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM 2021), 2021, : 157 - 164