An FPGA-Based Transformer Accelerator Using Output Block Stationary Dataflow for Object Recognition Applications

被引:6
|
作者
Zhao, Zhongyu [1 ,2 ]
Cao, Rujian [1 ,2 ]
Un, Ka-Fai [1 ,2 ]
Yu, Wei-Han [1 ,2 ]
Mak, Pui-In [1 ,2 ]
Martins, Rui P. [1 ,2 ,3 ]
机构
[1] Univ Macau, State Key Lab Analog & Mixed Signal VLSI, Inst Microelect, Macau, Peoples R China
[2] Univ Macau, Fac Sci & Technol ECE, Macau, Peoples R China
[3] Univ Lisbon, Inst Super Tecn, P-1649004 Lisbon, Portugal
关键词
Transformers; Energy efficiency; Broadcasting; Convolutional neural networks; Integrated circuit modeling; Field programmable gate arrays; Random access memory; Dataflow; digital accelerator; energy-efficient; field-programmable gate array (FPGA); energy efficiency; image recognition; transformer; CNN ACCELERATOR; EFFICIENT; HARDWARE;
D O I
10.1109/TCSII.2022.3196055
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The transformer-based model has great potential to deliver higher accuracy for object recognition applications when comparing it with the convolution neural network (CNN). Yet, the amount of weight sharing of a transformer-based model is significantly lower than that of the CNN, which should apply different dataflow to reduce the memory access. This brief proposes a transformer accelerator with an output block stationary (OBS) dataflow to minimize the repeated memory access by block-level and vector-level broadcasting while preserving a high digital signal processor (DSP) utilization rate, leading to higher energy efficiency. It also lowers the memory access bandwidth to the input and output. Verified through an FPGA, the proposed accelerator evaluates a transformer-in-transformer (TNT) model with a throughput of 728.3 GOPs, corresponding to energy efficiency of 58.31 GOPs/W.
引用
收藏
页码:281 / 285
页数:5
相关论文
共 50 条
  • [41] FPGA-based smart chair recognition system using flex sensors
    AbuTerkia, Ibrahim
    Hannoun, Mustafa
    Suwal, Bikal
    Ahmed, Md Sharif
    Sundaravdivel, Prabha
    [J]. PROCEEDINGS OF THE 2022 15TH IEEE DALLAS CIRCUITS AND SYSTEMS CONFERENCE (DCAS 2022), 2022,
  • [42] FPGA-based Convolutional Neural Network Accelerator design using High Level Synthesize
    Ghaffari, Sina
    Sharifian, Saeed
    [J]. 2016 2ND INTERNATIONAL CONFERENCE OF SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS), 2016, : 29 - 34
  • [43] FPGA-Based Reconfigurable Convolutional Neural Network Accelerator Using Sparse and Convolutional Optimization
    Gowda, Kavitha Malali Vishveshwarappa
    Madhavan, Sowmya
    Rinaldi, Stefano
    Divakarachari, Parameshachari Bidare
    Atmakur, Anitha
    [J]. ELECTRONICS, 2022, 11 (10)
  • [44] Comparison of Different Deployment Approaches of FPGA-Based Hardware Accelerator for 3D Object Detection Models
    Pereira, Pedro
    Silva, Antonio Linhares
    Machado, Rui
    Silva, Joao
    Duraes, Dalila
    Machado, Jose
    Novais, Paulo
    Monteiro, Joao
    Melo-Pinto, Pedro
    Fernandes, Duarte
    [J]. PROGRESS IN ARTIFICIAL INTELLIGENCE, EPIA 2022, 2022, 13566 : 285 - 296
  • [45] FPGA-Based Selected Object Tracking Using LBP, HOG and Motion Detection
    Sledevic, Tomyslav
    Serackis, Arturas
    Plonis, Darius
    [J]. 2018 IEEE 6TH WORKSHOP ON ADVANCES IN INFORMATION, ELECTRONIC AND ELECTRICAL ENGINEERING (AIEEE), 2018,
  • [46] A Multistage Dataflow Implementation of a Deep Convolutional Neural Network Based on FPGA For High-Speed Object Recognition
    Li, Ning
    Takaki, Shunpei
    Tomioka, Yoichi
    Kitazawa, Hitoshi
    [J]. 2016 IEEE SOUTHWEST SYMPOSIUM ON IMAGE ANALYSIS AND INTERPRETATION (SSIAI), 2016, : 165 - 168
  • [47] Improving FPGA-based Async-logic AES Accelerator with the Integration of Sync-logic Block RAMs
    Ng, Jun-Sheng
    Chen, Juncheng
    Wu, Si
    Kyaw, Nay Aung
    Chong, Kwen-Siong
    Lin, Zhiping
    Gwee, Bah-Hwee
    [J]. 2023 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS, 2023,
  • [48] A Cost-Efficient FPGA-Based CNN-Transformer Using Neural ODE
    Okubo, Ikumi
    Sugiura, Keisuke
    Matsutani, Hiroki
    [J]. IEEE Access, 2024, 12 : 155773 - 155788
  • [49] FPGA-Based Accelerator for Rank-Enhanced and Highly-Pruned Block-Circulant Neural Networks
    Song, Haena
    Yoon, Jongho
    Kim, Dohun
    Kwon, Eunji
    Oh, Tae-Hyun
    Kang, Seokhyeong
    [J]. 2023 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2023,
  • [50] An FPGA-Based Hardware Accelerator for Real-Time Block-Matching and 3D Filtering
    Wang, Dong
    Xu, Jia
    Xu, Ke
    [J]. IEEE ACCESS, 2020, 8 : 121987 - 121998