Hardware Architecture Exploration for Deep Neural Networks

被引:0
|
作者
Wenqi Zheng
Yangyi Zhao
Yunfan Chen
Jinhong Park
Hyunchul Shin
机构
[1] Hanyang University,Department of Electrical Engineering
[2] Samsung Electronics Inc,undefined
关键词
AI architecture; Neural network architecture; CNN; Design space exploration;
D O I
暂无
中图分类号
学科分类号
摘要
Owing to good performance, deep Convolution Neural Networks (CNNs) are rapidly rising in popularity across a broad range of applications. Since high accuracy CNNs are both computation intensive and memory intensive, many researchers have shown significant interest in the accelerator design. Furthermore, the AI chip market size grows and the competition on the performance, cost, and power consumption of the artificial intelligence SoC designs is increasing. Therefore, it is important to develop design techniques and platforms that are useful for the efficient design of optimized AI architectures to satisfy the given specifications in a short design time. In this research, we have developed design space exploration techniques and environments for the optimal design of the overall system including computing modules and memories. Our current design platform is built using NVIDIA Deep Learning Accelerator as a computing model, SRAM as a buffer, and DRAM with GDDR6 as an off-chip memory. We also developed a program to estimate the processing time of a given neural network. By modifying both the on-chip SRAM size and the computing module size, a designer can explore the design space efficiently, and then choose the optimal architecture which shows the minimal cost while satisfying the performance specification. To illustrate the operation of the design platform, two well-known deep CNNs are used, which are YOLOv3 and faster RCNN. This technology can be used to explore and to optimize the hardware architectures of the CNNs so that the cost can be minimized.
引用
收藏
页码:9703 / 9712
页数:9
相关论文
共 50 条
  • [1] Hardware Architecture Exploration for Deep Neural Networks
    Zheng, Wenqi
    Zhao, Yangyi
    Chen, Yunfan
    Park, Jinhong
    Shin, Hyunchul
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2021, 46 (10) : 9703 - 9712
  • [2] A Fast Compressed Hardware Architecture for Deep Neural Networks
    Ansari, Anaam
    Shelton, Allen
    Ogunfunmi, Tokunbo
    Panchbhaiyye, Vineet
    [J]. 2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 370 - 374
  • [3] Efficient Softmax Hardware Architecture for Deep Neural Networks
    Du, Gaoming
    Tian, Chao
    Li, Zhenmin
    Zhang, Duoli
    Yin, Yongsheng
    Ouyang, Yiming
    [J]. GLSVLSI '19 - PROCEEDINGS OF THE 2019 ON GREAT LAKES SYMPOSIUM ON VLSI, 2019, : 75 - 80
  • [4] An Efficient and Fast Softmax Hardware Architecture (EFSHA) for Deep Neural Networks
    Hussain, Muhammad Awais
    Tsai, Tsung-Han
    [J]. 2021 IEEE 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE CIRCUITS AND SYSTEMS (AICAS), 2021,
  • [5] Tile-Based Architecture Exploration for Convolutional Accelerators in Deep Neural Networks
    Chen, Yang-Tsai
    Yen, Yu-Xiang
    Chen, Chun-Tse
    Chen, Tzu-Yu
    Huang, Chih-Tsun
    Liou, Jing-Jia
    Lu, Juin-Ming
    [J]. 2021 IEEE 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE CIRCUITS AND SYSTEMS (AICAS), 2021,
  • [6] SLIT: An Energy-Efficient Reconfigurable Hardware Architecture for Deep Convolutional Neural Networks
    Tran, Thi Diem
    Nakashima, Yasuhiko
    [J]. IEICE TRANSACTIONS ON ELECTRONICS, 2021, E104C (07) : 319 - 329
  • [7] Hardware Compilation of Deep Neural Networks: An Overview
    Zhao, Ruizhe
    Liu, Shuanglong
    Ng, Ho-Cheung
    Wang, Erwei
    Davis, James J.
    Niu, Xinyu
    Wang, Xiwei
    Shi, Huifeng
    Constantinides, George A.
    Cheung, Peter Y. K.
    Luk, Wayne
    [J]. 2018 IEEE 29TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP), 2018, : 120 - 127
  • [8] Formal Verification of Deep Neural Networks in Hardware
    Saji, Sincy Ann
    Agrawal, Shreyansh
    Sood, Surinder
    [J]. 2022 IEEE WOMEN IN TECHNOLOGY CONFERENCE (WINTECHCON): SMARTER TECHNOLOGIES FOR A SUSTAINABLE AND HYPER-CONNECTED WORLD, 2022,
  • [9] BioNetExplorer: Architecture-Space Exploration of Biosignal Processing Deep Neural Networks for Wearables
    Prabakaran, Bharath Srinivas
    Akhtar, Asima
    Rehman, Semeen
    Hasan, Osman
    Shafique, Muhammad
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (17): : 13251 - 13265
  • [10] Architecture Disentanglement for Deep Neural Networks
    Hu, Jie
    Cao, Liujuan
    Tong, Tong
    Ye, Qixiang
    Zhang, Shengchuan
    Li, Ke
    Huang, Feiyue
    Shao, Ling
    Ji, Rongrong
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 652 - 661