ViP: A Hierarchical Parallel Vision Processor for Hybrid Vision Chip

被引:6
|
作者
Zheng, Xuemin [1 ,2 ,3 ]
Cheng, Li [1 ,2 ,3 ]
Zhao, Mingxin [1 ,2 ,3 ]
Luo, Qian [1 ,2 ,3 ]
Li, Honglong [1 ,2 ,3 ]
Dou, Runjiang [1 ,2 ,3 ]
Yu, Shuangming [1 ,2 ,3 ]
Wu, Nanjian [1 ,2 ,3 ]
Liu, Liyuan [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Inst Semicond, State Key Lab Superlattices & Microstruct, Beijing 100083, Peoples R China
[2] Chinese Acad Sci, Ctr Excellence Brain Sci & Intelligence Technol, Beijing 100083, Peoples R China
[3] Univ Chinese Acad Sci, Ctr Mat Sci & Optoelect Engn, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Convolution; Parallel processing; Artificial neural networks; Kernel; Reduced instruction set computing; Computer architecture; Hardware; Hierarchical parallel; vision chip; computer vision; neural network;
D O I
10.1109/TCSII.2022.3156945
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Nowadays, the vision chip bridging sensing and processing has been extensively employed in high-speed image processing, owing to its excellent performance, low power consumption, and economical cost. However, there is a dilemma in designing processors to support conventional computer vision algorithms and neural networks since the two algorithms have a non-trivial trade-off in proposing a unified architecture. By analyzing computation properties, we propose a novel hierarchical parallel vision processor (ViP) for hybrid vision chips to accelerate both traditional computer vision (CV) and neural network (NN). The ViP architecture includes three parallelism levels: PE for pixel-centric, computing core (CC) for block, and vision core (VC) for global. PEs contain dedicated computing units and data paths for convolution operations without degrading its flexibility. Each CC is driven by customized SIMD instructions and can be dynamically connected for meeting block parallelism requirements. ViP is fabricated in 65nm CMOS technology and achieves a peak performance of 614.4 GOPS and an energy efficiency of 640 GOPS/W at 200 MHz clock frequency. Notably, several experiments on CV and NN are performed, illustrating an ultra-low latency in executing hybrid algorithms.
引用
收藏
页码:2957 / 2961
页数:5
相关论文
共 50 条
  • [31] Live Demonstration: Programmable Vision Chip with Neighborhood Level Parallel Processing
    Gharzai, Mahir Kabeer
    Schmitz, Joseph A.
    Balkir, Sina
    Hoffman, Michael W.
    2016 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2016, : 450 - 450
  • [32] A Programmable Vision Chip with Pixel-Neighborhood Level Parallel Processing
    Schmitz, Joseph A.
    Gharzai, Mahir Kabeer
    Balkir, Sina
    Hoffman, Michael W.
    White, Daniel J.
    Schemm, Nathan
    2015 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2015, : 2125 - 2128
  • [33] A general-purpose processor-per-pixel analog SIMD vision chip
    Dudek, P
    Hicks, PJ
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2005, 52 (01) : 13 - 20
  • [35] Robot vision application on embedded vision implementation with digital signal processor
    Guan, Xiaochun
    Huang, Jianlin
    Tang, Tinglong
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2020, 17 (01)
  • [36] Insect-Vision Inspired Collision Warning Vision Processor for Automobiles
    Linan-Cembrano, Gustavo
    Carranza, Luis
    Rind, Claire
    Zarandy, Akos
    Soininen, Martti
    Rodriguez-Vazquez, Angel
    IEEE CIRCUITS AND SYSTEMS MAGAZINE, 2008, 8 (02) : 6 - 24
  • [37] PARALLEL ARCHITECTURES FOR VISION
    MARESCA, M
    LAVIN, MA
    LI, HW
    PROCEEDINGS OF THE IEEE, 1988, 76 (08) : 970 - 981
  • [38] VLSI PROCESSOR ARCHITECTURES FOR COMPUTER VISION
    YANG, W
    CHIANG, AM
    IMAGE UNDERSTANDING WORKSHOP /, 1989, : 193 - 199
  • [39] M(2)VIP - Variety in mechatronics and vision
    Morris, J
    INDUSTRIAL ROBOT, 1995, 22 (06): : 16 - 18
  • [40] A novel architecture for a massively parallel low level vision processing engine on chip
    Tomasi, Matteo
    Vanegas, Mauricio
    Barranco, Francisco
    Diaz, Javier
    Ros, Eduardo
    IEEE INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS (ISIE 2010), 2010, : 3033 - 3039