Smart vision chip

被引:1
|
作者
Liu, Liyuan [1 ,2 ]
Feng, Peng [1 ,3 ]
Yang, Xu [1 ,3 ]
Yu, Shuangming [1 ,3 ]
Dou, Runjiang [1 ,3 ]
Liu, Jian [1 ,3 ]
Wu, Nanjian [1 ,3 ]
机构
[1] Chinese Acad Sci, Inst Semicond, State Key Lab Superlattices & Microstruct, Beijing 100083, Peoples R China
[2] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100049, Peoples R China
[3] Univ Chinese Acad Sci, Coll Mat Sci & Optoelect Technol, Beijing 100049, Peoples R China
来源
CHINESE SCIENCE BULLETIN-CHINESE | 2023年 / 68卷 / 35期
关键词
vision chip; CMOS image sensor; image processor; edge computing; fusion of sensoring storage and computing; IMAGE SENSOR; PROCESSOR; ACCELERATOR; NETWORK; TEMPERATURE; SPEED; BURST; LOIHI; CNN;
D O I
10.1360/TB-2023-0859
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Vision chips are a fusion of image sensors, processors and data storage. They are a tight combination of image sensors and processors. Data from the image sensor could be processed in situ by a processor, which shows lower latency and lower power performance. Meanwhile, the sensor could also be evaluated continuously and configured in real time, which allows the sensor to always work in a proper status. The design of a vision chip is a nontrivial task that requires a good understanding of image sensor, processor, algorithm and integration technology. Vision chips can be multibit-based or spiking-based according to the type of sensor and processor integrated. The algorithm running on the vision chip can be a classical computational algorithm with handcrafted features or a deep neural network algorithm that relies on training. The CMOS image sensor is the best candidate for building vision chips. The basic principle behind this process is the photoelectric effect in Si materials. A photo diode is the fundamental sensing device. Inventions of pinned photodiode and 4-T pixel topology are two significant processes that greatly reduce dark current and reset noise and improve image quality. The architecture of image sensors includes column-parallel, chip-parallel and pixel-parallel fashions, among which column-parallel architectures have become very popular because of their good balance between complexity and readout parallelism. Currently, image sensors are developing toward higher resolution, higher frame rate, higher dynamic vision, 3D image and multispectrum vision capabilities, which significantly increase data size. To reduce the data burden, spiking image sensors were developed that output spiking maps instead of gray images. To process the image data in situ, two kinds of vision processors can be adopted. For gray images, a multibit-based architecture is needed. This architecture has evolved from application-specified design to flexible programmable design. Currently, programmable processors are mainstream owing to their high flexibility. To process images smartly, convolutional neural networks (CNNs) are now widely adopted. However, CNNs are both computationally intensive and storage intensive. To process CNNs on chips more efficiently, hardware and software optimization techniques are both needed. Hardware-wise, parallel computing is the basis for dealing with intensive computation. Employing quantization, sparsity and data reuse is very useful to reduce computational complexity and power consumption. For spiking images, the spiking-based processor is preferred. Some spiking processors employ brain-like cross-bar topology. Although achieving low power consumption, the hardware cost increases when dealing with complex algorithms. Others adopt a time-multiplexing method to design a spiking processor that is cost effective. Software-wise, classical computer vision (CV) algorithms use handcrafted features and behave well in specific applications. However, CNNs show more robust and accurate performance. As CNNs need more computational capacity, neural network pruning and effective quantization methods are important to reduce the computational burden. For various spiking-based processors, a neural network operating on it can be built up using transfer learning. To hook up the image sensor and image processor, plain integration and 3-D integration could be adopted. Compared to plain integration, 3-D integration allows the optimization of sensors and processors with suitable fabrication technologies. 3-D integration also allows cramming more memories on chip. In the future, mixed-signal processing and computing in memory techniques could be employed for more efficient computing, and novel 2-D materials may open new ways for sensing-storagecomputing fusion in a single device.
引用
收藏
页码:4844 / 4861
页数:18
相关论文
共 97 条
  • [1] True North: Design and Tool Flow of a 65 mW 1 Million Neuron Programmable Neurosynaptic Chip
    Akopyan, Filipp
    Sawada, Jun
    Cassidy, Andrew
    Alvarez-Icaza, Rodrigo
    Arthur, John
    Merolla, Paul
    Imam, Nabil
    Nakamura, Yutaka
    Datta, Pallab
    Nam, Gi-Joon
    Taba, Brian
    Beakes, Michael
    Brezzo, Bernard
    Kuang, Jente B.
    Manohar, Rajit
    Risk, William P.
    Jackson, Bryan
    Modha, Dharmendra S.
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2015, 34 (10) : 1537 - 1557
  • [2] [Anonymous], 2023, 2023 IEEE INT SOL ST, P90
  • [3] An Always-On 3.8 μJ/86% CIFAR-10 Mixed-Signal Binary CNN Processor With All Memory on Chip in 28-nm CMOS
    Bankman, Daniel
    Yang, Lita
    Moons, Bert
    Verhelst, Marian
    Murmann, Boris
    [J]. IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2019, 54 (01) : 158 - 172
  • [4] Low-Power, Adaptive Neuromorphic Systems: Recent Progress and Future Directions
    Basu, Arindam
    Acharya, Jyotibdha
    Karnik, Tanay
    Liu, Huichu
    Li, Hai
    Seo, Jae-Sun
    Song, Chang
    [J]. IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2018, 8 (01) : 6 - 27
  • [5] A 30 μW 30 fps 110 x 110 Pixels Vision Sensor Embedding Local Binary Patterns
    Berkovich, Andrew
    Lecca, Michela
    Gasparini, Leonardo
    Abshire, Pamela A.
    Gottardi, Massimo
    [J]. IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2015, 50 (09) : 2138 - 2148
  • [6] Point-to-point connectivity between neuromorphic chips using address events
    Boahen, KA
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2000, 47 (05) : 416 - 434
  • [7] Bogaerts J, 2016, ISSCC DIG TECH PAP I, V59, P114, DOI 10.1109/ISSCC.2016.7417933
  • [8] IR imager based on 128x128 HgCdTe staring focal plane array
    Boltar, KO
    Bovina, LA
    Saginov, LD
    Stafeev, VI
    Gibin, IS
    Maleev, VM
    [J]. INTERNATIONAL CONFERENCE ON PHOTOELECTRONICS AND NIGHT VISION DEVICES, 1999, 3819 : 92 - 95
  • [9] A 35-mm format 11 M pixel full-frame CCD for professional digital still imaging
    Bosiers, JT
    Dillen, BGM
    Draijer, C
    Kleimann, AC
    Polderdijk, FJ
    de Wolf, MARC
    Klaassens, W
    Theuwissen, AJP
    Peek, HL
    Folkerts, HO
    [J]. IEEE TRANSACTIONS ON ELECTRON DEVICES, 2003, 50 (01) : 254 - 265
  • [10] Dynamic-range compression and contrast enhancement in infrared imaging systems
    Branchitta, Francesco
    Diani, Marco
    Corsini, Giovanni
    Porta, Antonio
    [J]. OPTICAL ENGINEERING, 2008, 47 (07)