Research progress on visual image detection based on convolutional neural network

被引:0
|
作者
Lan J. [1 ]
Wang D. [1 ]
Shen X. [1 ]
机构
[1] School of Automation and Electrical Engineering, University of Science and Technology Beijing, Beijing
来源
Lan, Jinhui (lanjh@ustb.edu.cn) | 1600年 / Science Press卷 / 41期
关键词
Computer vision; Convolutional neural network (CNN); Deep learning; Image detection;
D O I
10.19650/j.cnki.cjsi.J2006003
中图分类号
学科分类号
摘要
Visual image detection has great research significance and application value in the computer vision field. In recent years, the development of convolutional neural network (CNN) has led to the progress of visual image detection. A large number of new theories and new methods are applied to convolutional neural network, which improves the network feature expression ability, reduces the network complexity and improves the network performance. This paper presents the basic structure of Convolutional CNN, summarizes the improvements of CNN in recent years on different aspects, including convolutional layer, pooling layer, activation function, network regularization and network optimization, sorts various applications of CNN in visual image detection field and summarizes the advantages of CNN in visual image detection field, finally, prospects the future research direction. © 2020, Science Press. All right reserved.
引用
收藏
页码:167 / 182
页数:15
相关论文
共 72 条
  • [1] SHI J, ZHOU Y L, ZHANG Q ZH, Et al., Service robot item recognition system based on improved Mask RCNN and Kinect, Chinese Journal of Scientific Instrument, 40, 4, pp. 216-228, (2019)
  • [2] CHEN P, TANG Y P, HE X, Et al., Detection method for fake plate vehicles based on multitask Faster R-CNN, Chinese Journal of Scientific Instrument, 38, 12, pp. 3079-3089, (2017)
  • [3] WANG Y L, CAO J T, JI X F, Et al., PCB defect detection and recognition algorithm based on convolutional neuralnetwork, Chinese Journal of Scientific Instrument, 33, 8, pp. 78-84, (2019)
  • [4] ZHANG L M, JIA ZH W, XIE J L, Et al., Recent four years review on deep learning classification research in capsule endoscopy image processing, Chinese Journal of Scientific Instrument, 33, 8, pp. 69-77, (2019)
  • [5] HINTON G, SALAKHUTDINOV R R., Reducing the dimensionality of data with neural networks, Science, 313, 5786, pp. 504-507, (2006)
  • [6] HUBEL D H, WIESEL T N., Receptive fields, binocular interaction, and functional architecture in the cat's visual cortex, Journal of Physiology, 160, 1, pp. 106-154, (1962)
  • [7] HUBEL D H, WIESEL T N., Receptive fields of single neurons in the cat's striate cortex, Journal of Physiology, 148, 3, pp. 547-591, (1959)
  • [8] FUKUSHIMA K., Neural network model for a mechanism of pattern recognition unaffected by shift in position-neocognitron, IEICE Technical Report, 62, 10, pp. 658-665, (1989)
  • [9] FUKUSHIMA K., Neocognitron: A self-organizing neural network for a mechanism of pattern recognition unaffected by shift in position, Biological Cybernetics, 36, 4, pp. 193-202, (1980)
  • [10] FUKUSHIMA K., Artificial vision by multi-layered neural networks: Neocognitron and its advances, Neural Networks, 37, pp. 103-119, (2013)