Understanding the Convolutional Neural Networks with Gradient Descent and Backpropagation

被引:15
|
作者
Zhou, XueFei [1 ]
机构
[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215006, Peoples R China
关键词
D O I
10.1088/1742-6596/1004/1/012028
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the development of computer technology, the applications of machine learning are more and more extensive. And machine learning is providing endless opportunities to develop new applications. One of those applications is image recognition by using Convolutional Neural Networks (CNNs). CNN is one of the most common algorithms in image recognition. It is significant to understand its theory and structure for every scholar who is interested in this field. CNN is mainly used in computer identification, especially in voice, text recognition and other aspects of the application. It utilizes hierarchical structure with different layers to accelerate computing speed. In addition, the greatest features of CNNs are the weight sharing and dimension reduction. And all of these consolidate the high effectiveness and efficiency of CNNs with idea computing speed and error rate. With the help of other learning altruisms, CNNs could be used in several scenarios for machine learning, especially for deep learning. Based on the general introduction to the background and the core solution CNN, this paper is going to focus on summarizing how Gradient Descent and Backpropagation work, and how they contribute to the high performances of CNNs. Also, some practical applications will be discussed in the following parts. The last section exhibits the conclusion and some perspectives of future work.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Applying Gradient Descent in Convolutional Neural Networks
    Cui, Nan
    [J]. 2ND INTERNATIONAL CONFERENCE ON MACHINE VISION AND INFORMATION TECHNOLOGY (CMVIT 2018), 2018, 1004
  • [2] Calibrated Stochastic Gradient Descent for Convolutional Neural Networks
    Zhuo, Li'an
    Zhang, Baochang
    Chen, Chen
    Ye, Qixiang
    Liu, Jianzhuang
    Doermann, David
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 9348 - 9355
  • [3] Hardware implementation of backpropagation using progressive gradient descent for in situ training of multilayer neural networks
    van Doremaele, Eveline R. W.
    Stevens, Tim
    Ringeling, Stijn
    Spolaor, Simone
    Fattori, Marco
    van de Burgt, Yoeri
    [J]. SCIENCE ADVANCES, 2024, 10 (28):
  • [4] Guaranteed Convergence of Training Convolutional Neural Networks via Accelerated Gradient Descent
    Zhang, Shuai
    Wang, Meng
    Liu, Sijia
    Chen, Pin-Yu
    Xiong, Jinjun
    [J]. 2020 54TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2020, : 41 - 46
  • [5] Homogeneous Vector Capsules Enable Adaptive Gradient Descent in Convolutional Neural Networks
    Byerly, Adam
    Kalganova, Tatiana
    [J]. IEEE ACCESS, 2021, 9 : 48519 - 48530
  • [6] Overfitting and neural networks: Conjugate gradient and backpropagation
    Lawrence, S
    Giles, CL
    [J]. IJCNN 2000: PROCEEDINGS OF THE IEEE-INNS-ENNS INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOL I, 2000, : 114 - 119
  • [7] INVERSION OF NEURAL NETWORKS BY GRADIENT DESCENT
    KINDERMANN, J
    LINDEN, A
    [J]. PARALLEL COMPUTING, 1990, 14 (03) : 277 - 286
  • [8] Gradient Descent for Spiking Neural Networks
    Huh, Dongsung
    Sejnowski, Terrence J.
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [9] AdaInject: Injection-Based Adaptive Gradient Descent Optimizers for Convolutional Neural Networks
    Dubey S.R.
    Basha S.H.S.
    Singh S.K.
    Chaudhuri B.B.
    [J]. IEEE Transactions on Artificial Intelligence, 2023, 4 (06): : 1540 - 1548
  • [10] A Comparative Analysis of Gradient Descent-Based Optimization Algorithms on Convolutional Neural Networks
    Dogo, E. M.
    Afolabi, O. J.
    Nwulu, N. I.
    Twala, B.
    Aigbavboa, C. O.
    [J]. PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON COMPUTATIONAL TECHNIQUES, ELECTRONICS AND MECHANICAL SYSTEMS (CTEMS), 2018, : 92 - 99