Understanding the Convolutional Neural Networks with Gradient Descent and Backpropagation

被引：15

作者：

Zhou, XueFei ^{[1
]}

机构：

[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215006, Peoples R China

来源：

2ND INTERNATIONAL CONFERENCE ON MACHINE VISION AND INFORMATION TECHNOLOGY (CMVIT 2018) | 2018年 / 1004卷

关键词：

D O I：

10.1088/1742-6596/1004/1/012028

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

With the development of computer technology, the applications of machine learning are more and more extensive. And machine learning is providing endless opportunities to develop new applications. One of those applications is image recognition by using Convolutional Neural Networks (CNNs). CNN is one of the most common algorithms in image recognition. It is significant to understand its theory and structure for every scholar who is interested in this field. CNN is mainly used in computer identification, especially in voice, text recognition and other aspects of the application. It utilizes hierarchical structure with different layers to accelerate computing speed. In addition, the greatest features of CNNs are the weight sharing and dimension reduction. And all of these consolidate the high effectiveness and efficiency of CNNs with idea computing speed and error rate. With the help of other learning altruisms, CNNs could be used in several scenarios for machine learning, especially for deep learning. Based on the general introduction to the background and the core solution CNN, this paper is going to focus on summarizing how Gradient Descent and Backpropagation work, and how they contribute to the high performances of CNNs. Also, some practical applications will be discussed in the following parts. The last section exhibits the conclusion and some perspectives of future work.

引用

页数：5

共 50 条

[41] Fast gradient descent algorithm for image classification with neural networks
El Mouatasim, Abdelkrim
SIGNAL IMAGE AND VIDEO PROCESSING, 2020, 14 (08) : 1565 - 1572
[42] Convergence rates for shallow neural networks learned by gradient descent
Braun, Alina
Kohler, Michael
Langer, Sophie
Walk, Harro
BERNOULLI, 2024, 30 (01) : 475 - 502
[43] Fast Convergence of Natural Gradient Descent for Overparameterized Neural Networks
Zhang, Guodong
Martens, James
Grosse, Roger
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[44] Smooth Exact Gradient Descent Learning in Spiking Neural Networks
Klos, Christian
Memmesheimer, Raoul-Martin
Physical Review Letters, 2025, 134 (02)
[45] Gradient Descent Analysis: On Visualizing the Training of Deep Neural Networks
Becker, Martin
Lippel, Jens
Zielke, Thomas
PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL 3: IVAPP, 2019, : 338 - 345
[46] Time delay learning by gradient descent in Recurrent Neural Networks
Boné, R
Cardot, H
ARTIFICIAL NEURAL NETWORKS: FORMAL MODELS AND THEIR APPLICATIONS - ICANN 2005, PT 2, PROCEEDINGS, 2005, 3697 : 175 - 180
[47] Comparing the performance of Hebbian against backpropagation learning using convolutional neural networks
Lagani, Gabriele
Falchi, Fabrizio
Gennaro, Claudio
Amato, Giuseppe
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (08): : 6503 - 6519
[48] Comparing the performance of Hebbian against backpropagation learning using convolutional neural networks
Gabriele Lagani
Fabrizio Falchi
Claudio Gennaro
Giuseppe Amato
Neural Computing and Applications, 2022, 34 : 6503 - 6519
[49] Learning dynamics of gradient descent optimization in deep neural networks
Wu, Wei
Jing, Xiaoyuan
Du, Wencai
Chen, Guoliang
SCIENCE CHINA-INFORMATION SCIENCES, 2021, 64 (05)
[50] Gradient descent learning of radial-basis neural networks
Karayiannis, NB
1997 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, 1997, : 1815 - 1820

← 1 2 3 4 5 →