Advancements in Deep Learning Architectures for Image Recognition and Semantic Segmentation

被引：0

作者：

Nimma, Divya ^{[1
]}

Uddagiri, Arjun ^{[2
]}

机构：

[1] Univ Southern Mississippi, Computat Sci, Hattiesburg, MS 39406 USA

[2] Gloom Dev Pvt Ltd, Vijayawada 521139, Andhra Pradesh, India

来源：

INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS | 2024年 / 15卷 / 08期

关键词：

Convolutional Neural Networks (CNNs); AlexNet; image classification; transfer learning; MNIST Dataset; Custom CNN Architecture; deep learning; model training and evaluation; neural network optimization; activation functions; feature extraction; machine learning; pattern recognition; data preprocessing; loss functions; model accuracy;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

This paper focuses on using Convolutional Neural Networks (CNNs) for tasks such as image classification. It covers both pre-trained models and those that are built from scratch. The paper begins by demonstrating how to utilize the well-known AlexNet model, which is highly effective for image recognition due to transfer learning. It then explains how to load and prepare the MNIST dataset, a common choice for testing image classification methods. Additionally, it introduces a custom CNN designed specifically for recognizing MNIST digits, outlining its architecture, which includes convolutional layers, activation functions, and fully connected layers for capturing handwritten numbers' details. The paper also guides starting the model, running it on sample data, reviewing outputs, and assessing the accuracy of predictions. Furthermore, it delves into training the custom CNN and evaluating its performance by comparing it with established benchmarks, utilizing loss functions and optimization techniques to fine-tune the model and assess its classification accuracy. This work integrates theory with practical application, serving as a comprehensive guide for creating and evaluating CNNs in image classification, with implications for both research and real-world applications in computer vision.

引用

页码：1172 / 1185

页数：14

共 50 条

[1] DEEP LEARNING ARCHITECTURES FOR MEDICAL IMAGE SEGMENTATION
Subramaniam, Sudha
Jayanthi, K. B.
Rajasekaran, C.
Kuchelar, Ramani
[J]. 2020 IEEE 33RD INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS(CBMS 2020), 2020, : 579 - 584
[2] Deep Neural Architectures for Medical Image Semantic Segmentation: Review
Khan, Muhammad Zubair
Gajendran, Mohan Kumar
Lee, Yugyung
Khan, Muazzam A.
[J]. IEEE ACCESS, 2021, 9 : 83002 - 83024
[3] Deep Dual Learning for Semantic Image Segmentation
Luo, Ping
Wang, Guangrun
Lin, Liang
Wang, Xiaogang
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2737 - 2745
[4] Image Classification and Semantic Segmentation with Deep Learning
Quazi, Saiman
Musa, Sarhan M.
[J]. 6TH IEEE INTERNATIONAL CONFERENCE ON RECENT ADVANCES AND INNOVATIONS IN ENGINEERING (ICRAIE), 2021,
[5] Automatic Development of Deep Learning Architectures for Image Segmentation
Nistor, Sergiu Cosmin
Ileni, Tudor Alexandru
Darabant, Adrian Sergiu
[J]. SUSTAINABILITY, 2020, 12 (22) : 1 - 18
[6] Semantic Segmentation: A Zoology of Deep Architectures
Artola, Aitor
[J]. IMAGE PROCESSING ON LINE, 2023, 13 : 167 - 182
[7] Multimodal Deep Learning in Semantic Image Segmentation: A Review
Raman, Vishal
Kumari, Madhu
[J]. PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTERNET OF THINGS (CCIOT 2018), 2018, : 7 - 11
[8] Medical image semantic segmentation based on deep learning
Jiang, Feng
Grigorev, Aleksei
Rho, Seungmin
Tian, Zhihong
Fu, YunSheng
Jifara, Worku
Adil, Khan
Liu, Shaohui
[J]. NEURAL COMPUTING & APPLICATIONS, 2018, 29 (05): : 1257 - 1265
[9] Semantic image segmentation network based on deep learning
Chen, Bo
Zhang, Jiahao
Zhou, Jianbang
Chen, Zhong
Yang, Tian
Zhang, Yanna
[J]. MIPPR 2019: AUTOMATIC TARGET RECOGNITION AND NAVIGATION, 2020, 11429
[10] A survey on deep learning techniques for image and video semantic segmentation
Garcia-Garcia, Alberto
Orts-Escolano, Sergio
Oprea, Sergiu
Villena-Martinez, Victor
Martinez-Gonzalez, Pablo
Garcia-Rodriguez, Jose
[J]. APPLIED SOFT COMPUTING, 2018, 70 : 41 - 65

← 1 2 3 4 5 →