Advancements in Deep Learning Architectures for Image Recognition and Semantic Segmentation

被引：0

作者：

Nimma, Divya ^{[1
]}

Uddagiri, Arjun ^{[2
]}

机构：

[1] Univ Southern Mississippi, Computat Sci, Hattiesburg, MS 39406 USA

[2] Gloom Dev Pvt Ltd, Vijayawada 521139, Andhra Pradesh, India

来源：

INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS | 2024年 / 15卷 / 08期

关键词：

Convolutional Neural Networks (CNNs); AlexNet; image classification; transfer learning; MNIST Dataset; Custom CNN Architecture; deep learning; model training and evaluation; neural network optimization; activation functions; feature extraction; machine learning; pattern recognition; data preprocessing; loss functions; model accuracy;

D O I：

10.14569/IJACSA.2024.01508114

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

This paper focuses on using Convolutional Neural Networks (CNNs) for tasks such as image classification. It covers both pre-trained models and those that are built from scratch. The paper begins by demonstrating how to utilize the well-known AlexNet model, which is highly effective for image recognition due to transfer learning. It then explains how to load and prepare the MNIST dataset, a common choice for testing image classification methods. Additionally, it introduces a custom CNN designed specifically for recognizing MNIST digits, outlining its architecture, which includes convolutional layers, activation functions, and fully connected layers for capturing handwritten numbers' details. The paper also guides starting the model, running it on sample data, reviewing outputs, and assessing the accuracy of predictions. Furthermore, it delves into training the custom CNN and evaluating its performance by comparing it with established benchmarks, utilizing loss functions and optimization techniques to fine-tune the model and assess its classification accuracy. This work integrates theory with practical application, serving as a comprehensive guide for creating and evaluating CNNs in image classification, with implications for both research and real-world applications in computer vision.

引用

页码：1172 / 1185

页数：14

共 50 条

[41] A Survey on Deep Learning-based Architectures for Semantic Segmentation on 2D Images
Ulku, Irem
Akagunduz, Erdem
[J]. APPLIED ARTIFICIAL INTELLIGENCE, 2022, 36 (01)
[42] Systematic Evaluation of Image Tiling Adverse Effects on Deep Learning Semantic Segmentation
Reina, G. Anthony
Panchumarthy, Ravi
Thakur, Siddhesh Pravin
Bastidas, Alexei
Bakas, Spyridon
[J]. FRONTIERS IN NEUROSCIENCE, 2020, 14
[43] Features to Text: A Comprehensive Survey of Deep Learning on Semantic Segmentation and Image Captioning
Oluwasammi, Ariyo
Aftab, Muhammad Umar
Qin, Zhiguang
Son Tung Ngo
Thang Van Doan
Son Ba Nguyen
Son Hoang Nguyen
Giang Hoang Nguyen
[J]. COMPLEXITY, 2021, 2021
[44] Semantic segmentation of PolSAR image data using advanced deep learning model
Garg, Rajat
Kumar, Anil
Bansal, Nikunj
Prateek, Manish
Kumar, Shashi
[J]. SCIENTIFIC REPORTS, 2021, 11 (01)
[45] Exploring Effects of Colour and Image Quality in Semantic Segmentation by Deep Learning Methods
De, Kanjar
[J]. JOURNAL OF IMAGING SCIENCE AND TECHNOLOGY, 2022, 66 (05)
[46] Combining deep learning and ontology reasoning for remote sensing image semantic segmentation
Li, Yansheng
Ouyang, Song
Zhang, Yongjun
[J]. KNOWLEDGE-BASED SYSTEMS, 2022, 243
[47] Research on Semantic Segmentation Method of Urban Streetscape Image Based on Deep Learning
Gan Peixin
Luo Xiaoyan
Liu Bo
Li Lu
Shi Xiaofeng
[J]. SEVENTH ASIA PACIFIC CONFERENCE ON OPTICS MANUFACTURE (APCOM 2021), 2022, 12166
[48] Subset selection for visualization of relevant image fractions for deep learning based semantic image segmentation
Mauch, Lukas
Wang, Chunlai
Yang, Bin
[J]. JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2018, 355 (04): : 1931 - 1944
[49] Image Semantic Segmentation Method Based on Deep Learning in UAV Aerial Remote Sensing Image
Ling, Min
Cheng, Qun
Peng, Jun
Zhao, Chenyi
Jiang, Ling
[J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
[50] Efficient deep feature selection for remote sensing image recognition with fused deep learning architectures
Fatih Özyurt
[J]. The Journal of Supercomputing, 2020, 76 : 8413 - 8431

← 1 2 3 4 5 →