Advancements in Deep Learning Architectures for Image Recognition and Semantic Segmentation

被引:0
|
作者
Nimma, Divya [1 ]
Uddagiri, Arjun [2 ]
机构
[1] Univ Southern Mississippi, Computat Sci, Hattiesburg, MS 39406 USA
[2] Gloom Dev Pvt Ltd, Vijayawada 521139, Andhra Pradesh, India
关键词
Convolutional Neural Networks (CNNs); AlexNet; image classification; transfer learning; MNIST Dataset; Custom CNN Architecture; deep learning; model training and evaluation; neural network optimization; activation functions; feature extraction; machine learning; pattern recognition; data preprocessing; loss functions; model accuracy;
D O I
10.14569/IJACSA.2024.01508114
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper focuses on using Convolutional Neural Networks (CNNs) for tasks such as image classification. It covers both pre-trained models and those that are built from scratch. The paper begins by demonstrating how to utilize the well-known AlexNet model, which is highly effective for image recognition due to transfer learning. It then explains how to load and prepare the MNIST dataset, a common choice for testing image classification methods. Additionally, it introduces a custom CNN designed specifically for recognizing MNIST digits, outlining its architecture, which includes convolutional layers, activation functions, and fully connected layers for capturing handwritten numbers' details. The paper also guides starting the model, running it on sample data, reviewing outputs, and assessing the accuracy of predictions. Furthermore, it delves into training the custom CNN and evaluating its performance by comparing it with established benchmarks, utilizing loss functions and optimization techniques to fine-tune the model and assess its classification accuracy. This work integrates theory with practical application, serving as a comprehensive guide for creating and evaluating CNNs in image classification, with implications for both research and real-world applications in computer vision.
引用
收藏
页码:1172 / 1185
页数:14
相关论文
共 50 条
  • [41] A Survey on Deep Learning-based Architectures for Semantic Segmentation on 2D Images
    Ulku, Irem
    Akagunduz, Erdem
    [J]. APPLIED ARTIFICIAL INTELLIGENCE, 2022, 36 (01)
  • [42] Systematic Evaluation of Image Tiling Adverse Effects on Deep Learning Semantic Segmentation
    Reina, G. Anthony
    Panchumarthy, Ravi
    Thakur, Siddhesh Pravin
    Bastidas, Alexei
    Bakas, Spyridon
    [J]. FRONTIERS IN NEUROSCIENCE, 2020, 14
  • [43] Features to Text: A Comprehensive Survey of Deep Learning on Semantic Segmentation and Image Captioning
    Oluwasammi, Ariyo
    Aftab, Muhammad Umar
    Qin, Zhiguang
    Son Tung Ngo
    Thang Van Doan
    Son Ba Nguyen
    Son Hoang Nguyen
    Giang Hoang Nguyen
    [J]. COMPLEXITY, 2021, 2021
  • [44] Semantic segmentation of PolSAR image data using advanced deep learning model
    Garg, Rajat
    Kumar, Anil
    Bansal, Nikunj
    Prateek, Manish
    Kumar, Shashi
    [J]. SCIENTIFIC REPORTS, 2021, 11 (01)
  • [45] Exploring Effects of Colour and Image Quality in Semantic Segmentation by Deep Learning Methods
    De, Kanjar
    [J]. JOURNAL OF IMAGING SCIENCE AND TECHNOLOGY, 2022, 66 (05)
  • [46] Combining deep learning and ontology reasoning for remote sensing image semantic segmentation
    Li, Yansheng
    Ouyang, Song
    Zhang, Yongjun
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 243
  • [47] Research on Semantic Segmentation Method of Urban Streetscape Image Based on Deep Learning
    Gan Peixin
    Luo Xiaoyan
    Liu Bo
    Li Lu
    Shi Xiaofeng
    [J]. SEVENTH ASIA PACIFIC CONFERENCE ON OPTICS MANUFACTURE (APCOM 2021), 2022, 12166
  • [48] Subset selection for visualization of relevant image fractions for deep learning based semantic image segmentation
    Mauch, Lukas
    Wang, Chunlai
    Yang, Bin
    [J]. JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2018, 355 (04): : 1931 - 1944
  • [49] Image Semantic Segmentation Method Based on Deep Learning in UAV Aerial Remote Sensing Image
    Ling, Min
    Cheng, Qun
    Peng, Jun
    Zhao, Chenyi
    Jiang, Ling
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [50] Efficient deep feature selection for remote sensing image recognition with fused deep learning architectures
    Fatih Özyurt
    [J]. The Journal of Supercomputing, 2020, 76 : 8413 - 8431