Advancements in Deep Learning Architectures for Image Recognition and Semantic Segmentation

被引:0
|
作者
Nimma, Divya [1 ]
Uddagiri, Arjun [2 ]
机构
[1] Univ Southern Mississippi, Computat Sci, Hattiesburg, MS 39406 USA
[2] Gloom Dev Pvt Ltd, Vijayawada 521139, Andhra Pradesh, India
关键词
Convolutional Neural Networks (CNNs); AlexNet; image classification; transfer learning; MNIST Dataset; Custom CNN Architecture; deep learning; model training and evaluation; neural network optimization; activation functions; feature extraction; machine learning; pattern recognition; data preprocessing; loss functions; model accuracy;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper focuses on using Convolutional Neural Networks (CNNs) for tasks such as image classification. It covers both pre-trained models and those that are built from scratch. The paper begins by demonstrating how to utilize the well-known AlexNet model, which is highly effective for image recognition due to transfer learning. It then explains how to load and prepare the MNIST dataset, a common choice for testing image classification methods. Additionally, it introduces a custom CNN designed specifically for recognizing MNIST digits, outlining its architecture, which includes convolutional layers, activation functions, and fully connected layers for capturing handwritten numbers' details. The paper also guides starting the model, running it on sample data, reviewing outputs, and assessing the accuracy of predictions. Furthermore, it delves into training the custom CNN and evaluating its performance by comparing it with established benchmarks, utilizing loss functions and optimization techniques to fine-tune the model and assess its classification accuracy. This work integrates theory with practical application, serving as a comprehensive guide for creating and evaluating CNNs in image classification, with implications for both research and real-world applications in computer vision.
引用
收藏
页码:1172 / 1185
页数:14
相关论文
共 50 条
  • [1] DEEP LEARNING ARCHITECTURES FOR MEDICAL IMAGE SEGMENTATION
    Subramaniam, Sudha
    Jayanthi, K. B.
    Rajasekaran, C.
    Kuchelar, Ramani
    [J]. 2020 IEEE 33RD INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS(CBMS 2020), 2020, : 579 - 584
  • [2] Deep Neural Architectures for Medical Image Semantic Segmentation: Review
    Khan, Muhammad Zubair
    Gajendran, Mohan Kumar
    Lee, Yugyung
    Khan, Muazzam A.
    [J]. IEEE ACCESS, 2021, 9 : 83002 - 83024
  • [3] Deep Dual Learning for Semantic Image Segmentation
    Luo, Ping
    Wang, Guangrun
    Lin, Liang
    Wang, Xiaogang
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2737 - 2745
  • [4] Image Classification and Semantic Segmentation with Deep Learning
    Quazi, Saiman
    Musa, Sarhan M.
    [J]. 6TH IEEE INTERNATIONAL CONFERENCE ON RECENT ADVANCES AND INNOVATIONS IN ENGINEERING (ICRAIE), 2021,
  • [5] Automatic Development of Deep Learning Architectures for Image Segmentation
    Nistor, Sergiu Cosmin
    Ileni, Tudor Alexandru
    Darabant, Adrian Sergiu
    [J]. SUSTAINABILITY, 2020, 12 (22) : 1 - 18
  • [6] Semantic Segmentation: A Zoology of Deep Architectures
    Artola, Aitor
    [J]. IMAGE PROCESSING ON LINE, 2023, 13 : 167 - 182
  • [7] Multimodal Deep Learning in Semantic Image Segmentation: A Review
    Raman, Vishal
    Kumari, Madhu
    [J]. PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTERNET OF THINGS (CCIOT 2018), 2018, : 7 - 11
  • [8] Medical image semantic segmentation based on deep learning
    Jiang, Feng
    Grigorev, Aleksei
    Rho, Seungmin
    Tian, Zhihong
    Fu, YunSheng
    Jifara, Worku
    Adil, Khan
    Liu, Shaohui
    [J]. NEURAL COMPUTING & APPLICATIONS, 2018, 29 (05): : 1257 - 1265
  • [9] Semantic image segmentation network based on deep learning
    Chen, Bo
    Zhang, Jiahao
    Zhou, Jianbang
    Chen, Zhong
    Yang, Tian
    Zhang, Yanna
    [J]. MIPPR 2019: AUTOMATIC TARGET RECOGNITION AND NAVIGATION, 2020, 11429
  • [10] A survey on deep learning techniques for image and video semantic segmentation
    Garcia-Garcia, Alberto
    Orts-Escolano, Sergio
    Oprea, Sergiu
    Villena-Martinez, Victor
    Martinez-Gonzalez, Pablo
    Garcia-Rodriguez, Jose
    [J]. APPLIED SOFT COMPUTING, 2018, 70 : 41 - 65