Compressed Residual-VGG16 CNN Model for Big Data Places Image Recognition

被引:0
|
作者
Qassim, Hussam [1 ]
Verma, Abhishek [2 ]
Feinzimer, David [1 ]
机构
[1] Calif State Univ Fullerton, Dept Comp Sci, Fullerton, CA 92831 USA
[2] New Jersey City Univ, Dept Comp Sci, Jersey City, NJ 07305 USA
关键词
Convolutional Neural Networks; VGG16; Residual Learning; Squeeze Neural Networks; scene classification;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning has given way to a new era of machine learning, apart from computer vision. Convolutional neural networks have been implemented in image classification, segmentation and object detection. Despite recent advancements, we are still in the very early stages and have yet to settle on best practices for network architecture in terms of deep design, small in size and a short training time. In this paper, we address the issue of speed and size by proposing a compressed convolutional neural network model namely Residual Squeeze VGG16. Proposed model compresses the earlier very successful VGG16 network and further improves on following aspects: (1) small model size, (2) faster speed, (3) uses residual learning for faster convergence, better generalization, and solves the issue of degradation, (4) matches the recognition accuracy of the non-compressed model on the very large-scale grand challenge MIT Places 365-Standard scene dataset. In comparison to VGG16 the proposed model is 88.4% smaller in size and 23.86% faster in the training time. This supports our claim that the proposed model inherits the best aspects of VGG16 and further improves upon it. In comparison to SqueezeNet our proposed framework can be more easily adapted and fully integrated with the residual learning for compressing various other contemporary deep learning convolutional neural network models Broader impact of our work could improve the performance in specialized tasks such as video-based surveillance, self-driving cars, and mobile GPU applications.
引用
收藏
页码:169 / 175
页数:7
相关论文
共 50 条
  • [1] Image Forgery Detection by CNN and Pretrained VGG16 Model
    Gupta, Pranjal Raaj
    Sharma, Disha
    Goel, Nidhi
    PROCEEDINGS OF ACADEMIA-INDUSTRY CONSORTIUM FOR DATA SCIENCE (AICDS 2020), 2022, 1411 : 141 - 152
  • [2] Residual Squeeze CNDS Deep Learning CNN Model for Very Large Scale Places Image Recognition
    Verma, Abhishek
    Qassim, Hussam
    Feinzimer, David
    2017 IEEE 8TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS AND MOBILE COMMUNICATION CONFERENCE (UEMCON), 2017, : 463 - +
  • [3] Brain Neoplasm Identification using CNN with VGG-16 Model
    Rani, Ms Soja S.
    Nirmala, Dr M.
    Reddy, Sai Kumar J.
    Raksha, Shree C.
    Hruday, S.
    2024 4TH INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND SOCIAL NETWORKING, ICPCSN 2024, 2024, : 48 - 54
  • [4] Optical handwritten character recognition for Tamil language using CNN-VGG-16 model with RF classifier
    Pughazendi, N.
    Harikrishnan, M.
    Khilar, Rashmita
    Sharmila, L.
    OPTICAL AND QUANTUM ELECTRONICS, 2023, 55 (11)
  • [5] An Improved VGG16 Model for Pneumonia Image Classification
    Jiang, Zhi-Peng
    Liu, Yi-Yang
    Shao, Zhen-En
    Huang, Ko-Wei
    APPLIED SCIENCES-BASEL, 2021, 11 (23):
  • [6] Retraction Note: Optical handwritten character recognition for Tamil language using CNN-VGG-16 model with RF classifier
    N. Pughazendi
    M. HariKrishnan
    Rashmita Khilar
    L. Sharmila
    Optical and Quantum Electronics, 56 (11)
  • [7] Traffic Sign Recognition Based on Improved VGG-16 Model
    Tang Shuyuan
    Li Jintao
    Liu Chang
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT II, 2023, 14087 : 676 - 687
  • [8] A Lightweight Model of VGG-16 for Remote Sensing Image Classification
    Ye, Mu
    Ruiwen, Ni
    Chang, Zhang
    He, Gong
    Tianli, Hu
    Shijun, Li
    Yu, Sun
    Tong, Zhang
    Ying, Guo
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 (14) : 6916 - 6922
  • [9] EEG-dependent automatic speech recognition using deep residual encoder based VGG net CNN
    Chinta, Babu
    Moorthi, M.
    COMPUTER SPEECH AND LANGUAGE, 2023, 79
  • [10] VGG16-random fourier hybrid model for masked face recognition
    O. K. Sikha
    Bandla Bharath
    Soft Computing, 2022, 26 : 12795 - 12810