Compressed Residual-VGG16 CNN Model for Big Data Places Image Recognition

被引：0

作者：

Qassim, Hussam ^{[1
]}

Verma, Abhishek ^{[2
]}

Feinzimer, David ^{[1
]}

机构：

[1] Calif State Univ Fullerton, Dept Comp Sci, Fullerton, CA 92831 USA

[2] New Jersey City Univ, Dept Comp Sci, Jersey City, NJ 07305 USA

来源：

2018 IEEE 8TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC) | 2018年

关键词：

Convolutional Neural Networks; VGG16; Residual Learning; Squeeze Neural Networks; scene classification;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep learning has given way to a new era of machine learning, apart from computer vision. Convolutional neural networks have been implemented in image classification, segmentation and object detection. Despite recent advancements, we are still in the very early stages and have yet to settle on best practices for network architecture in terms of deep design, small in size and a short training time. In this paper, we address the issue of speed and size by proposing a compressed convolutional neural network model namely Residual Squeeze VGG16. Proposed model compresses the earlier very successful VGG16 network and further improves on following aspects: (1) small model size, (2) faster speed, (3) uses residual learning for faster convergence, better generalization, and solves the issue of degradation, (4) matches the recognition accuracy of the non-compressed model on the very large-scale grand challenge MIT Places 365-Standard scene dataset. In comparison to VGG16 the proposed model is 88.4% smaller in size and 23.86% faster in the training time. This supports our claim that the proposed model inherits the best aspects of VGG16 and further improves upon it. In comparison to SqueezeNet our proposed framework can be more easily adapted and fully integrated with the residual learning for compressing various other contemporary deep learning convolutional neural network models Broader impact of our work could improve the performance in specialized tasks such as video-based surveillance, self-driving cars, and mobile GPU applications.

引用

页码：169 / 175

页数：7

共 50 条

[1] Image Forgery Detection by CNN and Pretrained VGG16 Model
Gupta, Pranjal Raaj
Sharma, Disha
Goel, Nidhi
PROCEEDINGS OF ACADEMIA-INDUSTRY CONSORTIUM FOR DATA SCIENCE (AICDS 2020), 2022, 1411 : 141 - 152
[2] Residual Squeeze CNDS Deep Learning CNN Model for Very Large Scale Places Image Recognition
Verma, Abhishek
Qassim, Hussam
Feinzimer, David
2017 IEEE 8TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS AND MOBILE COMMUNICATION CONFERENCE (UEMCON), 2017, : 463 - +
[3] Brain Neoplasm Identification using CNN with VGG-16 Model
Rani, Ms Soja S.
Nirmala, Dr M.
Reddy, Sai Kumar J.
Raksha, Shree C.
Hruday, S.
2024 4TH INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND SOCIAL NETWORKING, ICPCSN 2024, 2024, : 48 - 54
[4] Optical handwritten character recognition for Tamil language using CNN-VGG-16 model with RF classifier
Pughazendi, N.
Harikrishnan, M.
Khilar, Rashmita
Sharmila, L.
OPTICAL AND QUANTUM ELECTRONICS, 2023, 55 (11)
[5] An Improved VGG16 Model for Pneumonia Image Classification
Jiang, Zhi-Peng
Liu, Yi-Yang
Shao, Zhen-En
Huang, Ko-Wei
APPLIED SCIENCES-BASEL, 2021, 11 (23):
[6] Retraction Note: Optical handwritten character recognition for Tamil language using CNN-VGG-16 model with RF classifier
N. Pughazendi
M. HariKrishnan
Rashmita Khilar
L. Sharmila
Optical and Quantum Electronics, 56 (11)
[7] Traffic Sign Recognition Based on Improved VGG-16 Model
Tang Shuyuan
Li Jintao
Liu Chang
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT II, 2023, 14087 : 676 - 687
[8] A Lightweight Model of VGG-16 for Remote Sensing Image Classification
Ye, Mu
Ruiwen, Ni
Chang, Zhang
He, Gong
Tianli, Hu
Shijun, Li
Yu, Sun
Tong, Zhang
Ying, Guo
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 (14) : 6916 - 6922
[9] EEG-dependent automatic speech recognition using deep residual encoder based VGG net CNN
Chinta, Babu
Moorthi, M.
COMPUTER SPEECH AND LANGUAGE, 2023, 79
[10] VGG16-random fourier hybrid model for masked face recognition
O. K. Sikha
Bandla Bharath
Soft Computing, 2022, 26 : 12795 - 12810

← 1 2 3 4 5 →