A Novel Approach using Vision Transformers (VIT) for Classification of Holes Drilled in Melamine Faced Chipboard

被引：0

作者：

Bukowski, Michal ^{[1
]}

Jegorowa, Albina ^{[2
]}

Kurek, Jaroslaw ^{[1
]}

机构：

[1] Warsaw Univ Life Sci, Inst Informat Technol, Dept Artificial Intelligence, Warsaw, Poland

[2] Warsaw Univ Life Sci, Inst Wood Sci & Furniture, Dept Mech Proc Wood, Warsaw, Poland

来源：

PRZEGLAD ELEKTROTECHNICZNY | 2024年 / 100卷 / 05期

关键词：

Vision Transformer; Convolutional Neural Network; tool state monitoring; melamine faced chipboard;

D O I：

10.15199/48.2024.05.52

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper presents a comprehensive performance evaluation of various AI architectures for a classification of holes drilled in melamine faced chipboard, including custom Convolutional Neural Network (CNN-designed), five-fold CNN-designed, VGG19, single and five-fold VGG16, an ensemble of CNN-designed, VGG19, and 5xVGG16, and Vision Transformers (ViT). Each model's performance was measured and compared based on their classification accuracy, with the Vision Transformer models, particularly the B_32 model trained for 8000 epochs, demonstrating superior performance with an accuracy of 71.14%. Despite this achievement, the study underscores the need to balance model performance with other considerations such as computational resources, model complexity, and training times. The results highlight the importance of careful model selection and fine-tuning, guided not only by performance metrics but also by the specific requirements and constraints of the task and context. The study provides a strong foundation for further exploration into other transformer-based models and encourages deeper investigations into model fine-tuning to harness the full potential of these AI architectures for image classification tasks.

引用

页码：273 / 276

页数：4

共 50 条

[41] VisionCervix: Papanicolaou cervical smears classification using novel CNN-Vision ensemble approach
Maurya, Ritesh
Pandey, Nageshwar Nath
Dutta, Malay Kishore
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 79
[42] A novel mobile robot localization approach based on classification with rejection option using computer vision
Marinho, Leandro B.
Reboucas Filho, Pedro P.
Almeida, Jefferson S.
Souza, Joao Wellington M.
Souza Junior, Amauri H.
de Albuquerque, Victor Hugo C.
COMPUTERS & ELECTRICAL ENGINEERING, 2018, 68 : 26 - 43
[43] Vision Transformers (ViT) for Blanket-Penetrating Sleep Posture Recognition Using a Triple Ultra-Wideband (UWB) Radar System
Lai, Derek Ka-Hei
Yu, Zi-Han
Leung, Tommy Yau-Nam
Lim, Hyo-Jung
Tam, Andy Yiu-Chau
So, Bryan Pak-Hei
Mao, Ye-Jiao
Cheung, Daphne Sze Ki
Wong, Duo Wai-Chi
Cheung, James Chung-Wai
SENSORS, 2023, 23 (05)
[44] Enhancing Fashion Classification with Vision Transformer (ViT) and Developing Recommendation Fashion Systems Using DINOVA2
Abd Alaziz, Hadeer M.
Elmannai, Hela
Saleh, Hager
Hadjouni, Myriam
Anter, Ahmed M.
Koura, Abdelrahim
Kayed, Mohammed
ELECTRONICS, 2023, 12 (20)
[45] GNViT- An enhanced image-based groundnut pest classification using Vision Transformer (ViT) model
Venkatasaichandrakanth, P.
Iyapparaja, M.
PLOS ONE, 2024, 19 (03):
[46] Enhancing Retinal Disease Classification with Dual Scale Twin Vision Transformers using OCT Imaging
Karn, Prakash Kumar
Abdulla, Waleed H.
2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 2362 - 2369
[47] Composite Deep Learning Architecture for Vehicle Classification Using Vision Transformers and Wheel Position Features
Ma S.
Yang J.J.
Chorzepa M.G.
Morris C.
Kim S.S.
Durham S.A.
SN Computer Science, 5 (2)
[48] Rice leaf disease classification using a fusion vision approach
Kumar, B. Naresh
Sakthivel, S.
SCIENTIFIC REPORTS, 2025, 15 (01):
[49] CWC-MP-MC Image-based breast tumor classification using an optimized Vision Transformer (ViT)
Kabir, Shahriar Mahmud
Bhuiyan, Mohammed Imamul Hassan
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 100
[50] Improving satellite image classification accuracy using GAN-based data augmentation and vision transformers
Alzahem, Ayyub
Boulila, Wadii
Koubaa, Anis
Khan, Zahid
Alturki, Ibrahim
EARTH SCIENCE INFORMATICS, 2023, 16 (04) : 4169 - 4186

← 1 2 3 4 5 →