A Novel Approach using Vision Transformers (VIT) for Classification of Holes Drilled in Melamine Faced Chipboard

被引:0
|
作者
Bukowski, Michal [1 ]
Jegorowa, Albina [2 ]
Kurek, Jaroslaw [1 ]
机构
[1] Warsaw Univ Life Sci, Inst Informat Technol, Dept Artificial Intelligence, Warsaw, Poland
[2] Warsaw Univ Life Sci, Inst Wood Sci & Furniture, Dept Mech Proc Wood, Warsaw, Poland
来源
PRZEGLAD ELEKTROTECHNICZNY | 2024年 / 100卷 / 05期
关键词
Vision Transformer; Convolutional Neural Network; tool state monitoring; melamine faced chipboard;
D O I
10.15199/48.2024.05.52
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a comprehensive performance evaluation of various AI architectures for a classification of holes drilled in melamine faced chipboard, including custom Convolutional Neural Network (CNN-designed), five-fold CNN-designed, VGG19, single and five-fold VGG16, an ensemble of CNN-designed, VGG19, and 5xVGG16, and Vision Transformers (ViT). Each model's performance was measured and compared based on their classification accuracy, with the Vision Transformer models, particularly the B_32 model trained for 8000 epochs, demonstrating superior performance with an accuracy of 71.14%. Despite this achievement, the study underscores the need to balance model performance with other considerations such as computational resources, model complexity, and training times. The results highlight the importance of careful model selection and fine-tuning, guided not only by performance metrics but also by the specific requirements and constraints of the task and context. The study provides a strong foundation for further exploration into other transformer-based models and encourages deeper investigations into model fine-tuning to harness the full potential of these AI architectures for image classification tasks.
引用
收藏
页码:273 / 276
页数:4
相关论文
共 50 条
  • [41] VisionCervix: Papanicolaou cervical smears classification using novel CNN-Vision ensemble approach
    Maurya, Ritesh
    Pandey, Nageshwar Nath
    Dutta, Malay Kishore
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 79
  • [42] A novel mobile robot localization approach based on classification with rejection option using computer vision
    Marinho, Leandro B.
    Reboucas Filho, Pedro P.
    Almeida, Jefferson S.
    Souza, Joao Wellington M.
    Souza Junior, Amauri H.
    de Albuquerque, Victor Hugo C.
    COMPUTERS & ELECTRICAL ENGINEERING, 2018, 68 : 26 - 43
  • [43] Vision Transformers (ViT) for Blanket-Penetrating Sleep Posture Recognition Using a Triple Ultra-Wideband (UWB) Radar System
    Lai, Derek Ka-Hei
    Yu, Zi-Han
    Leung, Tommy Yau-Nam
    Lim, Hyo-Jung
    Tam, Andy Yiu-Chau
    So, Bryan Pak-Hei
    Mao, Ye-Jiao
    Cheung, Daphne Sze Ki
    Wong, Duo Wai-Chi
    Cheung, James Chung-Wai
    SENSORS, 2023, 23 (05)
  • [44] Enhancing Fashion Classification with Vision Transformer (ViT) and Developing Recommendation Fashion Systems Using DINOVA2
    Abd Alaziz, Hadeer M.
    Elmannai, Hela
    Saleh, Hager
    Hadjouni, Myriam
    Anter, Ahmed M.
    Koura, Abdelrahim
    Kayed, Mohammed
    ELECTRONICS, 2023, 12 (20)
  • [45] GNViT- An enhanced image-based groundnut pest classification using Vision Transformer (ViT) model
    Venkatasaichandrakanth, P.
    Iyapparaja, M.
    PLOS ONE, 2024, 19 (03):
  • [46] Enhancing Retinal Disease Classification with Dual Scale Twin Vision Transformers using OCT Imaging
    Karn, Prakash Kumar
    Abdulla, Waleed H.
    2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 2362 - 2369
  • [47] Composite Deep Learning Architecture for Vehicle Classification Using Vision Transformers and Wheel Position Features
    Ma S.
    Yang J.J.
    Chorzepa M.G.
    Morris C.
    Kim S.S.
    Durham S.A.
    SN Computer Science, 5 (2)
  • [48] Rice leaf disease classification using a fusion vision approach
    Kumar, B. Naresh
    Sakthivel, S.
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [49] CWC-MP-MC Image-based breast tumor classification using an optimized Vision Transformer (ViT)
    Kabir, Shahriar Mahmud
    Bhuiyan, Mohammed Imamul Hassan
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 100
  • [50] Improving satellite image classification accuracy using GAN-based data augmentation and vision transformers
    Alzahem, Ayyub
    Boulila, Wadii
    Koubaa, Anis
    Khan, Zahid
    Alturki, Ibrahim
    EARTH SCIENCE INFORMATICS, 2023, 16 (04) : 4169 - 4186