Comparative Analysis of Vision Transformer Models for Facial Emotion Recognition Using Augmented Balanced Datasets

被引:4
|
作者
Bobojanov, Sukhrob [1 ]
Kim, Byeong Man [1 ]
Arabboev, Mukhriddin [2 ]
Begmatov, Shohruh [2 ]
机构
[1] Kumoh Natl Inst Technol, Comp Software Engn, Gumi 39177, South Korea
[2] Tashkent Univ Informat Technol, Tashkent 10084, Uzbekistan
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 22期
关键词
facial emotion recognition; vision transformer; data augmentation; balanced data; FER2013; RAF-DB;
D O I
10.3390/app132212271
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Facial emotion recognition (FER) has a huge importance in the field of human-machine interface. Given the intricacies of human facial expressions and the inherent variations in images, which are characterized by diverse facial poses and lighting conditions, the task of FER remains a challenging endeavour for computer-based models. Recent advancements have seen vision transformer (ViT) models attain state-of-the-art results across various computer vision tasks, encompassing image classification, object detection, and segmentation. Moreover, one of the most important aspects of creating strong machine learning models is correcting data imbalances. To avoid biased predictions and guarantee reliable findings, it is essential to maintain the distribution equilibrium of the training dataset. In this work, we have chosen two widely used open-source datasets, RAF-DB and FER2013. As well as resolving the imbalance problem, we present a new, balanced dataset, applying data augmentation techniques and cleaning poor-quality images from the FER2013 dataset. We then conduct a comprehensive evaluation of thirteen different ViT models with these three datasets. Our investigation concludes that ViT models present a promising approach for FER tasks. Among these ViT models, Mobile ViT and Tokens-to-Token ViT models appear to be the most effective, followed by PiT and Cross Former models.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Enhanced Facial Emotion Recognition Using Vision Transformer Models
    N. Sabiyath Fatima
    G. Deepika
    Arun Anthonisamy
    R. Jothi Chitra
    J. Muralidharan
    Manjunathan Alagarsamy
    Kummari Ramyasree
    Journal of Electrical Engineering & Technology, 2025, 20 (2) : 1143 - 1152
  • [2] Facial Emotion Recognition Using Computer Vision
    Jonathan
    Lim, Andreas Pangestu
    Paoline
    Kusuma, Gede Putra
    Zahra, Amalia
    2018 INDONESIAN ASSOCIATION FOR PATTERN RECOGNITION INTERNATIONAL CONFERENCE (INAPR), 2018, : 46 - 50
  • [3] An enhanced speech emotion recognition using vision transformer
    Akinpelu, Samson
    Viriri, Serestina
    Adegun, Adekanmi
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [4] Facial emotion recognition: A comparative analysis using 22 LBP variants
    Slimani, K.
    Kas, M.
    El Merabet, Y.
    Messoussi, R.
    Ruichek, Y.
    PROCEEDINGS OF THE 2ND MEDITERRANEAN CONFERENCE ON PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE (MEDPRAI-2018), 2018, : 88 - 94
  • [5] Fine Tuning Vision Transformer Model for Facial Emotion Recognition: Performance Analysis for Human-Machine Teaming
    Roka, Sanjeev
    Rawat, Danda B.
    2023 IEEE 24TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE, IRI, 2023, : 134 - 139
  • [6] Facial Expression Analysis for Emotion Recognition Using Kernel Methods and Statistical Models
    Garcia, Hernan F.
    Torres, Cristian A.
    Marin Hurtado, Jorge Ivan
    2014 XIX SYMPOSIUM ON IMAGE, SIGNAL PROCESSING AND ARTIFICIAL VISION (STSIVA), 2014,
  • [7] Multimodal transformer augmented fusion for speech emotion recognition
    Wang, Yuanyuan
    Gu, Yu
    Yin, Yifei
    Han, Yingping
    Zhang, He
    Wang, Shuang
    Li, Chenyu
    Quan, Dou
    FRONTIERS IN NEUROROBOTICS, 2023, 17
  • [8] A study on computer vision for facial emotion recognition
    Huang, Zi-Yu
    Chiang, Chia-Chin
    Chen, Jian-Hao
    Chen, Yi-Chian
    Chung, Hsin-Lung
    Cai, Yu-Ping
    Hsu, Hsiu-Chuan
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [9] A study on computer vision for facial emotion recognition
    Zi-Yu Huang
    Chia-Chin Chiang
    Jian-Hao Chen
    Yi-Chian Chen
    Hsin-Lung Chung
    Yu-Ping Cai
    Hsiu-Chuan Hsu
    Scientific Reports, 13
  • [10] ViTFER: Facial Emotion Recognition with Vision Transformers
    Chaudhari, Aayushi
    Bhatt, Chintan
    Krishna, Achyut
    Mazzeo, Pier Luigi
    APPLIED SYSTEM INNOVATION, 2022, 5 (04)