Comparative Analysis of Vision Transformer Models for Facial Emotion Recognition Using Augmented Balanced Datasets

被引:6
|
作者
Bobojanov, Sukhrob [1 ]
Kim, Byeong Man [1 ]
Arabboev, Mukhriddin [2 ]
Begmatov, Shohruh [2 ]
机构
[1] Kumoh Natl Inst Technol, Comp Software Engn, Gumi 39177, South Korea
[2] Tashkent Univ Informat Technol, Tashkent 10084, Uzbekistan
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 22期
关键词
facial emotion recognition; vision transformer; data augmentation; balanced data; FER2013; RAF-DB;
D O I
10.3390/app132212271
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Facial emotion recognition (FER) has a huge importance in the field of human-machine interface. Given the intricacies of human facial expressions and the inherent variations in images, which are characterized by diverse facial poses and lighting conditions, the task of FER remains a challenging endeavour for computer-based models. Recent advancements have seen vision transformer (ViT) models attain state-of-the-art results across various computer vision tasks, encompassing image classification, object detection, and segmentation. Moreover, one of the most important aspects of creating strong machine learning models is correcting data imbalances. To avoid biased predictions and guarantee reliable findings, it is essential to maintain the distribution equilibrium of the training dataset. In this work, we have chosen two widely used open-source datasets, RAF-DB and FER2013. As well as resolving the imbalance problem, we present a new, balanced dataset, applying data augmentation techniques and cleaning poor-quality images from the FER2013 dataset. We then conduct a comprehensive evaluation of thirteen different ViT models with these three datasets. Our investigation concludes that ViT models present a promising approach for FER tasks. Among these ViT models, Mobile ViT and Tokens-to-Token ViT models appear to be the most effective, followed by PiT and Cross Former models.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] A Multimodal Emotion Recognition System Using Facial Landmark Analysis
    Rahdari, Farhad
    Rashedi, Esmat
    Eftekhari, Mahdi
    IRANIAN JOURNAL OF SCIENCE AND TECHNOLOGY-TRANSACTIONS OF ELECTRICAL ENGINEERING, 2019, 43 (Suppl 1) : 171 - 189
  • [22] Emotion Recognition Using Facial Expressions
    Jasuja, Arush
    Rathee, Sonia
    INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2021, 11 (03) : 1 - 17
  • [23] Emotion sensing using facial recognition
    Shivashankar, Shreyas Gulur
    Hiremath, Sushant
    Proceedings of the 2017 International Conference On Smart Technology for Smart Nation, SmartTechCon 2017, 2018, : 830 - 833
  • [24] Emotion recognition using facial expressions
    Tarnowski, Pawel
    Kolodziej, Marcin
    Majkowski, Andrzej
    Rak, Remigiusz J.
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS 2017), 2017, 108 : 1175 - 1184
  • [25] Emotion Sensing Using Facial Recognition
    Shivashankar, Shreya Gulur
    Hiremath, Sushant
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON SMART TECHNOLOGIES FOR SMART NATION (SMARTTECHCON), 2017, : 830 - 833
  • [26] Mobile Emotion Recognition via Multiple Physiological Signals using Convolution-augmented Transformer
    Yang, Kangning
    Tag, Benjamin
    Gu, Yue
    Wang, Chaofan
    Dingler, Tilman
    Wadley, Greg
    Goncalves, Jorge
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2022, 2022, : 562 - 570
  • [27] Vision Transformer With Attentive Pooling for Robust Facial Expression Recognition
    Xue, Fanglei
    Wang, Qiangchang
    Tan, Zichang
    Ma, Zhongsong
    Guo, Guodong
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (04) : 3244 - 3256
  • [28] Research on facial recognition of sika deer based on vision transformer
    Gong, He
    Luo, Tianye
    Ni, Lingyun
    Li, Ji
    Guo, Jie
    Liu, Tonghe
    Feng, Ruilong
    Mu, Ye
    Hu, Tianli
    Sun, Yu
    Guo, Ying
    Li, Shijun
    ECOLOGICAL INFORMATICS, 2023, 78
  • [29] A Comparative Analysis of Various Deep Learning Models for Facial Recognition
    Sabharwal, Tanupreet
    Garg, Tanya
    Singh, Sanghmitra Vikram
    PROCEEDINGS OF THE 2019 6TH INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2019, : 966 - 970
  • [30] Emotion Recognition from Facial Images using Hybrid Deep Learning Models
    Yaseen, Arfa Fatima
    Shaukat, Arslan
    Alam, Maria
    2022 2nd International Conference on Digital Futures and Transformative Technologies, ICoDT2 2022, 2022,