Robust Human Face Emotion Classification Using Triplet-Loss-Based Deep CNN Features and SVM

被引:2
|
作者
Haider, Irfan [1 ]
Yang, Hyung-Jeong [1 ]
Lee, Guee-Sang [1 ]
Kim, Soo-Hyung [1 ]
机构
[1] Chonnam Natl Univ, Dept Artificial Intelligence Convergence, Gwangju 500757, South Korea
基金
新加坡国家研究基金会;
关键词
emotion classification; SVM; triplet loss; transfer learning; ResNet18; RECOGNITION;
D O I
10.3390/s23104770
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Human facial emotion detection is one of the challenging tasks in computer vision. Owing to high inter-class variance, it is hard for machine learning models to predict facial emotions accurately. Moreover, a person with several facial emotions increases the diversity and complexity of classification problems. In this paper, we have proposed a novel and intelligent approach for the classification of human facial emotions. The proposed approach comprises customized ResNet18 by employing transfer learning with the integration of triplet loss function (TLF), followed by SVM classification model. Using deep features from a customized ResNet18 trained with triplet loss, the proposed pipeline consists of a face detector used to locate and refine the face bounding box and a classifier to identify the facial expression class of discovered faces. RetinaFace is used to extract the identified face areas from the source image, and a ResNet18 model is trained on cropped face images with triplet loss to retrieve those features. An SVM classifier is used to categorize the facial expression based on the acquired deep characteristics. In this paper, we have proposed a method that can achieve better performance than state-of-the-art (SoTA) methods on JAFFE and MMI datasets. The technique is based on the triplet loss function to generate deep input image features. The proposed method performed well on the JAFFE and MMI datasets with an accuracy of 98.44% and 99.02%, respectively, on seven emotions; meanwhile, the performance of the method needs to be fine-tuned for the FER2013 and AFFECTNET datasets.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] SPEECH EMOTION CLASSIFICATION USING SVM AND MLP ON PROSODIC AND VOICE QUALITY FEATURES
    Idris, Inshirah
    Salam, Md Sah Hj
    Sunar, Mohd Shahrizal
    [J]. JURNAL TEKNOLOGI, 2016, 78 (2-2): : 27 - 33
  • [22] Integrating Geometric and Textural Features for Facial Emotion Classification Using SVM Frameworks
    Datta, Samyak
    Sen, Debashis
    Balasubramanian, R.
    [J]. PROCEEDINGS OF INTERNATIONAL CONFERENCE ON COMPUTER VISION AND IMAGE PROCESSING, CVIP 2016, VOL 1, 2017, 459 : 619 - 628
  • [23] Automated robust human emotion classification system using hybrid EEG features with ICBrainDB dataset
    Deniz, Erkan
    Sobahi, Nebras
    Omar, Naaman
    Sengur, Abdulkadir
    Acharya, U. Rajendra
    [J]. HEALTH INFORMATION SCIENCE AND SYSTEMS, 2022, 10 (01)
  • [24] Automated robust human emotion classification system using hybrid EEG features with ICBrainDB dataset
    Erkan Deniz
    Nebras Sobahi
    Naaman Omar
    Abdulkadir Sengur
    U. Rajendra Acharya
    [J]. Health Information Science and Systems, 10
  • [25] Grasp the Implicit Features: Hierarchical Emotion Classification based on Topic Model and SVM
    Zhang, Fan
    Xu, Hua
    Wang, Jiushuo
    Sun, Xiaomin
    Deng, Junhui
    [J]. 2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 3592 - 3599
  • [26] Personalized face emotion classification using optimized data of three features
    Karthigayan, M.
    Nagarajan, R.
    Rizon, M.
    Yaacob, Sazah
    [J]. 2007 THIRD INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING, VOL 1, PROCEEDINGS, 2007, : 57 - 60
  • [27] Robust Vehicle Classification Based on Deep Features Learning
    Niroomand, Naghmeh
    Bach, Christian
    Elser, Miriam
    [J]. IEEE ACCESS, 2021, 9 : 95675 - 95685
  • [28] Speech emotion recognition and classification using hybrid deep CNN and BiLSTM model
    Swami Mishra
    Nehal Bhatnagar
    Prakasam P
    Sureshkumar T. R
    [J]. Multimedia Tools and Applications, 2024, 83 : 37603 - 37620
  • [29] Speech emotion recognition and classification using hybrid deep CNN and BiLSTM model
    Mishra, Swami
    Bhatnagar, Nehal
    Prakasam, P.
    Sureshkumar, T. R.
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (13) : 37603 - 37620
  • [30] Deep-Net: A Lightweight CNN-Based Speech Emotion Recognition System Using Deep Frequency Features
    Anvarjon, Tursunov
    Mustaqeem
    Kwon, Soonil
    [J]. SENSORS, 2020, 20 (18) : 1 - 16