Robust Human Face Emotion Classification Using Triplet-Loss-Based Deep CNN Features and SVM

被引：2

作者：

Haider, Irfan ^{[1
]}

Yang, Hyung-Jeong ^{[1
]}

Lee, Guee-Sang ^{[1
]}

Kim, Soo-Hyung ^{[1
]}

机构：

[1] Chonnam Natl Univ, Dept Artificial Intelligence Convergence, Gwangju 500757, South Korea

来源：

SENSORS | 2023年 / 23卷 / 10期

基金：

新加坡国家研究基金会;

关键词：

emotion classification; SVM; triplet loss; transfer learning; ResNet18; RECOGNITION;

D O I：

10.3390/s23104770

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Human facial emotion detection is one of the challenging tasks in computer vision. Owing to high inter-class variance, it is hard for machine learning models to predict facial emotions accurately. Moreover, a person with several facial emotions increases the diversity and complexity of classification problems. In this paper, we have proposed a novel and intelligent approach for the classification of human facial emotions. The proposed approach comprises customized ResNet18 by employing transfer learning with the integration of triplet loss function (TLF), followed by SVM classification model. Using deep features from a customized ResNet18 trained with triplet loss, the proposed pipeline consists of a face detector used to locate and refine the face bounding box and a classifier to identify the facial expression class of discovered faces. RetinaFace is used to extract the identified face areas from the source image, and a ResNet18 model is trained on cropped face images with triplet loss to retrieve those features. An SVM classifier is used to categorize the facial expression based on the acquired deep characteristics. In this paper, we have proposed a method that can achieve better performance than state-of-the-art (SoTA) methods on JAFFE and MMI datasets. The technique is based on the triplet loss function to generate deep input image features. The proposed method performed well on the JAFFE and MMI datasets with an accuracy of 98.44% and 99.02%, respectively, on seven emotions; meanwhile, the performance of the method needs to be fine-tuned for the FER2013 and AFFECTNET datasets.

引用

页数：19

共 50 条

[21] SPEECH EMOTION CLASSIFICATION USING SVM AND MLP ON PROSODIC AND VOICE QUALITY FEATURES
Idris, Inshirah
Salam, Md Sah Hj
Sunar, Mohd Shahrizal
[J]. JURNAL TEKNOLOGI, 2016, 78 (2-2): : 27 - 33
[22] Integrating Geometric and Textural Features for Facial Emotion Classification Using SVM Frameworks
Datta, Samyak
Sen, Debashis
Balasubramanian, R.
[J]. PROCEEDINGS OF INTERNATIONAL CONFERENCE ON COMPUTER VISION AND IMAGE PROCESSING, CVIP 2016, VOL 1, 2017, 459 : 619 - 628
[23] Automated robust human emotion classification system using hybrid EEG features with ICBrainDB dataset
Deniz, Erkan
Sobahi, Nebras
Omar, Naaman
Sengur, Abdulkadir
Acharya, U. Rajendra
[J]. HEALTH INFORMATION SCIENCE AND SYSTEMS, 2022, 10 (01)
[24] Automated robust human emotion classification system using hybrid EEG features with ICBrainDB dataset
Erkan Deniz
Nebras Sobahi
Naaman Omar
Abdulkadir Sengur
U. Rajendra Acharya
[J]. Health Information Science and Systems, 10
[25] Grasp the Implicit Features: Hierarchical Emotion Classification based on Topic Model and SVM
Zhang, Fan
Xu, Hua
Wang, Jiushuo
Sun, Xiaomin
Deng, Junhui
[J]. 2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 3592 - 3599
[26] Personalized face emotion classification using optimized data of three features
Karthigayan, M.
Nagarajan, R.
Rizon, M.
Yaacob, Sazah
[J]. 2007 THIRD INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING, VOL 1, PROCEEDINGS, 2007, : 57 - 60
[27] Robust Vehicle Classification Based on Deep Features Learning
Niroomand, Naghmeh
Bach, Christian
Elser, Miriam
[J]. IEEE ACCESS, 2021, 9 : 95675 - 95685
[28] Speech emotion recognition and classification using hybrid deep CNN and BiLSTM model
Swami Mishra
Nehal Bhatnagar
Prakasam P
Sureshkumar T. R
[J]. Multimedia Tools and Applications, 2024, 83 : 37603 - 37620
[29] Speech emotion recognition and classification using hybrid deep CNN and BiLSTM model
Mishra, Swami
Bhatnagar, Nehal
Prakasam, P.
Sureshkumar, T. R.
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (13) : 37603 - 37620
[30] Deep-Net: A Lightweight CNN-Based Speech Emotion Recognition System Using Deep Frequency Features
Anvarjon, Tursunov
Mustaqeem
Kwon, Soonil
[J]. SENSORS, 2020, 20 (18) : 1 - 16

← 1 2 3 4 5 →