Recognition of emotions from video using acoustic and facial features

被引:8
|
作者
Rao, K. Sreenivasa [1 ]
Koolagudi, Shashidhar G. [2 ]
机构
[1] Indian Inst Technol, Sch Informat Technol, Kharagpur 721302, W Bengal, India
[2] Natl Inst Technol Karnataka, Dept Comp Sci & Engn, Surathkal 575025, Karnataka, India
关键词
Emotion recognition; Autoassociative neural network (AANN); Spectral and prosodic features; Facial features; Acoustic features; VOICE CONVERSION; SPEECH; EXPRESSION; FACE;
D O I
10.1007/s11760-013-0522-6
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, acoustic and facial features extracted from video are explored for recognizing emotions. The temporal variation of gray values of the pixels within eye and mouth regions is used as a feature to capture the emotion-specific knowledge from the facial expressions. Acoustic features representing spectral and prosodic information are explored for recognizing emotions from the speech signal. Autoassociative neural network models are used to capture the emotion-specific information from acoustic and facial features. The basic objective of this work is to examine the capability of the proposed acoustic and facial features in view of capturing the emotion-specific information. Further, the correlations among the feature sets are analyzed by combining the evidences at different levels. The performance of the emotion recognition system developed using acoustic and facial features is observed to be 85.71 and 88.14 %, respectively. It has been observed that combining the evidences of models developed using acoustic and facial features improved the recognition performance to 93.62 %. The performance of the emotion recognition systems developed using neural network models is compared with hidden Markov models, Gaussian mixture models and support vector machine models. The proposed features and models are evaluated on real-life emotional database, Interactive Emotional Dyadic Motion Capture database, which was recently collected at University of Southern California.
引用
收藏
页码:1029 / 1045
页数:17
相关论文
共 50 条
  • [31] Recognition of Emotions From Facial Point-Light Displays
    Bidet-Ildei, Christel
    Decatoire, Arnaud
    Gil, Sandrine
    FRONTIERS IN PSYCHOLOGY, 2020, 11
  • [32] Facial Emotions Recognition using Gabor Transform and Facial Animation Parameters with Neural Networks
    Harit, Aditya
    Joshi, Col J. C.
    Gupta, K. K.
    3RD INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS (ICCS-2017), 2018, 331
  • [33] Facial-Expression Recognition from Video Using Enhanced Convolutional LSTM
    Miyoshi, Ryo
    Nagata, Noriko
    Hashimoto, Manabu
    2019 DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2019, : 432 - 437
  • [34] Emotions Recognition System for Acoustic Music Data Based on Human Perception Features
    Endrjukaite, Tatiana
    Kiyoki, Yasushi
    INFORMATION MODELLING AND KNOWLEDGE BASES XXVIII, 2017, 292 : 283 - 302
  • [35] Facial Emotions Recognition in Machine Learning
    Bolcas R.-D.
    Dranga D.
    EEA - Electrotehnica, Electronica, Automatica, 2021, 69 (04): : 87 - 94
  • [36] Person recognition form video using facial mimics
    Saeed, Usman
    Dugelay, Jean-Luc
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PTS 1-3, PROCEEDINGS, 2007, : 493 - +
  • [37] Deep facial emotion recognition in video using eigenframes
    Hajarolasvadi, Noushin
    Demirel, Hasan
    IET IMAGE PROCESSING, 2020, 14 (14) : 3536 - 3546
  • [38] Recognition of facial emotions in neuropsychiatric disorders
    Kohler, CG
    Turner, TH
    Gur, RE
    Gur, RC
    CNS SPECTRUMS, 2004, 9 (04) : 267 - +
  • [39] The role of facial mimicry in the recognition of emotions
    Blairy, S
    Hess, U
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 1996, 31 (3-4) : 54166 - 54166
  • [40] Activity Recognition from Video Data using Spatial and Temporal Features
    Al-Wattar, Mohamad
    Khusainov, Rinat
    Azzi, Djamel
    Chiverton, John
    12TH INTERNATIONAL CONFERENCE ON INTELLIGENT ENVIRONMENTS - IE 2016, 2016, : 250 - 253