Recognition of emotions from video using acoustic and facial features

被引:8
|
作者
Rao, K. Sreenivasa [1 ]
Koolagudi, Shashidhar G. [2 ]
机构
[1] Indian Inst Technol, Sch Informat Technol, Kharagpur 721302, W Bengal, India
[2] Natl Inst Technol Karnataka, Dept Comp Sci & Engn, Surathkal 575025, Karnataka, India
关键词
Emotion recognition; Autoassociative neural network (AANN); Spectral and prosodic features; Facial features; Acoustic features; VOICE CONVERSION; SPEECH; EXPRESSION; FACE;
D O I
10.1007/s11760-013-0522-6
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, acoustic and facial features extracted from video are explored for recognizing emotions. The temporal variation of gray values of the pixels within eye and mouth regions is used as a feature to capture the emotion-specific knowledge from the facial expressions. Acoustic features representing spectral and prosodic information are explored for recognizing emotions from the speech signal. Autoassociative neural network models are used to capture the emotion-specific information from acoustic and facial features. The basic objective of this work is to examine the capability of the proposed acoustic and facial features in view of capturing the emotion-specific information. Further, the correlations among the feature sets are analyzed by combining the evidences at different levels. The performance of the emotion recognition system developed using acoustic and facial features is observed to be 85.71 and 88.14 %, respectively. It has been observed that combining the evidences of models developed using acoustic and facial features improved the recognition performance to 93.62 %. The performance of the emotion recognition systems developed using neural network models is compared with hidden Markov models, Gaussian mixture models and support vector machine models. The proposed features and models are evaluated on real-life emotional database, Interactive Emotional Dyadic Motion Capture database, which was recently collected at University of Southern California.
引用
收藏
页码:1029 / 1045
页数:17
相关论文
共 50 条
  • [41] A Comparative Study of Articulatory Features From Facial Video and Acoustic-To-Articulatory Inversion for Phonetic Discrimination
    Narwekar, Abhishek
    Ghosh, Prasanta Kumar
    2016 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM), 2016,
  • [42] Emotion recognition from telephone speech using acoustic and nonlinear features
    Bedoya-Jaramillo, S.
    Orozco-Arroyave, J. R.
    Arias-Londono, J. D.
    Vargas-Bonilla, J. F.
    2013 47TH INTERNATIONAL CARNAHAN CONFERENCE ON SECURITY TECHNOLOGY (ICCST), 2013,
  • [43] Automatic Facial Expression Recognition Using Features of Salient Facial Patches
    Happy, S. L.
    Routray, Aurobinda
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2015, 6 (01) : 1 - 12
  • [44] Recognition of Primary Emotions Using the Paradigm of Intelligent Agents for the Recognition of Subtle Facial Expressions
    Aguirre, Enrique
    Alanis Garza, Arnulfo
    del Rosario Baltazar, Mara
    Lemus Zuniga, Lenin G.
    Magdaleno Palencia, Sergio
    Lino Ramirez, Carlos
    AGENT AND MULTI-AGENT SYSTEMS: TECHNOLOGIES AND APPLICATIONS, 2015, 38 : 345 - 351
  • [45] Video-based facial expression recognition using learned spatiotemporal pyramid sparse coding features
    Long, Fei
    Bartlett, Marian S.
    NEUROCOMPUTING, 2016, 173 : 2049 - 2054
  • [46] Emotion Recognition using Acoustic and Lexical Features
    Rozgic, Viktor
    Ananthakrishnan, Sankaranarayanan
    Saleem, Shirin
    Kumar, Rohit
    Vembu, Aravind Namandi
    Prasad, Rohit
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 366 - 369
  • [47] FERCE: Facial Expression Recognition for Combined Emotions Using FERCE Algorithm
    Swaminathan, A.
    Vadivel, A.
    Arock, Michael
    IETE JOURNAL OF RESEARCH, 2022, 68 (05) : 3235 - 3250
  • [48] Robust Representation and Recognition of Facial Emotions Using Extreme Sparse Learning
    Shojaeilangari, Seyedehsamaneh
    Yau, Wei-Yun
    Nandakumar, Karthik
    Li, Jun
    Teoh, Eam Khwang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (07) : 2140 - 2152
  • [49] Facial expression recognition using HLAC features and WPCA
    Liu, F
    Wang, ZL
    Wang, L
    Meng, XY
    AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PROCEEDINGS, 2005, 3784 : 88 - 94
  • [50] Facial expression recognition by using differential geometric features
    Zangeneh, Erfan
    Moradi, Aref
    IMAGING SCIENCE JOURNAL, 2018, 66 (08): : 463 - 470