Recognition of emotions from video using acoustic and facial features

被引:8
|
作者
Rao, K. Sreenivasa [1 ]
Koolagudi, Shashidhar G. [2 ]
机构
[1] Indian Inst Technol, Sch Informat Technol, Kharagpur 721302, W Bengal, India
[2] Natl Inst Technol Karnataka, Dept Comp Sci & Engn, Surathkal 575025, Karnataka, India
关键词
Emotion recognition; Autoassociative neural network (AANN); Spectral and prosodic features; Facial features; Acoustic features; VOICE CONVERSION; SPEECH; EXPRESSION; FACE;
D O I
10.1007/s11760-013-0522-6
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, acoustic and facial features extracted from video are explored for recognizing emotions. The temporal variation of gray values of the pixels within eye and mouth regions is used as a feature to capture the emotion-specific knowledge from the facial expressions. Acoustic features representing spectral and prosodic information are explored for recognizing emotions from the speech signal. Autoassociative neural network models are used to capture the emotion-specific information from acoustic and facial features. The basic objective of this work is to examine the capability of the proposed acoustic and facial features in view of capturing the emotion-specific information. Further, the correlations among the feature sets are analyzed by combining the evidences at different levels. The performance of the emotion recognition system developed using acoustic and facial features is observed to be 85.71 and 88.14 %, respectively. It has been observed that combining the evidences of models developed using acoustic and facial features improved the recognition performance to 93.62 %. The performance of the emotion recognition systems developed using neural network models is compared with hidden Markov models, Gaussian mixture models and support vector machine models. The proposed features and models are evaluated on real-life emotional database, Interactive Emotional Dyadic Motion Capture database, which was recently collected at University of Southern California.
引用
收藏
页码:1029 / 1045
页数:17
相关论文
共 50 条
  • [1] Recognition of emotions from video using acoustic and facial features
    K. Sreenivasa Rao
    Shashidhar G. Koolagudi
    Signal, Image and Video Processing, 2015, 9 : 1029 - 1045
  • [2] Positive and Negative Emotions Recognition from Speech Signal Using Acoustic and Lexical Features
    Kurniawati, Pipin
    Lestari, Dessi Puji
    2017 4TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATICS, CONCEPTS, THEORY, AND APPLICATIONS (ICAICTA) PROCEEDINGS, 2017,
  • [3] Acoustic and facial features for speaker recognition
    Roach, MJ
    Brand, JD
    Mason, JSD
    15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, PROCEEDINGS: IMAGE, SPEECH AND SIGNAL PROCESSING, 2000, : 258 - 261
  • [4] RECOGNITION OF EMOTIONS FROM TIME AND TIME-FREQUENCY FEATURES USING FACIAL ELECTROMYOGRAPHY SIGNALS
    Shiva J.
    Makaram N.
    Karthick P.A.
    Swaminathan R.
    Biomedical Sciences Instrumentation, 2021, 57 (03) : 386 - 391
  • [5] Automatic Recognition of Spontaneous Emotions in Speech Using Acoustic and Lexical Features
    Truong, Khict P.
    Raaijmakers, Stephan
    MACHINE LEARNING FOR MULTIMODAL INTERACTION, PROCEEDINGS, 2008, 5237 : 161 - +
  • [6] Study of Video based Facial Expression and Emotions Recognition Methods
    Salih, Husam
    Kulkarni, Lalit
    2017 INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC), 2017, : 692 - 696
  • [7] Recognition of emotions from video using neural network models
    Rao, K. Sreenivasa
    Saroj, V. K.
    Maity, Sudhamay
    Koolagudi, Shashidhar G.
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (10) : 13181 - 13185
  • [8] Recognition of Emotions from Speech using Excitation Source Features
    Koolagudi, Shashidhar G.
    Devliyal, Swati
    Chawla, Bhavna
    Barthwal, Anurag
    Rao, K. Sreenivasa
    INTERNATIONAL CONFERENCE ON MODELLING OPTIMIZATION AND COMPUTING, 2012, 38 : 3409 - 3417
  • [9] Automatic Recognition of Emotions from Facial Expressions
    Xue, Henry
    Gertner, Izidor
    AUTOMATIC TARGET RECOGNITION XXIV, 2014, 9090
  • [10] Emotions Detection Using Facial Expressions Recognition and EEG
    Matlovic, Tomas
    Gaspar, Peter
    Moro, Robert
    Simko, Jakub
    Bielikova, Maria
    2016 11TH INTERNATIONAL WORKSHOP ON SEMANTIC AND SOCIAL MEDIA ADAPTATION AND PERSONALIZATION (SMAP), 2016, : 18 - 23