A study on emotion recognition using speech acoustic features and face images

被引:0
|
作者
Son M.-J. [1 ]
Lee S.-P. [2 ]
机构
[1] Dept. of Computer Science, Sangmyung University
[2] Dept. of Electronic Engineering, Sangmyung University
来源
Trans. Korean Inst. Electr. Eng. | 2020年 / 7卷 / 1081-1086期
基金
新加坡国家研究基金会;
关键词
Acoustic feature; Deep learning; Emotion recognition; Facial image;
D O I
10.5370/KIEE.2020.69.7.1081
中图分类号
学科分类号
摘要
Generally, people recognize other people's emotions by their voices and facial expressions. So, speech signals and facial images have been actively studied in the field of emotional recognition. Therefore, in this paper, we present effective acoustic features for emotion recognition and a method to recognize emotions by combining speech signals and facial image sequences. To combine these the two inputs like speech signals and facial image sequences, three models are designed. And these three models are combined by using the Joint Fine Tuning method. The result shows that the performance of our model is very promising for emotion recognitions in comparison with other models using speech signals and facial image sequences. © The Korean Institute of Electrical Engineers
引用
收藏
页码:1081 / 1086
页数:5
相关论文
共 50 条
  • [31] Excitation Features of Speech for Emotion Recognition Using Neutral Speech as Reference
    Sudarsana Reddy Kadiri
    P. Gangamohan
    Suryakanth V. Gangashetty
    Paavo Alku
    B. Yegnanarayana
    Circuits, Systems, and Signal Processing, 2020, 39 : 4459 - 4481
  • [32] Emotion recognition using acoustic features and textual content
    Chuang, ZJ
    Wu, CH
    2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 53 - 56
  • [33] Recognizing emotion from Turkish speech using acoustic features
    Oflazoglu, Caglar
    Yildirim, Serdar
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2013,
  • [34] Recognizing emotion from Turkish speech using acoustic features
    Caglar Oflazoglu
    Serdar Yildirim
    EURASIP Journal on Audio, Speech, and Music Processing, 2013
  • [35] Variational mode decomposition based acoustic and entropy features for speech emotion recognition
    Mishra, Siba Prasad
    Warule, Pankaj
    Deb, Suman
    APPLIED ACOUSTICS, 2023, 212
  • [36] Speech emotion recognition based on the reconstruction of acoustic and text features in latent space
    Santoso, Jennifer
    Sekiguchi, Rintaro
    Yamada, Takeshi
    Ishizuka, Kenkichi
    Hashimoto, Taiichi
    Makino, Shoji
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1678 - 1683
  • [37] The Impact of Face Mask and Emotion on Automatic Speech Recognition (ASR) and Speech Emotion Recognition (SER)
    Oh, Qi Qi
    Seow, Chee Kiat
    Yusuff, Mulliana
    Pranata, Sugiri
    Cao, Qi
    2023 8TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYTICS, ICCCBDA, 2023, : 523 - 531
  • [38] Children’s Emotion Recognition from Spontaneous Speech Using a Reduced Set of Acoustic and Linguistic Features
    Santiago Planet
    Ignasi Iriondo
    Cognitive Computation, 2013, 5 : 526 - 532
  • [39] Children's Emotion Recognition from Spontaneous Speech Using a Reduced Set of Acoustic and Linguistic Features
    Planet, Santiago
    Iriondo, Ignasi
    COGNITIVE COMPUTATION, 2013, 5 (04) : 526 - 532
  • [40] Speech Emotion Recognition Using Neural Network and Wavelet Features
    Roy, Tanmoy
    Marwala, Tshilidzi
    Chakraverty, S.
    RECENT TRENDS IN WAVE MECHANICS AND VIBRATIONS, WMVC 2018, 2020, : 427 - 438