Emotion recognition of speech based on RNN

被引:0
|
作者
Park, CH
Lee, DW
Sim, KB
机构
关键词
pitch; RNN; ES; emotion; center-clipping;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Emotion Recognition has various methods. Mainly, it can be performed by visual or aural method. Any boy's expression face. informed others of his emotion. And, when people are talking over the telephone, they can know the opposite person's emotion by only sound data. From this point, we know that it is possible to recognize people's emotion by only sound data. In this paper, we use the pitch of speech as a main feature. And, as the most important thing, we define features of 4-emotions (normal, angry, laugh, surprise) in pitch analysis. And, based on this feature pattern, we implement a simulator by VC++. First of all, this simulator is composed of 'Generation of individuals', 'RNN', 'Evaluation'. And, using the result from learning part of this simulator, we can get results applied to other speech data (excepting for learning data). In detail, Each module uses the following method. First, 'generation of individuals'-part uses (1+100)-ES and (1+1)-ES (that is, random). Thus, we observe the comparison result of both methods. Of course, then, we select the best way. Second, 'RNN(Recurrent Neural Network)'-part is composed of 7-nodes. That is, 1-input node, 2-hidden layer nodes, 4-output nodes. Selection of this structure depends on the characteristics of sequentially inputted speech data. Third, 'evaluation'-part is very important This part is the cause of the extraction speed and satisfaction degree of result Then we implement a simulator from above modules. And, applied other speech data, we observe the result of recognition..
引用
收藏
页码:2210 / 2213
页数:4
相关论文
共 50 条
  • [41] Improving speech emotion recognition based on acoustic words emotion dictionary
    Wei, Wang
    Cao, Xinyi
    Li, He
    Shen, Lingjie
    Feng, Yaqin
    Watters, Paul A.
    NATURAL LANGUAGE ENGINEERING, 2021, 27 (06) : 747 - 761
  • [42] An algorithm study for speech emotion recognition based speech feature analysis
    Zhengbiao, Ji
    Feng, Zhou
    Ming, Zhu
    International Journal of Multimedia and Ubiquitous Engineering, 2015, 10 (11): : 33 - 42
  • [43] CTA-RNN: Channel and Temporal-wise Attention RNN Leveraging Pre-trained ASR Embeddings for Speech Emotion Recognition
    Chen, Chengxin
    Zhang, Pengyuan
    INTERSPEECH 2022, 2022, : 4730 - 4734
  • [44] Speech Emotion Recognition based on Multiple Feature Fusion
    Jiang, Changjiang
    Mao, Rong
    Liu, Geng
    Wang, Mingyi
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 907 - 912
  • [45] Towards Robust Speech-Based Emotion Recognition
    Tabatabaei, Talieh S.
    Krishnan, Sridhar
    2010 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2010), 2010,
  • [46] Continuous Wavelet Transform based Speech Emotion Recognition
    Shegokar, Pankaj
    Sircar, Pradip
    2016 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2016,
  • [47] ANN based Decision Fusion for Speech Emotion Recognition
    Xu, Lu
    Xu, Mingxing
    Yang, Dali
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2003 - +
  • [48] Research of emotion recognition based on speech and facial expression
    Wang, Yutai
    Yang, Xinghai
    Zou, Jing
    Telkomnika - Indonesian Journal of Electrical Engineering, 2013, 11 (01): : 83 - 90
  • [49] Multimodal emotion recognition based on speech and ECG signals
    Huang C.
    Jin Y.
    Wang Q.
    Zhao L.
    Zou C.
    Dongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Southeast University (Natural Science Edition), 2010, 40 (05): : 895 - 900
  • [50] Speech Emotion Recognition Based on EMD in Noisy Environments
    Chu, Yunyun
    Xiong, Weihua
    Chen, Wei
    ADVANCES IN CIVIL ENGINEERING AND BUILDING MATERIALS III, 2014, 831 : 460 - 464