Emotion recognition of speech based on RNN

被引:0
|
作者
Park, CH
Lee, DW
Sim, KB
机构
关键词
pitch; RNN; ES; emotion; center-clipping;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Emotion Recognition has various methods. Mainly, it can be performed by visual or aural method. Any boy's expression face. informed others of his emotion. And, when people are talking over the telephone, they can know the opposite person's emotion by only sound data. From this point, we know that it is possible to recognize people's emotion by only sound data. In this paper, we use the pitch of speech as a main feature. And, as the most important thing, we define features of 4-emotions (normal, angry, laugh, surprise) in pitch analysis. And, based on this feature pattern, we implement a simulator by VC++. First of all, this simulator is composed of 'Generation of individuals', 'RNN', 'Evaluation'. And, using the result from learning part of this simulator, we can get results applied to other speech data (excepting for learning data). In detail, Each module uses the following method. First, 'generation of individuals'-part uses (1+100)-ES and (1+1)-ES (that is, random). Thus, we observe the comparison result of both methods. Of course, then, we select the best way. Second, 'RNN(Recurrent Neural Network)'-part is composed of 7-nodes. That is, 1-input node, 2-hidden layer nodes, 4-output nodes. Selection of this structure depends on the characteristics of sequentially inputted speech data. Third, 'evaluation'-part is very important This part is the cause of the extraction speed and satisfaction degree of result Then we implement a simulator from above modules. And, applied other speech data, we observe the result of recognition..
引用
收藏
页码:2210 / 2213
页数:4
相关论文
共 50 条
  • [1] RNN with Improved Temporal Modeling for Speech Emotion Recognition
    Lieskovska, Eva
    Jakubec, Maros
    Jarina, Roman
    [J]. 2022 32ND INTERNATIONAL CONFERENCE RADIOELEKTRONIKA (RADIOELEKTRONIKA), 2022, : 5 - 9
  • [2] Extending RNN-T-based speech recognition systems with emotion and language classification
    Kons, Zvi
    Aronowitz, Hagai
    Morais, Edmilson
    Damasceno, Matheus
    Kuo, Hong-Kwang
    Thomas, Samuel
    Saon, George
    [J]. INTERSPEECH 2022, 2022, : 546 - 549
  • [3] Performance Analysis of a Chunk-Based Speech Emotion Recognition Model Using RNN
    Shin, Hyun-Sam
    Hong, Jun-Ki
    [J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 36 (01): : 235 - 248
  • [4] SPEECH EMOTION RECOGNITION WITH I-VECTOR FEATURE AND RNN MODEL
    Zhang, Teng
    Wu, Ji
    [J]. 2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING, 2015, : 524 - 528
  • [5] Speech emotion recognition based on emotion perception
    Gang Liu
    Shifang Cai
    Ce Wang
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2023
  • [6] Speech emotion recognition based on emotion perception
    Liu, Gang
    Cai, Shifang
    Wang, Ce
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2023, 2023 (01)
  • [7] English speech emotion recognition method based on speech recognition
    Liu, Man
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2022, 25 (2) : 391 - 398
  • [8] English speech emotion recognition method based on speech recognition
    Man Liu
    [J]. International Journal of Speech Technology, 2022, 25 : 391 - 398
  • [9] Speaker Recognition and Speech Emotion Recognition Based on GMM
    Xu, Shupeng
    Liu, Yan
    Liu, Xiping
    [J]. PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON ELECTRIC AND ELECTRONICS, 2013, : 434 - 436
  • [10] An overview of RNN-based Mandarin speech recognition approaches
    Liao, YF
    Hong, WT
    Wang, WJ
    Wang, YR
    Chen, SH
    [J]. JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 1999, 22 (05) : 535 - 547