Emotion recognition of speech based on RNN

被引:0
|
作者
Park, CH
Lee, DW
Sim, KB
机构
关键词
pitch; RNN; ES; emotion; center-clipping;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Emotion Recognition has various methods. Mainly, it can be performed by visual or aural method. Any boy's expression face. informed others of his emotion. And, when people are talking over the telephone, they can know the opposite person's emotion by only sound data. From this point, we know that it is possible to recognize people's emotion by only sound data. In this paper, we use the pitch of speech as a main feature. And, as the most important thing, we define features of 4-emotions (normal, angry, laugh, surprise) in pitch analysis. And, based on this feature pattern, we implement a simulator by VC++. First of all, this simulator is composed of 'Generation of individuals', 'RNN', 'Evaluation'. And, using the result from learning part of this simulator, we can get results applied to other speech data (excepting for learning data). In detail, Each module uses the following method. First, 'generation of individuals'-part uses (1+100)-ES and (1+1)-ES (that is, random). Thus, we observe the comparison result of both methods. Of course, then, we select the best way. Second, 'RNN(Recurrent Neural Network)'-part is composed of 7-nodes. That is, 1-input node, 2-hidden layer nodes, 4-output nodes. Selection of this structure depends on the characteristics of sequentially inputted speech data. Third, 'evaluation'-part is very important This part is the cause of the extraction speed and satisfaction degree of result Then we implement a simulator from above modules. And, applied other speech data, we observe the result of recognition..
引用
收藏
页码:2210 / 2213
页数:4
相关论文
共 50 条
  • [21] Speech emotion recognition based on HMM and SVM
    Lin, YL
    Wei, G
    PROCEEDINGS OF 2005 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-9, 2005, : 4898 - 4901
  • [22] Speech Emotion Recognition Based on Arabic Features
    Meddeb, Mohamed
    Karray, Hichem
    Alimi, Adel M.
    2015 15TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2015, : 46 - 51
  • [23] Speech Emotion Recognition Based on Feature Fusion
    Shen, Qi
    Chen, Guanggen
    Chang, Lin
    PROCEEDINGS OF THE 2017 2ND INTERNATIONAL CONFERENCE ON MATERIALS SCIENCE, MACHINERY AND ENERGY ENGINEERING (MSMEE 2017), 2017, 123 : 1071 - 1074
  • [24] Speech Emotion Recognition Based on Modified ReliefF
    Li, Guo-Min
    Liu, Na
    Zhang, Jun-Ao
    SENSORS, 2022, 22 (21)
  • [25] Speech Emotion Recognition Based on Henan Dialect
    Cheng, Zichen
    Li, Yan
    Jiu, Mengfei
    Ge, Jiangwei
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, VOL. 1, 2022, 878 : 498 - 505
  • [26] Empirical Interpretation of Speech Emotion Perception with Attention Based Model for Speech Emotion Recognition
    Jalal, Md Asif
    Milner, Rosanna
    Hain, Thomas
    INTERSPEECH 2020, 2020, : 4113 - 4117
  • [27] Informative Speech Features based on Emotion Classes and Gender in Explainable Speech Emotion Recognition
    Yildirim, Huseyin Ediz
    Iren, Deniz
    2023 11TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION WORKSHOPS AND DEMOS, ACIIW, 2023,
  • [28] Speech Emotion Recognition
    Lalitha, S.
    Madhavan, Abhishek
    Bhushan, Bharath
    Saketh, Srinivas
    2014 INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRONICS, COMPUTERS AND COMMUNICATIONS (ICAECC), 2014,
  • [29] IMPROVED ROBUSTNESS TO DISFLUENCIES IN RNN-TRANSDUCER BASED SPEECH RECOGNITION
    Mendelev, Valentin
    Raissi, Tina
    Camporese, Guglielmo
    Giollo, Manuel
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6878 - 6882
  • [30] Incremental learning for RNN-Transducer based speech recognition models
    Baby, Deepak
    D'Alterio, Pasquale
    Mendelev, Valentin
    INTERSPEECH 2022, 2022, : 71 - 75