Application of Emotion Recognition and Modification for Emotional Telugu Speech Recognition

被引:0
|
作者
Vishnu Vidyadhara Raju Vegesna
Krishna Gurugubelli
Anil Kumar Vuppala
机构
[1] KCIS,Speech Processing Lab
[2] International Institute of Information Technology,undefined
[3] Hyderabad (IIIT-H),undefined
来源
关键词
ASR; Emotion recognition; Emotive speech;
D O I
暂无
中图分类号
学科分类号
摘要
Majority of the automatic speech recognition systems (ASR) are trained with neutral speech and the performance of these systems are affected due to the presence of emotional content in the speech. The recognition of these emotions in human speech is considered to be the crucial aspect of human-machine interaction. The combined spectral and differenced prosody features are considered for the task of the emotion recognition in the first stage. The task of emotion recognition does not serve the sole purpose of improvement in the performance of an ASR system. Based on the recognized emotions from the input speech, the corresponding adapted emotive ASR model is selected for the evaluation in the second stage. This adapted emotive ASR model is built using the existing neutral and synthetically generated emotive speech using prosody modification method. In this work, the importance of emotion recognition block at the front-end along with the emotive speech adaptation to the ASR system models were studied. The speech samples from IIIT-H Telugu speech corpus were considered for building the large vocabulary ASR systems. The emotional speech samples from IITKGP-SESC Telugu corpus were used for the evaluation. The adapted emotive speech models have yielded better performance over the existing neutral speech models.
引用
收藏
页码:193 / 201
页数:8
相关论文
共 50 条
  • [41] Emotion recognition in Arabic speech
    Klaylat, Samira
    Osman, Ziad
    Hamandi, Lama
    Zantout, Rached
    ANALOG INTEGRATED CIRCUITS AND SIGNAL PROCESSING, 2018, 96 (02) : 337 - 351
  • [42] The Impact of Face Mask and Emotion on Automatic Speech Recognition (ASR) and Speech Emotion Recognition (SER)
    Oh, Qi Qi
    Seow, Chee Kiat
    Yusuff, Mulliana
    Pranata, Sugiri
    Cao, Qi
    2023 8TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYTICS, ICCCBDA, 2023, : 523 - 531
  • [43] Assessment of spontaneous emotional speech database toward emotion recognition: Intensity and similarity of perceived emotion from spontaneously expressed emotional speech
    Arimoto, Yoshiko
    Ohno, Sumio
    Iida, Hitoshi
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2011, 32 (01) : 26 - 29
  • [44] Speaker Recognition and Speech Emotion Recognition Based on GMM
    Xu, Shupeng
    Liu, Yan
    Liu, Xiping
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON ELECTRIC AND ELECTRONICS, 2013, : 434 - 436
  • [45] Application of Improved Spectral Subtraction Algorithm for Speech Emotion Recognition
    Zhang Wanli
    Li Guoxin
    Wang Lirong
    PROCEEDINGS 2015 IEEE FIFTH INTERNATIONAL CONFERENCE ON BIG DATA AND CLOUD COMPUTING BDCLOUD 2015, 2015, : 213 - 216
  • [46] Cooperative Learning and its Application to Emotion Recognition from Speech
    Zhang, Zixing
    Coutinho, Eduardo
    Deng, Jun
    Schuller, Bjoern
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (01) : 115 - 126
  • [47] Application of Vector Quantization in Emotion Recognition from Human Speech
    Khanna, Preeti
    Kumar, M. Sasi
    INFORMATION INTELLIGENCE, SYSTEMS, TECHNOLOGY AND MANAGEMENT, 2011, 141 : 118 - +
  • [48] Speech emotion recognition based on emotion perception
    Gang Liu
    Shifang Cai
    Ce Wang
    EURASIP Journal on Audio, Speech, and Music Processing, 2023
  • [49] A study of speech emotion recognition and its application to mobile services
    Yoon, Won-Joong
    Cho, Youn-Ho
    Park, Kyu-Sik
    UBIQUITOUS INTELLIGENCE AND COMPUTING, PROCEEDINGS, 2007, 4611 : 758 - +
  • [50] Speech emotion recognition based on emotion perception
    Liu, Gang
    Cai, Shifang
    Wang, Ce
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2023, 2023 (01)