Application of Emotion Recognition and Modification for Emotional Telugu Speech Recognition

被引:0
|
作者
Vishnu Vidyadhara Raju Vegesna
Krishna Gurugubelli
Anil Kumar Vuppala
机构
[1] KCIS,Speech Processing Lab
[2] International Institute of Information Technology,undefined
[3] Hyderabad (IIIT-H),undefined
来源
关键词
ASR; Emotion recognition; Emotive speech;
D O I
暂无
中图分类号
学科分类号
摘要
Majority of the automatic speech recognition systems (ASR) are trained with neutral speech and the performance of these systems are affected due to the presence of emotional content in the speech. The recognition of these emotions in human speech is considered to be the crucial aspect of human-machine interaction. The combined spectral and differenced prosody features are considered for the task of the emotion recognition in the first stage. The task of emotion recognition does not serve the sole purpose of improvement in the performance of an ASR system. Based on the recognized emotions from the input speech, the corresponding adapted emotive ASR model is selected for the evaluation in the second stage. This adapted emotive ASR model is built using the existing neutral and synthetically generated emotive speech using prosody modification method. In this work, the importance of emotion recognition block at the front-end along with the emotive speech adaptation to the ASR system models were studied. The speech samples from IIIT-H Telugu speech corpus were considered for building the large vocabulary ASR systems. The emotional speech samples from IITKGP-SESC Telugu corpus were used for the evaluation. The adapted emotive speech models have yielded better performance over the existing neutral speech models.
引用
收藏
页码:193 / 201
页数:8
相关论文
共 50 条
  • [1] Application of Emotion Recognition and Modification for Emotional Telugu Speech Recognition
    Vegesna, Vishnu Vidyadhara Raju
    Gurugubelli, Krishna
    Vuppala, Anil Kumar
    MOBILE NETWORKS & APPLICATIONS, 2019, 24 (01): : 193 - 201
  • [2] Application of prosody modification for Speech Recognition in different Emotion conditions
    Raju, V. V. Vidyadhara
    Gangamohan, P.
    Gangashetty, Suryakanth V.
    Vuppala, Anil Kumar
    PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), 2016, : 951 - 954
  • [3] Speech Based Emotion Recognition in Tamil and Telugu using LPCC and Hurst Parameters
    Renjith, S.
    Manju, K. G.
    PROCEEDINGS OF 2017 IEEE INTERNATIONAL CONFERENCE ON CIRCUIT ,POWER AND COMPUTING TECHNOLOGIES (ICCPCT), 2017,
  • [4] Emotion Attribute Projection for Speaker Recognition on Emotional Speech
    Bao, Huanjun
    Xu, Mingxing
    Zheng, Thomas Fang
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 601 - 604
  • [5] Building a Recognition System of Speech Emotion and Emotional States
    Feng, Xiaoyan
    Watada, Junzo
    2013 SECOND INTERNATIONAL CONFERENCE ON ROBOT, VISION AND SIGNAL PROCESSING (RVSP), 2013, : 253 - 258
  • [6] Generative emotional AI for speech emotion recognition: The case for synthetic emotional speech augmentation
    Latif, Siddique
    Shahid, Abdullah
    Qadir, Junaid
    APPLIED ACOUSTICS, 2023, 210
  • [7] Minimum data generation for Telugu speech recognition
    Sunitha, K.
    Sharada, A.
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2015, 18 (02) : 217 - 230
  • [8] TELUGU ANKELU: A Telugu Spoken Digits Corpora for Mobile Speech Recognition
    Bhagath, Parabattina
    Pullagura, Meghana
    Das, Pradip K.
    Yandra, Vikram Kumar
    Thetla, Santhi Sri
    2022 12TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION SYSTEMS (ICPRS), 2022,
  • [9] Application of Neural Networks in Emotional Speech Recognition
    Bojanic, Milana
    Crnojevic, Vladimir
    Delic, Vlado
    ELEVENTH SYMPOSIUM ON NEURAL NETWORK APPLICATIONS IN ELECTRICAL ENGINEERING (NEUREL 2012), 2012,
  • [10] Prominence features: Effective emotional features for speech emotion recognition
    Jing, Shaoling
    Mao, Xia
    Chen, Lijiang
    DIGITAL SIGNAL PROCESSING, 2018, 72 : 216 - 231