Application of Emotion Recognition and Modification for Emotional Telugu Speech Recognition

被引：0

作者：

Vishnu Vidyadhara Raju Vegesna

Krishna Gurugubelli

Anil Kumar Vuppala

机构：

[1] KCIS,Speech Processing Lab

[2] International Institute of Information Technology,undefined

[3] Hyderabad (IIIT-H),undefined

来源：

Mobile Networks and Applications | 2019年 / 24卷

关键词：

ASR; Emotion recognition; Emotive speech;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Majority of the automatic speech recognition systems (ASR) are trained with neutral speech and the performance of these systems are affected due to the presence of emotional content in the speech. The recognition of these emotions in human speech is considered to be the crucial aspect of human-machine interaction. The combined spectral and differenced prosody features are considered for the task of the emotion recognition in the first stage. The task of emotion recognition does not serve the sole purpose of improvement in the performance of an ASR system. Based on the recognized emotions from the input speech, the corresponding adapted emotive ASR model is selected for the evaluation in the second stage. This adapted emotive ASR model is built using the existing neutral and synthetically generated emotive speech using prosody modification method. In this work, the importance of emotion recognition block at the front-end along with the emotive speech adaptation to the ASR system models were studied. The speech samples from IIIT-H Telugu speech corpus were considered for building the large vocabulary ASR systems. The emotional speech samples from IITKGP-SESC Telugu corpus were used for the evaluation. The adapted emotive speech models have yielded better performance over the existing neutral speech models.

引用

页码：193 / 201

页数：8

共 50 条

[31] Persian Speech Emotion Recognition
Savargiv, Mohammad
Bastanfard, Azam
2015 7TH CONFERENCE ON INFORMATION AND KNOWLEDGE TECHNOLOGY (IKT), 2015,
[32] Multiroom Speech Emotion Recognition
Shalev, Erez
Cohen, Israel
2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 135 - 139
[33] Emotion recognition in Arabic speech
Samira Klaylat
Ziad Osman
Lama Hamandi
Rached Zantout
Analog Integrated Circuits and Signal Processing, 2018, 96 : 337 - 351
[34] Windowing for Speech Emotion Recognition
Puterka, Boris
Kacur, Juraj
Pavlovicova, Jarmila
2019 61ST INTERNATIONAL SYMPOSIUM ELMAR, 2019, : 147 - 150
[35] Mandarin emotion recognition in speech
Pao, TL
Chen, YT
ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 227 - 230
[36] Emotion recognition in Arabic speech
Hadjadji, Imene
Falek, Leila
Demri, Lyes
Teffahi, Hocine
2019 INTERNATIONAL CONFERENCE ON ADVANCED ELECTRICAL ENGINEERING (ICAEE), 2019,
[37] Progress in speech emotion recognition
Zhang, Xueying
Sun, Ying
Duan, Shufei
TENCON 2015 - 2015 IEEE REGION 10 CONFERENCE, 2015,
[38] Review on speech emotion recognition
Han, W.-J. (hanwenjing07@gmail.com), 1600, Chinese Academy of Sciences (25):
[39] Bengali Speech Emotion Recognition
Mohanta, Abhijit
Sharma, Uzzal
PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 2812 - 2814
[40] Multiroom Speech Emotion Recognition
Shalev, Erez
Cohen, Israel
European Signal Processing Conference, 2022, 2022-August : 135 - 139

← 1 2 3 4 5 →