Influence of Emotional Speech on Continuous Speech Recognition

被引：0

作者：

Zgank, Andrej ^{[1
]}

Maucec, Mirjam Sepesy ^{[1
]}

机构：

[1] Univ Maribor, Fac Elect Engn & Comp Sci, Maribor, Slovenia

来源：

13TH INTERNATIONAL CONFERENCE ON ELEKTRO (ELEKTRO 2020) | 2020年

关键词：

speech recognition; emotional speech; highly inflected language; Human-Computer Interaction; CLASSIFICATION; FEATURES;

D O I：

10.1109/elektro49696.2020.9130316

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Emotions are an important part of human communication, but they can present harsh conditions for an automatic continuous speech recognition system. This paper presents an analysis of to which level the emotional speech degrades speech recognition accuracy, when dealing with a highly inflected Slovenian language. Namely, the language characteristics are those that also influence the speech recognition performance, and inflection is one of the most challenging ones. Moreover, Slovenian belongs to the group of under-resourced languages, like other Slavic languages. The speech recognition system was developed with the Slovenian BNSI Broadcast News speech database. The Interface speech database was used for the experiments with the emotional speech. The analysis was carried out with HMM and DNN acoustic models, combined with a 3-gram statistical language model. The results show that emotional speech degrades speech recognition accuracy in the range between 5% and 7% absolutely.

引用

页数：4

共 50 条

[21] SPEECH ENHANCEMENT AND FEATURES COMPENSATION ALGORITHMS FOR CONTINUOUS SPEECH RECOGNITION
Arcos, Christian
Grivet, Marco
Alcaim, Abraham
[J]. 2014 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (CHINASIP), 2014, : 27 - 31
[22] Recognition of alkohol influence on speech
Mensík, R
[J]. TEXT, SPEECH AND DIALOGUE, 1999, 1692 : 384 - 387
[23] COMPUTER RECOGNITION OF VOWELS IN CONTINUOUS SPEECH
PALIWAL, KK
RAO, PVS
[J]. INDIAN JOURNAL OF TECHNOLOGY, 1980, 18 (07): : 285 - 289
[24] Constraints on the recognition of words in continuous speech
McQueen, JM
[J]. INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2000, 35 (3-4) : 39 - 39
[25] Segmental search for continuous speech recognition
Laface, P
Fissore, L
Maro, A
Ravera, F
[J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2155 - 2158
[26] A time continuous model for speech recognition
Euler, S
[J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 889 - 892
[27] Robust Mizo Continuous Speech Recognition
Dey, Abhishek
Sarma, Biswajit Dev
Lalhminghlui, Wendy
Ngente, Lalnunsiami
Gogoi, Parismita
Sarmah, Priyankoo
Prasanna, S. R. M.
Sinha, Rohit
Nirmala, S. R.
[J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1036 - 1040
[28] Lexical Stress in Continuous Speech Recognition
van Dalen, Rogier C.
Wiggers, Pascal
Rothkrantz, Leon J. M.
[J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2382 - 2385
[29] Continuous speech recognition for radiology reporting
Schwartz, LH
Kijewski, PK
Hertogen, HI
Roossin, PS
Castellino, RA
[J]. RADIOLOGY, 1996, 201 : 9708 - 9708
[30] Continuous speech recognition in radiology reporting
Aggarwal, V
Rudrapatna, V
Rajappan, S
Raval, B
Tonnesen, AS
Zhang, JJ
[J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2000, : 946 - 946

← 1 2 3 4 5 →