Influence of Emotional Speech on Continuous Speech Recognition

被引：0

作者：

Zgank, Andrej ^{[1
]}

Maucec, Mirjam Sepesy ^{[1
]}

机构：

[1] Univ Maribor, Fac Elect Engn & Comp Sci, Maribor, Slovenia

来源：

13TH INTERNATIONAL CONFERENCE ON ELEKTRO (ELEKTRO 2020) | 2020年

关键词：

speech recognition; emotional speech; highly inflected language; Human-Computer Interaction; CLASSIFICATION; FEATURES;

D O I：

10.1109/elektro49696.2020.9130316

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Emotions are an important part of human communication, but they can present harsh conditions for an automatic continuous speech recognition system. This paper presents an analysis of to which level the emotional speech degrades speech recognition accuracy, when dealing with a highly inflected Slovenian language. Namely, the language characteristics are those that also influence the speech recognition performance, and inflection is one of the most challenging ones. Moreover, Slovenian belongs to the group of under-resourced languages, like other Slavic languages. The speech recognition system was developed with the Slovenian BNSI Broadcast News speech database. The Interface speech database was used for the experiments with the emotional speech. The analysis was carried out with HMM and DNN acoustic models, combined with a 3-gram statistical language model. The results show that emotional speech degrades speech recognition accuracy in the range between 5% and 7% absolutely.

引用

页数：4

共 50 条

[41] Continuous Speech Recognition System for Chhattisgarhi
Londhe, N. D.
Kshirsagar, G. B.
[J]. 2017 INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), 2017, : 365 - 369
[42] A wave decoder for continuous speech recognition
Burhke, E
Chou, W
Zhou, QR
[J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2135 - 2138
[43] On Continuous Speech Recognition of Indian English
Jin, Xin
Zhang, Keliang
Huang, Xian
Miao, Min
[J]. 2018 INTERNATIONAL CONFERENCE ON ALGORITHMS, COMPUTING AND ARTIFICIAL INTELLIGENCE (ACAI 2018), 2018,
[44] RECOGNITION OF CONTINUOUS COMPLEX SPEECH BY MACHINES
LEVINSON, SE
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1990, 87 (01): : 422 - 423
[45] Continuous Speech Recognition from ECoG
Heger, Dominic
Herff, Christian
de Pesters, Adriana
Telaar, Dominic
Brunner, Peter
Schalk, Gerwin
Schultz, Tanja
[J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1131 - 1135
[46] CONTINUOUS SPEECH RECOGNITION OF KAZAKH LANGUAGE
Mamyrbayev, Orken
Turdalyuly, Mussa
Mekebayev, Nurbapa
Mukhsina, Kuralay
Keylan, Alimukhan
BabaAli, Bagher
Nabieva, Gulnaz
Duisenbayeva, Aigerim
Akhmetov, Bekturgan
[J]. AMCSE 2018 - INTERNATIONAL CONFERENCE ON APPLIED MATHEMATICS, COMPUTATIONAL SCIENCE AND SYSTEMS ENGINEERING, 2019, 24
[47] Parallel Scalability in Speech Recognition Inference engines in large vocabulary continuous speech recognition
You, Kisun
Chong, Jike
Yi, Youngmin
Gonina, Ekaterina
Hughes, Christopher J.
Chen, Yen-Kuang
Sung, Wonyong
Keutzer, Kurt
[J]. IEEE SIGNAL PROCESSING MAGAZINE, 2009, 26 (06) : 124 - 135
[48] Influence of features extraction methods in performance of continuous speech recognition for Romanian
Dumitru, C. O.
Gavat, Inge
[J]. 2007 14TH INTERNATIONAL WORKSHOP ON SYSTEMS, SIGNALS, & IMAGE PROCESSING & EURASIP CONFERENCE FOCUSED ON SPEECH & IMAGE PROCESSING, MULTIMEDIA COMMUNICATIONS & SERVICES, 2007, : 40 - 43
[49] Reduced Feature Extraction for Emotional Speech Recognition
Palo, Hemanta Kumar
Mohanty, Mihir Narayan
[J]. 2015 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2015,
[50] Towards more reality in the recognition of emotional speech
Schuller, Bjoern
Seppi, Dino
Batliner, Anton
Maier, Andreas
Steidl, Stefan
[J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 941 - +

← 1 2 3 4 5 →