Influence of Emotional Speech on Continuous Speech Recognition

被引:0
|
作者
Zgank, Andrej [1 ]
Maucec, Mirjam Sepesy [1 ]
机构
[1] Univ Maribor, Fac Elect Engn & Comp Sci, Maribor, Slovenia
关键词
speech recognition; emotional speech; highly inflected language; Human-Computer Interaction; CLASSIFICATION; FEATURES;
D O I
10.1109/elektro49696.2020.9130316
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Emotions are an important part of human communication, but they can present harsh conditions for an automatic continuous speech recognition system. This paper presents an analysis of to which level the emotional speech degrades speech recognition accuracy, when dealing with a highly inflected Slovenian language. Namely, the language characteristics are those that also influence the speech recognition performance, and inflection is one of the most challenging ones. Moreover, Slovenian belongs to the group of under-resourced languages, like other Slavic languages. The speech recognition system was developed with the Slovenian BNSI Broadcast News speech database. The Interface speech database was used for the experiments with the emotional speech. The analysis was carried out with HMM and DNN acoustic models, combined with a 3-gram statistical language model. The results show that emotional speech degrades speech recognition accuracy in the range between 5% and 7% absolutely.
引用
收藏
页数:4
相关论文
共 50 条
  • [21] SPEECH ENHANCEMENT AND FEATURES COMPENSATION ALGORITHMS FOR CONTINUOUS SPEECH RECOGNITION
    Arcos, Christian
    Grivet, Marco
    Alcaim, Abraham
    [J]. 2014 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (CHINASIP), 2014, : 27 - 31
  • [22] Recognition of alkohol influence on speech
    Mensík, R
    [J]. TEXT, SPEECH AND DIALOGUE, 1999, 1692 : 384 - 387
  • [23] COMPUTER RECOGNITION OF VOWELS IN CONTINUOUS SPEECH
    PALIWAL, KK
    RAO, PVS
    [J]. INDIAN JOURNAL OF TECHNOLOGY, 1980, 18 (07): : 285 - 289
  • [24] Constraints on the recognition of words in continuous speech
    McQueen, JM
    [J]. INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2000, 35 (3-4) : 39 - 39
  • [25] Segmental search for continuous speech recognition
    Laface, P
    Fissore, L
    Maro, A
    Ravera, F
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2155 - 2158
  • [26] A time continuous model for speech recognition
    Euler, S
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 889 - 892
  • [27] Robust Mizo Continuous Speech Recognition
    Dey, Abhishek
    Sarma, Biswajit Dev
    Lalhminghlui, Wendy
    Ngente, Lalnunsiami
    Gogoi, Parismita
    Sarmah, Priyankoo
    Prasanna, S. R. M.
    Sinha, Rohit
    Nirmala, S. R.
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1036 - 1040
  • [28] Lexical Stress in Continuous Speech Recognition
    van Dalen, Rogier C.
    Wiggers, Pascal
    Rothkrantz, Leon J. M.
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2382 - 2385
  • [29] Continuous speech recognition for radiology reporting
    Schwartz, LH
    Kijewski, PK
    Hertogen, HI
    Roossin, PS
    Castellino, RA
    [J]. RADIOLOGY, 1996, 201 : 9708 - 9708
  • [30] Continuous speech recognition in radiology reporting
    Aggarwal, V
    Rudrapatna, V
    Rajappan, S
    Raval, B
    Tonnesen, AS
    Zhang, JJ
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2000, : 946 - 946