Emotional Speech Recognition of Holocaust Survivors with Deep Neural Network Models for Russian Language

被引:0
|
作者
Bukreeva, Liudmila [1 ]
Guseva, Daria [1 ]
Dolgushin, Mikhail [1 ]
Evdokimova, Vera [1 ]
Obotnina, Vasilisa [1 ]
机构
[1] St Petersburg State Univ, Univ Skaya Emb 7-9, St Petersburg 199034, Russia
来源
关键词
Question Answering; Corpora; Visual History Archives;
D O I
10.1007/978-3-031-48309-7_6
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recognition of highly emotional speech remains a challenging case of automatic speech recognition task. The aim of this article is to carry out experiments on highly emotional speech recognition by investigating oral history archives provided by the Yad Vashem foundation. The material consists of elderly peoples' emotional speech full of accents and common language. We analyze and preprocess 26 h of publicly available video interviews with Holocaust survivors. Our objective was to develop a system able to perform emotional speech recognition based on deep neural network models. We present and evaluate the obtained results that contribute to the research field of oral history archives.
引用
收藏
页码:68 / 76
页数:9
相关论文
共 50 条
  • [1] A study of neural network Russian language models for automatic continuous speech recognition systems
    Kipyatkova, I. S.
    Karpov, A. A.
    AUTOMATION AND REMOTE CONTROL, 2017, 78 (05) : 858 - 867
  • [2] A study of neural network Russian language models for automatic continuous speech recognition systems
    I. S. Kipyatkova
    A. A. Karpov
    Automation and Remote Control, 2017, 78 : 858 - 867
  • [3] Comparison of Various Neural Network Language Models in Speech Recognition
    Zuo, Lingyun
    Liu, Jian
    Wan, Xin
    2016 3RD INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE), 2016, : 894 - 898
  • [4] Deep Neural Networks in Russian Speech Recognition
    Markovnikov, Nikita
    Kipyatkova, Irina
    Karpov, Alexey
    Filchenkov, Andrey
    ARTIFICIAL INTELLIGENCE AND NATURAL LANGUAGE, 2018, 789 : 54 - 67
  • [5] Speech Recognition Model for Assamese Language Using Deep Neural Network
    Singh, Moirangthem Tiken
    Barman, Partha Pratim
    Gogoi, Rupjyoti
    2018 INTERNATIONAL CONFERENCE ON RECENT INNOVATIONS IN ELECTRICAL, ELECTRONICS & COMMUNICATION ENGINEERING (ICRIEECE 2018), 2018, : 2722 - 2727
  • [6] Investigating Bidirectional Recurrent Neural Network Language Models for Speech Recognition
    Chen, X.
    Ragni, A.
    Liu, X.
    Gales, M. J. F.
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 269 - 273
  • [7] BIDIRECTIONAL RECURRENT NEURAL NETWORK LANGUAGE MODELS FOR AUTOMATIC SPEECH RECOGNITION
    Arisoy, Ebru
    Sethy, Abhinav
    Ramabhadran, Bhuvana
    Chen, Stanley
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5421 - 5425
  • [8] Structured Output Layer Neural Network Language Models for Speech Recognition
    Le, Hai-Son
    Oparin, Ilya
    Allauzen, Alexandre
    Gauvain, Jean-Luc
    Yvon, Francois
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (01): : 195 - 204
  • [9] Empirical study of neural network language models for Arabic speech recognition
    Emami, Ahmad
    Mangu, Lidia
    2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 147 - 152
  • [10] Emotional Speech Recognition Using Deep Neural Networks
    Trinh Van, Loan
    Dao Thi Le, Thuy
    Le Xuan, Thanh
    Castelli, Eric
    SENSORS, 2022, 22 (04)