Emotional Speech Recognition of Holocaust Survivors with Deep Neural Network Models for Russian Language

被引:0
|
作者
Bukreeva, Liudmila [1 ]
Guseva, Daria [1 ]
Dolgushin, Mikhail [1 ]
Evdokimova, Vera [1 ]
Obotnina, Vasilisa [1 ]
机构
[1] St Petersburg State Univ, Univ Skaya Emb 7-9, St Petersburg 199034, Russia
来源
关键词
Question Answering; Corpora; Visual History Archives;
D O I
10.1007/978-3-031-48309-7_6
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recognition of highly emotional speech remains a challenging case of automatic speech recognition task. The aim of this article is to carry out experiments on highly emotional speech recognition by investigating oral history archives provided by the Yad Vashem foundation. The material consists of elderly peoples' emotional speech full of accents and common language. We analyze and preprocess 26 h of publicly available video interviews with Holocaust survivors. Our objective was to develop a system able to perform emotional speech recognition based on deep neural network models. We present and evaluate the obtained results that contribute to the research field of oral history archives.
引用
收藏
页码:68 / 76
页数:9
相关论文
共 50 条
  • [41] Deep Neural Network Approaches to Speaker and Language Recognition
    Richardson, Fred
    Reynolds, Douglas
    Dehak, Najim
    IEEE SIGNAL PROCESSING LETTERS, 2015, 22 (10) : 1671 - 1675
  • [42] CONTEXT DEPENDENT STATE TYING FOR SPEECH RECOGNITION USING DEEP NEURAL NETWORK ACOUSTIC MODELS
    Bacchiani, Michiel
    Rybach, David
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [43] Deep Neural Network-Based Speech Recognition with Combination of Speaker-Class Models
    Kosaka, Tetsuo
    Konno, Kazuki
    Kato, Masaharu
    2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 1203 - 1206
  • [44] Designing and Implementing of Intelligent Emotional Speech Recognition with Wavelet and Neural Network
    Mansouri, Bibi Zahra
    Mirvaziri, Hamid
    Sadeghi, Faramarz
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (09) : 26 - 30
  • [45] Deep Learning for Emotional Speech Recognition
    Sanchez-Gutierrez, Maximo E.
    Marcelo Albornoz, E.
    Martinez-Licona, Fabiola
    Leonardo Rufiner, H.
    Goddard, John
    PATTERN RECOGNITION, MCPR 2014, 2014, 8495 : 311 - +
  • [46] Deep Learning for Emotional Speech Recognition
    Alhamada, M., I
    Khalifa, O. O.
    Abdalla, A. H.
    PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON ELECTRONIC DEVICES, SYSTEMS AND APPLICATIONS (ICEDSA2020), 2020, 2306
  • [47] Audiovisual speech recognition based on a deep convolutional neural network
    Rudregowda S.
    Patilkulkarni S.
    Ravi V.
    H.L. G.
    Krichen M.
    Data Science and Management, 2024, 7 (01): : 25 - 34
  • [48] A Study on Speech Emotion Recognition Using a Deep Neural Network
    Lee, Kyong Hee
    Choi, Hyun Kyun
    Jang, Byung Tae
    Kim, Do Hyun
    2019 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC): ICT CONVERGENCE LEADING THE AUTONOMOUS FUTURE, 2019, : 1162 - 1165
  • [49] Transfer Learning of Deep Neural Network for Speech Emotion Recognition
    Huang, Ying
    Hu, Mingqing
    Yu, Xianguo
    Wang, Tao
    Yang, Chen
    PATTERN RECOGNITION (CCPR 2016), PT II, 2016, 663 : 721 - 729
  • [50] Deep neural network architectures for dysarthric speech analysis and recognition
    Brahim Fares Zaidi
    Sid Ahmed Selouani
    Malika Boudraa
    Mohammed Sidi Yakoub
    Neural Computing and Applications, 2021, 33 : 9089 - 9108