Automatic Recognition of Kazakh Speech Using Deep Neural Networks

被引:14
|
作者
Mamyrbayev, Orken [1 ]
Turdalyuly, Mussa [1 ]
Mekebayev, Nurbapa [2 ]
Alimhan, Keylan [1 ]
Kydyrbekova, Aizat [2 ]
Turdalykyzy, Tolganay [1 ]
机构
[1] Inst Informat & Computat Technol, Alma Ata 050010, Kazakhstan
[2] al Farabi Kazakh Natl Univ, Alma Ata 050040, Kazakhstan
关键词
DNN; ASR; Kazakh speech recognition; LM;
D O I
10.1007/978-3-030-14802-7_40
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article presents a deep neural network (DNN) system based on automatic speech recognition for Kazakh language, developed using the Kaldi speech recognition tool. DNNs are initialized using the restricted Boltzmann machines (RBM) and are trained using cross-entropy as the objective function and the standard back propagation of error. In order to achieve optimal results, the training has been modified based on peculiarities of Kazakh language. A 76 hours-corpus has been used in training. Results are compared for two different sets of values between classical models and various DNN settings.
引用
收藏
页码:465 / 474
页数:10
相关论文
共 50 条
  • [1] Automatic Speech Recognition with Deep Neural Networks for Impaired Speech
    Espana-Bonet, Cristina
    Fonollosa, Jose A. R.
    [J]. ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, IBERSPEECH 2016, 2016, 10077 : 97 - 107
  • [2] Deep Spiking Neural Networks for Large Vocabulary Automatic Speech Recognition
    Wu, Jibin
    Yilmaz, Emre
    Zhang, Malu
    Li, Haizhou
    Tan, Kay Chen
    [J]. FRONTIERS IN NEUROSCIENCE, 2020, 14
  • [3] Multichannel Signal Processing With Deep Neural Networks for Automatic Speech Recognition
    Sainath, Tara N.
    Weiss, Ron J.
    Wilson, Kevin W.
    Li, Bo
    Narayanan, Arun
    Variani, Ehsan
    Bacchiani, Michiel
    Shafran, Izhak
    Senior, Andrew
    Chin, Kean
    Misra, Ananya
    Kim, Chanwoo
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (05) : 965 - 979
  • [4] Emotional Speech Recognition Using Deep Neural Networks
    Trinh Van, Loan
    Dao Thi Le, Thuy
    Le Xuan, Thanh
    Castelli, Eric
    [J]. SENSORS, 2022, 22 (04)
  • [5] A comprehensive survey on automatic speech recognition using neural networks
    Amandeep Singh Dhanjal
    Williamjeet Singh
    [J]. Multimedia Tools and Applications, 2024, 83 : 23367 - 23412
  • [6] A comprehensive survey on automatic speech recognition using neural networks
    Dhanjal, Amandeep Singh
    Singh, Williamjeet
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (8) : 23367 - 23412
  • [7] Multi-resolution speech analysis for automatic speech recognition using deep neural networks: Experiments on TIMIT
    Toledano, Doroteo T.
    Pilar Fernandez-Gallego, Maria
    Lozano-Diez, Alicia
    [J]. PLOS ONE, 2018, 13 (10):
  • [8] ADAPTATION OF CONTEXT-DEPENDENT DEEP NEURAL NETWORKS FOR AUTOMATIC SPEECH RECOGNITION
    Yao, Kaisheng
    Yu, Dong
    Seide, Frank
    Su, Hang
    Deng, Li
    Gong, Yifan
    [J]. 2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 366 - 369
  • [9] DEEP NEURAL NETWORKS BASED AUTOMATIC SPEECH RECOGNITION FOR FOUR ETHIOPIAN LANGUAGES
    Abate, Solomon Teferra
    Tachbelie, Martha Ylfiru
    Schultz, Tanja
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8274 - 8278
  • [10] Automatic Speech Recognition Based on Neural Networks
    Schlueter, Ralf
    Doetsch, Patrick
    Golik, Pavel
    Kitza, Markus
    Menne, Tobias
    Irie, Kazuki
    Tueske, Zoltan
    Zeyer, Albert
    [J]. SPEECH AND COMPUTER, 2016, 9811 : 3 - 17