Automatic Recognition of Kazakh Speech Using Deep Neural Networks

被引：14

作者：

Mamyrbayev, Orken ^{[1
]}

Turdalyuly, Mussa ^{[1
]}

Mekebayev, Nurbapa ^{[2
]}

Alimhan, Keylan ^{[1
]}

Kydyrbekova, Aizat ^{[2
]}

Turdalykyzy, Tolganay ^{[1
]}

机构：

[1] Inst Informat & Computat Technol, Alma Ata 050010, Kazakhstan

[2] al Farabi Kazakh Natl Univ, Alma Ata 050040, Kazakhstan

来源：

INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2019, PT II | 2019年 / 11432卷

关键词：

DNN; ASR; Kazakh speech recognition; LM;

D O I：

10.1007/978-3-030-14802-7_40

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This article presents a deep neural network (DNN) system based on automatic speech recognition for Kazakh language, developed using the Kaldi speech recognition tool. DNNs are initialized using the restricted Boltzmann machines (RBM) and are trained using cross-entropy as the objective function and the standard back propagation of error. In order to achieve optimal results, the training has been modified based on peculiarities of Kazakh language. A 76 hours-corpus has been used in training. Results are compared for two different sets of values between classical models and various DNN settings.

引用

页码：465 / 474

页数：10

共 50 条

[1] Automatic Speech Recognition with Deep Neural Networks for Impaired Speech
Espana-Bonet, Cristina
Fonollosa, Jose A. R.
[J]. ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, IBERSPEECH 2016, 2016, 10077 : 97 - 107
[2] Deep Spiking Neural Networks for Large Vocabulary Automatic Speech Recognition
Wu, Jibin
Yilmaz, Emre
Zhang, Malu
Li, Haizhou
Tan, Kay Chen
[J]. FRONTIERS IN NEUROSCIENCE, 2020, 14
[3] Multichannel Signal Processing With Deep Neural Networks for Automatic Speech Recognition
Sainath, Tara N.
Weiss, Ron J.
Wilson, Kevin W.
Li, Bo
Narayanan, Arun
Variani, Ehsan
Bacchiani, Michiel
Shafran, Izhak
Senior, Andrew
Chin, Kean
Misra, Ananya
Kim, Chanwoo
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (05) : 965 - 979
[4] Emotional Speech Recognition Using Deep Neural Networks
Trinh Van, Loan
Dao Thi Le, Thuy
Le Xuan, Thanh
Castelli, Eric
[J]. SENSORS, 2022, 22 (04)
[5] A comprehensive survey on automatic speech recognition using neural networks
Amandeep Singh Dhanjal
Williamjeet Singh
[J]. Multimedia Tools and Applications, 2024, 83 : 23367 - 23412
[6] A comprehensive survey on automatic speech recognition using neural networks
Dhanjal, Amandeep Singh
Singh, Williamjeet
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (8) : 23367 - 23412
[7] Multi-resolution speech analysis for automatic speech recognition using deep neural networks: Experiments on TIMIT
Toledano, Doroteo T.
Pilar Fernandez-Gallego, Maria
Lozano-Diez, Alicia
[J]. PLOS ONE, 2018, 13 (10):
[8] ADAPTATION OF CONTEXT-DEPENDENT DEEP NEURAL NETWORKS FOR AUTOMATIC SPEECH RECOGNITION
Yao, Kaisheng
Yu, Dong
Seide, Frank
Su, Hang
Deng, Li
Gong, Yifan
[J]. 2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 366 - 369
[9] DEEP NEURAL NETWORKS BASED AUTOMATIC SPEECH RECOGNITION FOR FOUR ETHIOPIAN LANGUAGES
Abate, Solomon Teferra
Tachbelie, Martha Ylfiru
Schultz, Tanja
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8274 - 8278
[10] Automatic Speech Recognition Based on Neural Networks
Schlueter, Ralf
Doetsch, Patrick
Golik, Pavel
Kitza, Markus
Menne, Tobias
Irie, Kazuki
Tueske, Zoltan
Zeyer, Albert
[J]. SPEECH AND COMPUTER, 2016, 9811 : 3 - 17

← 1 2 3 4 5 →