CONTINUOUS SPEECH RECOGNITION OF KAZAKH LANGUAGE

被引:2
|
作者
Mamyrbayev, Orken [1 ]
Turdalyuly, Mussa [1 ]
Mekebayev, Nurbapa [2 ]
Mukhsina, Kuralay [2 ]
Keylan, Alimukhan [1 ]
BabaAli, Bagher [1 ]
Nabieva, Gulnaz [1 ]
Duisenbayeva, Aigerim [2 ]
Akhmetov, Bekturgan [1 ]
机构
[1] Inst Informat & Computat Technol, Alma Ata, Kazakhstan
[2] Al Farabi Kazakh Natl Univ, Informat Technol Dept, Alma Ata, Kazakhstan
关键词
D O I
10.1051/itmconf/20192401012
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article describes the methods of creating a system of recognizing the continuous speech of Kazakh language. Studies on recognition of Kazakh speech in comparison with other languages began relatively recently, that is after obtaining independence of the country, and belongs to low resource languages. A large amount of data is required to create a reliable system and evaluate it accurately. A database has been created for the Kazakh language, consisting of a speech signal and corresponding transcriptions. The continuous speech has been composed of 200 speakers of different genders and ages, and the pronunciation vocabulary of the selected language. Traditional models and deep neural networks have been used to train the system. As a result, a word error rate (WER) of 30.01% has been obtained.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] Long-Distance Continuous Space Language Modeling for Speech Recognition
    Talaat, Mohamed
    Abdou, Sherif
    Shoman, Mahmoud
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2015), PT II, 2015, 9042 : 549 - 564
  • [22] Progresses in continuous speech recognition based on statistical modelling for Romanian language
    Dumitru, Corneliu Octavian
    Gavat, Inge
    Militaru, Diana
    [J]. ICINCO 2007: PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS, VOL SPSMC: SIGNAL PROCESSING, SYSTEMS MODELING AND CONTROL, 2007, : 262 - 267
  • [23] Automatic language identification using large vocabulary continuous speech recognition
    Mendoza, S
    Gillick, L
    Ito, Y
    Lowe, S
    Newmann, M
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 785 - 788
  • [24] Features extraction and training strategies in continuous speech recognition for Romanian language
    Dumitru, Corneliu Octavian
    Gavat, Inge
    [J]. ICINCO 2006: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS: SIGNAL PROCESSING, SYSTEMS MODELING AND CONTROL, 2006, : 114 - 121
  • [25] A Novel Approach in Continuous Speech Recognition for Vietnamese, an isolating tonal language
    Nguyen Hong Quang
    Nocera, Pascal
    Castelli, Eric
    Trinh Van Loan
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1149 - +
  • [26] A novel statistical language modeling method for continuous Chinese speech recognition
    Tian, B
    Tian, HX
    Fu, Q
    Yi, KC
    [J]. ICSP '98: 1998 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1998, : 734 - 737
  • [27] A unified language model for large vocabulary continuous speech recognition of Turkish
    Arisoy, Ebru
    Dutagaci, Helin
    Arslan, Levent M.
    [J]. SIGNAL PROCESSING, 2006, 86 (10) : 2844 - 2862
  • [28] Hybrid end-to-end model for Kazakh speech recognition
    Mamyrbayev O.Z.
    Oralbekova D.O.
    Alimhan K.
    Nuranbayeva B.M.
    [J]. International Journal of Speech Technology, 2023, 26 (2) : 261 - 270
  • [29] Automatic Recognition of Kazakh Speech Using Deep Neural Networks
    Mamyrbayev, Orken
    Turdalyuly, Mussa
    Mekebayev, Nurbapa
    Alimhan, Keylan
    Kydyrbekova, Aizat
    Turdalykyzy, Tolganay
    [J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2019, PT II, 2019, 11432 : 465 - 474
  • [30] A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline
    Khassanov, Yerbolat
    Mussakhojayeva, Saida
    Mirzakhmetov, Almas
    Adiyev, Alen
    Nurpeiissov, Mukhamet
    Varol, Huseyin Atakan
    [J]. 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 697 - 706