CONTINUOUS SPEECH RECOGNITION OF KAZAKH LANGUAGE

被引:2
|
作者
Mamyrbayev, Orken [1 ]
Turdalyuly, Mussa [1 ]
Mekebayev, Nurbapa [2 ]
Mukhsina, Kuralay [2 ]
Keylan, Alimukhan [1 ]
BabaAli, Bagher [1 ]
Nabieva, Gulnaz [1 ]
Duisenbayeva, Aigerim [2 ]
Akhmetov, Bekturgan [1 ]
机构
[1] Inst Informat & Computat Technol, Alma Ata, Kazakhstan
[2] Al Farabi Kazakh Natl Univ, Informat Technol Dept, Alma Ata, Kazakhstan
关键词
D O I
10.1051/itmconf/20192401012
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article describes the methods of creating a system of recognizing the continuous speech of Kazakh language. Studies on recognition of Kazakh speech in comparison with other languages began relatively recently, that is after obtaining independence of the country, and belongs to low resource languages. A large amount of data is required to create a reliable system and evaluate it accurately. A database has been created for the Kazakh language, consisting of a speech signal and corresponding transcriptions. The continuous speech has been composed of 200 speakers of different genders and ages, and the pronunciation vocabulary of the selected language. Traditional models and deep neural networks have been used to train the system. As a result, a word error rate (WER) of 30.01% has been obtained.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] CONTINUOUS TOPIC LANGUAGE MODELING FOR SPEECH RECOGNITION
    Chueh, Chuang-Hua
    Chien, Jen-Tzung
    [J]. 2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS, 2008, : 193 - 196
  • [2] Integration of speech and language processing in Chinese continuous speech recognition
    ZHAO Li ZOU Cairong WU Zhenyang(Department of Radio Engineering
    [J]. Chinese Journal of Acoustics, 2002, (04) : 343 - 351
  • [3] Cascade Speech Translation for the Kazakh Language
    Kozhirbayev, Zhanibek
    Islamgozhayev, Talgat
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (15):
  • [4] A Free Kazakh Speech Database and a Speech Recognition Baseline
    Shi, Ying
    Hamdulla, Askar
    Tang, Zhiyuan
    Wang, Dong
    Zheng, Thomas Fang
    [J]. 2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 745 - 748
  • [5] Named entity recognition for the Kazakh language
    Kozhirbayev, Z. M.
    Yessenbayev, Z. A.
    [J]. JOURNAL OF MATHEMATICS MECHANICS AND COMPUTER SCIENCE, 2020, 107 (03): : 57 - 66
  • [6] Syllable modeling in continuous speech recognition for Tamil language
    Thangarajan, R.
    Natarajan, A.
    Selvam, M.
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2009, 12 (01) : 47 - 57
  • [7] Feature sets in continuous speech recognition for the Portuguese language
    dos Santos, SCB
    Alcaim, A
    [J]. ITS '98 PROCEEDINGS - SBT/IEEE INTERNATIONAL TELECOMMUNICATIONS SYMPOSIUM, VOLS 1 AND 2, 1998, : 126 - 129
  • [8] SySRA: A System of a Continuous Speech Recognition in Arab Language
    Abdelhamid, Samir
    Bouguechal, Noureddine
    [J]. PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 11, 2006, 11 : 207 - +
  • [9] A Study of Kazakh Speech Recognition in Hiformer Model
    Mamyrbayev, Orken
    Kurmetkan, Turdbek
    Oralbekova, Dina
    Zhumazhan, Nurdaulet
    [J]. RECENT CHALLENGES IN INTELLIGENT INFORMATION AND DATABASE SYSTEMS, PT II, ACIIDS 2024, 2024, 2145 : 330 - 340
  • [10] A study of transformer-based end-to-end speech recognition system for Kazakh language
    Mamyrbayev, Orken
    Oralbekova, Dina
    Alimhan, Keylan
    Turdalykyzy, Tolganay
    Othman, Mohamed
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01)