KSC2: An Industrial-Scale Open-Source Kazakh Speech Corpus

被引:3
|
作者
Mussakhojayeva, Saida [1 ]
Khassanov, Yerbolat [1 ]
Varol, Huseyin Atakan [1 ]
机构
[1] Nazarbayev Univ, Inst Smart Syst & Artificial Intelligence ISSAI, Nur Sultan, Kazakhstan
来源
关键词
speech corpus; Kazakh; speech recognition; streaming ASR; spontaneous; code-switching; agglutinative; RECOGNITION;
D O I
10.21437/Interspeech.2022-421
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We present the first industrial-scale open-source Kazakh speech corpus for automatic speech recognition research and development. Our corpus subsumes two previously presented corpora: 1) Kazakh speech corpus (KSC) and 2) Kazakh text-to-speech 2 (KazakhTTS2). We also provide additional data from other sources, including television news, television and radio programs, parliament speeches, and podcasts. Our corpus, which we have named KSC2, contains over a thousand hours of high-quality transcribed data, which is triple the size of KSC. KSC2 was manually transcribed with the help of native Kazakh speakers and validated via preliminary speech recognition experiments on various evaluation sets. Moreover, it contains utterances with Kazakh-Russian code-switching, a conversational practice common among Kazakh speakers. We believe that our corpus will facilitate speech processing research for Kazakh, which is widely considered an under-resourced language. To ensure the reproducibility of experiments, we share the KSC2 corpus, training recipes, and pretrained models(1).
引用
收藏
页码:1367 / 1371
页数:5
相关论文
共 50 条
  • [41] Optimization of an industrial heat exchanger using an open-source CFD code
    Selma, Brahim
    Desilets, Martin
    Proulx, Pierre
    [J]. APPLIED THERMAL ENGINEERING, 2014, 69 (1-2) : 241 - 250
  • [42] INDUSTRIAL-SCALE HOP EXTRACTION WITH LIQUID CO2
    GARDNER, DS
    [J]. CHEMISTRY & INDUSTRY, 1982, (12) : 402 - 405
  • [43] The Implementation of a Vocabulary and Grammar for an Open-Source Speech-Recognition Programming Platform
    Rodriguez-Cartagena, Jean K.
    Claudio-Palacios, Andrea
    Pacheco-Tallaj, Natalia
    Santiago-Gonzalez, Valerie
    Ordonez-Franco, Patricia
    [J]. ASSETS'15: PROCEEDINGS OF THE 17TH INTERNATIONAL ACM SIGACCESS CONFERENCE ON COMPUTERS & ACCESSIBILITY, 2015, : 447 - 448
  • [44] Bad Speech, Good Evidence: Content Moderation in the Context of Open-Source Investigations
    Hubley, Hillary
    [J]. INTERNATIONAL CRIMINAL LAW REVIEW, 2022, 22 (5-6) : 989 - 1015
  • [45] Praaline: An Open-Source System for Managing, Annotating, Visualising and Analysing Speech Corpora
    Christodoulides, George
    [J]. 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2018): PROCEEDINGS OF SYSTEM DEMONSTRATIONS, 2018, : 111 - 115
  • [46] Evaluating Open-source Toolkits for Automatic Speech Recognition of South African Languages
    Naidoo, Ashentha
    Tsoeu, Mohohlo
    [J]. 2019 SOUTHERN AFRICAN UNIVERSITIES POWER ENGINEERING CONFERENCE/ROBOTICS AND MECHATRONICS/PATTERN RECOGNITION ASSOCIATION OF SOUTH AFRICA (SAUPEC/ROBMECH/PRASA), 2019, : 160 - 165
  • [47] Using Open-Source Automatic Speech Recognition Tools for the Annotation of Dutch Infant-Directed Speech
    van der Klis, Anika
    Adriaans, Frans
    Han, Mengru
    Kager, Rene
    [J]. MULTIMODAL TECHNOLOGIES AND INTERACTION, 2023, 7 (07)
  • [48] Sage: An Open-Source Tool for Fast Proteomics Searching and Quantification at Scale
    Lazear, Michael R.
    [J]. JOURNAL OF PROTEOME RESEARCH, 2023, 22 (11) : 3652 - 3659
  • [49] An Open-Source Benchmark for Scale-Aware Visual Odometry Algorithms
    Choi, Hyukdoo
    [J]. INTERNATIONAL JOURNAL OF FUZZY LOGIC AND INTELLIGENT SYSTEMS, 2019, 19 (02) : 119 - 128
  • [50] An Open-Source Scale Model Platform for Teaching Autonomous Vehicle Technologies
    Vincke, Bastien
    Florez, Sergio Rodriguez
    Aubert, Pascal
    [J]. SENSORS, 2021, 21 (11)