KSC2: An Industrial-Scale Open-Source Kazakh Speech Corpus

被引:3
|
作者
Mussakhojayeva, Saida [1 ]
Khassanov, Yerbolat [1 ]
Varol, Huseyin Atakan [1 ]
机构
[1] Nazarbayev Univ, Inst Smart Syst & Artificial Intelligence ISSAI, Nur Sultan, Kazakhstan
来源
关键词
speech corpus; Kazakh; speech recognition; streaming ASR; spontaneous; code-switching; agglutinative; RECOGNITION;
D O I
10.21437/Interspeech.2022-421
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We present the first industrial-scale open-source Kazakh speech corpus for automatic speech recognition research and development. Our corpus subsumes two previously presented corpora: 1) Kazakh speech corpus (KSC) and 2) Kazakh text-to-speech 2 (KazakhTTS2). We also provide additional data from other sources, including television news, television and radio programs, parliament speeches, and podcasts. Our corpus, which we have named KSC2, contains over a thousand hours of high-quality transcribed data, which is triple the size of KSC. KSC2 was manually transcribed with the help of native Kazakh speakers and validated via preliminary speech recognition experiments on various evaluation sets. Moreover, it contains utterances with Kazakh-Russian code-switching, a conversational practice common among Kazakh speakers. We believe that our corpus will facilitate speech processing research for Kazakh, which is widely considered an under-resourced language. To ensure the reproducibility of experiments, we share the KSC2 corpus, training recipes, and pretrained models(1).
引用
收藏
页码:1367 / 1371
页数:5
相关论文
共 50 条
  • [31] Autoscore: An open-source automated tool for scoring listener perception of speech
    Borrie, Stephanie A.
    Barrett, Tyson S.
    Yoho, Sarah E.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2019, 145 (01): : 392 - 399
  • [32] Towards an Open-Source Dutch Speech Recognition System for the Healthcare Domain
    Tejedor-Garcia, Cristian
    van der Molen, Berrie
    van den Heuvel, Henk
    van Hessen, Arjan
    Pieters, Toine
    [J]. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 1032 - 1039
  • [33] Open-Source License Violations of Binary Software at Large Scale
    Feng, Muyue
    Mao, Weixuan
    Yuan, Zimu
    Xiao, Yang
    Ban, Gu
    Wang, Wei
    Wang, Shiyang
    Tang, Qian
    Xu, Jiahuan
    Su, He
    Liu, Binghong
    Huo, Wei
    [J]. 2019 IEEE 26TH INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION AND REENGINEERING (SANER), 2019, : 564 - 568
  • [34] An Open Source Emotional Speech Corpus for Human Robot Interaction Applications
    James, Jesin
    Tian, Li
    Watson, Catherine Inez
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2768 - 2772
  • [35] Open Source German Distant Speech Recognition: Corpus and Acoustic Model
    Radeck-Arneth, Stephan
    Milde, Benjamin
    Lange, Arvid
    Gouvea, Evandro
    Radomski, Stefan
    Muehlhaeuser, Max
    Biemann, Chris
    [J]. TEXT, SPEECH, AND DIALOGUE (TSD 2015), 2015, 9302 : 480 - 488
  • [36] AutoNetkit: Simplifying Large Scale, Open-Source Network Experimentation
    Knight, Simon
    Jaboldinov, Askar
    Maennel, Olaf
    Phillips, Iain
    Roughan, Matthew
    [J]. ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2012, 42 (04) : 97 - 98
  • [37] An open-source model for optimal design and operation of industrial energy systems
    Atabay, Dennis
    [J]. ENERGY, 2017, 121 : 803 - 821
  • [38] XiUOS: an open-source ubiquitous operating system for industrial Internet of Things
    Donggang Cao
    Dongliang Xue
    Zhiyi Ma
    Hong Mei
    [J]. Science China Information Sciences, 2022, 65
  • [39] XiUOS: an open-source ubiquitous operating system for industrial Internet of Things
    Cao, Donggang
    Xue, Dongliang
    Ma, Zhiyi
    Mei, Hong
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2022, 65 (01)
  • [40] An open-source, industrial-strength optimizing compiler for quantum programs
    Smith, R. S.
    Peterson, E. C.
    Skilbeck, M. G.
    Davis, E. J.
    [J]. QUANTUM SCIENCE AND TECHNOLOGY, 2020, 5 (04):