Developing an Open-Source Corpus of Yoruba Speech

被引:8
|
作者
Gutkin, Alexander [1 ]
Demirsahin, Isin [1 ]
Kjartansson, Oddur [1 ]
Rivera, Clara [1 ]
Tnbastin, Kola [2 ]
机构
[1] Google Res, London, England
[2] British Lib, London, England
来源
关键词
speech corpora; open-source; West Africa;
D O I
10.21437/Interspeech.2020-1096
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
This paper introduces an open-source speech dataset for Yoruba - one of the largest low-resource West African languages spoken by at least 22 million people. Yoruba is one of the official languages of Nigeria, Benin and Togo, and is spoken in other neighboring African countries and beyond. The corpus consists of over four hours of 48 kHz recordings from 36 male and female volunteers and the corresponding transcriptions that include disfluency annotation. The transcriptions have full diacritization, which is vital for pronunciation and lexical disambiguation. The annotated speech dataset described in this paper is primarily intended for use in text-to-speech systems, serve as adaptation data in automatic speech recognition and speech-to-speech translation, and provide insights in West African corpus linguistics. We demonstrate the use of this corpus in a simple statistical parametric speech synthesis (SPSS) scenario evaluating it against the related languages from the CMU Wilderness dataset and the Yoruba Lagos-NWU corpus.
引用
收藏
页码:404 / 408
页数:5
相关论文
共 50 条
  • [1] A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline
    Khassanov, Yerbolat
    Mussakhojayeva, Saida
    Mirzakhmetov, Almas
    Adiyev, Alen
    Nurpeiissov, Mukhamet
    Varol, Huseyin Atakan
    [J]. 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 697 - 706
  • [2] AISHELL-1: AN OPEN-SOURCE MANDARIN SPEECH CORPUS AND A SPEECH RECOGNITION BASELINE
    Bu, Hui
    Du, Jiayu
    Na, Xingyu
    Wu, Bengu
    Zheng, Hao
    [J]. 2017 20TH CONFERENCE OF THE ORIENTAL CHAPTER OF THE INTERNATIONAL COORDINATING COMMITTEE ON SPEECH DATABASES AND SPEECH I/O SYSTEMS AND ASSESSMENT (O-COCOSDA), 2017, : 58 - 62
  • [3] Open-Source Boundary-Annotated Corpus for Arabic Speech and Language Processing
    Brierley, Claire
    Sawalha, Majdi
    Atwell, Eric
    [J]. LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1011 - 1016
  • [4] KSC2: An Industrial-Scale Open-Source Kazakh Speech Corpus
    Mussakhojayeva, Saida
    Khassanov, Yerbolat
    Varol, Huseyin Atakan
    [J]. INTERSPEECH 2022, 2022, : 1367 - 1371
  • [5] THE BAVIECA OPEN-SOURCE SPEECH RECOGNITION TOOLKIT
    Bolanos, Daniel
    [J]. 2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 354 - 359
  • [6] TALCS: AN OPEN-SOURCE MANDARIN-ENGLISH CODE-SWITCHING CORPUS AND A SPEECH RECOGNITION BASELINE
    Li, Chengfei
    Deng, Shuhao
    Wang, Yaoping
    Wang, Guangjing
    Gong, Yaguang
    Chen, Changbin
    Bai, Jinfeng
    [J]. INTERSPEECH 2022, 2022, : 1741 - 1745
  • [7] speechocean762: An Open-Source Non-native English Speech Corpus For Pronunciation Assessment
    Zhang, Junbo
    Zhang, Zhiwen
    Wang, Yongqing
    Yan, Zhiyong
    Song, Qiong
    Huang, Yukai
    Li, Ke
    Povey, Daniel
    Wang, Yujun
    [J]. INTERSPEECH 2021, 2021, : 3710 - 3714
  • [8] Developing Open-Source Molecular Modeling Software
    不详
    [J]. CHEMICAL ENGINEERING PROGRESS, 2021, 117 (03) : 12 - 12
  • [9] Developing open-source tools to streamline computational research
    Votapka, Lane W.
    Czapla, Luke
    Zhenirovskyy, Maxim
    Demir, Ozlem
    Amaro, Rommie E.
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2012, 243
  • [10] Developing Secure Agent Infrastructures with Open Standards and Open-Source Technologies
    Bellver, Joan
    Such, Jose M.
    Espinosa, Agustin
    Garcia-Fornes, Ana
    [J]. HIGHLIGHTS IN PRACTICAL APPLICATIONS OF AGENTS AND MULTIAGENT SYSTEMS, 2011, 89 : 37 - 44