Developing an Open-Source Corpus of Yoruba Speech

被引：8

作者：

Gutkin, Alexander ^{[1
]}

Demirsahin, Isin ^{[1
]}

Kjartansson, Oddur ^{[1
]}

Rivera, Clara ^{[1
]}

Tnbastin, Kola ^{[2
]}

机构：

[1] Google Res, London, England

[2] British Lib, London, England

来源：

INTERSPEECH 2020 | 2020年

关键词：

speech corpora; open-source; West Africa;

D O I：

10.21437/Interspeech.2020-1096

中图分类号：

R36 [病理学]; R76 [耳鼻咽喉科学];

学科分类号：

100104 ; 100213 ;

摘要：

This paper introduces an open-source speech dataset for Yoruba - one of the largest low-resource West African languages spoken by at least 22 million people. Yoruba is one of the official languages of Nigeria, Benin and Togo, and is spoken in other neighboring African countries and beyond. The corpus consists of over four hours of 48 kHz recordings from 36 male and female volunteers and the corresponding transcriptions that include disfluency annotation. The transcriptions have full diacritization, which is vital for pronunciation and lexical disambiguation. The annotated speech dataset described in this paper is primarily intended for use in text-to-speech systems, serve as adaptation data in automatic speech recognition and speech-to-speech translation, and provide insights in West African corpus linguistics. We demonstrate the use of this corpus in a simple statistical parametric speech synthesis (SPSS) scenario evaluating it against the related languages from the CMU Wilderness dataset and the Yoruba Lagos-NWU corpus.

引用

页码：404 / 408

页数：5

共 50 条

[1] A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline
Khassanov, Yerbolat
Mussakhojayeva, Saida
Mirzakhmetov, Almas
Adiyev, Alen
Nurpeiissov, Mukhamet
Varol, Huseyin Atakan
[J]. 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 697 - 706
[2] AISHELL-1: AN OPEN-SOURCE MANDARIN SPEECH CORPUS AND A SPEECH RECOGNITION BASELINE
Bu, Hui
Du, Jiayu
Na, Xingyu
Wu, Bengu
Zheng, Hao
[J]. 2017 20TH CONFERENCE OF THE ORIENTAL CHAPTER OF THE INTERNATIONAL COORDINATING COMMITTEE ON SPEECH DATABASES AND SPEECH I/O SYSTEMS AND ASSESSMENT (O-COCOSDA), 2017, : 58 - 62
[3] Open-Source Boundary-Annotated Corpus for Arabic Speech and Language Processing
Brierley, Claire
Sawalha, Majdi
Atwell, Eric
[J]. LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1011 - 1016
[4] KSC2: An Industrial-Scale Open-Source Kazakh Speech Corpus
Mussakhojayeva, Saida
Khassanov, Yerbolat
Varol, Huseyin Atakan
[J]. INTERSPEECH 2022, 2022, : 1367 - 1371
[5] THE BAVIECA OPEN-SOURCE SPEECH RECOGNITION TOOLKIT
Bolanos, Daniel
[J]. 2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 354 - 359
[6] TALCS: AN OPEN-SOURCE MANDARIN-ENGLISH CODE-SWITCHING CORPUS AND A SPEECH RECOGNITION BASELINE
Li, Chengfei
Deng, Shuhao
Wang, Yaoping
Wang, Guangjing
Gong, Yaguang
Chen, Changbin
Bai, Jinfeng
[J]. INTERSPEECH 2022, 2022, : 1741 - 1745
[7] speechocean762: An Open-Source Non-native English Speech Corpus For Pronunciation Assessment
Zhang, Junbo
Zhang, Zhiwen
Wang, Yongqing
Yan, Zhiyong
Song, Qiong
Huang, Yukai
Li, Ke
Povey, Daniel
Wang, Yujun
[J]. INTERSPEECH 2021, 2021, : 3710 - 3714
[8] Developing Open-Source Molecular Modeling Software
不详
[J]. CHEMICAL ENGINEERING PROGRESS, 2021, 117 (03) : 12 - 12
[9] Developing open-source tools to streamline computational research
Votapka, Lane W.
Czapla, Luke
Zhenirovskyy, Maxim
Demir, Ozlem
Amaro, Rommie E.
[J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2012, 243
[10] Developing Secure Agent Infrastructures with Open Standards and Open-Source Technologies
Bellver, Joan
Such, Jose M.
Espinosa, Agustin
Garcia-Fornes, Ana
[J]. HIGHLIGHTS IN PRACTICAL APPLICATIONS OF AGENTS AND MULTIAGENT SYSTEMS, 2011, 89 : 37 - 44

← 1 2 3 4 5 →