Voicer: A Crowd Sourcing Tool for Speech Data Collection

被引:0
|
作者
Buddhika, Darshana [1 ]
Liyadipita, Ranula [1 ]
Nadeeshan, Sudeepa [1 ]
Witharana, Hasini [1 ]
Jayasena, Sanath [1 ]
Thayasivam, Uthayasanker [1 ]
机构
[1] Univ Moratuwa, Dept Comp Sci & Engn, Moratuwa, Sri Lanka
关键词
data corpus; data collection tool; low resourced languages;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Speech corpora do not exist for most low-resource languages. Thus, creating speech corpora for a language of such a nature is challenging and involves a significant amount of time and effort. This paper provides an overview of related data collection strategies, highlighting a few issues prevalent in the existing approaches. The objectives of this paper encompass firstly the introduction of an open-source tool called "Voicer" that is accessible via both handheld devices and computers that can he used to conduct a speech data collection for a specific domain in a short span of time irrespective of the language. Secondly, it demonstrates the power of the tool, utilizing the same to build a Sinhala speech corpus that consists of 10 hours of speech data for 39 different sentences in the banking domain. Finally, this paper provides a framework to evaluate a speech data corpus along with the lessons learned during the process of data collection with a view to contributing towards future researches.
引用
收藏
页码:174 / 181
页数:8
相关论文
共 50 条
  • [1] Samromur: Crowd-sourcing Data Collection for Icelandic Speech Recognition
    Mollberg, David Erik
    Jonsson, Olafur Helgi
    Porsteinsdottir, Sunneva
    Steingrimsson, Steinpor
    Magnusdottir, Eydis Huld
    Gudnason, Jon
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 3463 - 3467
  • [2] Elderly Speech Collection for Speech Recognition Based on Crowd Sourcing
    Judice, Ana
    Freitas, Joao
    Braga, Daniela
    Calado, Antonio
    Dias, Miguel
    Teixeira, Antonio
    Oliveira, Catarina
    [J]. PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON SOFTWARE DEVELOPMENT FOR ENHANCING ACCESSIBILITY AND FIGHTING INFO-EXCLUSION (DSAI 2010), 2010, : 103 - 110
  • [3] The RedDots Platform for Mobile Crowd-Sourcing of Speech Data
    Lee, Kong Aik
    Wang, Guangsen
    Ng, Kam Pheng
    Sun, Hanwu
    Trung Hieu Nguyen
    Thai, Ngoc Thuy Huong
    Ma, Bin
    Li, Haizhou
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2603 - 2604
  • [4] Foundations of Crowd Data Sourcing
    Amsterdamer, Yael
    Milo, Tova
    [J]. SIGMOD RECORD, 2014, 43 (04) : 5 - 14
  • [5] Crowd-Based Data Sourcing (Abstract)
    Milo, Tova
    [J]. DATABASES IN NETWORKED INFORMATION SYSTEMS, 2011, 7108 : 64 - 67
  • [6] Asking the Right Questions in Crowd Data Sourcing
    Boim, Rubi
    Greenshpan, Ohad
    Milo, Tova
    Novgorodov, Slava
    Polyzotis, Neoklis
    Tan, Wang-Chiew
    [J]. 2012 IEEE 28TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2012, : 1261 - 1264
  • [7] CROWD SOURCING
    Coxhead, Gabriel
    [J]. APOLLO-THE INTERNATIONAL ART MAGAZINE, 2020, 191 (692): : 44 - 49
  • [8] IP Geolocation with a Crowd-sourcing Broadband Performance Tool
    Lee, Yeonhee
    Park, Heasook
    Lee, Youngseok
    [J]. ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2016, 46 (01) : 12 - 20
  • [9] Crowd Sourcing for Conservation: Web 2.0 a Powerful Tool for Biologists
    Newell, David A.
    Pembroke, Margaret M.
    Boyd, William E.
    [J]. FUTURE INTERNET, 2012, 4 (02): : 551 - 562
  • [10] Uncertainty in Crowd Data Sourcing Under Structural Constraints
    Amarilli, Antoine
    Amsterdamer, Yael
    Milo, Tova
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2014, 2014, 8505 : 351 - 359