speechocean762: An Open-Source Non-native English Speech Corpus For Pronunciation Assessment

被引:8
|
作者
Zhang, Junbo [1 ]
Zhang, Zhiwen [2 ]
Wang, Yongqing [1 ]
Yan, Zhiyong [1 ]
Song, Qiong [2 ]
Huang, Yukai [2 ]
Li, Ke [2 ]
Povey, Daniel [1 ]
Wang, Yujun [1 ]
机构
[1] Xiaomi Corp, Beijing, Peoples R China
[2] SpeechOcean Ltd, Beijing, Peoples R China
来源
关键词
corpus; computer-assisted language learning (CALL); second language (L2); MISPRONUNCIATION DETECTION; LEXICAL STRESS;
D O I
10.21437/Interspeech.2021-1259
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
This paper introduces a new open-source speech corpus named "speechocean762" designed for pronunciation assessment use, consisting of 5000 English utterances from 250 non-native speakers, where half of the speakers are children. Five experts annotated each of the utterances at sentence-level, word-level and phoneme-level. A baseline system is released in open source to illustrate the phoneme-level pronunciation assessment workflow on this corpus. This corpus is allowed to be used freely for commercial and non-commercial purposes. It is available for free download from OpenSLR, and the corresponding baseline system is published in the Kaldi speech recognition toolkit.
引用
收藏
页码:3710 / 3714
页数:5
相关论文
共 50 条
  • [21] EXPLOITING INFORMATION FROM NATIVE DATA FOR NON-NATIVE AUTOMATIC PRONUNCIATION ASSESSMENT
    Lin, Binghuai
    Wang, Liyuan
    [J]. 2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 708 - 714
  • [22] A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline
    Khassanov, Yerbolat
    Mussakhojayeva, Saida
    Mirzakhmetov, Almas
    Adiyev, Alen
    Nurpeiissov, Mukhamet
    Varol, Huseyin Atakan
    [J]. 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 697 - 706
  • [23] Intelligibility of English Mosaic Speech: Comparison between Native and Non-Native Speakers of English
    Santi
    Nakajima, Yoshitaka
    Ueda, Kazuo
    Remijn, Gerard B.
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (19): : 1 - 13
  • [24] Exploring Native and Non-Native English Child Speech Recognition With Whisper
    Jain, Rishabh
    Barcovschi, Andrei
    Yiwere, Mariam Yahayah
    Corcoran, Peter
    Cucu, Horia
    [J]. IEEE ACCESS, 2024, 12 : 41601 - 41610
  • [25] TALCS: AN OPEN-SOURCE MANDARIN-ENGLISH CODE-SWITCHING CORPUS AND A SPEECH RECOGNITION BASELINE
    Li, Chengfei
    Deng, Shuhao
    Wang, Yaoping
    Wang, Guangjing
    Gong, Yaguang
    Chen, Changbin
    Bai, Jinfeng
    [J]. INTERSPEECH 2022, 2022, : 1741 - 1745
  • [26] Non-native pronunciation variants of city names as a problem for speech technology applications
    Schaden, S
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2003, 2807 : 229 - 236
  • [27] A Hybrid Acoustic and Pronunciation Model Adaptation Approach for Non-native Speech Recognition
    Oh, Yoo Rhee
    Kim, Hong Kook
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (09): : 2379 - 2387
  • [28] Evaluating Different Non-native Pronunciation Scoring Metrics with the Japanese Speakers of the SAMPLE Corpus
    Alvarez, Vandria Alvarez
    Escudero Mancebo, David
    Gonzalez Ferreras, Cesar
    Cardenoso Payo, Valentin
    [J]. ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, IBERSPEECH 2016, 2016, 10077 : 205 - 214
  • [29] NON-NATIVE SPEECH CORPORA FOR THE DEVELOPMENT OF COMPUTER ASSISTED PRONUNCIATION TRAINING SYSTEMS
    Carranza, M.
    Cucchiarini, C.
    Burgos, P.
    Strik, H.
    [J]. EDULEARN14: 6TH INTERNATIONAL CONFERENCE ON EDUCATION AND NEW LEARNING TECHNOLOGIES, 2014, : 3624 - 3633
  • [30] MLLR/MAP Adaptation Using Pronunciation Variation for Non-native Speech Recognition
    Oh, Yoo Rhee
    Kim, Hong Kook
    [J]. 2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 216 - 221