speechocean762: An Open-Source Non-native English Speech Corpus For Pronunciation Assessment

被引:8
|
作者
Zhang, Junbo [1 ]
Zhang, Zhiwen [2 ]
Wang, Yongqing [1 ]
Yan, Zhiyong [1 ]
Song, Qiong [2 ]
Huang, Yukai [2 ]
Li, Ke [2 ]
Povey, Daniel [1 ]
Wang, Yujun [1 ]
机构
[1] Xiaomi Corp, Beijing, Peoples R China
[2] SpeechOcean Ltd, Beijing, Peoples R China
来源
关键词
corpus; computer-assisted language learning (CALL); second language (L2); MISPRONUNCIATION DETECTION; LEXICAL STRESS;
D O I
10.21437/Interspeech.2021-1259
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
This paper introduces a new open-source speech corpus named "speechocean762" designed for pronunciation assessment use, consisting of 5000 English utterances from 250 non-native speakers, where half of the speakers are children. Five experts annotated each of the utterances at sentence-level, word-level and phoneme-level. A baseline system is released in open source to illustrate the phoneme-level pronunciation assessment workflow on this corpus. This corpus is allowed to be used freely for commercial and non-commercial purposes. It is available for free download from OpenSLR, and the corresponding baseline system is published in the Kaldi speech recognition toolkit.
引用
收藏
页码:3710 / 3714
页数:5
相关论文
共 50 条
  • [1] Comparing transcription agreement on non-native English speech corpus between native and non-native annotators
    Ryu, Hyuksu
    Kim, Sunhee
    Chung, Minhwa
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2363 - 2366
  • [2] Pronunciation accuracy and intelligibility of non-native speech
    Loukina, Anastassia
    Lopez, Melissa
    Evanini, Keelan
    Suenderinann-Oeft, David
    Ivanov, Alexei V.
    Zechner, Klaus
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1917 - 1921
  • [3] L2-ARCTIC: A Non-Native English Speech Corpus
    Zhao, Guanlong
    Sonsaat, Sinem
    Silpachai, Alif
    Lucic, Ivana
    Chukharev-Hudilainen, Evgeny
    Levis, John
    Gutierrez-Osuna, Ricardo
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2783 - 2787
  • [4] Automatic detection of accent and lexical pronunciation errors in spontaneous non-native English speech
    Kyriakopoulos, Konstantinos
    Knill, Kate M.
    Gales, Mark J. E.
    [J]. INTERSPEECH 2020, 2020, : 3052 - 3056
  • [5] Improving Pronunciation Modeling for Non-Native Speech Recognition
    Tan, Tien-Ping
    Besacier, Laurent
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1801 - 1804
  • [6] PRIDE AND PREJUDICE? Judging non-native pronunciation of English
    Koet, Ton
    van den Bergh, Huub
    [J]. L1 EDUCATIONAL STUDIES IN LANGUAGE AND LITERATURE, 2018, 18 : 1 - 13
  • [7] Non-native Speech in English Literature
    Lange, Claudia
    [J]. ANGLIA-ZEITSCHRIFT FUR ENGLISCHE PHILOLOGIE, 2016, 134 (03): : 527 - U359
  • [8] Developing an Open-Source Corpus of Yoruba Speech
    Gutkin, Alexander
    Demirsahin, Isin
    Kjartansson, Oddur
    Rivera, Clara
    Tnbastin, Kola
    [J]. INTERSPEECH 2020, 2020, : 404 - 408
  • [9] Unsupervised pronunciation grammar generation for non-native speech recognition
    Huang, Chien-Lin
    Wu, Chung-Hsien
    Chen, Yi
    Hsu, Chin-Shun
    Lee, Kuei-Ming
    [J]. TENCON 2007 - 2007 IEEE REGION 10 CONFERENCE, VOLS 1-3, 2007, : 452 - +
  • [10] INTELLIGIBILITY OF ENGLISH SPEECH TO NON-NATIVE ENGLISH SPEAKERS
    IRVINE, DH
    [J]. LANGUAGE AND SPEECH, 1977, 20 (OCT-) : 308 - 316