speechocean762: An Open-Source Non-native English Speech Corpus For Pronunciation Assessment

被引:8
|
作者
Zhang, Junbo [1 ]
Zhang, Zhiwen [2 ]
Wang, Yongqing [1 ]
Yan, Zhiyong [1 ]
Song, Qiong [2 ]
Huang, Yukai [2 ]
Li, Ke [2 ]
Povey, Daniel [1 ]
Wang, Yujun [1 ]
机构
[1] Xiaomi Corp, Beijing, Peoples R China
[2] SpeechOcean Ltd, Beijing, Peoples R China
来源
关键词
corpus; computer-assisted language learning (CALL); second language (L2); MISPRONUNCIATION DETECTION; LEXICAL STRESS;
D O I
10.21437/Interspeech.2021-1259
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
This paper introduces a new open-source speech corpus named "speechocean762" designed for pronunciation assessment use, consisting of 5000 English utterances from 250 non-native speakers, where half of the speakers are children. Five experts annotated each of the utterances at sentence-level, word-level and phoneme-level. A baseline system is released in open source to illustrate the phoneme-level pronunciation assessment workflow on this corpus. This corpus is allowed to be used freely for commercial and non-commercial purposes. It is available for free download from OpenSLR, and the corresponding baseline system is published in the Kaldi speech recognition toolkit.
引用
收藏
页码:3710 / 3714
页数:5
相关论文
共 50 条
  • [31] Evaluating Different Non-native Pronunciation Scoring Metrics with the Japanese Speakers of the SAMPLE Corpus
    Alvarez, Vandria Alvarez
    Escudero Mancebo, David
    Gonzalez Ferreras, Cesar
    Cardenoso Payo, Valentin
    [J]. ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, IBERSPEECH 2016, 2016, 10077 : 205 - 214
  • [32] Non-native English Teachers' Views towards Pedagogic Goals and Models of Pronunciation
    Takagishi, Ryosuke
    [J]. ASIAN ENGLISHES, 2012, 15 (02) : 108 - 135
  • [33] General adaptation to accented English: Speech intelligibility unaffected by perceived source of non-native accent
    Melguy, Yevgeniy Vasilyevich
    Johnson, Keith
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2021, 149 (04): : 2602 - 2614
  • [34] TEACHING IDIOMS AND FIGURES OF SPEECH TO NON-NATIVE SPEAKERS OF ENGLISH
    ADKINS, PG
    [J]. MODERN LANGUAGE JOURNAL, 1968, 52 (03): : 148 - 152
  • [35] Synthesizing Near Native-accented Speech for a Non-native Speaker by Imitating the Pronunciation and Prosody of a Native Speaker
    Chung, Raymond
    Mak, Brian
    [J]. INTERSPEECH 2022, 2022, : 4302 - 4306
  • [36] AISHELL-1: AN OPEN-SOURCE MANDARIN SPEECH CORPUS AND A SPEECH RECOGNITION BASELINE
    Bu, Hui
    Du, Jiayu
    Na, Xingyu
    Wu, Bengu
    Zheng, Hao
    [J]. 2017 20TH CONFERENCE OF THE ORIENTAL CHAPTER OF THE INTERNATIONAL COORDINATING COMMITTEE ON SPEECH DATABASES AND SPEECH I/O SYSTEMS AND ASSESSMENT (O-COCOSDA), 2017, : 58 - 62
  • [37] IMITATION OF ENGLISH VOWEL DURATION UPON EXPOSURE TO NATIVE AND NON-NATIVE SPEECH
    Zajac, Magdalena
    Rojczyk, Arkadiusz
    [J]. POZNAN STUDIES IN CONTEMPORARY LINGUISTICS, 2014, 50 (04): : 495 - 514
  • [38] Synthesized speech intelligibility among native speakers and non-native speakers of English
    Alamsaputra, Diane Mayasari
    Kohnert, Kathryn J.
    Munson, Benjamin
    Reichle, Joe
    [J]. AUGMENTATIVE AND ALTERNATIVE COMMUNICATION, 2006, 22 (04) : 258 - 268
  • [39] Towards defining a valid assessment criterion of pronunciation proficiency in non-native English-speaking graduate students
    Isaacs, Talia
    [J]. CANADIAN MODERN LANGUAGE REVIEW-REVUE CANADIENNE DES LANGUES VIVANTES, 2008, 64 (04): : 555 - 580
  • [40] Native and non-native talkers' mutual speech intelligibility of English focus sentences
    Lee, Joo-Kyeong
    [J]. LINGUISTIC RESEARCH, 2014, 31 (03) : 441 - 463