L2-ARCTIC: A Non-Native English Speech Corpus

被引:59
|
作者
Zhao, Guanlong [1 ]
Sonsaat, Sinem [2 ]
Silpachai, Alif [2 ]
Lucic, Ivana [2 ]
Chukharev-Hudilainen, Evgeny [2 ]
Levis, John [2 ]
Gutierrez-Osuna, Ricardo [1 ]
机构
[1] Texas A&M Univ, Dept Comp Sci & Engn, College Stn, TX 77843 USA
[2] Iowa State Univ, Dept English, Ames, IA USA
关键词
speech corpus; voice conversion; accent conversion; mispronunciation detection; STRESS;
D O I
10.21437/Interspeech.2018-1110
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we introduce L2-ARCTIC, a speech corpus of non-native English that is intended for research in voice conversion, accent conversion, and mispronunciation detection. This initial release includes recordings from ten non-native speakers of English whose first languages (LIs) are Hindi, Korean, Mandarin, Spanish, and Arabic, each L1 containing recordings from one male and one female speaker. Each speaker recorded approximately one hour of read speech from the Carnegie Mellon University ARCTIC prompts, from which we generated orthographic and forced-aligned phonetic transcriptions. In addition, we manually annotated 150 utterances per speaker to identify three types of mispronunciation errors: substitutions, deletions, and additions, making it a valuable resource not only for research in voice conversion and accent conversion but also in computer-assisted pronunciation training. The corpus is publicly accessible at https://psi.engr.tamu.edu/12-arctic-corpus/.
引用
收藏
页码:2783 / 2787
页数:5
相关论文
共 50 条
  • [41] A Corpus Study of Native and Non-native Vowel Quality
    Wu, Chen-huei
    Shih, Chilin
    [J]. PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON SPEECH PROSODY, VOLS I AND II, 2012, : 234 - 237
  • [42] Perceptual Learning for Native and Non-native Speech
    Baese-Berk, Melissa
    [J]. CURRENT TOPICS IN LANGUAGE, 2018, 68 : 1 - 29
  • [43] Identification of english /r/ and /l/ in white noise by native and non-native listeners
    Ueda, Kazuo
    Akahane-Yamada, Reiko
    Komaki, Ryo
    [J]. Acoustical Science and Technology, 2002, 23 (06) : 336 - 338
  • [44] Stereotypes of Cantonese English, apparent native/non-native status, and their effect on non-native English speakers' perception
    Hu, Guiling
    Lindemann, Stephanie
    [J]. JOURNAL OF MULTILINGUAL AND MULTICULTURAL DEVELOPMENT, 2009, 30 (03) : 253 - 269
  • [45] Non-native speech recognition sentences: A new materials set for non-native speech perception research
    Stringer, Louise
    Iverson, Paul
    [J]. BEHAVIOR RESEARCH METHODS, 2020, 52 (02) : 561 - 571
  • [46] Non-native English listeners' adaptation to native English speakers
    Lee, Dae-yong
    Baese-Berk, Melissa M.
    [J]. JASA EXPRESS LETTERS, 2021, 1 (10):
  • [47] THE INTERACTION BETWEEN NATIVE-NON-NATIVE AND NON-NATIVE - NON NATIVE: SPEECH SIGNALS IN L2 ITALIAN SPANISH PHONE Learners
    De Marco, Anna
    [J]. ITALIANO LINGUADUE, 2020, 12 (02) : 92 - 109
  • [48] Feedback in L2 writing: Voices from native and non-native English speaking teachers
    Bal-Gezegin, Betul
    [J]. PROCEEDINGS OF THE 1ST GLOBELT CONFERENCE ON TEACHING AND LEARNING ENGLISH AS AN ADDITIONAL LANGUAGE, 2015, 199 : 763 - 769
  • [49] Non-native speech recognition sentences: A new materials set for non-native speech perception research
    Louise Stringer
    Paul Iverson
    [J]. Behavior Research Methods, 2020, 52 : 561 - 571
  • [50] Conversational and clear speech intelligibility of |bVd| syllables produced by native and non-native English speakers
    Rogers, Catherine L.
    DeMasi, Teresa M.
    Krause, Jean C.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2010, 128 (01): : 410 - 423