NON-NATIVE SPEECH CORPORA FOR THE DEVELOPMENT OF COMPUTER ASSISTED PRONUNCIATION TRAINING SYSTEMS

被引:0
|
作者
Carranza, M. [1 ]
Cucchiarini, C. [2 ]
Burgos, P. [2 ]
Strik, H. [2 ]
机构
[1] Univ Autonoma Barcelona, E-08193 Barcelona, Spain
[2] Radboud Univ Nijmegen, NL-6525 ED Nijmegen, Netherlands
关键词
Non-native speech corpora; transcription; L2 pronunciation acquisition; computer assisted pronunciation training; speech technology;
D O I
暂无
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
In this paper we discuss the need for compiling non-native speech corpora for the development of Computer Assisted Pronunciation Training (CAPT) applications that address specific language pairs. Learner's L1-oriented CAPT tools enhanced with Automatic Speech Recognition (ASR) technology can perform better in automatically identifying common errors of speakers of a specific L1 when learning an L2. Nevertheless, the adaptation of an ASR system to non-native speech is a complex and time-consuming task which demands large quantities of speech data annotated at different transcription levels, and this data is generally not easily available for developers. Background research on CAPT development is presented by reporting on various projects that make use of non-native corpora for developing CAPT applications. We then present two different studies which share the objective of compiling an L1-L2 specific non-native corpus with the purpose of developing a CAPT system for the addressed language pair. The results of the two studies show that specific language combinations trigger specific pronunciations errors and these errors should be conveniently described in order to incorporate this information into the CAPT system and provide students with meaningful and effective feedback on their specific mispronunciations. Procedures for the compilation, annotation and transcription of non-native speech corpora that will improve the reusability and exchange of the databases will be also addressed. We discuss some possibilities for facilitating this task by means of using materials already provided with the orthographic transcription, which can be employed to automatically generate the phonemic transcription, or obtaining transcriptions by other methods such as crowdsourcing. These procedures are less time-consuming and should make it easier to develop effective applications. Enhancing awareness of the need for learner's speech databases is another priority that should be addressed, not only for research purposes, but also for the development of applications.
引用
收藏
页码:3624 / 3633
页数:10
相关论文
共 50 条
  • [1] Pronunciation accuracy and intelligibility of non-native speech
    Loukina, Anastassia
    Lopez, Melissa
    Evanini, Keelan
    Suenderinann-Oeft, David
    Ivanov, Alexei V.
    Zechner, Klaus
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1917 - 1921
  • [2] Improving Pronunciation Modeling for Non-Native Speech Recognition
    Tan, Tien-Ping
    Besacier, Laurent
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1801 - 1804
  • [3] Unsupervised pronunciation grammar generation for non-native speech recognition
    Huang, Chien-Lin
    Wu, Chung-Hsien
    Chen, Yi
    Hsu, Chin-Shun
    Lee, Kuei-Ming
    [J]. TENCON 2007 - 2007 IEEE REGION 10 CONFERENCE, VOLS 1-3, 2007, : 452 - +
  • [4] Combined Acoustic and Pronunciation Modelling for Non-Native Speech Recognition
    Bouselmi, G.
    Fohr, D.
    Illina, I.
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1209 - +
  • [5] Phonological feature-based speech recognition system for pronunciation training in non-native language learning
    Arora, Vipul
    Lahiri, Aditi
    Reetz, Henning
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2018, 143 (01): : 98 - 108
  • [7] A speaker adaptation method for non-native speech using learners' native utterances for computer-assisted language learning systems
    Ohkawa, Yuichi
    Suzuki, Motoyuki
    Ogasawara, Hirokazu
    Ito, Akinori
    Makino, Shozo
    [J]. SPEECH COMMUNICATION, 2009, 51 (10) : 875 - 882
  • [8] Non-native pronunciation variants of city names as a problem for speech technology applications
    Schaden, S
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2003, 2807 : 229 - 236
  • [9] A Hybrid Acoustic and Pronunciation Model Adaptation Approach for Non-native Speech Recognition
    Oh, Yoo Rhee
    Kim, Hong Kook
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (09): : 2379 - 2387
  • [10] MLLR/MAP Adaptation Using Pronunciation Variation for Non-native Speech Recognition
    Oh, Yoo Rhee
    Kim, Hong Kook
    [J]. 2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 216 - 221