Automatic pronunciation scoring of words and sentences independent from the non-native's first language

被引:53
|
作者
Cincarek, Tobias [1 ]
Gruhn, Rainer [1 ]
Hacker, Christian [2 ]
Noeth, Elmar [2 ]
Nakamura, Satoshi [1 ]
机构
[1] ATR Spoken Language Translat Res Labs, Keihanna Sci City 6190288, Japan
[2] Univ Erlangen Nurnberg, Inst Pattern Recognit, D-8520 Erlangen, Germany
来源
COMPUTER SPEECH AND LANGUAGE | 2009年 / 23卷 / 01期
关键词
Non-native speech; Pronunciation assessment; Sentence scoring; Word scoring; Mispronunciation detection; Phoneme mispronunciation statistic;
D O I
10.1016/j.csl.2008.03.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes ail approach for automatic scoring of pronunciation quality for non-native speech. It is applicable regardless of the foreign language Student's mother tongue. Sentences and words are considered as scoring units. Additionally, mispronunciation and phoneme confusion statistics for the target language phoneme set are derived from human annotations and word level scoring results using a Markov chain model of mispronunciation detection. The proposed methods call be employed for building a part of the scoring module of a system for computer assisted pronunciation training (CAPT). Methods from pattern and speech recognition are applied to develop appropriate feature sets for sentence and word level scoring. Besides features well-known from and approved in previous research, e.g. phoneme accuracy, posterior score, duration score and recognition accuracy, new features such as high-level phoneme confidence measures are identified. The proposed method is evaluated with native English speech, non-native English speech from German, French, Japanese, Indonesian and Chinese adults and non-native speech from German school children, The speech data are annotated with tags for mispronounced words and sentence level ratings by native English teachers. Experimental results show, that the reliability of automatic sentence level scoring by the system is almost as high as the average human evaluator. Furthermore, a good performance for detecting mispronounced words is achieved. In a validation experiment, it could also be verified, that the system gives the highest pronunciation quality scores to 90% of native speakers' utterances. Automatic error diagnosis based oil a automatically derived phoneme mispronunciation statistic showed reasonable results for five non-native speaker groups. The statistics call be exploited in order to provide the non-native feedback oil mispronounced phonemes. (C) 2008 Elsevier Ltd. All rights reserved.
引用
收藏
页码:65 / 88
页数:24
相关论文
共 50 条
  • [41] The true cost of science's language barrier for non-native English speakers
    Lenharo, Mariana
    NATURE, 2023, 619 (7971) : 678 - 679
  • [42] Neural Representations of Non-native Speech Reflect Proficiency and Interference from Native Language Knowledge
    Brodbeck, Christian
    Kandylaki, Katerina Danae
    Scharenborg, Odette
    JOURNAL OF NEUROSCIENCE, 2024, 44 (01):
  • [43] Maintaining Antarctica's isolation from non-native species
    Bergstrom, Dana M.
    TRENDS IN ECOLOGY & EVOLUTION, 2022, 37 (01) : 5 - 9
  • [44] Spoken words activate native and non-native letter-to-sound mappings: Evidence from eye tracking
    Marian, Viorica
    Bartolotti, James
    Daniel, Natalia L.
    Hayakawa, Sayuri
    BRAIN AND LANGUAGE, 2021, 223
  • [45] Do non-native languages have an effect on word order processing in first language Turkish?
    Cedden, Gulay
    Aydin, Ozgur
    INTERNATIONAL JOURNAL OF BILINGUALISM, 2019, 23 (04) : 804 - 816
  • [46] Using augmented reality with speech input for non-native children's language learning
    Dalim, Che Samihah Che
    Sunar, Mohd Shahrizal
    Dey, Arindam
    Billinghurst, Mark
    INTERNATIONAL JOURNAL OF HUMAN-COMPUTER STUDIES, 2020, 134 : 44 - 64
  • [47] Counting 'uhm's: How tracking the distribution of native and non-native disfluencies influences online language comprehension
    Bosker, Hans Rutger
    van Os, Marjolein
    Does, Rik
    van Bergen, Geertje
    JOURNAL OF MEMORY AND LANGUAGE, 2019, 106 : 189 - 202
  • [48] ETLT 2021: Shared Task on Automatic Speech Recognition for Non-Native Children's Speech
    Gretter, R.
    Matassoni, Marco
    Falavigna, D.
    Misra, A.
    Leong, C. W.
    Knill, K.
    Wang, L.
    INTERSPEECH 2021, 2021, : 3845 - 3849
  • [49] Using Fluency Representation Learned from Sequential Raw Features for Improving Non-native Fluency Scoring
    Fu, Kaiqi
    Gao, Shaojun
    Tian, Xiaohai
    Li, Wei
    Ma, Zejun
    INTERSPEECH 2022, 2022, : 4337 - 4341
  • [50] Non-native language exposure promotes children's willingness to accept labels in two languages
    Rojo, Dolly P.
    Echols, Catharine H.
    JOURNAL OF COGNITION AND DEVELOPMENT, 2018, 19 (01) : 107 - 118