Automatic pronunciation scoring of words and sentences independent from the non-native's first language

被引:53
|
作者
Cincarek, Tobias [1 ]
Gruhn, Rainer [1 ]
Hacker, Christian [2 ]
Noeth, Elmar [2 ]
Nakamura, Satoshi [1 ]
机构
[1] ATR Spoken Language Translat Res Labs, Keihanna Sci City 6190288, Japan
[2] Univ Erlangen Nurnberg, Inst Pattern Recognit, D-8520 Erlangen, Germany
来源
COMPUTER SPEECH AND LANGUAGE | 2009年 / 23卷 / 01期
关键词
Non-native speech; Pronunciation assessment; Sentence scoring; Word scoring; Mispronunciation detection; Phoneme mispronunciation statistic;
D O I
10.1016/j.csl.2008.03.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes ail approach for automatic scoring of pronunciation quality for non-native speech. It is applicable regardless of the foreign language Student's mother tongue. Sentences and words are considered as scoring units. Additionally, mispronunciation and phoneme confusion statistics for the target language phoneme set are derived from human annotations and word level scoring results using a Markov chain model of mispronunciation detection. The proposed methods call be employed for building a part of the scoring module of a system for computer assisted pronunciation training (CAPT). Methods from pattern and speech recognition are applied to develop appropriate feature sets for sentence and word level scoring. Besides features well-known from and approved in previous research, e.g. phoneme accuracy, posterior score, duration score and recognition accuracy, new features such as high-level phoneme confidence measures are identified. The proposed method is evaluated with native English speech, non-native English speech from German, French, Japanese, Indonesian and Chinese adults and non-native speech from German school children, The speech data are annotated with tags for mispronounced words and sentence level ratings by native English teachers. Experimental results show, that the reliability of automatic sentence level scoring by the system is almost as high as the average human evaluator. Furthermore, a good performance for detecting mispronounced words is achieved. In a validation experiment, it could also be verified, that the system gives the highest pronunciation quality scores to 90% of native speakers' utterances. Automatic error diagnosis based oil a automatically derived phoneme mispronunciation statistic showed reasonable results for five non-native speaker groups. The statistics call be exploited in order to provide the non-native feedback oil mispronounced phonemes. (C) 2008 Elsevier Ltd. All rights reserved.
引用
下载
收藏
页码:65 / 88
页数:24
相关论文
共 50 条
  • [1] EXPLOITING INFORMATION FROM NATIVE DATA FOR NON-NATIVE AUTOMATIC PRONUNCIATION ASSESSMENT
    Lin, Binghuai
    Wang, Liyuan
    2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 708 - 714
  • [2] A mouse with a roof?: effects of phonological neighbors on processing of words in sentences in a non-native language
    Rueschemeyer, Shirley-Ann
    Nojack, Agnes
    Limbach, Maxi
    BRAIN AND LANGUAGE, 2008, 104 (02) : 132 - 144
  • [3] Automatic pronunciation modelling for multiple non-native accents
    Goronzy, S
    Eisele, K
    ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 123 - 128
  • [4] Ambiguous words in sentences:: Brain indices for native and non-native disambiguation
    Elston-Guettler, Kerrie E.
    Friederici, Angela D.
    NEUROSCIENCE LETTERS, 2007, 414 (01) : 85 - 89
  • [5] Automatic text-independent pronunciation scoring of foreign language student speech
    Neumeyer, L
    Franco, H
    Weintraub, M
    Price, P
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1457 - 1460
  • [6] Recognizing non-native spoken words in background noise increases interference from the native language
    Florian Hintz
    Cesko C. Voeten
    Odette Scharenborg
    Psychonomic Bulletin & Review, 2023, 30 : 1549 - 1563
  • [7] Recognizing non-native spoken words in background noise increases interference from the native language
    Hintz, Florian
    Voeten, Cesko C.
    Scharenborg, Odette
    PSYCHONOMIC BULLETIN & REVIEW, 2023, 30 (04) : 1549 - 1563
  • [8] Evaluating Different Non-native Pronunciation Scoring Metrics with the Japanese Speakers of the SAMPLE Corpus
    Alvarez, Vandria Alvarez
    Escudero Mancebo, David
    Gonzalez Ferreras, Cesar
    Cardenoso Payo, Valentin
    ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, IBERSPEECH 2016, 2016, 10077 : 205 - 214
  • [9] Automatic scoring of non-native spontaneous speech in tests of spoken English
    Zechner, Klaus
    Higgins, Derrick
    Xi, Xiaoming
    Williamson, David M.
    SPEECH COMMUNICATION, 2009, 51 (10) : 883 - 895
  • [10] Automatic Speech Recognition and Pronunciation Error Detection of Dutch Non-native Speech: cumulating speech resources in a pluricentric language
    Wei, X.
    Cucchiarini, C.
    van Hout, R.
    Strik, H.
    SPEECH COMMUNICATION, 2022, 144 : 1 - 9