Automatic pronunciation scoring of words and sentences independent from the non-native's first language

被引:53
|
作者
Cincarek, Tobias [1 ]
Gruhn, Rainer [1 ]
Hacker, Christian [2 ]
Noeth, Elmar [2 ]
Nakamura, Satoshi [1 ]
机构
[1] ATR Spoken Language Translat Res Labs, Keihanna Sci City 6190288, Japan
[2] Univ Erlangen Nurnberg, Inst Pattern Recognit, D-8520 Erlangen, Germany
来源
COMPUTER SPEECH AND LANGUAGE | 2009年 / 23卷 / 01期
关键词
Non-native speech; Pronunciation assessment; Sentence scoring; Word scoring; Mispronunciation detection; Phoneme mispronunciation statistic;
D O I
10.1016/j.csl.2008.03.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes ail approach for automatic scoring of pronunciation quality for non-native speech. It is applicable regardless of the foreign language Student's mother tongue. Sentences and words are considered as scoring units. Additionally, mispronunciation and phoneme confusion statistics for the target language phoneme set are derived from human annotations and word level scoring results using a Markov chain model of mispronunciation detection. The proposed methods call be employed for building a part of the scoring module of a system for computer assisted pronunciation training (CAPT). Methods from pattern and speech recognition are applied to develop appropriate feature sets for sentence and word level scoring. Besides features well-known from and approved in previous research, e.g. phoneme accuracy, posterior score, duration score and recognition accuracy, new features such as high-level phoneme confidence measures are identified. The proposed method is evaluated with native English speech, non-native English speech from German, French, Japanese, Indonesian and Chinese adults and non-native speech from German school children, The speech data are annotated with tags for mispronounced words and sentence level ratings by native English teachers. Experimental results show, that the reliability of automatic sentence level scoring by the system is almost as high as the average human evaluator. Furthermore, a good performance for detecting mispronounced words is achieved. In a validation experiment, it could also be verified, that the system gives the highest pronunciation quality scores to 90% of native speakers' utterances. Automatic error diagnosis based oil a automatically derived phoneme mispronunciation statistic showed reasonable results for five non-native speaker groups. The statistics call be exploited in order to provide the non-native feedback oil mispronounced phonemes. (C) 2008 Elsevier Ltd. All rights reserved.
引用
下载
收藏
页码:65 / 88
页数:24
相关论文
共 50 条
  • [21] The development of language from non-native linguistic input
    Ross, DS
    Newport, EL
    PROCEEDINGS OF THE 20TH ANNUAL BOSTON UNIVERSITY CONFERENCE ON LANGUAGE DEVELOPMENT, VOLS 1 AND 2, 1996, : 634 - 645
  • [22] Learning unfamiliar words and perceiving non-native vowels in a second language: Insights from eye tracking
    Desmeules-Trudel, Felix
    Joanisse, Marc F.
    ACTA PSYCHOLOGICA, 2022, 226
  • [23] Automatic Pronunciation Evaluation of Non-Native Mandarin Tone by Using Multi-level Confidence Measures
    Lin, Ju
    Xie, Yanlu
    Zhang, Jinsong
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2666 - +
  • [24] Language Dependency of /s/ Production: Native Dutch Versus Non-Native English
    de Boer, Meike M.
    Heeren, Willemijn F. L.
    LANGUAGE AND SPEECH, 2024,
  • [25] Phonological feature-based speech recognition system for pronunciation training in non-native language learning
    Arora, Vipul
    Lahiri, Aditi
    Reetz, Henning
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2018, 143 (01): : 98 - 108
  • [26] Spanish learners of Portuguese as a non-native language: grammatical gender assignment in suffixed words
    Ferreira, Tania Santos
    Rio-Torto, Graca
    ESTUDOS DE LINGUISTICA GALEGA, 2023, 15
  • [28] The Impact of Non-Native Language Input on Bilingual Children's Language Skills
    Buac, Milijana
    Kaushanskaya, Margarita
    LANGUAGES, 2023, 8 (04)
  • [29] Processing figurative language: Evidence from native and non-native speakers of English
    Alkhammash, Reem
    FRONTIERS IN PSYCHOLOGY, 2022, 13
  • [30] Enhancing FL Learner's Perception of Non-native English Pronunciation with a Telecollaboration Project Work
    Casan Pitarch, Ricardo
    Angel Candel-Mora, Miguel
    LFE-REVISTA DE LENGUAS PARA FINES ESPECIFICOS, 2022, 28 (01): : 42 - 60