Combination of machine scores for automatic grading of pronunciation quality

被引:64
|
作者
Franco, H [1 ]
Neumeyer, L [1 ]
Digalakis, V [1 ]
Ronen, O [1 ]
机构
[1] SRI Int, Speech Technol & Res Lab, Menlo Pk, CA 94025 USA
关键词
automatic pronunciation scoring; combination of scores; hidden Markov models; speech recognition; pronunciation quality assessment; language instruction systems; computer aided language learning;
D O I
10.1016/S0167-6393(99)00045-X
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This work is part of an effort aimed at developing computer-based systems for language instruction; we address the task of grading the pronunciation quality of the speech of a student of a foreign language. The automatic grading system uses SRI's Decipher(TM) continuous speech recognition system to generate phonetic segmentations. Based on these segmentations and probabilistic models we produce different pronunciation scores for individual or groups of sentences that can be used as predictors of the pronunciation quality. Different types of these machine scores can be combined to obtain a better prediction of the overall pronunciation quality. In this paper we review some of the best-performing machine scores and discuss the application of several methods based on linear and nonlinear mapping and combination of individual machine scores to predict the pronunciation quality grade that a human expert would have given. We evaluate these methods in a database that consists of pronunciation-quality-graded speech from American students speaking French. With predictors based on spectral match and on durational characteristics, we find that the combination of scores improved the prediction of the human grades and that nonlinear mapping and combination methods performed better than linear ones. Characteristics of the different nonlinear methods studied are discussed. (C) 2000 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:121 / 130
页数:10
相关论文
共 50 条
  • [41] Comparison between automatic and manual quality grading of sawn softwood
    Lycken, A
    FOREST PRODUCTS JOURNAL, 2006, 56 (04) : 13 - 18
  • [42] Design and experiment of automatic quality grading equipment in grain purchase
    Zhao B.
    Mao W.
    Hu X.
    Zhang X.
    Nongye Jixie Xuebao/Transactions of the Chinese Society of Agricultural Machinery, 2010, 41 (SUPPL. 1): : 268 - 271+276
  • [43] Exponentiated Combination of Two Scores for Image Quality Assessment
    Isono, Kaoru
    Shimamura, Tetsuya
    2013 IEEE INTERNATIONAL CONFERENCE OF IEEE REGION 10 (TENCON), 2013,
  • [44] Optimized Three Scores Combination for Image Quality Assessment
    Ishiyama, Kei
    Sugiura, Yosuke
    Shimamura, Tetsuya
    2016 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS (APCCAS), 2016, : 5 - 8
  • [45] Combination of wavelet analysis and color applied to automatic color grading of ceramic tiles
    Ai, JY
    Liu, D
    Zhu, XF
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, 2004, : 235 - 238
  • [46] Different aspects of expert pronunciation quality ratings and their relation to scores produced by speech recognition algorithms
    Cucchiarini, C
    Strik, H
    Boves, L
    SPEECH COMMUNICATION, 2000, 30 (2-3) : 109 - 119
  • [47] Cow Milk Quality Grading using Machine Learning Methods
    Neware, Shubhangi
    INTERNATIONAL JOURNAL OF NEXT-GENERATION COMPUTING, 2023, 14 (01): : 1 - 7
  • [48] AUTOMATIC PRONUNCIATION ERROR DETECTION BASED ON LINGUISTIC KNOWLEDGE AND PRONUNCIATION SPACE
    Xu, Shuang
    Jiang, Jie
    Chen, Zhenbiao
    Xu, Bo
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4841 - 4844
  • [49] Automatic quality estimation for ASR system combination
    Jalalvand, Shahab
    Negri, Matteo
    Falavigna, Daniele
    Matassoni, Marco
    Turchi, Marco
    COMPUTER SPEECH AND LANGUAGE, 2018, 47 : 214 - 239
  • [50] Machine Vision-based Apple External Quality Grading
    Nie, Maoyong
    Zhao, Qinjun
    Xu, Yuan
    Shen, Tao
    PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 5961 - 5966