Combination of machine scores for automatic grading of pronunciation quality

被引:64
|
作者
Franco, H [1 ]
Neumeyer, L [1 ]
Digalakis, V [1 ]
Ronen, O [1 ]
机构
[1] SRI Int, Speech Technol & Res Lab, Menlo Pk, CA 94025 USA
关键词
automatic pronunciation scoring; combination of scores; hidden Markov models; speech recognition; pronunciation quality assessment; language instruction systems; computer aided language learning;
D O I
10.1016/S0167-6393(99)00045-X
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This work is part of an effort aimed at developing computer-based systems for language instruction; we address the task of grading the pronunciation quality of the speech of a student of a foreign language. The automatic grading system uses SRI's Decipher(TM) continuous speech recognition system to generate phonetic segmentations. Based on these segmentations and probabilistic models we produce different pronunciation scores for individual or groups of sentences that can be used as predictors of the pronunciation quality. Different types of these machine scores can be combined to obtain a better prediction of the overall pronunciation quality. In this paper we review some of the best-performing machine scores and discuss the application of several methods based on linear and nonlinear mapping and combination of individual machine scores to predict the pronunciation quality grade that a human expert would have given. We evaluate these methods in a database that consists of pronunciation-quality-graded speech from American students speaking French. With predictors based on spectral match and on durational characteristics, we find that the combination of scores improved the prediction of the human grades and that nonlinear mapping and combination methods performed better than linear ones. Characteristics of the different nonlinear methods studied are discussed. (C) 2000 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:121 / 130
页数:10
相关论文
共 50 条
  • [1] New machine scores and their combinations for automatic mandarin phonetic pronunciation quality assessment
    Pan, Fuping
    Zhao, Qingwei
    Yan, Yonghong
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS: KES 2007 - WIRN 2007, PT I, PROCEEDINGS, 2007, 4692 : 821 - +
  • [2] Machine vision system for automatic quality grading of fruit
    Blasco, J
    Aleixos, N
    Moltó, E
    BIOSYSTEMS ENGINEERING, 2003, 85 (04) : 415 - 423
  • [3] Automatic assessment of pronunciation quality
    Dong, B
    Zhao, QW
    Zhang, JP
    Yan, YH
    2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 137 - 140
  • [4] Automatic scoring of pronunciation quality
    Neumeyer, L
    Franco, H
    Digalakis, V
    Weintraub, M
    SPEECH COMMUNICATION, 2000, 30 (2-3) : 83 - 93
  • [5] AUTOMATIC MACHINE GRADING PROGRAMS
    FORSYTHE, GE
    COMMUNICATIONS OF THE ACM, 1964, 7 (07) : 401 - 401
  • [6] Automatic Scoring of Pronunciation Quality with Hybrid Measure
    Dong, Bin
    Ge, Fengpei
    Pan, Fuping
    Chan, Shui-duen
    2009 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, VOL III, PROCEEDINGS, 2009, : 381 - +
  • [7] A Machine Learning-based Approach for Automatic Grading and Quality Inspection of Indian Mangoes
    Bagchi, Sourav
    Aditya, Janumpally Varun
    Kumari, Sneha
    Dhanraj, Malla
    Jenamani, Mamata
    2023 IEEE 2ND INDUSTRIAL ELECTRONICS SOCIETY ANNUAL ON-LINE CONFERENCE, ONCON, 2023,
  • [8] AUTOMATIC LUMBER QUALITY GRADING IN PRACTICE
    JUVONEN, R
    PAPERI JA PUU-PAPER AND TIMBER, 1986, 68 (03): : 149 - 150
  • [9] Pronunciation quality evaluation of sentences by combining word based scores
    Wuth, Jorge
    Becerra Yoma, Nestor
    Benavides, Leopoldo
    Vivanco, Hiram
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1278 - 1281
  • [10] Design and experiment on automatic grading machine for kiwi
    Zuo, Xingjian
    Wu, Guangwei
    Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2014, 45 : 287 - 295