Combination of machine scores for automatic grading of pronunciation quality

被引:64
|
作者
Franco, H [1 ]
Neumeyer, L [1 ]
Digalakis, V [1 ]
Ronen, O [1 ]
机构
[1] SRI Int, Speech Technol & Res Lab, Menlo Pk, CA 94025 USA
关键词
automatic pronunciation scoring; combination of scores; hidden Markov models; speech recognition; pronunciation quality assessment; language instruction systems; computer aided language learning;
D O I
10.1016/S0167-6393(99)00045-X
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This work is part of an effort aimed at developing computer-based systems for language instruction; we address the task of grading the pronunciation quality of the speech of a student of a foreign language. The automatic grading system uses SRI's Decipher(TM) continuous speech recognition system to generate phonetic segmentations. Based on these segmentations and probabilistic models we produce different pronunciation scores for individual or groups of sentences that can be used as predictors of the pronunciation quality. Different types of these machine scores can be combined to obtain a better prediction of the overall pronunciation quality. In this paper we review some of the best-performing machine scores and discuss the application of several methods based on linear and nonlinear mapping and combination of individual machine scores to predict the pronunciation quality grade that a human expert would have given. We evaluate these methods in a database that consists of pronunciation-quality-graded speech from American students speaking French. With predictors based on spectral match and on durational characteristics, we find that the combination of scores improved the prediction of the human grades and that nonlinear mapping and combination methods performed better than linear ones. Characteristics of the different nonlinear methods studied are discussed. (C) 2000 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:121 / 130
页数:10
相关论文
共 50 条
  • [31] System combination for improved automatic generation of N-best proper nouns pronunciation
    Duncan, R
    IEEE SOUTHEASTCON 2001: ENGINEERING THE FUTURE, PROCEEDINGS, 2001, : 208 - 212
  • [32] Attribution Scores of BERT-Based SQL-Query Automatic Grading for Explainability
    Sooksatra, Korn
    Khanal, Bikram
    Rivas, Pablo
    Schwartz, Donald R.
    2023 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE, CSCI 2023, 2023, : 213 - 220
  • [33] Automatic grading of Bi-colored apples by multispectral machine vision
    Unay, Devrim
    Gosselin, Bernard
    Kleynen, Olivier
    Leemans, Vincent
    Destain, Marie-France
    Debeir, Olivier
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2011, 75 (01) : 204 - 212
  • [34] Design and Implementation of Machine Learning Algorithms in Automatic Grading of Students' Assignments
    Chen, Duo
    Xu, Fang
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (03) : 899 - 919
  • [35] Automatic Gleason grading of prostate cancer using SLIM and machine learning
    Nguyen, Tan H.
    Sridharan, Shamira
    Marcias, Virgilia
    Balla, Andre K.
    Do, Minh N.
    Popescu, Gabriel
    QUANTITATIVE PHASE IMAGING II, 2016, 9718
  • [36] Machine Learning Approach for Automatic Short Answer Grading: A Systematic Review
    Galhardi, Lucas Busatta
    Brancher, Jacques Duilio
    ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA 2018, 2018, 11238 : 380 - 391
  • [37] Automatic color grading model of foie gras based on machine vision
    Bin, Pang
    Tai-Lian, Liu
    Acta Technica CSAV (Ceskoslovensk Akademie Ved), 2017, 62 (02): : 455 - 464
  • [38] Automatic Shape Grading of Pearl Using Machine Vision based Measurement
    Cao, Yanlong
    Zheng, Huawen
    Yang, Jiangxin
    He, Yuanfeng
    MEASUREMENT TECHNOLOGY AND INTELLIGENT INSTRUMENTS IX, 2010, 437 : 389 - 392
  • [39] Automatic Pronunciation Evaluation of Singing
    Gupta, Chitralekha
    Li, Haizhou
    Wang, Ye
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1507 - 1511
  • [40] Automatic Pronunciation Evaluation and Classification
    Deshmukh, Om D.
    Joshi, Sachindra
    Verma, Ashish
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1721 - 1724