Combination of machine scores for automatic grading of pronunciation quality

被引：64

作者：

Franco, H ^{[1
]}

Neumeyer, L ^{[1
]}

Digalakis, V ^{[1
]}

Ronen, O ^{[1
]}

机构：

[1] SRI Int, Speech Technol & Res Lab, Menlo Pk, CA 94025 USA

来源：

SPEECH COMMUNICATION | 2000年 / 30卷 / 2-3期

关键词：

automatic pronunciation scoring; combination of scores; hidden Markov models; speech recognition; pronunciation quality assessment; language instruction systems; computer aided language learning;

D O I：

10.1016/S0167-6393(99)00045-X

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This work is part of an effort aimed at developing computer-based systems for language instruction; we address the task of grading the pronunciation quality of the speech of a student of a foreign language. The automatic grading system uses SRI's Decipher(TM) continuous speech recognition system to generate phonetic segmentations. Based on these segmentations and probabilistic models we produce different pronunciation scores for individual or groups of sentences that can be used as predictors of the pronunciation quality. Different types of these machine scores can be combined to obtain a better prediction of the overall pronunciation quality. In this paper we review some of the best-performing machine scores and discuss the application of several methods based on linear and nonlinear mapping and combination of individual machine scores to predict the pronunciation quality grade that a human expert would have given. We evaluate these methods in a database that consists of pronunciation-quality-graded speech from American students speaking French. With predictors based on spectral match and on durational characteristics, we find that the combination of scores improved the prediction of the human grades and that nonlinear mapping and combination methods performed better than linear ones. Characteristics of the different nonlinear methods studied are discussed. (C) 2000 Elsevier Science B.V. All rights reserved.

引用

页码：121 / 130

页数：10

共 50 条

[31] System combination for improved automatic generation of N-best proper nouns pronunciation
Duncan, R
IEEE SOUTHEASTCON 2001: ENGINEERING THE FUTURE, PROCEEDINGS, 2001, : 208 - 212
[32] Attribution Scores of BERT-Based SQL-Query Automatic Grading for Explainability
Sooksatra, Korn
Khanal, Bikram
Rivas, Pablo
Schwartz, Donald R.
2023 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE, CSCI 2023, 2023, : 213 - 220
[33] Automatic grading of Bi-colored apples by multispectral machine vision
Unay, Devrim
Gosselin, Bernard
Kleynen, Olivier
Leemans, Vincent
Destain, Marie-France
Debeir, Olivier
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2011, 75 (01) : 204 - 212
[34] Design and Implementation of Machine Learning Algorithms in Automatic Grading of Students' Assignments
Chen, Duo
Xu, Fang
JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (03) : 899 - 919
[35] Automatic Gleason grading of prostate cancer using SLIM and machine learning
Nguyen, Tan H.
Sridharan, Shamira
Marcias, Virgilia
Balla, Andre K.
Do, Minh N.
Popescu, Gabriel
QUANTITATIVE PHASE IMAGING II, 2016, 9718
[36] Machine Learning Approach for Automatic Short Answer Grading: A Systematic Review
Galhardi, Lucas Busatta
Brancher, Jacques Duilio
ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA 2018, 2018, 11238 : 380 - 391
[37] Automatic color grading model of foie gras based on machine vision
Bin, Pang
Tai-Lian, Liu
Acta Technica CSAV (Ceskoslovensk Akademie Ved), 2017, 62 (02): : 455 - 464
[38] Automatic Shape Grading of Pearl Using Machine Vision based Measurement
Cao, Yanlong
Zheng, Huawen
Yang, Jiangxin
He, Yuanfeng
MEASUREMENT TECHNOLOGY AND INTELLIGENT INSTRUMENTS IX, 2010, 437 : 389 - 392
[39] Automatic Pronunciation Evaluation of Singing
Gupta, Chitralekha
Li, Haizhou
Wang, Ye
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1507 - 1511
[40] Automatic Pronunciation Evaluation and Classification
Deshmukh, Om D.
Joshi, Sachindra
Verma, Ashish
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1721 - 1724

← 1 2 3 4 5 →