Automatic text-independent pronunciation scoring of foreign language student speech

被引:0
|
作者
Neumeyer, L
Franco, H
Weintraub, M
Price, P
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
SRI International is currently involved in he development of a new generation of software systems for automatic scoring of pronunciation as part of the Voice Interactive Language Training System (VILTS) project. This paper describes the goals of the VILTS system, the speech corpus, and the algorithm development. The automatic grading system uses SRI's Decipher(TM) continuous speech recognition system [1] to generate phonetic segmentations that are used to produce pronunciation scores at the end of each lesson. The scores produced by the system are similar to those of expert human listeners, Unlike previous approaches in which models were built for specific sentences or phrases, we present a new family of algorithms designed to perform well even when knowledge of the exact text to be used is nor available.
引用
收藏
页码:1457 / 1460
页数:4
相关论文
共 50 条
  • [1] Text-Independent Automatic Accent Identification System for Kannada Language
    Soorajkumar, R.
    Girish, G. N.
    Ramteke, Pravin B.
    Joshi, Shreyas S.
    Koolagudi, Shashidhar G.
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT 2016, VOL 2, 2017, 469 : 411 - 418
  • [2] Automatic Text-Independent Artifact Detection, Localization, and Classification in Synthetic Speech
    Pribil, Jiri
    Pribilova, Anna
    Matousek, Jindrich
    [J]. RADIOENGINEERING, 2017, 26 (04) : 1151 - 1160
  • [3] Automatic pronunciation scoring for language instruction
    Franco, H
    Neumeyer, L
    Kim, Y
    Ronen, O
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1471 - 1474
  • [4] A Text-Independent Method for Estimating Pronunciation Quality of Chinese Students
    Huang, Guimin
    Li, Huijuan
    Zhou, Rong
    Zhou, Ya
    [J]. INFORMATION TECHNOLOGY AND INTELLIGENT TRANSPORTATION SYSTEMS, VOL 2, 2017, 455 : 201 - 211
  • [5] AUTOMATIC ARABIC PRONUNCIATION SCORING FOR LANGUAGE INSTRUCTION
    Dahan, Hassan
    Hussin, Abdul
    Razak, Zaidi
    Odelha, Mourad
    [J]. EDULEARN11: 3RD INTERNATIONAL CONFERENCE ON EDUCATION AND NEW LEARNING TECHNOLOGIES, 2011, : 145 - 150
  • [6] Language dependency in text-independent speaker verification
    Auckenthaler, R
    Carey, MJ
    Mason, JSD
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 441 - 444
  • [7] Text-Dependent Versus Text-Independent Speech Emotion Recognition
    Nayak, Biswajit
    Pradhan, Manoj Kumar
    [J]. PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION TECHNOLOGIES, IC3T 2015, VOL 1, 2016, 379 : 153 - 161
  • [8] Robust local scoring function for text-independent speaker verification
    Liu, Ming
    Huang, Thomas S.
    Zhang, Zhengyou
    [J]. 18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2006, : 1146 - +
  • [9] Effect of speech coding on text-independent speaker identification
    Porwal, G
    Patil, HA
    Basu, TK
    [J]. 2005 International Conference on Intelligent Sensing and Information Processing, Proceedings, 2005, : 415 - 420
  • [10] Dynamic Speech Parameterization for Text-Independent Phone Segmentation
    Cherniz, Analia S.
    Torres, Maria E.
    Rufiner, Hugo L.
    [J]. 2010 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2010, : 4044 - 4047