AUTOMATIC EVALUATION OF ENGLISH PRONUNCIATION BASED ON SPEECH RECOGNITION TECHNIQUES

被引:0
|
作者
HAMADA, H
MIKI, S
NAKATSU, R
机构
关键词
SPEECH PROCESSING; PRONUNCIATION; SPEECH RECOGNITION; SPEAKER ADAPTATION; EDUCATION; ENGLISH;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A new method is proposed for automatically evaluating the English pronunciation quality of non-native speakers. It is assumed that pronunciation can be rated using three criteria: the static characteristics of phonetic spectra, the dynamic structure of spectrum sequences, and the prosodic characteristics of utterances. The evaluation uses speech recognition techniques to compare the English words pronounced by a non-native speaker with those pronounced by a native speaker. Three evaluation measures are proposed to rate pronunciation quality. (1) The standard deviation of the mapping vectors, which map the codebook vectors of the non-native speaker onto the vector space of the native speaker, is used to evaluate the static phonetic spectra characteristics. (2) The spectral distance between words pronounced by the non-native speaker and those pronounced by the native speaker obtained by the DTW method is used to evaluate the dynamic characteristics of spectral sequences. (3) The differences in fundamental frequency and speech power between the pronunciation of the native and non-native speaker are used as the criteria for evaluating prosodic characteristics. Evaluation experiments are carried out using 441 words spoken by 10 Japanese speakers and 10 native speakers. One half of the 441 words was used to evaluate static phonetic spectra characteristics, and the other half was used to evaluate the dynamic characteristics of spectral sequences, as well as the prosodic characteristics. Based on the experimental results, the correlation between the evaluation scores and the scores determined by human judgement is found to be 0.90.
引用
收藏
页码:352 / 359
页数:8
相关论文
共 50 条
  • [1] Improving English Pronunciation via Automatic Speech Recognition Technology
    Li, Meihui
    Han, Meiting
    Chen, Zejia
    Mo, Yiling
    Chen, Xiujuan
    Liu, Xiaobin
    [J]. 2017 INTERNATIONAL SYMPOSIUM ON EDUCATIONAL TECHNOLOGY (ISET 2017), 2017, : 224 - 228
  • [2] Improving English pronunciation via automatic speech recognition technology
    Liu, Xiaobin
    Xu, Manfei
    Li, Meihui
    Han, Meiting
    Chen, Zejia
    Mo, Yiling
    Chen, Xiujuan
    Liu, Minjia
    [J]. INTERNATIONAL JOURNAL OF INNOVATION AND LEARNING, 2019, 25 (02) : 126 - 140
  • [3] English Speech Recognition and Pronunciation Quality Evaluation Model Based on Neural Network
    Wang, Li
    [J]. SCIENTIFIC PROGRAMMING, 2022, 2022
  • [4] English Speech Recognition and Pronunciation Quality Evaluation Model Based on Neural Network
    Wang, Li
    [J]. SCIENTIFIC PROGRAMMING, 2022, 2022
  • [5] English Speech Recognition and Pronunciation Quality Evaluation Model Based on Neural Network
    Wang, Li
    [J]. SCIENTIFIC PROGRAMMING, 2022, 2022
  • [6] Automatic evaluation of Dutch pronunciation by using speech recognition technology
    Cucchiarini, C
    Strik, H
    Boves, L
    [J]. 1997 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, PROCEEDINGS, 1997, : 622 - 629
  • [7] AUTOMATIC PRONUNCIATION VERIFICATION FOR SPEECH RECOGNITION
    Rao, Kanishka
    Peng, Fuchun
    Beaufays, Francoise
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5162 - 5166
  • [8] Automatic Speech Recognition and Pronunciation Training
    Xiao, Wenqi
    [J]. PROCEEDINGS OF THE 2018 2ND INTERNATIONAL CONFERENCE ON EDUCATION, ECONOMICS AND MANAGEMENT RESEARCH (ICEEMR 2018), 2018, 182 : 466 - 468
  • [9] Development English Pronunciation Practicing System Based on Speech Recognition
    Ngoc Hoang Phan
    Thi Thu Trang Bui
    Spitsyn, V. G.
    [J]. CONTEXT-AWARE SYSTEMS AND APPLICATIONS, AND NATURE OF COMPUTATION AND COMMUNICATION, 2019, 298 : 157 - 166
  • [10] Research on English pronunciation training based on intelligent speech recognition
    Cai J.
    Liu Y.
    [J]. International Journal of Speech Technology, 2018, 21 (3) : 633 - 640