A Two-pass Framework of Mispronunciation Detection & Diagnosis for Computer-aided Pronunciation Training

被引:0
|
作者
Qian, Xiaojun [1 ]
Meng, Helen [1 ]
Soong, Frank [2 ]
机构
[1] Chinese Univ Hong Kong, Hong Kong, Hong Kong, Peoples R China
[2] Microsoft Res Asia, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a two-pass framework of mispronunciation detection and diagnosis (MD&D) - detection followed by diagnosis, without the need of explicit error pattern modeling, so that the main efforts can be devoted to improving acoustic modeling by discriminative training (or by applying alternative models like neural nets). The framework instantiates a set of anti-phones and a filler model in addition to the original phone model set, and crafts a general and compact phone error detection network. The detection network guarantees full coverage of all possible error patterns while maximally exploits the constraint offered by the text prompt. Specifically, it includes anti-phones to detect substitutions, filler model to detect insertions, and skips to detect deletions, so there is no prior assumptions on the possible form of error patterns. The subsequent diagnosis step expands the detected insertions and substitutions into phone networks, after which another recognition pass reveals the true identities of the detected errors. The crux of the trick is to bring down the modeling and recognition granularity down in the detection pass. Discriminative training (DT) of the detection and diagnosis models by minimizing the two expected full-sequence phone-level errors in the respective passes brings down the overall phone-level MD&D error by a relative of 40%. In particular, visualization of models in the framework shows that discriminative training effectively separates the canonical phones and their anti-phones.
引用
收藏
页码:384 / 387
页数:4
相关论文
共 50 条
  • [11] Context Aware Mispronunciation Detection for Mandarin Pronunciation Training
    Tong, Rong
    Chen, Nancy F.
    Ma, Bin
    Li, Haizhou
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3112 - +
  • [12] Automatic Derivation of Phonological Rules for Mispronunciation Detection in a Computer-Assisted Pronunciation Training System
    Lo, Wai-Kit
    Zhang, Shuang
    Meng, Helen
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 765 - 768
  • [13] Computer-aided detection and diagnosis of breast cancer
    Collins, Michael J.
    Hoffmeister, Jeffrey
    Worrell, Steven W.
    SEMINARS IN ULTRASOUND CT AND MRI, 2006, 27 (04) : 351 - 355
  • [14] A Computer-Aided Chinese Pronunciation Training Program for English-Speaking Learners
    Qin, Yi
    Wang, Guonian
    PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2014), 2014, : 154 - 157
  • [15] Will Computer-Aided Detection and Diagnosis Revolutionize Colonoscopy?
    Byrne, Michael F.
    Shahidi, Neal
    Rex, Douglas K.
    GASTROENTEROLOGY, 2017, 153 (06) : 1460 - +
  • [16] Computer-aided detection and diagnosis of breast cancer
    Vyborny, CJ
    Giger, ML
    Nishikawa, RM
    RADIOLOGIC CLINICS OF NORTH AMERICA, 2000, 38 (04) : 725 - +
  • [17] Computer-Aided Detection and Diagnosis of Neurological Disorder
    Huse, Shreyash
    Acharya, Sourya
    Shukla, Samarth
    Harshita, J.
    Sachdev, Ankita
    CUREUS JOURNAL OF MEDICAL SCIENCE, 2022, 14 (08)
  • [18] Automatic Generation and Pruning of Phonetic Mispronunciations to Support Computer-Aided Pronunciation Training
    Wang, Lan
    Feng, Xin
    Meng, Helen M.
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1729 - +
  • [19] Evaluation of computer-aided detection and diagnosis systems
    Petrick, Nicholas
    Sahiner, Berkman
    Armato, Samuel G., III
    Bert, Alberto
    Correale, Loredana
    Delsanto, Silvia
    Freedman, Matthew T.
    Fryd, David
    Gur, David
    Hadjiiski, Lubomir
    Huo, Zhimin
    Jiang, Yulei
    Morra, Lia
    Paquerault, Sophie
    Raykar, Vikas
    Samuelson, Frank
    Summers, Ronald M.
    Tourassi, Georgia
    Yoshida, Hiroyuki
    Zheng, Bin
    Zhou, Chuan
    Chan, Heang-Ping
    MEDICAL PHYSICS, 2013, 40 (08)
  • [20] Computer-Aided Detection and Diagnosis in Medical Imaging
    Chen, Chung-Ming
    Chou, Yi-Hong
    Tagawa, Norio
    Do, Younghae
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2013, 2013