A Two-pass Framework of Mispronunciation Detection & Diagnosis for Computer-aided Pronunciation Training

被引:0
|
作者
Qian, Xiaojun [1 ]
Meng, Helen [1 ]
Soong, Frank [2 ]
机构
[1] Chinese Univ Hong Kong, Hong Kong, Hong Kong, Peoples R China
[2] Microsoft Res Asia, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a two-pass framework of mispronunciation detection and diagnosis (MD&D) - detection followed by diagnosis, without the need of explicit error pattern modeling, so that the main efforts can be devoted to improving acoustic modeling by discriminative training (or by applying alternative models like neural nets). The framework instantiates a set of anti-phones and a filler model in addition to the original phone model set, and crafts a general and compact phone error detection network. The detection network guarantees full coverage of all possible error patterns while maximally exploits the constraint offered by the text prompt. Specifically, it includes anti-phones to detect substitutions, filler model to detect insertions, and skips to detect deletions, so there is no prior assumptions on the possible form of error patterns. The subsequent diagnosis step expands the detected insertions and substitutions into phone networks, after which another recognition pass reveals the true identities of the detected errors. The crux of the trick is to bring down the modeling and recognition granularity down in the detection pass. Discriminative training (DT) of the detection and diagnosis models by minimizing the two expected full-sequence phone-level errors in the respective passes brings down the overall phone-level MD&D error by a relative of 40%. In particular, visualization of models in the framework shows that discriminative training effectively separates the canonical phones and their anti-phones.
引用
收藏
页码:384 / 387
页数:4
相关论文
共 50 条
  • [41] PROSPECTS FOR COMPUTER-AIDED DIAGNOSIS
    GORRY, GA
    NEW ENGLAND JOURNAL OF MEDICINE, 1969, 281 (02): : 101 - &
  • [42] COMPUTER-AIDED DIAGNOSIS AND REPORTING
    REICHERT.PL
    BIOMETRICS, 1971, 27 (01) : 259 - &
  • [43] Computer-aided diagnosis in mammography
    Sittek, H
    Herrmann, K
    Perlet, C
    Kunzer, I
    Kessler, M
    Reiser, M
    RADIOLOGE, 1997, 37 (08): : 610 - 616
  • [44] COMPUTER-AIDED DIAGNOSIS AND NEGLIGENCE
    BAINBRIDGE, DI
    MEDICINE SCIENCE AND THE LAW, 1991, 31 (02) : 127 - 136
  • [45] What is computer-aided diagnosis?
    Steward, D
    SEMINARS IN VETERINARY MEDICINE AND SURGERY-SMALL ANIMAL, 1996, 11 (02): : 74 - 84
  • [46] COMPUTER-AIDED DIAGNOSIS IN PEDIATRICS
    CHAPMAN, WE
    POSTGRADUATE MEDICINE, 1970, 48 (04) : 49 - +
  • [47] Computer-aided diagnosis in mammography
    Giger, ML
    Nishikawa, RM
    Schmidt, RA
    Wolverton, DE
    Doi, K
    RADIOLOGY, 1996, 201 : 9418 - 9418
  • [48] Computer-aided diagnosis of genodermatoses
    Aksungur, VL
    Marakli, SS
    Akman, A
    Homan, S
    JOURNAL OF DERMATOLOGY, 2004, 31 (02): : 86 - 93
  • [49] Computer-aided diagnosis in radiology
    Giger, ML
    ACADEMIC RADIOLOGY, 2002, 9 (01) : 1 - 3
  • [50] COMPUTER-AIDED DIAGNOSIS OF ALCOHOLISM
    LYONS, JP
    IZADI, BM
    JOURNAL OF STUDIES ON ALCOHOL, 1980, 41 (05): : 448 - 455