Phonological Feature Based Mispronunciation Detection and Diagnosis using Multi-Task DNNs and Active Learning

被引:8
|
作者
Arora, Vipul [1 ]
Lahiri, Aditi [1 ]
Reetz, Henning [2 ]
机构
[1] Univ Oxford, Fac Linguist Philol & Phonet, Oxford, England
[2] Goethe Univ, Frankfurt, Germany
基金
欧洲研究理事会;
关键词
computer-aided pronunciation training; phonological features; multi-task DNNs; active learning; ACOUSTIC MODELS; SPEECH;
D O I
10.21437/Interspeech.2017-1350
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a phonological feature based computer aided pronunciation training system for the learners of a new language (L2). Phonological features allow analysing the learners' mispronunciations systematically and rendering the feedback more effectively. The proposed acoustic model consists of a multi-task deep neural network, which uses a shared representation for estimating the phonological features and HMM state probabilities. Moreover, an active learning based scheme is proposed to efficiently deal with the cost of annotation, which is done by expert teachers, by selecting the most informative samples for annotation. Experimental evaluations are carried out for German and Italian native-speakers speaking English. For mispronunciation detection, the proposed feature-based system outperforms conventional GOP measure and classifier based methods, while providing more detailed diagnosis. Evaluations also demonstrate the advantage of active learning based sampling over random sampling.
引用
收藏
页码:1432 / 1436
页数:5
相关论文
共 50 条
  • [41] Multi-fault diagnosis for gearboxes based on multi-task deep learning
    Zhao X.
    Wu J.
    Qian C.
    Zhang Y.
    Wang L.
    Zhendong yu Chongji/Journal of Vibration and Shock, 2019, 38 (23): : 271 - 278
  • [42] Probabilistic Joint Feature Selection for Multi-task Learning
    Xiong, Tao
    Bi, Jinbo
    Rao, Bharat
    Cherkassky, Vladimir
    PROCEEDINGS OF THE SEVENTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2007, : 332 - +
  • [43] Multi-Task Adversarial Network for Disentangled Feature Learning
    Liu, Yang
    Wang, Zhaowen
    Jin, Hailin
    Wassell, Ian
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3743 - 3751
  • [44] Deep sparse multi-task learning for feature selection in Alzheimer’s disease diagnosis
    Heung-Il Suk
    Seong-Whan Lee
    Dinggang Shen
    Brain Structure and Function, 2016, 221 : 2569 - 2587
  • [45] Deep sparse multi-task learning for feature selection in Alzheimer's disease diagnosis
    Suk, Heung-Il
    Lee, Seong-Whan
    Shen, Dinggang
    BRAIN STRUCTURE & FUNCTION, 2016, 221 (05): : 2569 - 2587
  • [46] Prognosis and Diagnosis of Parkinson's Disease Using Multi-Task Learning
    Emrani, Saba
    McGuirk, Anya
    Xiao, Wei
    KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, : 1457 - 1466
  • [47] Structured feature selection and task relationship inference for multi-task learning
    Hongliang Fei
    Jun Huan
    Knowledge and Information Systems, 2013, 35 : 345 - 364
  • [48] Structured feature selection and task relationship inference for multi-task learning
    Fei, Hongliang
    Huan, Jun
    KNOWLEDGE AND INFORMATION SYSTEMS, 2013, 35 (02) : 345 - 364
  • [49] Feature Extraction of Laser Machining Data by Using Deep Multi-Task Learning
    Zhang, Quexuan
    Wang, Zexuan
    Wang, Bin
    Ohsawa, Yukio
    Hayashi, Teruaki
    INFORMATION, 2020, 11 (08)
  • [50] Robust Feature Representation Using Multi-Task Learning for Human Activity Recognition
    Azadi, Behrooz
    Haslgruebler, Michael
    Anzengruber-Tanase, Bernhard
    Sopidis, Georgios
    Ferscha, Alois
    SENSORS, 2024, 24 (02)