Phonological Feature Based Mispronunciation Detection and Diagnosis using Multi-Task DNNs and Active Learning

被引:8
|
作者
Arora, Vipul [1 ]
Lahiri, Aditi [1 ]
Reetz, Henning [2 ]
机构
[1] Univ Oxford, Fac Linguist Philol & Phonet, Oxford, England
[2] Goethe Univ, Frankfurt, Germany
基金
欧洲研究理事会;
关键词
computer-aided pronunciation training; phonological features; multi-task DNNs; active learning; ACOUSTIC MODELS; SPEECH;
D O I
10.21437/Interspeech.2017-1350
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a phonological feature based computer aided pronunciation training system for the learners of a new language (L2). Phonological features allow analysing the learners' mispronunciations systematically and rendering the feedback more effectively. The proposed acoustic model consists of a multi-task deep neural network, which uses a shared representation for estimating the phonological features and HMM state probabilities. Moreover, an active learning based scheme is proposed to efficiently deal with the cost of annotation, which is done by expert teachers, by selecting the most informative samples for annotation. Experimental evaluations are carried out for German and Italian native-speakers speaking English. For mispronunciation detection, the proposed feature-based system outperforms conventional GOP measure and classifier based methods, while providing more detailed diagnosis. Evaluations also demonstrate the advantage of active learning based sampling over random sampling.
引用
收藏
页码:1432 / 1436
页数:5
相关论文
共 50 条
  • [31] An efficient active learning method for multi-task learning
    Xiao, Yanshan
    Chang, Zheng
    Liu, Bo
    KNOWLEDGE-BASED SYSTEMS, 2020, 190
  • [32] Deep Chessboard Corner Detection Using Multi-task Learning
    Yoon, Hyunse
    Lee, Seongmin
    Kang, Jiwoo
    Lee, Sanghoon
    IEEE MMSP 2021: 2021 IEEE 23RD INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2021,
  • [33] Multi-Task Network Anomaly Detection using Federated Learning
    Zhao, Ying
    Chen, Junjun
    Wu, Di
    Teng, Jian
    Yu, Shui
    SOICT 2019: PROCEEDINGS OF THE TENTH INTERNATIONAL SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY, 2019, : 273 - 279
  • [34] Fetal Cardiac Structure Detection Using Multi-task Learning
    He, Jie
    Yang, Lei
    Zhu, Yunping
    Li, Donglian
    Ding, Zhixing
    Lu, Yuhuan
    Liang, Bocheng
    Li, Shengli
    ADVANCED INTELLIGENT COMPUTING IN BIOINFORMATICS, PT II, ICIC 2024, 2024, 14882 : 405 - 419
  • [35] Multi-Task Active Learning with Output Constraints
    Zhang, Yi
    PROCEEDINGS OF THE TWENTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-10), 2010, : 667 - 672
  • [36] A multi-task based deep learning approach for intrusion detection
    Liu, Qigang
    Wang, Deming
    Jia, Yuhang
    Luo, Suyuan
    Wang, Chongren
    KNOWLEDGE-BASED SYSTEMS, 2022, 238
  • [37] NONPARAMETRIC BAYESIAN FEATURE SELECTION FOR MULTI-TASK LEARNING
    Li, Hui
    Liao, Xuejun
    Carin, Lawrence
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 2236 - 2239
  • [38] Multi-task feature learning-based improved supervised descent method for facial landmark detection
    Peng Bian
    Zhengnan Xie
    Yi Jin
    Signal, Image and Video Processing, 2018, 12 : 17 - 24
  • [39] Multi-task feature learning-based improved supervised descent method for facial landmark detection
    Bian, Peng
    Xie, Zhengnan
    Jin, Yi
    SIGNAL IMAGE AND VIDEO PROCESSING, 2018, 12 (01) : 17 - 24
  • [40] Unsupervised Multi-Task Feature Learning on Point Clouds
    Hassani, Kaveh
    Haley, Mike
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8159 - 8170