Phonological Feature Based Mispronunciation Detection and Diagnosis using Multi-Task DNNs and Active Learning

被引:8
|
作者
Arora, Vipul [1 ]
Lahiri, Aditi [1 ]
Reetz, Henning [2 ]
机构
[1] Univ Oxford, Fac Linguist Philol & Phonet, Oxford, England
[2] Goethe Univ, Frankfurt, Germany
基金
欧洲研究理事会;
关键词
computer-aided pronunciation training; phonological features; multi-task DNNs; active learning; ACOUSTIC MODELS; SPEECH;
D O I
10.21437/Interspeech.2017-1350
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a phonological feature based computer aided pronunciation training system for the learners of a new language (L2). Phonological features allow analysing the learners' mispronunciations systematically and rendering the feedback more effectively. The proposed acoustic model consists of a multi-task deep neural network, which uses a shared representation for estimating the phonological features and HMM state probabilities. Moreover, an active learning based scheme is proposed to efficiently deal with the cost of annotation, which is done by expert teachers, by selecting the most informative samples for annotation. Experimental evaluations are carried out for German and Italian native-speakers speaking English. For mispronunciation detection, the proposed feature-based system outperforms conventional GOP measure and classifier based methods, while providing more detailed diagnosis. Evaluations also demonstrate the advantage of active learning based sampling over random sampling.
引用
收藏
页码:1432 / 1436
页数:5
相关论文
共 50 条
  • [21] Multi-task Feature Learning for Social Recommendation
    Zhang, Yuanyuan
    Sun, Maosheng
    Zhang, Xiaowei
    Zhang, Yonglong
    KNOWLEDGE GRAPH AND SEMANTIC COMPUTING: KNOWLEDGE GRAPH EMPOWERS NEW INFRASTRUCTURE CONSTRUCTION, 2021, 1466 : 240 - 252
  • [22] Deep Asymmetric Multi-task Feature Learning
    Lee, Hae Beom
    Yang, Eunho
    Hwang, Sung Ju
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [23] Efficient Multi-Task Feature Learning with Calibration
    Gong, Pinghua
    Zhou, Jiayu
    Fan, Wei
    Ye, Jieping
    PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, : 761 - 770
  • [24] Multi-Task Model and Feature Joint Learning
    Li, Ya
    Tian, Xinmei
    Liu, Tongliang
    Tao, Dacheng
    PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 3643 - 3649
  • [25] Fault Detection and Diagnosis of Air Handling Units Based On Adversarial Multi-task Learning
    Tan, Jingwen
    Li, Dan
    Zheng, Zibin
    Ng, See-King
    2023 23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW 2023, 2023, : 89 - 98
  • [26] Online Performance Prediction of Perception DNNs by Multi-Task Learning With Depth Estimation
    Klingner, Marvin
    Fingscheidt, Tim
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (07) : 4670 - 4683
  • [27] Multi-Task Networks With Universe, Group, and Task Feature Learning
    Pentyala, Shiva
    Liu, Mengwen
    Dreyer, Markus
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 820 - 830
  • [28] Classification of Neurological Gait Disorders Using Multi-task Feature Learning
    Papavasileiou, Ioannis
    Zhang, Wenlong
    Wang, Xin
    Bi, Jinbo
    Zhang, Li
    Han, Song
    2017 IEEE/ACM SECOND INTERNATIONAL CONFERENCE ON CONNECTED HEALTH - APPLICATIONS, SYSTEMS AND ENGINEERING TECHNOLOGIES (CHASE), 2017, : 195 - 204
  • [29] Sacroiliitis diagnosis based on interpretable features and multi-task learning
    Liu, Lei
    Zhang, Haoyu
    Zhang, Weifeng
    Mei, Wei
    Huang, Ruibin
    PHYSICS IN MEDICINE AND BIOLOGY, 2024, 69 (04):
  • [30] Multi-Feature and Multi-Modal Mispronunciation Detection and Diagnosis Method Based on the Squeezeformer Encoder
    Guo, Shen
    Kadeer, Zaokere
    Wumaier, Aishan
    Wang, Liejun
    Fan, Cong
    IEEE ACCESS, 2023, 11 : 66245 - 66256