Phonological Feature Based Mispronunciation Detection and Diagnosis using Multi-Task DNNs and Active Learning

被引:8
|
作者
Arora, Vipul [1 ]
Lahiri, Aditi [1 ]
Reetz, Henning [2 ]
机构
[1] Univ Oxford, Fac Linguist Philol & Phonet, Oxford, England
[2] Goethe Univ, Frankfurt, Germany
基金
欧洲研究理事会;
关键词
computer-aided pronunciation training; phonological features; multi-task DNNs; active learning; ACOUSTIC MODELS; SPEECH;
D O I
10.21437/Interspeech.2017-1350
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a phonological feature based computer aided pronunciation training system for the learners of a new language (L2). Phonological features allow analysing the learners' mispronunciations systematically and rendering the feedback more effectively. The proposed acoustic model consists of a multi-task deep neural network, which uses a shared representation for estimating the phonological features and HMM state probabilities. Moreover, an active learning based scheme is proposed to efficiently deal with the cost of annotation, which is done by expert teachers, by selecting the most informative samples for annotation. Experimental evaluations are carried out for German and Italian native-speakers speaking English. For mispronunciation detection, the proposed feature-based system outperforms conventional GOP measure and classifier based methods, while providing more detailed diagnosis. Evaluations also demonstrate the advantage of active learning based sampling over random sampling.
引用
收藏
页码:1432 / 1436
页数:5
相关论文
共 50 条
  • [1] A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-task Learning
    Ryu, Hyungshin
    Kim, Sunhee
    Chung, Minhwa
    INTERSPEECH 2023, 2023, : 959 - 963
  • [2] Multi-Task Based Mispronunciation Detection of Children Speech Using Multi-Lingual Information
    Wei, Linxuan
    Dong, Wenwei
    Lin, Binghuai
    Zhang, Jinsong
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1791 - 1794
  • [3] Multi-Task Learning for Mispronunciation Detection on Singapore Children's Mandarin Speech
    Tong, Rong
    Chen, Nancy E.
    Ma, Bin
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2193 - 2197
  • [4] Multi-task Feature Learning Based Anomaly Detection of Network Dataflow
    Ren Hui-feng
    Yan Feng
    Dong Qing-chao
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 4144 - 4147
  • [5] Deep Reinforcement Learning Based Multi-Task Automated Channel Pruning for DNNs
    Ma, Xiaodong
    Fang, Weiwei
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [6] Convex multi-task feature learning
    Andreas Argyriou
    Theodoros Evgeniou
    Massimiliano Pontil
    Machine Learning, 2008, 73 : 243 - 272
  • [7] Convex multi-task feature learning
    Argyriou, Andreas
    Evgeniou, Theodoros
    Pontil, Massimiliano
    MACHINE LEARNING, 2008, 73 (03) : 243 - 272
  • [8] Multi-Task Feature Interaction Learning
    Lin, Kaixiang
    Xu, Jianpeng
    Baytas, Inci M.
    Ji, Shuiwang
    Zhou, Jiayu
    KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 1735 - 1744
  • [9] Learning Task Relational Structure for Multi-Task Feature Learning
    Wang, De
    Nie, Feiping
    Huang, Heng
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2016, : 1239 - 1244
  • [10] Bearing Fault Diagnosis based on Multi-task Learning
    Mao, Wentao
    He, Jianliang
    Feng, Wushi
    Tian, Siyu
    2018 PROGNOSTICS AND SYSTEM HEALTH MANAGEMENT CONFERENCE (PHM-CHONGQING 2018), 2018, : 358 - 363