Pronunciation Error Detection using DNN Articulatory Model based on Multi-lingual and Multi-task Learning

被引:0
|
作者
Duan, Richeng [1 ]
Kawahara, Tatsuya [1 ]
Dantsuji, Masatake [2 ]
Zhang, Jinsong [3 ]
机构
[1] Kyoto Univ, Sch Informat, Sakyo Ku, Kyoto 6068501, Japan
[2] Kyoto Univ, Acad Ctr Comp & Media Studies, Kyoto, Japan
[3] Beijing Language & Culture Univ, Sch Informat Sci, Beijing, Peoples R China
关键词
CAPT; pronunciation error detection; articulation modeling; multi-lingual learning; multi-task learning; MISPRONUNCIATION DETECTION;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Aiming at detecting pronunciation errors produced by second language learners and providing corrective feedbacks related with articulation, we address effective articulatory models based on deep neural network (DNN). Articulatory attributes are defined for manner and place of articulation. In order to efficiently train these models of non-native speech without using such data, which is difficult to collect in a large scale, we propose a multi-lingual learning method, in which the speech database of the target language (L2) and the native language (L1) of the learners are combined. We also investigate multi-task learning methods by tuning the weights of the secondary task. These methods are applied to Mandarin Chinese pronunciation learning by Japanese native speakers. Effects of the multi-lingual and multi-task learning methods are confirmed in the attribute classification and pronunciation error detection.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] Multi-Task Clustering with Model Relation Learning
    Zhang, Xiaotong
    Zhang, Xianchao
    Liu, Han
    Luo, Jiebo
    [J]. PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 3132 - 3140
  • [42] Multi-Task Model and Feature Joint Learning
    Li, Ya
    Tian, Xinmei
    Liu, Tongliang
    Tao, Dacheng
    [J]. PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 3643 - 3649
  • [43] Multi-population genomic prediction using a multi-task Bayesian learning model
    Chen, Liuhong
    Li, Changxi
    Miller, Stephen
    Schenkel, Flavio
    [J]. BMC GENETICS, 2014, 15
  • [44] Multi-task CNN Model for Action Detection
    Chen, Xin
    Han, Yahong
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (IEEE VCIP), 2018,
  • [45] Multi-population genomic prediction using a multi-task Bayesian learning model
    Liuhong Chen
    Changxi Li
    Stephen Miller
    Flavio Schenkel
    [J]. BMC Genetics, 15
  • [46] A Multi-Task Learning Framework for Multi-Target Stance Detection
    Li, Yingjie
    Caragea, Cornelia
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 2320 - 2326
  • [47] Joint Disaster Classification and Victim Detection using Multi-Task Learning
    Tham, Mau-Luen
    Wong, Yi Jie
    Kwan, Ban Hoe
    Owada, Yasunori
    Sein, Myint Myint
    Chang, Yoong Choon
    [J]. 2021 IEEE 12TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2021, : 407 - 412
  • [48] Debunking multi-lingual social media posts using deep learning
    Kotiyal B.
    Pathak H.
    Singh N.
    [J]. International Journal of Information Technology, 2023, 15 (5) : 2569 - 2581
  • [49] Sentiment Analysis and Sarcasm Detection using Deep Multi-Task Learning
    Yik Yang Tan
    Chee-Onn Chow
    Jeevan Kanesan
    Joon Huang Chuah
    YongLiang Lim
    [J]. Wireless Personal Communications, 2023, 129 : 2213 - 2237
  • [50] Multi-lingual Scene Text Detection Based on Fully Convolutional Networks
    Liu, Shaohua
    Shang, Yan
    Han, Jizhong
    Wang, Xi
    Gao, Hongchao
    Liu, Dongqin
    [J]. ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT I, 2018, 10735 : 423 - 432