Pronunciation Error Detection using DNN Articulatory Model based on Multi-lingual and Multi-task Learning

被引:0
|
作者
Duan, Richeng [1 ]
Kawahara, Tatsuya [1 ]
Dantsuji, Masatake [2 ]
Zhang, Jinsong [3 ]
机构
[1] Kyoto Univ, Sch Informat, Sakyo Ku, Kyoto 6068501, Japan
[2] Kyoto Univ, Acad Ctr Comp & Media Studies, Kyoto, Japan
[3] Beijing Language & Culture Univ, Sch Informat Sci, Beijing, Peoples R China
关键词
CAPT; pronunciation error detection; articulation modeling; multi-lingual learning; multi-task learning; MISPRONUNCIATION DETECTION;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Aiming at detecting pronunciation errors produced by second language learners and providing corrective feedbacks related with articulation, we address effective articulatory models based on deep neural network (DNN). Articulatory attributes are defined for manner and place of articulation. In order to efficiently train these models of non-native speech without using such data, which is difficult to collect in a large scale, we propose a multi-lingual learning method, in which the speech database of the target language (L2) and the native language (L1) of the learners are combined. We also investigate multi-task learning methods by tuning the weights of the secondary task. These methods are applied to Mandarin Chinese pronunciation learning by Japanese native speakers. Effects of the multi-lingual and multi-task learning methods are confirmed in the attribute classification and pronunciation error detection.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Multi-lingual and Multi-task DNN Learning for Articulatory Error Detection
    Duan, Richeng
    Kawahara, Tatsuya
    Dantsuji, Masatake
    Zhang, Jinsong
    [J]. 2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [2] Multi-Task Based Mispronunciation Detection of Children Speech Using Multi-Lingual Information
    Wei, Linxuan
    Dong, Wenwei
    Lin, Binghuai
    Zhang, Jinsong
    [J]. 2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1791 - 1794
  • [3] DNN-Based Voice Activity Detection with Multi-Task Learning
    Kang, Tae Gyoon
    Kim, Nam Soo
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2016, E99D (02): : 550 - 553
  • [4] Exploring Multi-lingual, Multi-task, and Adversarial Learning for Low-resource Sentiment Analysis
    Mamta
    Ekbal, Asif
    Bhattacharyya, Pushpak
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (05)
  • [5] One "Ruler" for All Languages: Multi-Lingual Dialogue Evaluation with Adversarial Multi-Task Learning
    Tong, Xiaowei
    Fu, Zhenxin
    Shang, Mingyue
    Zhao, Dongyan
    Yan, Rui
    [J]. PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 4432 - 4438
  • [6] A Multi-lingual Multi-task Architecture for Low-resource Sequence Labeling
    Lin, Ying
    Yang, Shengqi
    Stoyanov, Veselin
    Ji, Heng
    [J]. PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 799 - 809
  • [7] Multi-task Learning for Acoustic Modeling Using Articulatory Attributes
    Lee, Yueh-Ting
    Chen, Xuan-Bo
    Lee, Hung-Shin
    Jang, Jyh-Shing Roger
    Wang, Hsin-Min
    [J]. 2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 855 - 861
  • [8] MULTI-LINGUAL MULTI-TASK SPEECH EMOTION RECOGNITION USING WAV2VEC 2.0
    Sharma, Mayank
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6907 - 6911
  • [9] Adversarial Training for Multi-task and Multi-lingual Joint Modeling of Utterance Intent Classification
    Masumura, Ryo
    Shinohara, Yusuke
    Higashinaka, Ryuichiro
    Aono, Yushi
    [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 633 - 639
  • [10] MULTI-TASK LEARNING WITH LOCALIZED GENERALIZATION ERROR MODEL
    Li, Wendi
    Zhu, Yi
    Wang, Ting
    Ng, Wing W. Y.
    [J]. PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), 2019, : 380 - 387