A STUDY OF RANK-CONSTRAINED MULTILINGUAL DNNS FOR LOW-RESOURCE ASR

被引:0
|
作者
Sahraeian, Reza [1 ]
Van Compernolle, Dirk [1 ]
机构
[1] KU Leuven ESAT, Kasteelpk Arenberg 10, B-3001 Heverlee, Belgium
关键词
Multilingual deep neural network; low-rank factorization; low-resource ASR; NEURAL-NETWORKS;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Multilingual Deep Neural Networks (DNNs) have been successfully used to exploit out-of-language data to improve under-resourced ASR. In this paper, we improve on a multilingual DNN by utilizing low-rank factorization (LRF) of weight matrices via Singular Value Decomposition (SVD) to sparsify a multilingual DNN. LRF was previously used for monolingual DNNs, yielding large computational savings without a significant loss in recognition accuracy. In this work, we show that properly applying LRF on a multilingual DNN can improve recognition accuracy for multiple low-resource ASR configurations. First, only the final weight layer is factorized. Since the output weight layer needs to be trained with language specific data, reducing the number of parameters is beneficial for under-resourced languages. It is common in multilingual DNN speech recognition, to further adapt the full neural network through retraining of the multilingual DNN on target language data. Again we observe that in low-resource situations, this adaptation can bring significant improvement if LRF is applied to all hidden layers. We demonstrate the positive effect of LRF in two very different scenarios: one is a phone recognition task for two related languages and the other is a word recognition task using five different languages from the GlobalPhone dataset.
引用
收藏
页码:5420 / 5424
页数:5
相关论文
共 50 条
  • [41] Combining Simple but Novel Data Augmentation Methods for Improving Low-Resource ASR
    Damania, Ronit
    Homan, Christopher
    Prud'hommeaux, Emily
    [J]. INTERSPEECH 2022, 2022, : 4890 - 4894
  • [42] MGB-3 BUT SYSTEM: LOW-RESOURCE ASR ON EGYPTIAN YOUTUBE DATA
    Vesely, Karel
    Murali, Baskar Karthick
    Diez, Mireia
    Benes, Karel
    [J]. 2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 368 - 373
  • [43] The Low-Resource Double Bind: An Empirical Study of Pruning for Low-Resource Machine Translation
    Ahia, Orevaoghene
    Kreutzer, Julia
    Hooker, Sara
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 3316 - 3333
  • [44] Bottleneck Low-rank Transformers for Low-resource Spoken Language Understanding
    Wang, Pu
    Van Hamme, Hugo
    [J]. INTERSPEECH 2022, 2022, : 1248 - 1252
  • [45] Low Resource Language Adaptation using Two-stage Regularization for Multilingual ASR
    Kwok, Chin Yuen
    Yip, Jia Qi
    Chng, Eng Siong
    [J]. 2024 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, IALP 2024, 2024, : 332 - 337
  • [46] Few-shot Controllable Style Transfer for Low-Resource Multilingual Settings
    Krishna, Kalpesh
    Nathani, Deepak
    Garcia, Xavier
    Samanta, Bidisha
    Talukdar, Partha
    [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 7439 - 7468
  • [47] Extremely Low-resource Multilingual Neural Machine Translation for Indic Mizo Language
    Lalrempuii C.
    Soni B.
    [J]. International Journal of Information Technology, 2023, 15 (8) : 4275 - 4282
  • [48] Rapid Update of Multilingual Deep Neural Network for Low-Resource Keyword Search
    Ni, Chongjia
    Wang, Lei
    Leung, Cheung-Chi
    Rao, Feng
    Lug, Li
    Ma, Bin
    Li, Haizhou
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3698 - 3702
  • [49] The Flores-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation
    Goyal, Naman
    Gao, Cynthia
    Chaudhary, Vishrav
    Chen, Peng-Jen
    Wenzek, Guillaume
    Ju, Da
    Krishnan, Sanjana
    Ranzato, Marc'Aurelio
    Guzman, Francisco
    Fan, Angela
    [J]. TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2022, 10 : 522 - 538
  • [50] Multilingual Recurrent Neural Networks with Residual Learning for Low-Resource Speech Recognition
    Zhou, Shiyu
    Zhao, Yuanyuan
    Xu, Shuang
    Xu, Bo
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 704 - 708