A STUDY OF RANK-CONSTRAINED MULTILINGUAL DNNS FOR LOW-RESOURCE ASR

被引:0
|
作者
Sahraeian, Reza [1 ]
Van Compernolle, Dirk [1 ]
机构
[1] KU Leuven ESAT, Kasteelpk Arenberg 10, B-3001 Heverlee, Belgium
关键词
Multilingual deep neural network; low-rank factorization; low-resource ASR; NEURAL-NETWORKS;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Multilingual Deep Neural Networks (DNNs) have been successfully used to exploit out-of-language data to improve under-resourced ASR. In this paper, we improve on a multilingual DNN by utilizing low-rank factorization (LRF) of weight matrices via Singular Value Decomposition (SVD) to sparsify a multilingual DNN. LRF was previously used for monolingual DNNs, yielding large computational savings without a significant loss in recognition accuracy. In this work, we show that properly applying LRF on a multilingual DNN can improve recognition accuracy for multiple low-resource ASR configurations. First, only the final weight layer is factorized. Since the output weight layer needs to be trained with language specific data, reducing the number of parameters is beneficial for under-resourced languages. It is common in multilingual DNN speech recognition, to further adapt the full neural network through retraining of the multilingual DNN on target language data. Again we observe that in low-resource situations, this adaptation can bring significant improvement if LRF is applied to all hidden layers. We demonstrate the positive effect of LRF in two very different scenarios: one is a phone recognition task for two related languages and the other is a word recognition task using five different languages from the GlobalPhone dataset.
引用
收藏
页码:5420 / 5424
页数:5
相关论文
共 50 条
  • [1] MULTILINGUAL SHIFTING DEEP BOTTLENECK FEATURES FOR LOW-RESOURCE ASR
    Quoc Bao Nguyen
    Gehring, Jonas
    Mueller, Markus
    Stueker, Sebastian
    Waibel, Alex
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [2] Using Weighted Model Averaging in Distributed Multilingual DNNs to Improve Low Resource ASR
    Sahraeian, Reza
    Van Compernolle, Dirk
    [J]. SLTU-2016 5TH WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGIES FOR UNDER-RESOURCED LANGUAGES, 2016, 81 : 152 - 158
  • [3] Multilingual end-to-end ASR for low-resource Turkic languages with common alphabets
    Bekarystankyzy, Akbayan
    Mamyrbayev, Orken
    Mendes, Mateus
    Fazylzhanova, Anar
    Assam, Muhammad
    [J]. SCIENTIFIC REPORTS, 2024, 14 (01):
  • [4] EXPLOITING SEQUENTIAL LOW-RANK FACTORIZATION FOR MULTILINGUAL DNNS
    Sahraeian, Reza
    Van Compernolle, Dirk
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5025 - 5029
  • [5] Improving low-resource Tibetan end-to-end ASR by multilingual and multilevel unit modeling
    Qin, Siqing
    Wang, Longbiao
    Li, Sheng
    Dang, Jianwu
    Pan, Lixin
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2022, 2022 (01)
  • [6] Improving low-resource Tibetan end-to-end ASR by multilingual and multilevel unit modeling
    Siqing Qin
    Longbiao Wang
    Sheng Li
    Jianwu Dang
    Lixin Pan
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2022
  • [7] Extending Multilingual BERT to Low-Resource Languages
    Wang, Zihan
    Karthikeyan, K.
    Mayhew, Stephen
    Roth, Dan
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 2649 - 2656
  • [8] Data Augmentation for Low-Resource Quechua ASR Improvement
    Zevallos, Rodolfo
    Bel, Nuria
    Cambara, Guillermo
    Farrus, Mireia
    Luque, Jordi
    [J]. INTERSPEECH 2022, 2022, : 3518 - 3522
  • [9] Reduce and Reconstruct: ASR for Low-Resource Phonetic Languages
    Diwan, Anuj
    Jyothi, Preethi
    [J]. INTERSPEECH 2021, 2021, : 3445 - 3449
  • [10] INCORPORATING DISCRIMINATIVE DPGMM POSTERIORGRAMS FOR LOW-RESOURCE ASR
    Wu, Bin
    Sakti, Sakriani
    Nakamura, Satoshi
    [J]. 2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 201 - 208