Acoustic Modeling with Bootstrap and Restructuring for Low-resourced Languages

被引:0
|
作者
Cui, Xiaodong [1 ]
Xue, Jian [1 ]
Dognin, Pierre L. [1 ]
Chaudhari, Upendra V. [1 ]
Zhou, Bowen [1 ]
机构
[1] IBM TJ Watson Res Ctr, Yorktown Hts, NY 10598 USA
关键词
speech recognition; bootstrap; model restructuring; low-resourced language;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper investigates an acoustic modeling approach for low-resourced languages based on bootstrap and model restructuring. The approach first creates an acoustic model with redundancy by averaging over bootstrapped models from resampled subsets of sparse training data, which is followed by model restructuring to scale down the model to a desired cardinality. A variety of techniques for Gaussian clustering and model refinement are discussed for the model restructuring. LVCSR experiments are carried out on Pashto language with up to 105 hours of training data. The proposed approach is shown to yield more robust acoustic models given sparse training data and obtain superior performance over the traditional training procedure.
引用
收藏
页码:2974 / 2977
页数:4
相关论文
共 50 条
  • [1] Hidden Markov Acoustic Modeling With Bootstrap and Restructuring for Low-Resourced Languages
    Cui, Xiaodong
    Xue, Jian
    Chen, Xin
    Olsen, Peder A.
    Dognin, Pierre L.
    Chaudhari, Upendra V.
    Hershey, John R.
    Zhou, Bowen
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (08): : 2252 - 2264
  • [2] Multilingual Neural Semantic Parsing for Low-Resourced Languages
    Xia, Menglin
    Monti, Emilio
    [J]. 10TH CONFERENCE ON LEXICAL AND COMPUTATIONAL SEMANTICS (SEM 2021), 2021, : 185 - 194
  • [3] Neural Machine Translation for Low-Resourced Indian Languages
    Choudhary, Himanshu
    Rao, Shivansh
    Rohilla, Rajesh
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 3610 - 3615
  • [4] Surface Realization Architecture for Low-resourced African Languages
    Mahlaza, Zola
    Keet, C. Maria
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (03)
  • [5] A Linguistics-Driven Approach to Statistical Parsing for Low-Resourced Languages
    Boonkwan, Prachya
    Supnithi, Thepchai
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2015, E98D (05): : 1045 - 1052
  • [6] Towards Mental Health Analysis in Social Media for Low-resourced Languages
    Garg, Muskan
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (03)
  • [7] Uniform Multilingual Multi-Speaker Acoustic Model for Statistical Parametric Speech Synthesis of Low-Resourced Languages
    Gutkin, Alexander
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2183 - 2187
  • [8] Diabetes in low-resourced countries
    Ashwal, Eran
    Hadar, Eran
    Hod, Moshe
    [J]. BEST PRACTICE & RESEARCH CLINICAL OBSTETRICS & GYNAECOLOGY, 2015, 29 (01) : 91 - 101
  • [9] ASR DOMAIN ADAPTATION METHODS FOR LOW-RESOURCED LANGUAGES: APPLICATION TO ROMANIAN LANGUAGE
    Cucu, Horia
    Besacier, Laurent
    Burileanu, Corneliu
    Buzo, Andi
    [J]. 2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 1648 - 1652
  • [10] Low-resourced Languages and Online Knowledge Repositories: A Need-Finding Study
    Nigatu, Hellina Hailu
    Canny, John
    Chasins, Sarah E.
    [J]. PROCEEDINGS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYTEMS (CHI 2024), 2024,