Common latent representation learning for low-resourced spoken language identification

被引:0
|
作者
Chen, Chen [1 ,2 ]
Bu, Yulin [1 ]
Chen, Yong [1 ]
Chen, Deyun [1 ,2 ]
机构
[1] Harbin Univ Sci & Technol, Sch Comp Sci & Technol, Harbin 150080, Heilongjiang, Peoples R China
[2] Harbin Univ Sci & Technol, Postdoctoral Res Stn Comp Sci & Technol, Harbin 150080, Heilongjiang, Peoples R China
基金
黑龙江省自然科学基金; 中国博士后科学基金; 中国国家自然科学基金;
关键词
Spoken language identification; Total variability space; I-vector; Common latent representation learning; RECOGNITION; SPEECH;
D O I
10.1007/s11042-023-16865-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The i-vector method is one of the mainstream methods in spoken language identification (SLID). It estimates the total variability space (TVS) to obtain a low-rank representation which can characterize the language, called the i-vector. However, on small-scale datasets, low learning resources can significantly degrade the performance of SLID system. Therefore, it is necessary to improve the performance of SLID system in low-resourced condition. In this paper, we propose a common latent representation learning (CLRL) method to learn the TVS, which introduces prior information to address the lack of information in low-resourced condition. The prior information includes category label and parameter prior hypothesis. The CLRL method is evaluated on the OLR2020 dataset. Compared with other state-of-the-art methods, the CLRL method shows better performance on all datasets of different data scales. Moreover, the CLRL method can effectively improve the performance of the SLID system on low-resourced/small-scale datasets.
引用
收藏
页码:34515 / 34535
页数:21
相关论文
共 50 条
  • [1] Common latent representation learning for low-resourced spoken language identification
    Chen Chen
    Yulin Bu
    Yong Chen
    Deyun Chen
    Multimedia Tools and Applications, 2024, 83 : 34515 - 34535
  • [2] Common latent representation learning for low-resourced spoken language identification
    Chen, Chen
    Bu, Yulin
    Chen, Yong
    Chen, Deyun
    Multimedia Tools and Applications, 2024, 83 (12) : 34515 - 34535
  • [3] Wavelet Scattering Transform for Improving Generalization in Low-Resourced Spoken Language Identification
    Dey, Spandan
    Singh, Premjeet
    Saha, Goutam
    INTERSPEECH 2023, 2023, : 1953 - 1957
  • [4] INTENT RECOGNITION AND UNSUPERVISED SLOT IDENTIFICATION FOR LOW-RESOURCED SPOKEN DIALOG SYSTEMS
    Gupta, Akshat
    Deng, Olivia
    Kushwaha, Akruti
    Mittal, Saloni
    Zeng, William
    Rallabandi, Sai Krishna
    Black, Alan W.
    2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 853 - 860
  • [5] An Automatic Summarizer for a Low-Resourced Language
    Pattnaik, Sagarika
    Nayak, Ajit Kumar
    ADVANCED COMPUTING AND INTELLIGENT ENGINEERING, 2020, 1082 : 285 - 295
  • [6] Topic and Keyword Identification for Low-resourced Speech Using Cross-Language Transfer Learning
    Chen, Wenda
    Hasegawa-Johnson, Mark
    Chen, Nancy F.
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2047 - 2051
  • [7] GeezSwitch: Language Identification in Typologically Related Low-resourced East African Languages
    Gaim, Fitsum
    Yang, Wonsuk
    Park, Jong C.
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 6578 - 6584
  • [8] Performance of Recent Large Language Models for a Low-Resourced Language
    Jayakody, Ravindu
    Dias, Gihan
    2024 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, IALP 2024, 2024, : 162 - 167
  • [9] Multilingual broad phoneme recognition and language-independent spoken term detection for low-resourced languages
    Deekshitha, G.
    Mary, Leena
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (09) : 7313 - 7323
  • [10] A Spell Checker for a Low-resourced and Morphologically Rich Language
    Octaviano, Manolito, Jr.
    Borra, Allan
    TENCON 2017 - 2017 IEEE REGION 10 CONFERENCE, 2017, : 1853 - 1856