Common latent representation learning for low-resourced spoken language identification

被引:0
|
作者
Chen, Chen [1 ,2 ]
Bu, Yulin [1 ]
Chen, Yong [1 ]
Chen, Deyun [1 ,2 ]
机构
[1] Harbin Univ Sci & Technol, Sch Comp Sci & Technol, Harbin 150080, Heilongjiang, Peoples R China
[2] Harbin Univ Sci & Technol, Postdoctoral Res Stn Comp Sci & Technol, Harbin 150080, Heilongjiang, Peoples R China
基金
黑龙江省自然科学基金; 中国博士后科学基金; 中国国家自然科学基金;
关键词
Spoken language identification; Total variability space; I-vector; Common latent representation learning; RECOGNITION; SPEECH;
D O I
10.1007/s11042-023-16865-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The i-vector method is one of the mainstream methods in spoken language identification (SLID). It estimates the total variability space (TVS) to obtain a low-rank representation which can characterize the language, called the i-vector. However, on small-scale datasets, low learning resources can significantly degrade the performance of SLID system. Therefore, it is necessary to improve the performance of SLID system in low-resourced condition. In this paper, we propose a common latent representation learning (CLRL) method to learn the TVS, which introduces prior information to address the lack of information in low-resourced condition. The prior information includes category label and parameter prior hypothesis. The CLRL method is evaluated on the OLR2020 dataset. Compared with other state-of-the-art methods, the CLRL method shows better performance on all datasets of different data scales. Moreover, the CLRL method can effectively improve the performance of the SLID system on low-resourced/small-scale datasets.
引用
收藏
页码:34515 / 34535
页数:21
相关论文
共 50 条
  • [21] Using Annotation Projection for Semantic Role Labeling of Low-Resourced Language: Sinhala
    Gunasekara, Sandun
    Chathura, Dulanjaya
    Jeewantha, Chamoda
    Dias, Gihan
    2020 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2020), 2020, : 98 - 103
  • [22] Question-Answering in a Low-resourced Language: Benchmark Dataset and Models for Tigrinya
    Gaim, Fitsum
    Yang, Wonsuk
    Park, Hancheol
    Park, Jong C.
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 11857 - 11870
  • [23] Language Model Data Augmentation for Keyword Spotting in Low-Resourced Training Conditions
    Gorin, Arseniy
    Lileikyte, Rasa
    Huang, Guangpu
    Lamel, Lori
    Gauvain, Jean-Luc
    Laurent, Antoine
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 775 - 779
  • [24] Toward the Development of Large-Scale Word Embedding for Low-Resourced Language
    Nazir, Shahzad
    Asif, Muhammad
    Sahi, Shahbaz Ahmad
    Ahmad, Shahbaz
    Ghadi, Yazeed Yasin
    Aziz, Muhammad Haris
    IEEE ACCESS, 2022, 10 : 54091 - 54097
  • [25] Case Study on Data Collection of Kreol Morisien, a Low-Resourced Creole Language
    Bastien, David Joshen
    Chumroo, Vijay Prakash
    Bastien, Johan Patrice
    2022 IST-AFRICA CONFERENCE, 2022,
  • [26] Leveraging Large Language Models in Low-resourced Language NLP: A spaCy Implementation for Modern Tibetan
    Kyogoku, Yuki
    Erhard, Franz Xaver
    Engels, James
    Barnett, Robert
    REVUE D ETUDES TIBETAINES, 2025, (74):
  • [27] Analysis of Automatic Evaluation Metric on Low-Resourced Language: BERTScore vs BLEU Score
    Datta, Goutam
    Joshi, Nisheeth
    Gupta, Kusum
    SPEECH AND COMPUTER, SPECOM 2022, 2022, 13721 : 155 - 162
  • [28] Explainable Pre-Trained Language Models for Sentiment Analysis in Low-Resourced Languages
    Mabokela, Koena Ronny
    Primus, Mpho
    Celik, Turgay
    BIG DATA AND COGNITIVE COMPUTING, 2024, 8 (11)
  • [29] END-TO-END CODE-SWITCHING ASR FOR LOW-RESOURCED LANGUAGE PAIRS
    Yue, Xianghu
    Lee, Grandee
    Yilmaz, Emre
    Deng, Fang
    Li, Haizhou
    2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 972 - 979
  • [30] Enabling Spoken Dialogue Systems for Low-Resourced Languages-End-to-End Dialect Recognition for North Sami
    Trung Ngo Trong
    Jokinen, Kristiina
    Hautamaki, Ville
    9TH INTERNATIONAL WORKSHOP ON SPOKEN DIALOGUE SYSTEM TECHNOLOGY, 2019, 579 : 221 - 235