Multilingual broad phoneme recognition and language-independent spoken term detection for low-resourced languages

被引:1
|
作者
Deekshitha, G. [1 ]
Mary, Leena [2 ]
机构
[1] APJ Abdul Kalam Technol Univ, Rajiv Gandhi Inst Technol, Ctr Adv Signal Proc CASP, Dept Elect & Commun Engn, Kottayam 686501, Kerala, India
[2] Rajiv Gandhi Inst Technol, Dept Elect & Commun Engn, Kottayam 686501, Kerala, India
关键词
Spoken term detection; Keyword spotting; Broad phoneme classification; Multilingual audio search; Low-resourced languages; Posteriorgrams; ACOUSTIC WORD EMBEDDINGS; QUERY;
D O I
10.1016/j.jksuci.2021.08.012
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Language-independent spoken term detection (LI-STD) refers to the process of locating the occurrences of spoken queries from speech databases of any language. This paper alization of a multilingual broad pho-neme classifier (BPC) and its application for the development of an LI-STD system. This work proposes a multi-stage architecture to address the task of LI-STD for low-resourced languages, where there is limited amount of labelled training data. The proposed LI-STD system contains three stages; one label sequence matching stage and two template matching stages. A deep neural network (DNN) based BPC trained using 16 handcrafted, signal-based features is the backbone of the proposed LI-STD system. In LI-STD system, stage 1 performs a broad phoneme sequence matching, while stage 2 and 3 perform template matching on posteriorgram and feature sequence, respectively. Concatenation of multiple stages results in search space reduction for the later computationally intensive template matching stages. In order to adapt to a new/unseen language, the BPC gets retrained using selected broad phoneme labelled data of the language generated by itself. The effectiveness of the proposed system is demonstrated on a set of low-resourced Indian languages. (c) 2021 The Authors. Published by Elsevier B.V. on behalf of King Saud University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:7313 / 7323
页数:11
相关论文
共 31 条
  • [1] Speaker independent recognition of low-resourced multilingual Arabic spoken words through hybrid fusion
    Mehra, Sunakshi
    Ranga, Virender
    Agarwal, Ritu
    Susan, Seba
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024,
  • [2] Multilingual Neural Semantic Parsing for Low-Resourced Languages
    Xia, Menglin
    Monti, Emilio
    [J]. 10TH CONFERENCE ON LEXICAL AND COMPUTATIONAL SEMANTICS (SEM 2021), 2021, : 185 - 194
  • [3] Multilingual Query by Example Spoken Term Detection for Under-Resourced Languages
    Buzo, Andi
    Cucu, Horia
    Safta, Mihai
    Burileanu, Corneliu
    [J]. 2013 7TH CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN - COMPUTER DIALOGUE (SPED), 2013,
  • [4] A Language-Independent Approach to Identify the Named Entities in Under-Resourced Languages and Clustering Multilingual Documents
    Kumar, N. Kiran
    Santosh, G. S. K.
    Varma, Vasudeva
    [J]. MULTILINGUAL AND MULTIMODAL INFORMATION ACCESS EVALUATION, 2011, 6941 : 74 - 82
  • [5] Common latent representation learning for low-resourced spoken language identification
    Chen, Chen
    Bu, Yulin
    Chen, Yong
    Chen, Deyun
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (12) : 34515 - 34535
  • [6] Common latent representation learning for low-resourced spoken language identification
    Chen, Chen
    Bu, Yulin
    Chen, Yong
    Chen, Deyun
    [J]. Multimedia Tools and Applications, 2024, 83 (12) : 34515 - 34535
  • [7] Common latent representation learning for low-resourced spoken language identification
    Chen Chen
    Yulin Bu
    Yong Chen
    Deyun Chen
    [J]. Multimedia Tools and Applications, 2024, 83 : 34515 - 34535
  • [8] Enabling Spoken Dialogue Systems for Low-Resourced Languages-End-to-End Dialect Recognition for North Sami
    Trung Ngo Trong
    Jokinen, Kristiina
    Hautamaki, Ville
    [J]. 9TH INTERNATIONAL WORKSHOP ON SPOKEN DIALOGUE SYSTEM TECHNOLOGY, 2019, 579 : 221 - 235
  • [9] INTENT RECOGNITION AND UNSUPERVISED SLOT IDENTIFICATION FOR LOW-RESOURCED SPOKEN DIALOG SYSTEMS
    Gupta, Akshat
    Deng, Olivia
    Kushwaha, Akruti
    Mittal, Saloni
    Zeng, William
    Rallabandi, Sai Krishna
    Black, Alan W.
    [J]. 2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 853 - 860
  • [10] ASR DOMAIN ADAPTATION METHODS FOR LOW-RESOURCED LANGUAGES: APPLICATION TO ROMANIAN LANGUAGE
    Cucu, Horia
    Besacier, Laurent
    Burileanu, Corneliu
    Buzo, Andi
    [J]. 2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 1648 - 1652