End-to-End Spoken Language Understanding: Bootstrapping in Low Resource Scenarios

被引:20
|
作者
Bhosale, Swapnil [1 ]
Sheikh, Imran [1 ]
Dumpala, Sri Harsha [1 ]
Kopparapu, Sunil Kumar [1 ]
机构
[1] TCS Res & Innovat Mumbai, Mumbai, Maharashtra, India
来源
关键词
SLU; intent classification; low resource;
D O I
10.21437/Interspeech.2019-2366
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
End-to-end Spoken Language Understanding (SLU) systems, without speech-to-text conversion, are more promising in low resource scenarios. They can be more effective when there is not enough labeled data to train reliable speech recognition and language understanding systems, or where running SLU on edge is preferred over cloud based services. In this paper, we present an approach for bootstrapping end-to-end SLU in low resource scenarios. We show that incorporating layers extracted from pre-trained acoustic models, instead of using the typical Mel filter bank features, lead to better performing SLU models. Moreover, the layers extracted from a model pre-trained on one language perform well even for (a) SLU tasks on a different language and also (b) on utterances from speakers with speech disorder.
引用
收藏
页码:1188 / 1192
页数:5
相关论文
共 50 条
  • [41] DIALOGUE HISTORY INTEGRATION INTO END-TO-END SIGNAL-TO-CONCEPT SPOKEN LANGUAGE UNDERSTANDING SYSTEMS
    Tomashenko, Natalia
    Raymond, Christian
    Caubriere, Antoine
    De Mori, Renato
    Esteve, Yannick
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8509 - 8513
  • [42] LARGE-SCALE UNSUPERVISED PRE-TRAINING FOR END-TO-END SPOKEN LANGUAGE UNDERSTANDING
    Wang, Pengwei
    Wei, Liangchen
    Cao, Yong
    Xie, Jinghui
    Nie, Zaiqing
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7999 - 8003
  • [43] The Interpreter Understands Your Meaning: End-to-end Spoken Language Understanding Aided by Speech Translation
    He, Mutian
    Garner, Philip N.
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 4408 - 4423
  • [44] End-to-End Memory Networks with Knowledge Carryover for Multi-Turn Spoken Language Understanding
    Chen, Yun-Nung
    Hakkani-Tur, Dilek
    Tur, Gokhan
    Gao, Jianfeng
    Deng, Li
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3245 - 3249
  • [45] Analysis of Acoustic information in End-to-End Spoken Language Translation
    Sant, Gerard
    Escolano, Carlos
    INTERSPEECH 2023, 2023, : 52 - 56
  • [46] EFFICIENT USE OF END-TO-END DATA IN SPOKEN LANGUAGE PROCESSING
    Lu, Yiting
    Wang, Yu
    Gales, Mark J. F.
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7518 - 7522
  • [47] An End-to-End TTS Model in Chhattisgarhi, a Low-Resource Indian Language
    Singh, Abhayjeet
    Jayakumar, Anjali
    Deekshitha, G.
    Kumar, Hitesh
    Bandekar, Jesuraja
    Badiger, Sandhya
    Udupa, Sathvik
    Kumar, Saurabh
    Ghosh, Prasanta Kumar
    SPEECH AND COMPUTER, SPECOM 2023, PT II, 2023, 14339 : 164 - 172
  • [48] A low latency ASR-free end to end spoken language understanding system
    Mhiri, Mohamed
    Myer, Samuel
    Tomar, Vikrant Singh
    INTERSPEECH 2020, 2020, : 1947 - 1951
  • [49] End-to-End Resource Reservation in IP Mobility Scenarios: A Survey
    Tao, Xing
    Hai, Lin
    INTERNATIONAL JOURNAL OF FUTURE GENERATION COMMUNICATION AND NETWORKING, 2016, 9 (10): : 375 - 394
  • [50] ConvKT: Conversation-Level Knowledge Transfer for Context Aware End-to-End Spoken Language Understanding
    Sunder, Vishal
    Fosler-Lussier, Eric
    Thomas, Samuel
    Kuo, Hong-Kwang J.
    Kingsbury, Brian
    INTERSPEECH 2023, 2023, : 1129 - 1133