Capsule Networks for Low Resource Spoken Language Understanding

被引:21
|
作者
Renkens, Vincent [1 ]
Van Hamme, Hugo [1 ]
机构
[1] KULeuven, Dept Elect Engn, ESAT, Kasteelpk Arenberg 10,Bus 2441, B-3001 Leuven, Belgium
关键词
Spoken Language Understanding; Capsule Networks; Deep Learning; Low Resource;
D O I
10.21437/Interspeech.2018-1013
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Designing a spoken language understanding system for command-and-control applications can be challenging because of a wide variety of domains and users or because of a lack of training data. In this paper we discuss a system that learns from scratch from user demonstrations. This method has the advantage that the same system can be used for many domains and users without modifications and that no training data is required prior to deployment. The user is required to train the system, so for a user friendly experience it is crucial to minimize the required amount of data. In this paper we investigate whether a capsule network can make efficient use of the limited amount of available training data. We compare the proposed model to an approach based on Non-negative Matrix Factorisation which is the state-of-the-art in this setting and another deep learning approach that was recently introduced for end-to-end spoken language understanding. We show that the proposed model outperforms the baseline models for three command-and-control applications: controlling a small robot, a vocally guided card game and a home automation task.
引用
收藏
页码:601 / 605
页数:5
相关论文
共 50 条
  • [41] Combining classifiers for spoken language understanding
    Karahan, M
    Hakkani-Tür, D
    Riccardi, G
    Tur, G
    ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 589 - 594
  • [42] Discriminative Models for Spoken Language Understanding
    Wang, Ye-Yi
    Acero, Alex
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2426 - 2429
  • [43] Compositional Generalization in Spoken Language Understanding
    Ray, Avik
    Shen, Yilin
    Jin, Hongxia
    INTERSPEECH 2023, 2023, : 750 - 754
  • [44] Understanding spoken language through TalkBank
    Brian MacWhinney
    Behavior Research Methods, 2019, 51 : 1919 - 1927
  • [45] PARSING COORDINATION FOR SPOKEN LANGUAGE UNDERSTANDING
    Agarwal, Sanchit
    Goel, Rahul
    Chung, Tagyoung
    Sethi, Abhishek
    Mandal, Arindam
    Matsoukas, Spyros
    2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 677 - 684
  • [46] Understanding spoken language through TalkBank
    MacWhinney, Brian
    BEHAVIOR RESEARCH METHODS, 2019, 51 (04) : 1919 - 1927
  • [47] APHASIC DIFFICULTIES UNDERSTANDING SPOKEN LANGUAGE
    SCHUELL, H
    NEUROLOGY, 1953, 3 (03) : 176 - 184
  • [48] Recent advances in spoken language understanding
    De Mori, Renato
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2007, 4629 : 14 - 14
  • [49] TechWare: Spoken language understanding resources
    Conversational Systems Research Center, Microsoft Research, Mountain View, CA, United States
    不详
    IEEE Signal Process Mag, 2013, 3 (187-189):
  • [50] Spoken language understanding for social robotics
    Romero-Gonzalez, Cristina
    Martinez-Gomez, Jesus
    Garcia-Varea, Ismael
    2020 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS (ICARSC 2020), 2020, : 152 - 157