INTENT RECOGNITION AND UNSUPERVISED SLOT IDENTIFICATION FOR LOW-RESOURCED SPOKEN DIALOG SYSTEMS

被引:1
|
作者
Gupta, Akshat [1 ]
Deng, Olivia [1 ]
Kushwaha, Akruti [1 ]
Mittal, Saloni [1 ]
Zeng, William [1 ]
Rallabandi, Sai Krishna [1 ]
Black, Alan W. [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
关键词
Intent Recognition; Spoken Language Understanding; Transformers; low-resourced; Multilingual;
D O I
10.1109/ASRU51503.2021.9688264
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Intent Recognition and Slot Identification are crucial components in spoken language understanding (SLU) systems. In this paper, we present a novel approach towards both these tasks in the context of low-resourced and unwritten languages. We use an acoustic based SLU system that converts speech to its phonetic transcription using a universal phone recognition system. We build a word-free natural language understanding module that does intent recognition and slot identification from these phonetic transcription. Our proposed SLU system performs competitively for resource rich scenarios and significantly outperforms existing approaches as the amount of available data reduces. We train both recurrent and transformer based neural networks and test our system on five natural speech datasets in five different languages. We observe more than 10% improvement for intent classification in Tamil and more than 5% improvement for intent classification in Sinhala. Additionally, we present a novel approach towards unsupervised slot identification using normalized attention scores. This approach can be used for unsupervised slot labelling, data augmentation and to generate data for a new slot in a one-shot way with only one speech recording.
引用
收藏
页码:853 / 860
页数:8
相关论文
共 29 条
  • [21] Improving Named Entity Recognition in Spoken Dialog Systems by Context and Speech Pattern Modeling
    Minh Nguyen
    Yu, Zhou
    [J]. SIGDIAL 2021: 22ND ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2021), 2021, : 45 - 55
  • [22] COMBINING UNSUPERVISED AND TEXT AUGMENTED SEMI-SUPERVISED LEARNING FOR LOW RESOURCED AUTOREGRESSIVE SPEECH RECOGNITION
    Li, Chak-Fai
    Keith, Francis
    Hartmann, William
    Snover, Matthew
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6892 - 6896
  • [23] A satisfaction-based model for affect recognition from conversational features in spoken dialog systems
    Lebai Lutfi, Syaheerah
    Fernandez-Martinez, Fernando
    Manuel Lucas-Cuesta, Juan
    Lopez-Lebon, Lorena
    Manuel Montero, Juan
    [J]. SPEECH COMMUNICATION, 2013, 55 (7-8) : 825 - 840
  • [24] The Ethics of Medical Practitioner Migration From Low-Resourced Countries to the Developed World: A Call for Action by Health Systems and Individual Doctors
    Charles Mpofu
    Tarun Sen Gupta
    Richard Hays
    [J]. Journal of Bioethical Inquiry, 2016, 13 : 395 - 406
  • [25] The Ethics of Medical Practitioner Migration From Low-Resourced Countries to the Developed World: A Call for Action by Health Systems and Individual Doctors
    Mpofu, Charles
    Sen Gupta, Tarun
    Hays, Richard
    [J]. JOURNAL OF BIOETHICAL INQUIRY, 2016, 13 (03) : 395 - 406
  • [26] End-to-end Multi-modal Low-resourced Speech Keywords Recognition Using Sequential Conv2D Nets
    Gambhir, Pooja
    Dev, Amita
    Bansal, Poonam
    Sharma, Deepak Kumar
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (01)
  • [27] Localization of Speech Recognition in Spoken Dialog Systems: How Machine Translation Can Make Our Lives Easier
    Suendermann, David
    Liscombe, Jackson
    Dayanidhi, Krishna
    Pieraccini, Roberto
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1487 - 1490
  • [28] Data-pooling and multi-task learning for enhanced performance of speech recognition systems in multiple low resourced languages
    Madhavaraj, A.
    Ramakrishnan, A. G.
    [J]. 2019 25TH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2019,
  • [29] Effects of low speed wind on the recognition/identification and pass-through communication tasks of auditory situation awareness afforded by military hearing protection/enhancement devices and tactical communication and protective systems
    Lee, Kichol
    Casali, John G.
    [J]. INTERNATIONAL JOURNAL OF AUDIOLOGY, 2016, 55 : S21 - S29