INTENT RECOGNITION AND UNSUPERVISED SLOT IDENTIFICATION FOR LOW-RESOURCED SPOKEN DIALOG SYSTEMS

被引：1

作者：

Gupta, Akshat ^{[1
]}

Deng, Olivia ^{[1
]}

Kushwaha, Akruti ^{[1
]}

Mittal, Saloni ^{[1
]}

Zeng, William ^{[1
]}

Rallabandi, Sai Krishna ^{[1
]}

Black, Alan W. ^{[1
]}

机构：

[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA

来源：

2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU) | 2021年

关键词：

Intent Recognition; Spoken Language Understanding; Transformers; low-resourced; Multilingual;

D O I：

10.1109/ASRU51503.2021.9688264

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Intent Recognition and Slot Identification are crucial components in spoken language understanding (SLU) systems. In this paper, we present a novel approach towards both these tasks in the context of low-resourced and unwritten languages. We use an acoustic based SLU system that converts speech to its phonetic transcription using a universal phone recognition system. We build a word-free natural language understanding module that does intent recognition and slot identification from these phonetic transcription. Our proposed SLU system performs competitively for resource rich scenarios and significantly outperforms existing approaches as the amount of available data reduces. We train both recurrent and transformer based neural networks and test our system on five natural speech datasets in five different languages. We observe more than 10% improvement for intent classification in Tamil and more than 5% improvement for intent classification in Sinhala. Additionally, we present a novel approach towards unsupervised slot identification using normalized attention scores. This approach can be used for unsupervised slot labelling, data augmentation and to generate data for a new slot in a one-shot way with only one speech recording.

引用

页码：853 / 860

页数：8

共 29 条

[21] Improving Named Entity Recognition in Spoken Dialog Systems by Context and Speech Pattern Modeling
Minh Nguyen
Yu, Zhou
[J]. SIGDIAL 2021: 22ND ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2021), 2021, : 45 - 55
[22] COMBINING UNSUPERVISED AND TEXT AUGMENTED SEMI-SUPERVISED LEARNING FOR LOW RESOURCED AUTOREGRESSIVE SPEECH RECOGNITION
Li, Chak-Fai
Keith, Francis
Hartmann, William
Snover, Matthew
[J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6892 - 6896
[23] A satisfaction-based model for affect recognition from conversational features in spoken dialog systems
Lebai Lutfi, Syaheerah
Fernandez-Martinez, Fernando
Manuel Lucas-Cuesta, Juan
Lopez-Lebon, Lorena
Manuel Montero, Juan
[J]. SPEECH COMMUNICATION, 2013, 55 (7-8) : 825 - 840
[24] The Ethics of Medical Practitioner Migration From Low-Resourced Countries to the Developed World: A Call for Action by Health Systems and Individual Doctors
Charles Mpofu
Tarun Sen Gupta
Richard Hays
[J]. Journal of Bioethical Inquiry, 2016, 13 : 395 - 406
[25] The Ethics of Medical Practitioner Migration From Low-Resourced Countries to the Developed World: A Call for Action by Health Systems and Individual Doctors
Mpofu, Charles
Sen Gupta, Tarun
Hays, Richard
[J]. JOURNAL OF BIOETHICAL INQUIRY, 2016, 13 (03) : 395 - 406
[26] End-to-end Multi-modal Low-resourced Speech Keywords Recognition Using Sequential Conv2D Nets
Gambhir, Pooja
Dev, Amita
Bansal, Poonam
Sharma, Deepak Kumar
[J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (01)
[27] Localization of Speech Recognition in Spoken Dialog Systems: How Machine Translation Can Make Our Lives Easier
Suendermann, David
Liscombe, Jackson
Dayanidhi, Krishna
Pieraccini, Roberto
[J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1487 - 1490
[28] Data-pooling and multi-task learning for enhanced performance of speech recognition systems in multiple low resourced languages
Madhavaraj, A.
Ramakrishnan, A. G.
[J]. 2019 25TH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2019,
[29] Effects of low speed wind on the recognition/identification and pass-through communication tasks of auditory situation awareness afforded by military hearing protection/enhancement devices and tactical communication and protective systems
Lee, Kichol
Casali, John G.
[J]. INTERNATIONAL JOURNAL OF AUDIOLOGY, 2016, 55 : S21 - S29

← 1 2 3 →