Encoding Syntactic Knowledge in Transformer Encoder for Intent Detection and Slot Filling

被引：0

作者：

Wang, Jixuan ^{[1
,2
,3
]}

Wei, Kai ^{[3
]}

Radfar, Martin ^{[3
]}

Zhang, Weiwei ^{[3
]}

Chung, Clement ^{[3
]}

机构：

[1] Univ Toronto, Toronto, ON, Canada

[2] Vector Inst, Toronto, ON, Canada

[3] Amazon Alexa, Pittsburgh, PA 15205 USA

来源：

THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2021年 / 35卷

关键词：

NEURAL-NETWORKS;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose a novel Transformer encoder-based architecture with syntactical knowledge encoded for intent detection and slot filling. Specifically, we encode syntactic knowledge into the Transformer encoder by jointly training it to predict syntactic parse ancestors and part-of-speech of each token via multi-task learning. Our model is based on self-attention and feed-forward layers and does not require external syntactic information to be available at inference time. Experiments show that on two benchmark datasets, our models with only two Transformer encoder layers achieve state-of-the-art results. Compared to the previously best performed model without pre-training, our models achieve absolute F1 score and accuracy improvement of 1.59% and 0.85% for slot filling and intent detection on the SNIPS dataset, respectively. Our models also achieve absolute F1 score and accuracy improvement of 0.1% and 0.34% for slot filling and intent detection on the ATIS dataset, respectively, over the previously best performed model. Furthermore, the visualization of the self-attention weights illustrates the benefits of incorporating syntactic information during training.

引用

页码：13943 / 13951

页数：9

共 50 条

[41] Joint intent detection and slot filling using weighted finite state transducer and BERT
Abro, Waheed Ahmed
Qi, Guilin
Aamir, Muhammad
Ali, Zafar
APPLIED INTELLIGENCE, 2022, 52 (15) : 17356 - 17370
[42] Joint Training Model of Intent Detection and Slot Filling for Multi Granularity Implicit Guidance
Li, Bin
Wang, Weihua
Bao, Feilong
2022 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2022), 2022, : 271 - 274
[43] A Novel Bi-directional Interrelated Model for Joint Intent Detection and Slot Filling
E, Haihong
Niu, Peiqing
Chen, Zhongfu
Song, Meina
57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 5467 - 5471
[44] CONVOLUTIONAL NEURAL NETWORK BASED TRIANGULAR CRF FOR JOINT INTENT DETECTION AND SLOT FILLING
Xu, Puyang
Sarikaya, Ruhi
2013 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2013, : 78 - 83
[45] Joint intent detection and slot filling using weighted finite state transducer and BERT
Waheed Ahmed Abro
Guilin Qi
Muhammad Aamir
Zafar Ali
Applied Intelligence, 2022, 52 : 17356 - 17370
[46] Joint agricultural intent detection and slot filling based on enhanced heterogeneous attention mechanism
Hao, Xia
Wang, Lu
Zhu, Hongmei
Guo, Xuchao
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2023, 207
[47] Intent Recognition and Slot Filling Joint Model for Question-Answering of Hazardous Chemical Knowledge
Li, Na
Liu, Enxiao
Zhao, Zhibin
Li, Fengyun
2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 4972 - 4976
[48] Intent Classification and Slot Filling for Turkish Dialogue Systems
Sahinuc, Furkan
Yucesoy, Veysel
Koc, Aykut
2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
[49] A GRAPH ATTENTION INTERACTIVE REFINE FRAMEWORK WITH CONTEXTUAL REGULARIZATION FOR JOINTING INTENT DETECTION AND SLOT FILLING
Zhu, Zhanbiao
Huang, Peijie
Huang, Haojing
Liu, Shudong
Lao, Leyi
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7617 - 7621
[50] Attention-Based Recurrent Neural Network Models for Joint Intent Detection and Slot Filling
Liu, Bing
Lane, Ian
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 685 - 689

← 1 2 3 4 5 →