QA-Driven Zero-shot Slot Filling with Weak Supervision Pretraining

被引：0

作者：

Du, Xinya ^{[1
,2
]}

He, Luheng ^{[2
]}

Li, Qi ^{[3
]}

Yu, Dian ^{[2
,4
]}

Pasupat, Panupong ^{[2
]}

Zhang, Yuan ^{[2
]}

机构：

[1] Cornell Univ, Ithaca, NY 14853 USA

[2] Google Res, Mountain View, CA 94043 USA

[3] Google Assistant, Mountain View, CA USA

[4] Univ Calif Davis, Davis, CA 95616 USA

来源：

ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2 | 2021年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Slot-filling is an essential component for building task-oriented dialog systems. In this work, we focus on the zero-shot slot-filling problem, where the model needs to predict slots and their values, given utterances from new domains without training on the target domain. Prior methods directly encode slot descriptions to generalize to unseen slot types. However, raw slot descriptions are often ambiguous and do not encode enough semantic information, limiting the models' zero-shot capability. To address this problem, we introduce QA-driven slot filling (QASF), which extracts slot-filler spans from utterances with a span-based QA model. We use a linguistically motivated questioning strategy to turn descriptions into questions, allowing the model to generalize to unseen slot types. Moreover, our QASF model can benefit from weak supervision signals from QA pairs synthetically generated from unlabeled conversations. Our full system substantially outperforms baselines by over 5% on the SNIPS benchmark.

引用

页码：654 / 664

页数：11

共 50 条

[31] Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers
Gong, Linyuan
Xiong, Chenyan
Liu, Xiaodong
Bajaj, Payal
Xie, Yiqing
Cheung, Alvin
Gao, Jianfeng
Song, Xia
PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 12933 - 12950
[32] Normalization Driven Zero-shot Multi-Speaker Speech Synthesis
Kumar, Neeraj
Goel, Srishti
Narang, Ankur
Lall, Brejesh
INTERSPEECH 2021, 2021, : 1354 - 1358
[33] Combining Cross-lingual and Cross-task Supervision for Zero-Shot Learning
Pikuliak, Matus
Simko, Marian
TEXT, SPEECH, AND DIALOGUE (TSD 2020), 2020, 12284 : 162 - 170
[34] Zero-Shot Cross-Modal Retrieval for Remote Sensing Images With Minimal Supervision
Chaudhuri, Ushasi
Bose, Rupak
Banerjee, Biplab
Bhattacharya, Avik
Datcu, Mihai
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[35] Deep supervision network with contrastive learning for zero-shot sketch-based retrieval
Shu, Zhenqiu
Zhuo, Guangyao
Yu, Jun
Yu, Zhengtao
APPLIED SOFT COMPUTING, 2024, 167
[36] An Empirical Investigation of Word Alignment Supervision for Zero-Shot Multilingual Neural Machine Translation
Raganato, Alessandro
Vazquez, Raul
Creutz, Mathias
Tiedemann, Jorg
2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 8449 - 8456
[37] Task-Driven Modular Networks for Zero-Shot Compositional Learning
Purushwalkam, Senthil
Nickel, Maximilian
Gupta, Abhinav
Ranzato, Marc'Aurelio
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3592 - 3601
[38] Leveraging Slot Descriptions for Zero-Shot Cross-Domain Dialogue State Tracking
Lin, Zhaojiang
Liu, Bing
Moon, Seungwhan
Crook, Paul
Zhou, Zhenpeng
Wang, Zhiguang
Yu, Zhou
Madotto, Andrea
Cho, Eunjoon
Subba, Rajen
2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 5640 - 5648
[39] Analyze, Generate and Refine: Query Expansion with LLMs for Zero-Shot Open-Domain QA
Chen, Xinran
Chen, Xuanang
He, Ben
Wen, Tengfei
Sun, Le
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 11908 - 11922
[40] u-HuBERT: Unified Mixed-Modal Speech Pretraining And Zero-Shot Transfer to Unlabeled Modality
Hsu, Wei-Ning
Shi, Bowen
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,

← 1 2 3 4 5 →