QA-Driven Zero-shot Slot Filling with Weak Supervision Pretraining

被引:0
|
作者
Du, Xinya [1 ,2 ]
He, Luheng [2 ]
Li, Qi [3 ]
Yu, Dian [2 ,4 ]
Pasupat, Panupong [2 ]
Zhang, Yuan [2 ]
机构
[1] Cornell Univ, Ithaca, NY 14853 USA
[2] Google Res, Mountain View, CA 94043 USA
[3] Google Assistant, Mountain View, CA USA
[4] Univ Calif Davis, Davis, CA 95616 USA
来源
ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2 | 2021年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Slot-filling is an essential component for building task-oriented dialog systems. In this work, we focus on the zero-shot slot-filling problem, where the model needs to predict slots and their values, given utterances from new domains without training on the target domain. Prior methods directly encode slot descriptions to generalize to unseen slot types. However, raw slot descriptions are often ambiguous and do not encode enough semantic information, limiting the models' zero-shot capability. To address this problem, we introduce QA-driven slot filling (QASF), which extracts slot-filler spans from utterances with a span-based QA model. We use a linguistically motivated questioning strategy to turn descriptions into questions, allowing the model to generalize to unseen slot types. Moreover, our QASF model can benefit from weak supervision signals from QA pairs synthetically generated from unlabeled conversations. Our full system substantially outperforms baselines by over 5% on the SNIPS benchmark.
引用
收藏
页码:654 / 664
页数:11
相关论文
共 50 条
  • [31] Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers
    Gong, Linyuan
    Xiong, Chenyan
    Liu, Xiaodong
    Bajaj, Payal
    Xie, Yiqing
    Cheung, Alvin
    Gao, Jianfeng
    Song, Xia
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 12933 - 12950
  • [32] Normalization Driven Zero-shot Multi-Speaker Speech Synthesis
    Kumar, Neeraj
    Goel, Srishti
    Narang, Ankur
    Lall, Brejesh
    INTERSPEECH 2021, 2021, : 1354 - 1358
  • [33] Combining Cross-lingual and Cross-task Supervision for Zero-Shot Learning
    Pikuliak, Matus
    Simko, Marian
    TEXT, SPEECH, AND DIALOGUE (TSD 2020), 2020, 12284 : 162 - 170
  • [34] Zero-Shot Cross-Modal Retrieval for Remote Sensing Images With Minimal Supervision
    Chaudhuri, Ushasi
    Bose, Rupak
    Banerjee, Biplab
    Bhattacharya, Avik
    Datcu, Mihai
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [35] Deep supervision network with contrastive learning for zero-shot sketch-based retrieval
    Shu, Zhenqiu
    Zhuo, Guangyao
    Yu, Jun
    Yu, Zhengtao
    APPLIED SOFT COMPUTING, 2024, 167
  • [36] An Empirical Investigation of Word Alignment Supervision for Zero-Shot Multilingual Neural Machine Translation
    Raganato, Alessandro
    Vazquez, Raul
    Creutz, Mathias
    Tiedemann, Jorg
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 8449 - 8456
  • [37] Task-Driven Modular Networks for Zero-Shot Compositional Learning
    Purushwalkam, Senthil
    Nickel, Maximilian
    Gupta, Abhinav
    Ranzato, Marc'Aurelio
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3592 - 3601
  • [38] Leveraging Slot Descriptions for Zero-Shot Cross-Domain Dialogue State Tracking
    Lin, Zhaojiang
    Liu, Bing
    Moon, Seungwhan
    Crook, Paul
    Zhou, Zhenpeng
    Wang, Zhiguang
    Yu, Zhou
    Madotto, Andrea
    Cho, Eunjoon
    Subba, Rajen
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 5640 - 5648
  • [39] Analyze, Generate and Refine: Query Expansion with LLMs for Zero-Shot Open-Domain QA
    Chen, Xinran
    Chen, Xuanang
    He, Ben
    Wen, Tengfei
    Sun, Le
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 11908 - 11922
  • [40] u-HuBERT: Unified Mixed-Modal Speech Pretraining And Zero-Shot Transfer to Unlabeled Modality
    Hsu, Wei-Ning
    Shi, Bowen
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,