Addressing Resource and Privacy Constraints in Semantic Parsing Through Data Augmentation

被引:0
|
作者
Yang, Kevin [1 ]
Deng, Olivia [2 ]
Chen, Charles [2 ]
Shin, Richard [2 ]
Roy, Subhro [2 ]
Van Durme, Benjamin [2 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] Microsoft Semant Machines, Redmond, WA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce a novel setup for low-resource task-oriented semantic parsing which incorporates several constraints that may arise in real-world scenarios: (1) lack of similar datasets/models from a related domain, (2) inability to sample useful logical forms directly from a grammar, and (3) privacy requirements for unlabeled natural utterances. Our goal is to improve a low-resource semantic parser using utterances collected through user interactions. In this highly challenging but realistic setting, we investigate data augmentation approaches involving generating a set of structured canonical utterances corresponding to logical forms, before simulating corresponding natural language and filtering the resulting pairs. We find that such approaches are effective despite our restrictive setup: in a low-resource setting on the complex SMCalFlow calendaring dataset (Andreas et al., 2020), we observe 33% relative improvement over a non-data-augmented baseline in top-1 match.
引用
收藏
页码:3685 / 3695
页数:11
相关论文
共 50 条
  • [1] Text augmentation for semantic frame induction and parsing
    Anwar, Saba
    Shelmanov, Artem
    Arefyev, Nikolay
    Panchenko, Alexander
    Biemann, Chris
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2024, 58 (02) : 363 - 408
  • [2] Controllable Semantic Parsing via Retrieval Augmentation
    Pasupat, Panupong
    Zhang, Yuan
    Guu, Kelvin
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 7683 - 7698
  • [3] Applying Semantic Parsing to Question Answering Over Linked Data: Addressing the Lexical Gap
    Hakimov, Sherzod
    Unger, Christina
    Walter, Sebastian
    Cimiano, Philipp
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, NLDB 2015, 2015, 9103 : 103 - 109
  • [4] A Linguistic Resource for Semantic Parsing of Motion Events
    Roberts, Kirk
    Gullapalli, Srikanth
    Bejan, Cosmin Adrian
    Harabagiu, Sanda
    [J]. LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 3293 - 3299
  • [5] Data Recombination for Neural Semantic Parsing
    Jia, Robin
    Liang, Percy
    [J]. PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2016, : 12 - 22
  • [6] Learning to Synthesize Data for Semantic Parsing
    Wang, Bailin
    Yin, Wenpeng
    Lin, Xi Victoria
    Xiong, Caiming
    [J]. 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 2760 - 2766
  • [7] Constraints based Web Service Semantic Augmentation
    Hu, Xiaocao
    Feng, Zhiyong
    Chen, Shizhan
    [J]. 2014 IEEE 21ST INTERNATIONAL CONFERENCE ON WEB SERVICES (ICWS 2014), 2014, : 702 - 703
  • [8] Comparing Knowledge-Intensive and Data-Intensive Models for English Resource Semantic Parsing
    Cao, Junjie
    Lin, Zi
    Sun, Weiwei
    Wan, Xiaojun
    [J]. COMPUTATIONAL LINGUISTICS, 2021, 47 (01) : 43 - 68
  • [9] The Power of Prompt Tuning for Low-Resource Semantic Parsing
    Schucher, Nathan
    Reddy, Siva
    de Vries, Harm
    [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): (SHORT PAPERS), VOL 2, 2022, : 148 - 156
  • [10] Low-Resource Compositional Semantic Parsing with Concept Pretraining
    Rongali, Subendhu
    Sridhar, Mukund
    Khan, Haidar
    Arkoudas, Konstantine
    Hamza, Wael
    McCallum, Andrew
    [J]. 17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1410 - 1419