Addressing Resource and Privacy Constraints in Semantic Parsing Through Data Augmentation

被引:0
|
作者
Yang, Kevin [1 ]
Deng, Olivia [2 ]
Chen, Charles [2 ]
Shin, Richard [2 ]
Roy, Subhro [2 ]
Van Durme, Benjamin [2 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] Microsoft Semant Machines, Redmond, WA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce a novel setup for low-resource task-oriented semantic parsing which incorporates several constraints that may arise in real-world scenarios: (1) lack of similar datasets/models from a related domain, (2) inability to sample useful logical forms directly from a grammar, and (3) privacy requirements for unlabeled natural utterances. Our goal is to improve a low-resource semantic parser using utterances collected through user interactions. In this highly challenging but realistic setting, we investigate data augmentation approaches involving generating a set of structured canonical utterances corresponding to logical forms, before simulating corresponding natural language and filtering the resulting pairs. We find that such approaches are effective despite our restrictive setup: in a low-resource setting on the complex SMCalFlow calendaring dataset (Andreas et al., 2020), we observe 33% relative improvement over a non-data-augmented baseline in top-1 match.
引用
收藏
页码:3685 / 3695
页数:11
相关论文
共 50 条
  • [41] Implicit Semantic Data Augmentation for Hand Pose Estimation
    Seo, Kyeongeun
    Cho, Hyeonjoong
    Choi, Daewoong
    Park, Ju-Derk
    [J]. IEEE ACCESS, 2022, 10 : 84680 - 84688
  • [42] Data augmentation for sentiment classification with semantic preservation and diversity
    Chao, Guoqing
    Liu, Jingyao
    Wang, Mingyu
    Chu, Dianhui
    [J]. KNOWLEDGE-BASED SYSTEMS, 2023, 280
  • [43] THDA: Treasure Hunt Data Augmentation for Semantic Navigation
    Maksymets, Oleksandr
    Cartillier, Vincent
    Gokaslan, Aaron
    Wijmans, Erik
    Galuba, Wojciech
    Lee, Stefan
    Batra, Dhruv
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 15354 - 15363
  • [44] AMUSE: Multilingual Semantic Parsing for Question Answering over Linked Data
    Hakimov, Sherzod
    Jebbara, Soufian
    Cimiano, Philipp
    [J]. SEMANTIC WEB - ISWC 2017, PT I, 2017, 10587 : 329 - 346
  • [45] Mining Twitter Data with Resource Constraints
    Valkanas, George
    Katakis, Ioannis
    Gunopulos, Dimitrios
    Stefanidis, Antony
    [J]. 2014 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 1, 2014, : 157 - 164
  • [46] ENFORCEMENT OF INTEGRITY CONSTRAINTS IN A SEMANTIC DATA MODEL
    SUDKAMP, N
    KANDZIA, P
    [J]. LECTURE NOTES IN COMPUTER SCIENCE, 1989, 385 : 313 - 328
  • [47] Data Utilization Versus Privacy Protection in Semantic Communications
    Zhao, Lindong
    Wu, Dan
    Zhou, Liang
    [J]. IEEE WIRELESS COMMUNICATIONS, 2023, 30 (03) : 44 - 50
  • [48] Semantic Disclosure Control: semantics meets data privacy
    Batet, Montserrat
    Sanchez, David
    [J]. ONLINE INFORMATION REVIEW, 2018, 42 (03) : 290 - 303
  • [49] Addressing the Privacy, Security, Risk, and Operations Aspects of the Data Ecosystem
    Prasad, Mathura
    [J]. ISACA Journal, 2024, 2 : 35 - 39
  • [50] Enhancing resource utilization and privacy in IoT data placement through fuzzy logic and PSO optimization
    Dhanushkodi, Kavitha
    Kumar, Raushan
    Mittal, Pratyush
    Das, Saumye Saran
    Suryavenu, Neelam Naga Saivenkata
    Venkataramani, Kiruthika
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (09): : 12603 - 12626