Addressing Resource and Privacy Constraints in Semantic Parsing Through Data Augmentation

被引:0
|
作者
Yang, Kevin [1 ]
Deng, Olivia [2 ]
Chen, Charles [2 ]
Shin, Richard [2 ]
Roy, Subhro [2 ]
Van Durme, Benjamin [2 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] Microsoft Semant Machines, Redmond, WA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce a novel setup for low-resource task-oriented semantic parsing which incorporates several constraints that may arise in real-world scenarios: (1) lack of similar datasets/models from a related domain, (2) inability to sample useful logical forms directly from a grammar, and (3) privacy requirements for unlabeled natural utterances. Our goal is to improve a low-resource semantic parser using utterances collected through user interactions. In this highly challenging but realistic setting, we investigate data augmentation approaches involving generating a set of structured canonical utterances corresponding to logical forms, before simulating corresponding natural language and filtering the resulting pairs. We find that such approaches are effective despite our restrictive setup: in a low-resource setting on the complex SMCalFlow calendaring dataset (Andreas et al., 2020), we observe 33% relative improvement over a non-data-augmented baseline in top-1 match.
引用
收藏
页码:3685 / 3695
页数:11
相关论文
共 50 条
  • [31] Your Data Is My Data: A Framework for Addressing Interdependent Privacy Infringements
    Kamleitner, Bernadette
    Mitchell, Vince
    [J]. JOURNAL OF PUBLIC POLICY & MARKETING, 2019, 38 (04) : 433 - 450
  • [32] Selling Data at an Auction under Privacy Constraints
    Zhang, Mengxiao
    Beltran, Fernando
    Liu, Jiamou
    [J]. CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI 2020), 2020, 124 : 669 - 678
  • [33] Classification of credit scoring data with privacy constraints
    Stecking, Ralf
    Schebesch, Klaus B.
    [J]. INTELLIGENT DATA ANALYSIS, 2015, 19 : S3 - S18
  • [34] On Variable Constraints in Privacy Preserving Data Mining
    Aggarwal, Charu C.
    Yu, Philip S.
    [J]. PROCEEDINGS OF THE FIFTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2005, : 115 - 125
  • [35] Parsing of Data on Real Estate Objects from Network Resource
    Cherkesov, Vyacheslav
    Malikov, Vitaliy
    Golubev, Alexey
    Parygin, Danila
    Smykovskaya, Tatiana
    [J]. PROCEEDINGS OF THE IV INTERNATIONAL RESEARCH CONFERENCE INFORMATION TECHNOLOGIES IN SCIENCE, MANAGEMENT, SOCIAL SPHERE AND MEDICINE (ITSMSSM 2017), 2017, 72 : 385 - 388
  • [36] Low-Resource Domain Adaptation for Compositional Task-Oriented Semantic Parsing
    Chen, Xilun
    Ghoshal, Asish
    Mehdad, Yashar
    Zettlemoyer, Luke
    Gupta, Sonal
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 5090 - 5100
  • [37] Addressing privacy concerns through the health insurance portability and accountability act privacy rule
    Hoffman, Sharona
    [J]. AMERICAN JOURNAL OF BIOETHICS, 2007, 7 (03): : 48 - 49
  • [38] Context-Dependent Semantic Parsing over Temporally Structured Data
    Chen, Charles
    Bunescu, Razvan
    [J]. 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 3576 - 3585
  • [39] Privacy Through Data Recolouring
    D'Acquisto, Giuseppe
    Mazzoccoli, Alessandro
    Ciminelli, Fabio
    Naldi, Maurizio
    [J]. PRIVACY TECHNOLOGIES AND POLICY, APF 2020, 2020, 12121 : 61 - 72
  • [40] Object Insertion Based Data Augmentation for Semantic Segmentation
    Yuan Ren
    Zhao, Siyan
    Liu Bingbing
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022,