Crowdsourcing Syntactically Diverse Paraphrases with Diversity-Aware Prompts and Workflows

被引:1
|
作者
Ramirez, Jorge [1 ]
Baez, Marcos [1 ]
Berro, Auday [1 ]
Benatallah, Boualem [2 ]
Casati, Fabio [3 ]
机构
[1] LIRIS Univ Claude Bernard Lyon 1, Villeurbanne, France
[2] Univ New South Wales, Kensington, NSW, Australia
[3] ServiceNow, Santa Clara, CA USA
关键词
Crowdsourcing; Paraphrasing; Diversity; Task-oriented bots;
D O I
10.1007/978-3-031-07472-1_15
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Task-oriented bots (or simply bots) enable humans to perform tasks in natural language. For example, to book a restaurant or check the weather. Crowdsourcing has become a prominent approach to build datasets for training and evaluating task-oriented bots, where the crowd grows an initial seed of utterances through paraphrasing, i.e., reformulating a given seed into semantically equivalent sentences. In this context, the resulting diversity is a relevant dimension of high-quality datasets, as diverse paraphrases capture the many ways users may express an intent. Current techniques, however, are either based on the assumption that crowd-powered paraphrases are naturally diverse or focus only on lexical diversity. In this paper, we address an overlooked aspect of diversity and introduce an approach for guiding the crowdsourcing process towards paraphrases that are syntactically diverse. We introduce a workflow and novel prompts that are informed by syntax patterns to elicit paraphrases avoiding or incorporating desired syntax. Our empirical analysis indicates that our approach yields higher syntactic diversity, syntactic novelty and more uniform pattern distribution than state-of-the-art baselines, albeit incurring on higher task effort.
引用
收藏
页码:253 / 269
页数:17
相关论文
共 50 条
  • [31] Diversity-Aware Weighted Majority Vote Classifier for Imbalanced Data
    Goyal, Anil
    Khiari, Jihed
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [32] APIRec: deep knowledge and diversity-aware web API recommendation
    Song, Fanfei
    Wang, Bin
    Xie, Xinqiang
    Pu, Rong
    Zhang, Qingbo
    Wang, Wei
    SERVICE ORIENTED COMPUTING AND APPLICATIONS, 2024,
  • [33] A Theoretical and Empirical Study of Diversity-Aware Mutation Adequacy Criterion
    Shin, Donghwan
    Yoo, Shin
    Bae, Doo-Hwan
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2018, 44 (10) : 914 - 931
  • [34] Slowfast Diversity-aware Prototype Learning for Egocentric Action Recognition
    Dai, Guangzhao
    Shu, Xiangbo
    Yan, Rui
    Huang, Peng
    Tang, Jinhui
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 7549 - 7558
  • [35] Diversity-Aware k-median: Clustering with Fair Center Representation
    Thejaswi, Suhas
    Ordozgoiti, Bruno
    Gionis, Aristides
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: RESEARCH TRACK, PT II, 2021, 12976 : 765 - 780
  • [36] A Location- and Diversity-Aware News Feed System for Mobile Users
    Xu, Wenjian
    Chow, Chi-Yin
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2016, 9 (06) : 846 - 861
  • [37] Diversity-Aware Marine Predators Algorithm for Task Scheduling in Cloud Computing
    Chen, Dujing
    Zhang, Yanyan
    ENTROPY, 2023, 25 (02)
  • [38] Effects of Individual Traits on Diversity-Aware Music Recommender User Interfaces
    Jin, Yucheng
    Tintarev, Nava
    Verbert, Katrien
    PROCEEDINGS OF THE 26TH CONFERENCE ON USER MODELING, ADAPTATION AND PERSONALIZATION (UMAP'18), 2018, : 291 - 299
  • [39] Diversity-aware classifier ensemble selection via f-score
    Visentini, Ingrid
    Snidaro, Lauro
    Foresti, Gian Luca
    INFORMATION FUSION, 2016, 28 : 24 - 43
  • [40] Diversity-Aware Mutation Adequacy Criterion for Improving Fault Detection Capability
    Shin, Donghwan
    Yoo, Shin
    Bae, Doo-Hwan
    2016 IEEE NINTH INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION WORKSHOPS (ICSTW), 2016, : 122 - 131