Cross-Task Crowdsourcing

被引:0
|
作者
Mo, Kaixiang [1 ]
Zhong, Erheng [1 ]
Yang, Qiang [1 ,2 ]
机构
[1] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
[2] Huawei Noahs Ark Lab, Shatin, Hong Kong, Peoples R China
关键词
Crowdsourcing; Transfer Learning;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Crowdsourcing is an effective method for collecting labeled data for various data mining tasks. It is critical to ensure the veracity of the produced data because responses collected from different users may be noisy and unreliable. Previous works solve this veracity problem by estimating both the user ability and question difficulty based on the knowledge in each task individually. In this case, each single task needs large amounts of data to provide accurate estimations. However, in practice, budgets provided by customers for a given target task may be limited, and hence each question can be presented to only a few users where each user can answer only a few questions. This data sparsity problem can cause previous approaches to perform poorly due to the overfitting problem on rare data and eventually damage the data veracity. Fortunately, in real-world applications, users can answer questions from multiple historical tasks. For example, one can annotate images as well as label the sentiment of a given title. In this paper, we employ transfer learning, which borrows knowledge from auxiliary historical tasks to improve the data veracity in a given target task. The motivation is that users have stable characteristics across different crowdsourcing tasks and thus data from different tasks can be exploited collectively to estimate users' abilities in the target task. We propose a hierarchical Bayesian model, TLC (Transfer Learning for Crowdsourcing), to implement this idea by considering the overlapping users as a bridge. In addition, to avoid possible negative impact, TLC introduces task-specific factors to model task differences. The experimental results show that TLC significantly improves the accuracy over several state-of-the-art non-transfer-learning approaches under very limited budget in various labeling tasks.
引用
收藏
页码:677 / 685
页数:9
相关论文
共 50 条
  • [1] Cross-Task Generalization via Natural Language Crowdsourcing Instructions
    Mishra, Swaroop
    Khashabi, Daniel
    Baral, Chitta
    Hajishirzi, Hannaneh
    [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 3470 - 3487
  • [2] Cross-task strategic effects
    Rastle, K
    Kinoshita, S
    Lupker, SJ
    Coltheart, M
    [J]. MEMORY & COGNITION, 2003, 31 (06) : 867 - 876
  • [3] Cross-task strategic effects
    Kathleen Rastle
    Sachiko Kinoshita
    Stephen J. Lupker
    Max Coltheart
    [J]. Memory & Cognition, 2003, 31 : 867 - 876
  • [4] CROSS-TASK VALIDATION OF FUNCTIONAL MEASUREMENT
    ANDERSON, NH
    [J]. PERCEPTION & PSYCHOPHYSICS, 1972, 12 (05): : 389 - &
  • [5] The costs and benefits of cross-task priming
    Florian Waszak
    Bernhard Hommel
    [J]. Memory & Cognition, 2007, 35 : 1175 - 1186
  • [6] Age Differences in Cross-Task Bleeding
    Nicosia, Jessica
    Balota, David
    [J]. PSYCHOLOGY AND AGING, 2020, 35 (06) : 881 - 893
  • [7] The costs and benefits of cross-task priming
    Waszak, Florian
    Hommel, Bernhard
    [J]. MEMORY & COGNITION, 2007, 35 (05) : 1175 - 1186
  • [8] CROSS-TASK FACILITATION IN SEMANTIC MEMORY
    MACLEOD, CM
    VOUMVAKIS, S
    [J]. BULLETIN OF THE PSYCHONOMIC SOCIETY, 1980, 16 (03) : 153 - 153
  • [9] CROSS-TASK CROSS-TALK IN MEMORY AND PERCEPTION
    DUTTA, A
    SCHWEICKERT, R
    CHOI, S
    PROCTOR, RW
    [J]. ACTA PSYCHOLOGICA, 1995, 90 (1-3) : 49 - 62
  • [10] Mental Rotation: Cross-Task Training and Generalization
    Stransky, Debi
    Wilcox, Laurie M.
    Dubrowski, Adam
    [J]. JOURNAL OF EXPERIMENTAL PSYCHOLOGY-APPLIED, 2010, 16 (04) : 349 - 360