Cross-Task Crowdsourcing

被引：0

作者：

Mo, Kaixiang ^{[1
]}

Zhong, Erheng ^{[1
]}

Yang, Qiang ^{[1
,2
]}

机构：

[1] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China

[2] Huawei Noahs Ark Lab, Shatin, Hong Kong, Peoples R China

来源：

19TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'13) | 2013年

关键词：

Crowdsourcing; Transfer Learning;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Crowdsourcing is an effective method for collecting labeled data for various data mining tasks. It is critical to ensure the veracity of the produced data because responses collected from different users may be noisy and unreliable. Previous works solve this veracity problem by estimating both the user ability and question difficulty based on the knowledge in each task individually. In this case, each single task needs large amounts of data to provide accurate estimations. However, in practice, budgets provided by customers for a given target task may be limited, and hence each question can be presented to only a few users where each user can answer only a few questions. This data sparsity problem can cause previous approaches to perform poorly due to the overfitting problem on rare data and eventually damage the data veracity. Fortunately, in real-world applications, users can answer questions from multiple historical tasks. For example, one can annotate images as well as label the sentiment of a given title. In this paper, we employ transfer learning, which borrows knowledge from auxiliary historical tasks to improve the data veracity in a given target task. The motivation is that users have stable characteristics across different crowdsourcing tasks and thus data from different tasks can be exploited collectively to estimate users' abilities in the target task. We propose a hierarchical Bayesian model, TLC (Transfer Learning for Crowdsourcing), to implement this idea by considering the overlapping users as a bridge. In addition, to avoid possible negative impact, TLC introduces task-specific factors to model task differences. The experimental results show that TLC significantly improves the accuracy over several state-of-the-art non-transfer-learning approaches under very limited budget in various labeling tasks.

引用

页码：677 / 685

页数：9

共 50 条

[1] Cross-Task Generalization via Natural Language Crowdsourcing Instructions
Mishra, Swaroop
Khashabi, Daniel
Baral, Chitta
Hajishirzi, Hannaneh
[J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 3470 - 3487
[2] Cross-task strategic effects
Rastle, K
Kinoshita, S
Lupker, SJ
Coltheart, M
[J]. MEMORY & COGNITION, 2003, 31 (06) : 867 - 876
[3] Cross-task strategic effects
Kathleen Rastle
Sachiko Kinoshita
Stephen J. Lupker
Max Coltheart
[J]. Memory & Cognition, 2003, 31 : 867 - 876
[4] CROSS-TASK VALIDATION OF FUNCTIONAL MEASUREMENT
ANDERSON, NH
[J]. PERCEPTION & PSYCHOPHYSICS, 1972, 12 (05): : 389 - &
[5] The costs and benefits of cross-task priming
Florian Waszak
Bernhard Hommel
[J]. Memory & Cognition, 2007, 35 : 1175 - 1186
[6] Age Differences in Cross-Task Bleeding
Nicosia, Jessica
Balota, David
[J]. PSYCHOLOGY AND AGING, 2020, 35 (06) : 881 - 893
[7] The costs and benefits of cross-task priming
Waszak, Florian
Hommel, Bernhard
[J]. MEMORY & COGNITION, 2007, 35 (05) : 1175 - 1186
[8] CROSS-TASK FACILITATION IN SEMANTIC MEMORY
MACLEOD, CM
VOUMVAKIS, S
[J]. BULLETIN OF THE PSYCHONOMIC SOCIETY, 1980, 16 (03) : 153 - 153
[9] CROSS-TASK CROSS-TALK IN MEMORY AND PERCEPTION
DUTTA, A
SCHWEICKERT, R
CHOI, S
PROCTOR, RW
[J]. ACTA PSYCHOLOGICA, 1995, 90 (1-3) : 49 - 62
[10] Mental Rotation: Cross-Task Training and Generalization
Stransky, Debi
Wilcox, Laurie M.
Dubrowski, Adam
[J]. JOURNAL OF EXPERIMENTAL PSYCHOLOGY-APPLIED, 2010, 16 (04) : 349 - 360

← 1 2 3 4 5 →