Crowdsourced Top-k Algorithms: An Experimental Evaluation

被引:31
|
作者
Zhang, Xiaohang [1 ]
Li, Guoliang [1 ]
Feng, Jianhua [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci, Tsinghua Natl Lab Informat Sci & Technol TNList, Beijing, Peoples R China
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2016年 / 9卷 / 08期
关键词
D O I
10.14778/2921558.2921559
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Crowdsourced top-k computation has attracted significant attention recently, thanks to emerging crowdsourcing platforms, e. g., Amazon Mechanical Turk and CrowdFlower. Crowdsourced top-k algorithms ask the crowd to compare the objects and infer the top-k objects based on the crowdsourced comparison results. The crowd may return incorrect answers, but traditional top-k algorithms cannot tolerate the errors from the crowd. To address this problem, the database and machine-learning communities have independently studied the crowdsourced top-k problem. The database community proposes the heuristic-based solutions while the machine-learning community proposes the learning-based methods (e. g., maximum likelihood estimation). However, these two types of techniques have not been compared systematically under the same experimental framework. Thus it is rather difficult for a practitioner to decide which algorithm should be adopted. Furthermore, the experimental evaluation of existing studies has several weaknesses. Some methods assume the crowd returns high-quality results and some algorithms are only tested on simulated experiments. To alleviate these limitations, in this paper we present a comprehensive comparison of crowdsourced top-k algorithms. Using various synthetic and real datasets, we evaluate each algorithm in terms of result quality and efficiency on real crowdsourcing platforms. We reveal the characteristics of different techniques and provide guidelines on selecting appropriate algorithms for various scenarios.
引用
收藏
页码:612 / 623
页数:12
相关论文
共 50 条
  • [1] An Experimental Evaluation of Aggregation Algorithms for Processing Top-K Queries
    Zhu, Liang
    Ma, Qin
    Meng, Weiyi
    Yang, Mingqian
    Yuan, Fang
    [J]. CIT/IUCC/DASC/PICOM 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY - UBIQUITOUS COMPUTING AND COMMUNICATIONS - DEPENDABLE, AUTONOMIC AND SECURE COMPUTING - PERVASIVE INTELLIGENCE AND COMPUTING, 2015, : 326 - 333
  • [2] Efficient Techniques for Crowdsourced Top-k Lists
    de Alfaro, Luca
    Polychronopoulos, Vassilis
    Polyzotis, Neoklis
    [J]. PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 4801 - 4805
  • [3] Supervised Evaluation of Top-k Itemset Mining Algorithms
    Lucchese, Claudio
    Orlando, Salvatore
    Perego, Raffaele
    [J]. BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY, 2015, 9263 : 82 - 94
  • [4] Evidential Top-k Queries Evaluation: Algorithms and Experiments
    Bousnina, Fatma Ezzahra
    Chebbah, Mouna
    Tobji, Mohamed Anis Bach
    Hadjali, Allel
    Ben Yaghlane, Boutheina
    [J]. INFORMATION PROCESSING AND MANAGEMENT OF UNCERTAINTY IN KNOWLEDGE-BASED SYSTEMS: THEORY AND FOUNDATIONS, IPMU 2018, PT I, 2018, 853 : 407 - 417
  • [5] Top-k Algorithms and Applications
    Das, Gautam
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, 2009, 5463 : 789 - 792
  • [6] Parameterized top-K algorithms
    Chen, Jianer
    Kanj, Iyad A.
    Meng, Jie
    Xia, Ge
    Zhang, Fenghui
    [J]. THEORETICAL COMPUTER SCIENCE, 2013, 470 : 105 - 119
  • [7] A Toolkit for Managing Multiple Crowdsourced Top-K Queries
    Shan, Caihua
    Hou, Leong U.
    Mamoulis, Nikos
    Cheng, Reynold
    [J]. CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 3453 - 3456
  • [8] Group Formation Based on Crowdsourced Top-k Recommendation
    Gao, Yunpeng
    Cai, Wei
    Liang, Kuiyang
    [J]. WEB AND BIG DATA, 2017, 10612 : 204 - 213
  • [9] A Rating-Ranking Method for Crowdsourced Top-k Computation
    Li, Kaiyu
    Zhang, Xiaohang
    Li, Guoliang
    [J]. SIGMOD'18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2018, : 975 - 990
  • [10] Crowdsourced Top-k Queries by Confidence-Aware Pairwise Judgments
    Kou, Ngai Meng
    Li, Yan
    Wang, Hao
    Hou, Leong U.
    Gong, Zhiguo
    [J]. SIGMOD'17: PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2017, : 1415 - 1430