Crowdsourced top-k queries by pairwise preference judgments with confidence and budget control

被引:0
|
作者
Yan Li
Hao Wang
Ngai Meng Kou
Leong Hou U
Zhiguo Gong
机构
[1] University of Macau,State Key Laboratory of Internet of Things for Smart City, Department of Computer and Information Science
[2] Inception Institute of Artificial Intelligence,undefined
[3] Cainiao Smart Logistics Network Limited,undefined
来源
The VLDB Journal | 2021年 / 30卷
关键词
Crowdsourcing; Top-; query; Preference judgments; Confidence; Budget control;
D O I
暂无
中图分类号
学科分类号
摘要
Crowdsourced query processing is an emerging technique that tackles computationally challenging problems by human intelligence. The basic idea is to decompose a computationally challenging problem into a set of human-friendly microtasks (e.g., pairwise comparisons) that are distributed to and answered by the crowd. The solution of the problem is then computed (e.g., by aggregation) based on the crowdsourced answers to the microtasks. In this work, we attempt to revisit the crowdsourced processing of the top-k queries, aiming at (1) securing the quality of crowdsourced comparisons by a certain confidence level and (2) minimizing the total monetary cost. To secure the quality of each paired comparison, we employ statistical tools to estimate the confidence interval from the collected judgments of the crowd, which is then used to guide the aggregated judgment. We propose novel frameworks, SPR and SPR+\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$^+$$\end{document}, to address the crowdsourced top-k queries. Both SPR and SPR+\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$^+$$\end{document} are budget-aware, confidence-aware, and effective in producing high-quality top-k results. SPR requires as input a budget for each paired comparison, whereas SPR+\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$^+$$\end{document} requires only a total budget for the whole top-k task. Extensive experiments, conducted on four real datasets, demonstrate that our proposed methods outperform the other existing top-k processing techniques by a visible difference.
引用
收藏
页码:189 / 213
页数:24
相关论文
共 50 条
  • [21] Top-k queries on temporal data
    Feifei Li
    Ke Yi
    Wangchao Le
    The VLDB Journal, 2010, 19 : 715 - 733
  • [22] Continuous Top-k Dominating Queries
    Kontaki, Maria
    Papadopoulos, Apostolos N.
    Manolopoulos, Yannis
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (05) : 840 - 853
  • [23] Top-k Sequenced Route Queries
    Ohsawa, Yutaka
    Htoo, Htoo
    2017 18TH IEEE INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT (IEEE MDM 2017), 2017, : 320 - 323
  • [24] Probabilistic Reverse Top-k Queries
    Jin, Cheqing
    Zhang, Rong
    Kang, Qiangqiang
    Zhang, Zhao
    Zhou, Aoying
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2014, PT I, 2014, 8421 : 406 - 419
  • [25] Top-k Dominating Queries: a Survey
    Tiakas, Eleftherios
    Papadopoulos, Apostolos N.
    Manolopoulos, Yannis
    2015 12TH IEEE INTERNATIONAL CONFERENCE ON PROGRAMMING AND SYSTEMS (ISPS), 2015,
  • [26] Geometric Approaches for Top-k Queries
    Mouratidis, Kyriakos
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 10 (12): : 1985 - 1987
  • [27] Approximate distributed top-k queries
    Patt-Shamir, Boaz
    Shafrir, Allon
    DISTRIBUTED COMPUTING, 2008, 21 (01) : 1 - 22
  • [28] Efficient Techniques for Crowdsourced Top-k Lists
    de Alfaro, Luca
    Polychronopoulos, Vassilis
    Polyzotis, Neoklis
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 4801 - 4805
  • [29] Crowdsourced Top-k Algorithms: An Experimental Evaluation
    Zhang, Xiaohang
    Li, Guoliang
    Feng, Jianhua
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2016, 9 (08): : 612 - 623
  • [30] A New Approach to Process Top-k Spatial Preference Queries in a Directed Road Network
    Attique, Muhammad
    Qamar, Rizwan
    Cho, Hyung-Ju
    Chung, Tae-Sun
    PROCEEDINGS OF THE THIRD ACM SIGSPATIAL INTERNATIONAL WORKSHOP ON MOBILE GEOGRAPHIC INFORMATION SYSTEMS (MOBIGIS), 2014, : 34 - 42