Crowdsourced top-k queries by pairwise preference judgments with confidence and budget control

被引:0
|
作者
Yan Li
Hao Wang
Ngai Meng Kou
Leong Hou U
Zhiguo Gong
机构
[1] University of Macau,State Key Laboratory of Internet of Things for Smart City, Department of Computer and Information Science
[2] Inception Institute of Artificial Intelligence,undefined
[3] Cainiao Smart Logistics Network Limited,undefined
来源
The VLDB Journal | 2021年 / 30卷
关键词
Crowdsourcing; Top-; query; Preference judgments; Confidence; Budget control;
D O I
暂无
中图分类号
学科分类号
摘要
Crowdsourced query processing is an emerging technique that tackles computationally challenging problems by human intelligence. The basic idea is to decompose a computationally challenging problem into a set of human-friendly microtasks (e.g., pairwise comparisons) that are distributed to and answered by the crowd. The solution of the problem is then computed (e.g., by aggregation) based on the crowdsourced answers to the microtasks. In this work, we attempt to revisit the crowdsourced processing of the top-k queries, aiming at (1) securing the quality of crowdsourced comparisons by a certain confidence level and (2) minimizing the total monetary cost. To secure the quality of each paired comparison, we employ statistical tools to estimate the confidence interval from the collected judgments of the crowd, which is then used to guide the aggregated judgment. We propose novel frameworks, SPR and SPR+\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$^+$$\end{document}, to address the crowdsourced top-k queries. Both SPR and SPR+\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$^+$$\end{document} are budget-aware, confidence-aware, and effective in producing high-quality top-k results. SPR requires as input a budget for each paired comparison, whereas SPR+\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$^+$$\end{document} requires only a total budget for the whole top-k task. Extensive experiments, conducted on four real datasets, demonstrate that our proposed methods outperform the other existing top-k processing techniques by a visible difference.
引用
收藏
页码:189 / 213
页数:24
相关论文
共 50 条
  • [31] Optimization of top-k spatial preference queries' execution process based on similarity of preferences
    Gorawski, Marcin
    Dowlaszewicz, Kamil
    Frontiers in Artificial Intelligence and Applications, 2008, 181 (01) : 140 - 151
  • [32] Top-k Differential Queries in Graph Databases
    Vasilyeva, Elena
    Thiele, Maik
    Bornhoevd, Christof
    Lehner, Wolfgang
    ADVANCES IN DATABASES AND INFORMATION SYSTEMS (ADBIS 2014), 2014, 8716 : 112 - 125
  • [33] Top-k Queries for Categorized RFID Systems
    Liu, Xiulong
    Li, Keqiu
    Guo, Song
    Liu, Alex X.
    Li, Peng
    Wang, Kun
    Wu, Jie
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2017, 25 (05) : 2587 - 2600
  • [34] Efficient processing of distributed top-k queries
    Yu, HL
    Li, HG
    Wu, P
    Agrawal, D
    El Abbadi, A
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2005, 3588 : 65 - 74
  • [35] Processing top-k queries from samples
    Cohen, Edith
    Grossaug, Nadav
    Kaplan, Haim
    COMPUTER NETWORKS, 2008, 52 (14) : 2605 - 2622
  • [36] Top-k Dominating Queries on Incomplete Data
    Miao, Xiaoye
    Gao, Yunjun
    Zheng, Baihua
    Chen, Gang
    Cui, Huiyong
    2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 1500 - 1501
  • [37] Top-k differential queries in graph databases
    Vasilyeva, Elena
    Thiele, Maik
    Bornhövd, Christof
    Lehner, Wolfgang
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8716 : 112 - 115
  • [38] Approximate top-k queries in sensor networks
    Patt-Shamir, Boaz
    Shafrir, Allon
    STRUCTURAL INFORMATION AND COMMUNICATION COMPLEXITY, PROCEEDINGS, 2006, 4056 : 319 - +
  • [39] CrowdK: Answering top-k queries with crowdsourcing
    Lee, Jongwuk
    Lee, Dongwon
    Hwang, Seung-won
    INFORMATION SCIENCES, 2017, 399 : 98 - 120
  • [40] Reverse spatial top-k keyword queries
    Ahmed, Pritom
    Eldawy, Ahmed
    Hristidis, Vagelis
    Tsotras, Vassilis J.
    VLDB JOURNAL, 2023, 32 (03): : 501 - 524