Adversarial Top-K Ranking

被引:13
|
作者
Suh, Changho [1 ]
Tan, Vincent Y. F. [2 ,3 ]
Zhao, Renbo [2 ,3 ,4 ]
机构
[1] Korea Adv Inst Sci & Technol, Sch Elect Engn, Taejon 305701, South Korea
[2] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore 117583, Singapore
[3] NUS, Dept Math, Singapore 119076, Singapore
[4] NUS, Dept Ind & Syst Engn, Singapore 117576, Singapore
基金
新加坡国家研究基金会;
关键词
Adversarial population; Bradley-Terry-Luce model; crowdsourcing; minimax optimality; sample complexity; top-K ranking; tensor decompositions;
D O I
10.1109/TIT.2017.2659660
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We study the top-K ranking problem where the goal is to recover the set of top-K ranked items out of a large collection of items based on partially revealed preferences. We consider an adversarial crowdsourced setting where there are two population sets, and pairwise comparison samples drawn from one of the populations follow the standard Bradley-Terry-Luce model (i.e., the chance of item i beating item j is proportional to the relative score of item i to item j), while in the other population, the corresponding chance is inversely proportional to the relative score. When the relative size of the two populations is known, we characterize the minimax limit on the sample size required (up to a constant) for reliably identifying the top-K items, and demonstrate how it scales with the relative size. Moreover, by leveraging a tensor decomposition method for disambiguating mixture distributions, we extend our result to the more realistic scenario, in which the relative population size is unknown, thus establishing an upper bound on the fundamental limit of the sample size for recovering the top-K set.
引用
收藏
页码:2201 / 2225
页数:25
相关论文
共 50 条
  • [41] Ranking the big sky: efficient top-k skyline computation on massive data
    Xixian Han
    Bailing Wang
    Jianzhong Li
    Hong Gao
    [J]. Knowledge and Information Systems, 2019, 60 : 415 - 446
  • [42] Consensus-based aggregation for identification and ranking of top-k influential nodes
    Bharat Tidke
    Rupa Mehta
    Jenish Dhanani
    [J]. Neural Computing and Applications, 2020, 32 : 10275 - 10301
  • [43] Top-k best probability queries and semantics ranking properties on probabilistic databases
    Trieu Minh Nhut Le
    Cao, Jinli
    He, Zhen
    [J]. DATA & KNOWLEDGE ENGINEERING, 2013, 88 : 248 - 266
  • [44] An efficient top-k ranking method for service selection based on ε-ADMOPSO algorithm
    Yu, Wei
    Li, Shijun
    Tang, Xiaoyue
    Wang, Kai
    [J]. NEURAL COMPUTING & APPLICATIONS, 2019, 31 (Suppl 1): : 77 - 92
  • [45] Ranking the big sky: efficient top-k skyline computation on massive data
    Han, Xixian
    Wang, Bailing
    Li, Jianzhong
    Gao, Hong
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 60 (01) : 415 - 446
  • [46] Consensus-based aggregation for identification and ranking of top-k influential nodes
    Tidke, Bharat
    Mehta, Rupa
    Dhanani, Jenish
    [J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (14): : 10275 - 10301
  • [47] Optimal Sample Complexity of M-wise Data for Top-K Ranking
    Jang, Minje
    Kim, Sunghyun
    Suh, Changho
    Oh, Sewoong
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [48] Consensus-based aggregation for identification and ranking of top-k influential nodes
    Tidke, Bharat
    Mehta, Rupa
    Dhanani, Jenish
    [J]. Neural Computing and Applications, 2020, 32 (14) : 10275 - 10301
  • [49] PARTIAL RECOVERY FOR TOP-k RANKING: OPTIMALITY OF MLE AND SUBOPTIMALITY OF THE SPECTRAL METHOD
    Chen, Pinhan
    Gao, Chao
    Zhang, Anderson Y.
    [J]. ANNALS OF STATISTICS, 2022, 50 (03): : 1618 - 1652
  • [50] An efficient top-k ranking method for service selection based on ε-ADMOPSO algorithm
    Wei Yu
    Shijun Li
    Xiaoyue Tang
    Kai Wang
    [J]. Neural Computing and Applications, 2019, 31 : 77 - 92