Is Top-k Sufficient for Ranking?

被引:7
|
作者
Lan, Yanyan [1 ]
Niu, Shuzi [1 ]
Guo, Jiafeng [1 ]
Cheng, Xueqi [1 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Learning to Rank; Top-k; Full-Order; Sufficient; JUDGMENTS;
D O I
10.1145/2505515.2505685
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, 'top-k learning to rank' has attracted much attention in the community of information retrieval. The motivation comes from the difficulty in obtaining a full-order ranking list for training, when employing reliable pairwise preference judgment. Inspired by the observation that users mainly care about top ranked search result, top-k learning to rank proposes to utilize top-k ground-truth for training, where only the total order of top k items are provided, instead of a full-order ranking list. However, it is not clear whether the underlying assumption holds, i.e. top-k ground-truth is sufficient for training. In this paper, we propose to study this problem from both empirical and theoretical aspects. Empirically, our experimental results on benchmark datasets LETOR4.0 show that the test performances of both pairwise and list-wise ranking algorithms will quickly increase to a stable value, with the growth of k in the top-k ground-truth. Theoretically, we prove that the losses of these typical ranking algorithms in top-k setting are tighter upper bounds of (1-NDCG@k), compared with that in full-order setting. Therefore, our studies reveal that learning on top-k ground-truth is surely sufficient for ranking, which lay a foundation for the new learning to rank framework.
引用
收藏
页码:1261 / 1270
页数:10
相关论文
共 50 条
  • [1] Adversarial Top-K Ranking
    Suh, Changho
    Tan, Vincent Y. F.
    Zhao, Renbo
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2017, 63 (04) : 2201 - 2225
  • [2] Top-k Ranking Bayesian Optimization
    Quoc Phong Nguyen
    Tay, Sebastian
    Low, Bryan Kian Hsiang
    Jaillet, Patrick
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 9135 - 9143
  • [3] Competitive analysis of the top-K ranking problem
    Chen, Xi
    Gopi, Sivakanth
    Mao, Jieming
    Schneider, Jon
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2017, : 1245 - 1264
  • [4] Answering Definition Question: Ranking for Top-k
    Shen, Chao
    Qiu, Xipeng
    Huang, Xuanjing
    Wu, Lide
    [J]. ECAI 2008, PROCEEDINGS, 2008, 178 : 839 - +
  • [5] On Pruning for Top-K Ranking in Uncertain Databases
    Wang, Chonghai
    Yuan, Li Yan
    You, Jia-Huai
    Zaiane, Osmar R.
    Pei, Jian
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2011, 4 (10): : 598 - 609
  • [6] FA*IR: A Fair Top-k Ranking Algorithm
    Zehlike, Meike
    Bonchi, Francesco
    Castillo, Carlos
    Hajian, Sara
    Megahed, Mohamed
    Baeza-Yates, Ricardo
    [J]. CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 1569 - 1578
  • [7] Top-K Ranking: An Information-theoretic Perspective
    Chen, Yuxin
    Suh, Changho
    [J]. 2015 IEEE INFORMATION THEORY WORKSHOP - FALL (ITW), 2015, : 212 - 213
  • [8] On the semantics of top-k ranking for objects with uncertain data
    Wang, Chonghai
    Yuan, Li Yan
    You, Jia-Huai
    [J]. COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2011, 62 (07) : 2812 - 2823
  • [9] Fair Top-k Ranking with multiple protected groups
    Zehlike, Meike
    Suehr, Tom
    Baeza-Yates, Ricardo
    Bonchi, Francesco
    Castillo, Carlos
    Hajian, Sara
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (01)
  • [10] Probabilistic top-k and ranking-aggregate queries
    Soliman, Mohamed A.
    Ilyas, Ihab F.
    Chang, Kevin Chen-Chuan
    [J]. ACM TRANSACTIONS ON DATABASE SYSTEMS, 2008, 33 (03):