LETOR: A benchmark collection for research on learning to rank for information retrieval

被引:251
|
作者
Qin, Tao [1 ]
Liu, Tie-Yan [1 ]
Xu, Jun [1 ]
Li, Hang [1 ]
机构
[1] Microsoft Res Asia, Beijing, Peoples R China
来源
INFORMATION RETRIEVAL | 2010年 / 13卷 / 04期
关键词
Learning to rank; Information retrieval; Benchmark datasets; Feature extraction;
D O I
10.1007/s10791-009-9123-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
LETOR is a benchmark collection for the research on learning to rank for information retrieval, released by Microsoft Research Asia. In this paper, we describe the details of the LETOR collection and show how it can be used in different kinds of researches. Specifically, we describe how the document corpora and query sets in LETOR are selected, how the documents are sampled, how the learning features and meta information are extracted, and how the datasets are partitioned for comprehensive evaluation. We then compare several state-of-the-art learning to rank algorithms on LETOR, report their ranking performances, and make discussions on the results. After that, we discuss possible new research topics that can be supported by LETOR, in addition to algorithm comparison. We hope that this paper can help people to gain deeper understanding of LETOR, and enable more interesting research projects on learning to rank and related topics.
引用
收藏
页码:346 / 374
页数:29
相关论文
共 50 条
  • [1] LETOR: A benchmark collection for research on learning to rank for information retrieval
    Tao Qin
    Tie-Yan Liu
    Jun Xu
    Hang Li
    Information Retrieval, 2010, 13 : 346 - 374
  • [2] Learning to rank for Information Retrieval
    Liu, Tie-Yan
    Foundations and Trends in Information Retrieval, 2009, 3 (03): : 225 - 231
  • [3] Learning to Rank for Information Retrieval
    Liu, Tie-Yan
    SIGIR 2010: PROCEEDINGS OF THE 33RD ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH DEVELOPMENT IN INFORMATION RETRIEVAL, 2010, : 904 - 904
  • [4] Parallel Learning to Rank for Information Retrieval
    Wang, Shuaiqiang
    Gao, Byron J.
    Wang, Ke
    Lauw, Hady W.
    PROCEEDINGS OF THE 34TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR'11), 2011, : 1083 - 1084
  • [5] Learning to Rank for Biomedical Information Retrieval
    Xu, Bo
    Lin, Hongfei
    Lin, Yuan
    Ma, Yunlong
    Yang, Liang
    Wang, Jian
    Yang, Zhihao
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2015, : 464 - 469
  • [6] An evolutionary strategy with machine learning for learning to rank in information retrieval
    Osman Ali Sadek Ibrahim
    D. Landa-Silva
    Soft Computing, 2018, 22 : 3171 - 3185
  • [7] An evolutionary strategy with machine learning for learning to rank in information retrieval
    Ibrahim, Osman Ali Sadek
    Landa-Silva, D.
    SOFT COMPUTING, 2018, 22 (10) : 3171 - 3185
  • [8] Learning to Rank for Information Retrieval and Natural Language Processing
    Li H.
    Synthesis Lectures on Human Language Technologies, 2011, 4 (01): : 1 - 115
  • [9] Learning to Rank for Information Retrieval and Natural Language Processing
    Candito, Marie
    TRAITEMENT AUTOMATIQUE DES LANGUES, 2011, 52 (03): : 282 - 285
  • [10] Introduction to special issue on learning to rank for information retrieval
    Liu, Tie-Yan
    Joachims, Thorsten
    Li, Hang
    Zhai, Chengxiang
    INFORMATION RETRIEVAL, 2010, 13 (03): : 197 - 200