About Learning Models with Multiple Query-Dependent Features

被引:21
|
作者
Macdonald, Craig [1 ]
Santos, Rodrygo L. T. [1 ]
Ounis, Iadh [1 ]
He, Ben
机构
[1] Univ Glasgow, Glasgow G12 8QQ, Lanark, Scotland
关键词
Performance; Experimentation; Learning to rank; samples; field-based weighting models; STRATEGIES;
D O I
10.1145/2493175.2493176
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Several questions remain unanswered by the existing literature concerning the deployment of query-dependent features within learning to rank. In this work, we investigate three research questions in order to empirically ascertain best practices for learning-to-rank deployments. (i) Previous work in data fusion that pre-dates learning to rank showed that while different retrieval systems could be effectively combined, the combination of multiple models within the same system was not as effective. In contrast, the existing learning-to-rank datasets (e. g., LETOR), often deploy multiple weighting models as query-dependent features within a single system, raising the question as to whether such a combination is needed. (ii) Next, we investigate whether the training of weighting model parameters, traditionally required for effective retrieval, is necessary within a learning-to-rank context. (iii) Finally, we note that existing learning-to-rank datasets use weighting model features calculated on different fields (e. g., title, content, or anchor text), even though such weighting models have been criticized in the literature. Experiments addressing these three questions are conducted on Web search datasets, using various weighting models as query-dependent and typical query-independent features, which are combined using three learning-to-rank techniques. In particular, we show and explain why multiple weighting models should be deployed as features. Moreover, we unexpectedly find that training the weighting model's parameters degrades learned model's effectiveness. Finally, we show that computing a weighting model separately for each field is less effective than more theoretically-sound field-based weighting models.
引用
收藏
页码:1 / 39
页数:39
相关论文
共 50 条
  • [21] Approximate Shortest Distance Computing: A Query-Dependent Local Landmark Scheme
    Qiao, Miao
    Cheng, Hong
    Chang, Lijun
    Yu, Jeffrey Xu
    2012 IEEE 28TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2012, : 462 - 473
  • [22] Multi-video summarization with query-dependent weighted archetypal analysis
    Ji, Zhong
    Zhang, Yuanyuan
    Pang, Yanwei
    Li, Xuelong
    Pan, Jing
    NEUROCOMPUTING, 2019, 332 : 406 - 416
  • [23] Approximate Shortest Distance Computing: A Query-Dependent Local Landmark Scheme
    Qiao, Miao
    Cheng, Hong
    Chang, Lijun
    Yu, Jeffrey Xu
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (01) : 55 - 68
  • [24] A query-dependent duplicate detection approach for large scale search engines
    Ye, SZ
    Song, RH
    Wen, JR
    Ma, WY
    ADVANCED WEB TECHNOLOGIES AND APPLICATIONS, 2004, 3007 : 48 - 58
  • [25] Unsupervised Anomaly Localization Using Locally Adaptive Query-Dependent Scores
    Kawamura, Naoki
    IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT II, 2022, 13232 : 300 - 311
  • [26] Efficient Top-K Processing Over Query-Dependent Functions
    Guo, Lin
    Yahia, Sihem Amer
    Ramakrishnan, Raghu
    Shanmugasundaram, Jayavel
    Srivastava, Utkarsh
    Vee, Erik
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2008, 1 (01): : 1044 - 1055
  • [27] Image Search Reranking With Query-Dependent Click-Based Relevance Feedback
    Zhang, Yongdong
    Yang, Xiaopeng
    Mei, Tao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (10) : 4448 - 4459
  • [28] QoRank: A Query-Dependent Ranking Model Using LSE-Based Weighted Multiple Hyperplanes Aggregation for Information Retrieval
    Sun, Heli
    Huang, Jianbin
    Feng, Boqin
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2011, 26 (01) : 73 - 97
  • [29] Clip-based similarity measure for query-dependent clip retrieval and video summarization
    Peng, Yuxin
    Ngo, Chong-Wah
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2006, 16 (05) : 612 - 627
  • [30] Automatic topic-oriented multi-document summarization with combination of query-dependent and query-independent rankers
    Li, Sujian
    Wang, Wei
    PROCEEDINGS OF THE 2007 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (NLP-KE'07), 2007, : 441 - +