Result Diversification Based on Query-Specific Cluster Ranking

被引:36
|
作者
He, Jiyin [1 ]
Meij, Edgar [1 ]
de Rijke, Maarten [1 ]
机构
[1] Univ Amsterdam, ISLA, NL-1098 XH Amsterdam, Netherlands
关键词
INFORMATION; RETRIEVAL;
D O I
10.1002/asi.21468
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Result diversification is a retrieval strategy for dealing with ambiguous or multi-faceted queries by providing documents that cover as many facets of the query as possible. We propose a result diversification framework based on query-specific clustering and cluster ranking, in which diversification is restricted to documents belonging to clusters that potentially contain a high percentage of relevant documents. Empirical results show that the proposed framework improves the performance of several existing diversification methods. The framework also gives rise to a simple yet effective cluster-based approach to result diversification that selects documents from different clusters to be included in a ranked list in a round robin fashion. We describe a set of experiments aimed at thoroughly analyzing the behavior of the two main components of the proposed diversification framework, ranking and selecting clusters for diversification. Both components have a crucial impact on the overall performance of our framework, but ranking clusters plays a more important role than selecting clusters. We also examine properties that clusters should have in order for our diversification framework to be effective. Most relevant documents should be contained in a small number of high-quality clusters, while there should be no dominantly large clusters. Also, documents from these high-quality clusters should have a diverse content. These properties are strongly correlated with the overall performance of the proposed diversification framework.
引用
收藏
页码:550 / 571
页数:22
相关论文
共 50 条
  • [1] IRanker: Query-Specific Ranking of Reviewed Items
    Shahbazi, Moloud
    Wiley, Matthew
    Hristidis, Vagelis
    [J]. 2017 IEEE 33RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2017), 2017, : 211 - 214
  • [2] Query Subtopic Diversification based on Cluster Ranking and Semantic Features
    Shajalal, Md
    Ullah, Md Zia
    Chy, Abu Nowshed
    Aono, Masaki
    [J]. 2016 INTERNATIONAL CONFERENCE ON ADVANCED INFORMATICS - CONCEPTS, THEORY AND APPLICATION (ICAICTA), 2016,
  • [3] Query-specific Subtopic Clustering
    Kashyapi, Sumanta
    Dietz, Laura
    [J]. 2022 ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES (JCDL), 2022,
  • [4] Hashing Based Re-ranking of Web Images Using Query-Specific Semantic Signatures
    Dange, B. J.
    Kshirsagar, D. B.
    [J]. 2017 1ST INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND INFORMATION MANAGEMENT (ICISIM), 2017, : 164 - 170
  • [5] BERT-ER: Query-specific BERT Entity Representations for Entity Ranking
    Chatterjee, Shubham
    Dietz, Laura
    [J]. PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 1466 - 1477
  • [6] Query-Specific Visual Semantic Spaces for Web Image Re-ranking
    Wang, Xiaogang
    Liu, Ke
    Tang, Xiaoou
    [J]. 2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, : 857 - 864
  • [7] The Opposite of Smoothing: A Language Model Approach to Ranking Query-Specific Document Clusters
    Kurland, Oren
    Krikon, Eyal
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2011, 41 : 367 - 395
  • [8] Web Image Re-Ranking Using Query-Specific Semantic Signatures
    Wang, Xiaogang
    Qiu, Shi
    Liu, Ke
    Tang, Xiaoou
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (04) : 810 - 823
  • [9] On Query Result Diversification
    Vieira, Marcos R.
    Razente, Humberto L.
    Barioni, Maria C. N.
    Hadjieleftheriou, Marios
    Srivastava, Divesh
    Traina, Caetano, Jr.
    Tsotras, Vassilis J.
    [J]. IEEE 27TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2011), 2011, : 1163 - 1174
  • [10] Re-ranking search results using language models of query-specific clusters
    Kurland, Oren
    [J]. INFORMATION RETRIEVAL, 2009, 12 (04): : 437 - 460