Search Result Diversification in Short Text Streams

被引:13
|
作者
Liang, Shangsong [1 ]
Yilmaz, Emine [1 ,2 ]
Shen, Hong [3 ,4 ]
De Rijke, Maarten [5 ]
Croft, W. Bruce [6 ]
机构
[1] UCL, Dept Comp Sci, London, England
[2] Alan Turing Inst, London, England
[3] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou, Guangdong, Peoples R China
[4] Univ Adelaide, Dept Comp Sci, Adelaide, SA, Australia
[5] Univ Amsterdam, Informat Inst, Amsterdam, Netherlands
[6] Univ Massachusetts, Coll Informat & Comp Sci, Amherst, MA 01003 USA
关键词
Diversity; ad hoc retrieval; data streams;
D O I
10.1145/3057282
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the problem of search result diversification for streams of short texts. Diversifying search results in short text streams is more challenging than in the case of long documents, as it is difficult to capture the latent topics of short documents. To capture the changes of topics and the probabilities of documents for a given query at a specific time in a short text stream, we propose a dynamic Dirichlet multinomial mixture topic model, called D2M3, as well as a Gibbs sampling algorithm for the inference. We also propose a streaming diversification algorithm, SDA, that integrates the information captured by D2M3 with our proposed modified version of the PM-2 (Proportionality-based diversification Method second version) diversification algorithm. We conduct experiments on a Twitter dataset and find that SDA statistically significantly outperforms state-of-the-art non-streaming retrieval methods, plain streaming retrieval methods, as well as streaming diversification methods that use other dynamic topic models.
引用
收藏
页数:35
相关论文
共 50 条
  • [21] Search Result Diversification Based on Query Facets
    Sha Hu
    Zhi-Cheng Dou
    Xiao-Jie Wang
    Ji-Rong Wen
    Journal of Computer Science and Technology, 2015, 30 : 888 - 901
  • [22] Scalable and Efficient Web Search Result Diversification
    Naini, Kaweh Djafari
    Altingovde, Ismail Sengor
    Siberski, Wolf
    ACM TRANSACTIONS ON THE WEB, 2016, 10 (03)
  • [23] Modeling Intent Graph for Search Result Diversification
    Su, Zhan
    Dou, Zhicheng
    Zhu, Yutao
    Qin, Xubo
    Wen, Ji-Rong
    SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 736 - 746
  • [24] The impact of result diversification on search behaviour and performance
    Maxwell, David
    Azzopardi, Leif
    Moshfeghi, Yashar
    INFORMATION RETRIEVAL JOURNAL, 2019, 22 (05): : 422 - 446
  • [25] Passage-aware Search Result Diversification
    Su, Zhan
    Dou, Zhicheng
    Zhu, Yutao
    Wen, Ji-Rong
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2024, 42 (05)
  • [26] Using Score Differences for Search Result Diversification
    Kharazmi, Sadegh
    Sanderson, Mark
    Scholer, Falk
    Vallet, David
    SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, : 1143 - 1146
  • [27] Search Result Diversification via Data Fusion
    Wu, Shengli
    Huang, Chunlan
    SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, : 827 - 830
  • [28] The impact of result diversification on search behaviour and performance
    David Maxwell
    Leif Azzopardi
    Yashar Moshfeghi
    Information Retrieval Journal, 2019, 22 : 422 - 446
  • [29] Coverage-based search result diversification
    Wei Zheng
    Xuanhui Wang
    Hui Fang
    Hong Cheng
    Information Retrieval, 2012, 15 : 433 - 457
  • [30] A Learning Approach to Hierarchical Search Result Diversification
    Zheng, Hai-Tao
    Wang, Zhuren
    Xiao, Xi
    WEB AND BIG DATA, APWEB-WAIM 2017, PT II, 2017, 10367 : 303 - 310