Search Result Diversification in Short Text Streams

被引:13
|
作者
Liang, Shangsong [1 ]
Yilmaz, Emine [1 ,2 ]
Shen, Hong [3 ,4 ]
De Rijke, Maarten [5 ]
Croft, W. Bruce [6 ]
机构
[1] UCL, Dept Comp Sci, London, England
[2] Alan Turing Inst, London, England
[3] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou, Guangdong, Peoples R China
[4] Univ Adelaide, Dept Comp Sci, Adelaide, SA, Australia
[5] Univ Amsterdam, Informat Inst, Amsterdam, Netherlands
[6] Univ Massachusetts, Coll Informat & Comp Sci, Amherst, MA 01003 USA
关键词
Diversity; ad hoc retrieval; data streams;
D O I
10.1145/3057282
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the problem of search result diversification for streams of short texts. Diversifying search results in short text streams is more challenging than in the case of long documents, as it is difficult to capture the latent topics of short documents. To capture the changes of topics and the probabilities of documents for a given query at a specific time in a short text stream, we propose a dynamic Dirichlet multinomial mixture topic model, called D2M3, as well as a Gibbs sampling algorithm for the inference. We also propose a streaming diversification algorithm, SDA, that integrates the information captured by D2M3 with our proposed modified version of the PM-2 (Proportionality-based diversification Method second version) diversification algorithm. We conduct experiments on a Twitter dataset and find that SDA statistically significantly outperforms state-of-the-art non-streaming retrieval methods, plain streaming retrieval methods, as well as streaming diversification methods that use other dynamic topic models.
引用
收藏
页数:35
相关论文
共 50 条
  • [1] Search Result Diversification
    Santos, R. L. T.
    Macdonald, C.
    Ounis, I.
    FOUNDATIONS AND TRENDS IN INFORMATION RETRIEVAL, 2015, 9 (01): : 1 - 90
  • [2] Search Result Diversification
    Drosou, Marina
    Pitoura, Evaggelia
    SIGMOD RECORD, 2010, 39 (01) : 41 - 47
  • [3] Learning for Search Result Diversification
    Zhu, Yadong
    Lan, Yanyan
    Guo, Jiafeng
    Cheng, Xueqi
    Niu, Shuzi
    SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, : 293 - 302
  • [4] Aggregated Search Result Diversification
    Santos, Rodrygo L. T.
    Macdonald, Craig
    Ounis, Iadh
    ADVANCES IN INFORMATION RETRIEVAL THEORY, 2011, 6931 : 250 - 261
  • [5] Search Result Diversification in Flickr
    Negi, Sumit
    Jaju, Abhimanyu
    Chaudhury, Santanu
    2016 8TH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORKS (COMSNETS), 2016,
  • [6] A Survey on Search Result Diversification
    Dou Z.-C.
    Qin X.-B.
    Wen J.-R.
    Jisuanji Xuebao/Chinese Journal of Computers, 2019, 42 (12): : 2591 - 2613
  • [7] Result diversification for tweet search
    Ozsoy, Makbule Gulcin
    Onal, Kezban Dilek
    Altingovde, Ismail Sengor
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8787 : 78 - 89
  • [8] Result Diversification for Tweet Search
    Ozsoy, Makbule Gulcin
    Onal, Kezban Dilek
    Altingovde, Ismail Sengor
    WEB INFORMATION SYSTEMS ENGINEERING, PT II, 2014, 8787 : 78 - 89
  • [9] Knowledge Enhanced Search Result Diversification
    Su, Zhan
    Dou, Zhicheng
    Zhu, Yutao
    Wen, Ji-Rong
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 1687 - 1695
  • [10] A learnable search result diversification method
    Zheng, Hai-Tao
    Han, Jinxin
    Wang, Zhuren
    Xiao, Xi
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 108 : 74 - 80