Employing Document Dependency in Blog Search

被引:1
|
作者
Keikha, Mostafa [1 ]
Crestani, Fabio [1 ]
Carman, Mark James [2 ]
机构
[1] Univ Lugano, Fac Informat, Lugano, Switzerland
[2] Monash Univ, Fac IT, Clayton, Vic 3800, Australia
关键词
All Open Access; Green;
D O I
10.1002/asi.21687
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The goal in blog search is to rank blogs according to their recurrent relevance to the topic of the query. State-of-the-art approaches view it as an expert search or resource selection problem. We investigate the effect of content-based similarity between posts on the performance of the retrieval system. We test two different approaches for smoothing (regularizing) relevance scores of posts based on their dependencies. In the first approach, we smooth term distributions describing posts by performing a random walk over a document-term graph in which similar posts are highly connected. In the second, we directly smooth scores for posts using a regularization framework that aims to minimize the discrepancy between scores for similar documents. We then extend these approaches to consider the time interval between the posts in smoothing the scores. The idea is that if two posts are temporally close, then they are good sources for smoothing each other's relevance scores. We compare these methods with the state-of-the-art approaches in blog search that employ Language Modeling-based resource selection algorithms and fusion-based methods for aggregating post relevance scores. We show performance gains over the baseline techniques which do not take advantage of the relation between posts for smoothing relevance estimates.
引用
收藏
页码:354 / 365
页数:12
相关论文
共 50 条
  • [1] Based on The Document-Link and Time-Clue Relationships Between Blog Posts to Improve the Performance of Google Blog Search
    Chen, Lin-Chih
    [J]. INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2019, 15 (01) : 52 - 75
  • [2] Improving Search Engines' Document Ranking Employing Semantics and an Inference Network
    Makris, Christos
    Plegas, Yannis
    Tzimas, Giannis
    Viennas, Emmanouil
    [J]. WEB INFORMATION SYSTEMS AND TECHNOLOGIES, WEBIST 2013, 2014, 189 : 138 - 153
  • [3] A study of blog search
    Mishne, Gilad
    de Rijke, Maarten
    [J]. ADVANCES IN INFORMATION RETRIEVAL, 2006, 3936 : 289 - 301
  • [4] In search of the blog economy
    Smith, S
    [J]. ECONTENT, 2005, 28 (1-2) : 24 - +
  • [5] Blog search engines
    Thelwall, Mike
    Hasler, Laura
    [J]. ONLINE INFORMATION REVIEW, 2007, 31 (04) : 467 - 479
  • [6] Blogging As a Research Method? The EHRI Document Blog
    Frankl, Michal
    [J]. QUEST-ISSUES IN CONTEMPORARY JEWISH HISTORY, 2018, (13): : 24 - 51
  • [7] Blog sentiment orientation analysis based on dependency parsing
    Feng, Shi
    Fu, Yongchen
    Yang, Feng
    Wang, Daling
    Zhang, Yifei
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2012, 49 (11): : 2395 - 2406
  • [8] Blog feed search with a post index
    Wouter Weerkamp
    Krisztian Balog
    Maarten de Rijke
    [J]. Information Retrieval, 2011, 14 : 515 - 545
  • [9] Blog search - Market heats up
    Fritz, M
    [J]. ECONTENT, 2005, 28 (11) : 5 - 6
  • [10] Blog feed search with a post index
    Weerkamp, Wouter
    Balog, Krisztian
    de Rijke, Maarten
    [J]. INFORMATION RETRIEVAL, 2011, 14 (05): : 515 - 545