Caching Search Engine Results over Incremental Indices

被引:0
|
作者
Blanco, Roi [1 ]
Bortnikov, Edward
Junqueira, Flavio P. [1 ]
Lempel, Ronny
Telloli, Luca
Zaragoza, Hugo [1 ]
机构
[1] Yahoo Res, Barcelona, Spain
关键词
Search engine caching; Real-time indexing;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A Web search engine must update its index periodically to incorporate changes to the Web. We argue in this paper that index updates fundamentally impact the design of search engine result caches, a performance-critical component of modern search engines. Index updates lead to the problem of cache invalidation: invalidating cached entries of queries whose results have changed. Naive approaches, such as flushing the entire cache upon every index update, lead to poor performance and in fact, render caching futile when the frequency of updates is high. Solving the invalidation problem efficiently corresponds to predicting accurately which queries will produce different results if re-evaluated, given the actual changes to the index. To obtain this property, we propose a framework for developing invalidation predictors and define metrics to evaluate invalidation schemes. We describe concrete predictors using this framework and compare them against a baseline that uses a cache invalidation scheme based on time-to-live (TTL). Evaluation over Wikipedia documents using a query log from the Yahoo! search engine shows that selective invalidation of cached search results can lower the number of unnecessary query evaluations by as much as 30% compared to a baseline scheme, while returning results of similar freshness. In general, our predictors enable fewer unnecessary invalidations and fewer stale results compared to a TTL-only scheme for similar freshness of results.
引用
收藏
页码:82 / 89
页数:8
相关论文
共 50 条
  • [31] Static caching for incremental computation
    Indiana Univ, Bloomington, United States
    ACM Trans Program Lang Syst, 3 (546-585):
  • [32] Improving search results with data mining in a thematic search engine
    Caramia, M
    Felici, G
    Pezzoli, A
    COMPUTERS & OPERATIONS RESEARCH, 2004, 31 (14) : 2387 - 2404
  • [33] An efficient algorithm for clustering search engine results
    Zhang, Hui
    Pang, Bin
    Xie, Ke
    Wu, Hui
    COMPUTATIONAL INTELLIGENCE AND SECURITY, 2007, 4456 : 661 - 671
  • [34] Query facet Engine for easier search results
    Radhakrishnan, Anusree
    Madhav, Minu Lalitha
    PROCEEDINGS OF 2017 IEEE INTERNATIONAL CONFERENCE ON CIRCUIT ,POWER AND COMPUTING TECHNOLOGIES (ICCPCT), 2017,
  • [35] SearchRank : A Method Of Ranking Results For Search Engine
    Qi, Cong
    AMCIS 2014 PROCEEDINGS, 2014,
  • [36] Query Recommendation for Improving Search Engine Results
    Zahera, Hamada M.
    El Hady, Gamal F.
    El-Wahed, Waiel. F. Abd
    WORLD CONGRESS ON ENGINEERING AND COMPUTER SCIENCE, VOLS 1 AND 2, 2010, : 416 - +
  • [37] Query Recommendation for Improving Search Engine Results
    El-Hady, Gamal F.
    Zahera, Hamada M.
    Abd El-Wahed, W. F.
    INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2011, 1 (01) : 45 - 52
  • [38] An efficient algorithm for clustering search engine results
    Hui Zhang
    Bin Pang
    Ke Xie
    Hui Wu
    2006 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, PTS 1 AND 2, PROCEEDINGS, 2006, : 1429 - 1434
  • [39] Enhancing Search Engine's Results with Metadata
    Escudeiro, Nuno
    Escudeiro, Paula
    ADVANCED SCIENCE LETTERS, 2014, 20 (02) : 518 - 521
  • [40] Search Engine Results Improvement- A Review
    Agrawal, Hina
    Yadav, Sunita
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION TECHNOLOGY CICT 2015, 2015, : 180 - 185