Caching for web searching

被引:0
|
作者
Kalyanasundaram, B [1 ]
Noga, J
Pruhs, K
Woeginger, G
机构
[1] Georgetown Univ, Dept Comp Sci, Washington, DC 20057 USA
[2] Graz Tech Univ, Dept Math, A-8010 Graz, Austria
[3] Univ Pittsburgh, Dept Comp Sci, Pittsburgh, PA 15230 USA
来源
ALGORITHM THEORY - SWAT 2000 | 2000年 / 1851卷
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We study web caching when the input sequence is a depth first search traversal of some tree. There are at least two good motivations for investigating tree traversal as a search technique on the WWW: First, empirical studies of people browsing and searching the WWW have shown that user access patterns commonly are nearly depth first traversals of some tree. Secondly, (as we will show in this paper) the problem of visiting all the pages on some WWW site using anchor clicks (clicks on links) and back button clicks - by far the two most common user actions - reduces to the problem of how to best cache a tree traversal sequence (up to constant factors). We show that for tree traversal sequences the optimal offline strategy can be computed efficiently. In the bit model, where the access time of a page is proportional to its size, we show that the online algorithm LRU is (1 + 1/epsilon)-competitive against an adversary with unbounded cache as long as LRU has a cache of size at least (1 + epsilon) times the size of the largest item in the input sequence. In the general model, where pages have arbitrary access times and sizes, we show that in order to be constant competitive, any online algorithm needs a cache large enough to store Omega (log n) pages; here n is the number of distinct pages in the input sequence. We provide a matching upper bound by showing that the online algorithm Landlord is constant competitive against an adversary with an unbounded cache if Landlord has a cache large enough to store the Omega (log n) largest pages. This is further theoretical evidence that Landlord is the "right" algorithm for web caching.
引用
收藏
页码:150 / 163
页数:14
相关论文
共 50 条
  • [21] Semantic caching of Web queries
    Chidlovskii, B
    Borghoff, UM
    VLDB JOURNAL, 2000, 9 (01): : 2 - 17
  • [22] Probabilistic methods for web caching
    Starobinski, D
    Tse, D
    PERFORMANCE EVALUATION, 2001, 46 (2-3) : 125 - 137
  • [23] A scalable architecture for web caching
    Polusani, S
    Han, YJ
    Park, EK
    8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL XIV, PROCEEDINGS: COMPUTER AND INFORMATION SYSTEMS, TECHNOLOGIES AND APPLICATIONS, 2004, : 185 - 190
  • [24] Semantic caching for Web services
    Seltzsam, S
    Holzhauser, R
    Kemper, A
    SERVICE-ORIENTED COMPUTING - ICSOC 2005, PROCEEDINGS, 2005, 3826 : 324 - 340
  • [25] Web caching with consistent hashing
    Karger, David
    Sherman, Alex
    Berkheimer, Andy
    Bogstad, Bill
    Dhanidina, Rizwan
    Iwamoto, Ken
    Kim, Brian
    Matkins, Luke
    Yerushalmi, Yoav
    Computer Networks, 1999, 31 (11): : 1203 - 1213
  • [26] Measurements of web caching and applications
    Shi, Lei
    Wei, Lin
    Ye, Hai-Qin
    Shi, Yun
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 1587 - +
  • [27] Secure information caching on the web
    Hsiao, HI
    Brannon, KW
    ADVANCED WEB TECHNOLOGIES AND APPLICATIONS, 2004, 3007 : 24 - 35
  • [28] Web caching with consistent hashing
    Karger, D
    Sherman, A
    Berkheimer, A
    Bogstad, B
    Dhanidina, R
    Iwamoto, K
    Kim, B
    Matkins, L
    Yerushalmi, Y
    PROCEEDINGS OF THE EIGHTH INTERNATIONAL WORLD WIDE WEB CONFERENCE, 1999, : 125 - 135
  • [29] Transparent distributed Web caching
    Liang, H
    Hassanein, H
    Martin, P
    LCN 2001: 26TH ANNUAL IEEE CONFERENCE ON LOCAL COMPUTER NETWORKS, PROCEEDINGS, 2001, : 225 - 233
  • [30] Evolutionary techniques for Web caching
    Vakali, A
    DISTRIBUTED AND PARALLEL DATABASES, 2002, 11 (01) : 93 - 116