Search engines and Web dynamics

被引:39
|
作者
Risvik, KM [1 ]
Michelsen, R [1 ]
机构
[1] Fast Search & Transfer ASA, NO-0120 Oslo, Norway
关键词
dynamic information retrieval; indexing; document crawling; scalable architecture; algorithms; scheduling;
D O I
10.1016/S1389-1286(02)00213-X
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we study several dimensions of Web dynamics in the context of large-scale Internet search engines. Both growth and update dynamics clearly represent big challenges for search engines. We show how the problems arise in all components of a reference search engine model. Furthermore, we use the FAST Search Engine architecture as a case study for showing some possible solutions for Web dynamics and search engines. The focus is to demonstrate solutions that work in practice for real systems. The service is running live at www.alltheweb.com and major portals worldwide with more than 30 million queries a day, about 700 million full-text documents, a crawl base of 1.8 billion documents, updated every I I days, at a rate of 400 documents/second. We discuss future evolution of the Web, and some important issues for search engines will be scheduling and query execution as well as increasingly heterogeneous architectures to handle the dynamic Web. (C) 2002 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:289 / 302
页数:14
相关论文
共 50 条
  • [21] Defining a session on web search engines
    Jansen, Bernard J.
    Spink, Amanda
    Blakely, Chris
    Koshman, Sherry
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2007, 58 (06): : 862 - 871
  • [22] An introduction to search engines and Web navigation
    Ng, Wilfred
    INFORMATION PROCESSING & MANAGEMENT, 2007, 43 (01) : 290 - 292
  • [23] Staleness among Web search engines
    Koehler, Wallace
    Searcher:Magazine for Database Professionals, 1998, 6 (07):
  • [24] Searching for people on Web search engines
    Spink, A
    Jansen, BJ
    Pedersen, J
    JOURNAL OF DOCUMENTATION, 2004, 60 (03) : 266 - 278
  • [25] Web search engines: Part 2
    Hawking, David
    COMPUTER, 2006, 39 (08) : 88 - 90
  • [26] How search engines organize the web
    Zurawski, L
    CONTROL ENGINEERING, 1999, 46 (03) : 62 - 62
  • [27] Using search engines and Web directories
    Pealer, LN
    JOURNAL OF SCHOOL HEALTH, 1998, 68 (08) : 346 - 347
  • [28] Authoritative guide to Web search engines
    Scales, BJ
    JOURNAL OF ACADEMIC LIBRARIANSHIP, 1998, 24 (03): : 248 - 248
  • [29] Algorithmic challenges in Web search engines
    Baeza-Yates, R
    LATIN 2006: THEORETICAL INFORMATICS, 2006, 3887 : 1 - 7
  • [30] Search engines and web information retrieval
    López-Ortiz, A
    COMBINATORIAL AND ALGORITHMIC ASPECTS OF NETWORKING, 2005, 3405 : 183 - 191