TOWARDS A DISTRIBUTED SEARCH ENGINE

被引:0
|
作者
Baeza-Yates, Ricardo [1 ]
机构
[1] Yahoo Res, Barcelona, Spain
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Distributed search engines are often more complex to implement compared to centralized engines. Distributing a search engine across multiple sites, however, has several advantages. In particular, it enables the utilization of less computer resources and the exploitation of data and user locality. In this presentation we show the feasibility of distributed Web search engines, by proposing a model for assessing the total cost of a distributed Web-search engine that includes the computational costs as well as the communication cost among all distributed sites. Using examples, we show that a distributed Web search engine can be more cost effective than a centralized one, if there is a large percentage of local queries, which is usually the case. We then present a query-processing algorithm that maximizes the amount of queries answered locally, without sacrificing the quality of the results, by using caching and partial replication. We simulate our algorithm on real document collections and real query workloads to measure the actual parameters needed for our cost model, and we show that a distributed search engine can be competitive compared to a centralized architecture with respect to cost. This is joint work with Aris Gionis, Flavio Junqueira, Vassilis Plachouras and Luca Telloli.
引用
收藏
页码:IS13 / IS13
页数:1
相关论文
共 50 条
  • [1] TOWARDS A DISTRIBUTED SEARCH ENGINE
    Baeza-Yates, Ricardo
    [J]. ICEIS 2008: PROCEEDINGS OF THE TENTH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL AIDSS: ARTIFICIAL INTELLIGENCE AND DECISION SUPPORT SYSTEMS, 2008, : IS13 - IS13
  • [2] TOWARDS A DISTRIBUTED SEARCH ENGINE
    Baeza-Yates, Ricardo
    [J]. ICEIS 2008: PROCEEDINGS OF THE TENTH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL ISAS-2: INFORMATION SYSTEMS ANALYSIS AND SPECIFICATION, VOL 2, 2008, : IS13 - IS13
  • [3] TOWARDS A DISTRIBUTED SEARCH ENGINE
    Baeza-Yates, Ricardo
    [J]. ICEIS 2008: PROCEEDINGS OF THE TENTH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL ISAS-1: INFORMATION SYSTEMS ANALYSIS AND SPECIFICATION, VOL 1, 2008, : IS13 - IS13
  • [4] TOWARDS A DISTRIBUTED SEARCH ENGINE
    Baeza-Yates, Ricardo
    [J]. ICEIS 2008: PROCEEDINGS OF THE TENTH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL DISI: DATABASES AND INFORMATION SYSTEMS INTEGRATION, 2008, : IS13 - IS13
  • [5] Towards a Distributed Search Engine
    Baeza-Yates, Ricardo
    [J]. ALGORITHMS AND COMPLEXITY, PROCEEDINGS, 2010, 6078 : 1 - 5
  • [6] TOWARDS A DISTRIBUTED SEARCH ENGINE
    Bacza-Yates, Ricardo
    [J]. ICEIS 2008: PROCEEDINGS OF THE TENTH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL HCI: HUMAN-COMPUTER INTERACTION, 2008, : IS13 - IS13
  • [7] Towards a fully distributed P2P web search engine
    Zhou, J
    Li, K
    Tang, L
    [J]. 10TH IEEE INTERNATIONAL WORKSHOP ON FUTURE TRENDS OF DISTRIBUTED COMPUTING SYSTEMS, PROCEEDINGS, 2004, : 332 - 338
  • [8] Towards an EEG Search Engine
    Bigdely-Shamlo, Nima
    Kreutz-Delgado, Ken
    Kothe, Christian
    Makeig, Scott
    [J]. 2013 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2013, : 25 - 28
  • [9] Redundancy of meta search servers in a distributed search engine
    Sato, N
    Udagawa, M
    Uehara, M
    Sakai, Y
    Mori, H
    [J]. AINA 2003: 17TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS, 2003, : 400 - 407
  • [10] Scalability and reliability in a distributed search engine
    Sato, N
    Udagawa, M
    Uehara, M
    Sakai, Y
    Mori, H
    [J]. NINTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, PROCEEDINGS, 2002, : 57 - 62