TOWARDS A DISTRIBUTED SEARCH ENGINE

被引:0
|
作者
Baeza-Yates, Ricardo [1 ]
机构
[1] Yahoo Res, Barcelona, Spain
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Distributed search engines are often more complex to implement compared to centralized engines. Distributing a search engine across multiple sites, however, has several advantages. In particular, it enables the utilization of less computer resources and the exploitation of data and user locality. In this presentation we show the feasibility of distributed Web search engines, by proposing a model for assessing the total cost of a distributed Web-search engine that includes the computational costs as well as the communication cost among all distributed sites. Using examples, we show that a distributed Web search engine can be more cost effective than a centralized one, if there is a large percentage of local queries, which is usually the case. We then present a query-processing algorithm that maximizes the amount of queries answered locally, without sacrificing the quality of the results, by using caching and partial replication. We simulate our algorithm on real document collections and real query workloads to measure the actual parameters needed for our cost model, and we show that a distributed search engine can be competitive compared to a centralized architecture with respect to cost. This is joint work with Aris Gionis, Flavio Junqueira, Vassilis Plachouras and Luca Telloli.
引用
收藏
页码:IS13 / IS13
页数:1
相关论文
共 50 条
  • [31] Towards distributed node similarity search on graphs
    Zhang, Tianming
    Gao, Yunjun
    Zheng, Baihua
    Chen, Lu
    Wen, Shiting
    Guo, Wei
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2020, 23 (06): : 3025 - 3053
  • [32] An information update method towards internal search engine
    Bian, Zhifan
    Li, Yukun
    Yue, Tinghai
    Lei, Pengfei
    Zhao, Dexin
    Xiao, Yingyuan
    [J]. 2015 12TH WEB INFORMATION SYSTEM AND APPLICATION CONFERENCE (WISA), 2015, : 211 - 216
  • [33] Towards a Semantic Search Engine for Open Source Software
    Ben Sassi, Sihem
    [J]. SOFTWARE REUSE: BRIDGING WITH SOCIAL-AWARENESS, 2016, 9679 : 300 - 314
  • [34] Public awareness and attitudes towards search engine optimization
    Lewandowski, Dirk
    Schultheiss, Sebastian
    [J]. BEHAVIOUR & INFORMATION TECHNOLOGY, 2023, 42 (08) : 1025 - 1044
  • [35] Towards a reliable distributed Web Service Execution Engine
    Ye, Xinfeng
    [J]. ICWS 2006: IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES, PROCEEDINGS, 2006, : 595 - 602
  • [36] Towards Efficient and Intelligent Internet of Things Search Engine
    Hatcher, William Grant
    Qian, Cheng
    Gao, Weichao
    Liang, Fan
    Hua, Kun
    Yu, Wei
    [J]. IEEE ACCESS, 2021, 9 : 15778 - 15795
  • [37] Search Engine Predilection towards News Media Providers
    Azzopardi, Leif
    Owens, Ciaran
    [J]. PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 774 - 775
  • [38] Reliable distributed search engine based on multiple meta servers
    Sato, N
    Udagawa, M
    Uehara, M
    Sakai, Y
    Mori, H
    [J]. FIRST INTERNATIONAL SYMPOSIUM ON CYBER WORLDS, PROCEEDINGS, 2002, : 79 - 84
  • [39] Distributed Search Engine Architecture Based On Topic Specific Searches
    Abudaqqa, Yousra
    Patel, Ahmed
    [J]. INTERNATIONAL CONFERENCE ON MATHEMATICS, ENGINEERING AND INDUSTRIAL APPLICATIONS 2014 (ICOMEIA 2014), 2015, 1660
  • [40] A topic-based and distributed search engine for business intelligence
    Fei, Yulian
    Zhu, Anding
    Wang, Guangmin
    [J]. DCABES 2006 PROCEEDINGS, VOLS 1 AND 2, 2006, : 913 - 917