A decentralized search engine for dynamic Web communities

被引:6
|
作者
Wang, Daze [1 ]
Tse, Quincy Chi Kwan [1 ]
Zhou, Ying [1 ]
机构
[1] Univ Sydney, Sch Informat Technol, Sydney, NSW 2006, Australia
关键词
Distributed hash table; Bloom filter; Information retrieval; Community level search; Web feed;
D O I
10.1007/s10115-009-0270-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Currently, most Web search engines perform search on corpus comprising nearly entire content of the Web. The same centralized search service can be performed on a single site as well. Nonetheless, there is little research on community-wide search. This paper presents a peer-to-peer search engine ComSearch. ComSearch is designed to provide small- and middle-scale online communities-the ability to perform text search within the community. Communities are formed in a self-organizing style. P2P IR system may suffer unnecessary internal traffic in answering a multi-term query. In this paper, we propose several techniques to optimize the multi-term query process. The simulation results show that our proposed algorithms have good scalability. Compared with baseline approach, our improved algorithm can reduce the communication cost by about two orders of magnitude in the best case. We also deploy the system in a small-scale network and conduct a series of experiments to estimate the actual query response time as well as to investigate the data movement effect caused by node joining. Experimental results show that multiple data movements are quite common during network expansion. However, the percentage of multiple data movements decreases when a network is getting stable after the initial frequent joining activities. This provides possibilities for improvement on P2P data movement management.
引用
收藏
页码:105 / 125
页数:21
相关论文
共 50 条
  • [21] Web searching on the vivisimo search engine
    Koshman, Sherry
    Spink, Amanda
    Jansen, Bernard J.
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2006, 57 (14): : 1875 - 1887
  • [22] MediCrawl - A Web Search Engine For Diseases
    Trivedi, Devharsh
    Gopalakrishnan, Vaishnavi
    [J]. 2021 IEEE 11TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2021, : 148 - 157
  • [23] Web search engine based on DNS
    Wang Liang
    Guo Yi-Ping
    Fang Ming
    [J]. JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2007, 30 (02) : 466 - 478
  • [24] ExpertRec: A Collaborative Web Search Engine
    Sun, Jingyu
    Chen, Junjie
    Yu, Xueli
    Zhong, Ning
    [J]. WEB INFORMATION SYSTEMS AND MINING, PT II, 2011, 6988 : 385 - +
  • [25] IMPLEMENTATION OF A SIMPLE WEB SEARCH ENGINE
    Saveluc, Diana-Alexandra
    [J]. PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE 'LINQUISTIC RESOURCES AND TOOLS FOR PROCESSING THE ROMANIAN LANGUAGE', 2015, 2015, : 163 - 174
  • [26] Web search engine as a bee hive
    Navrat, Pavol
    Kovacik, Martin
    [J]. 2006 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, (WI 2006 MAIN CONFERENCE PROCEEDINGS), 2006, : 694 - +
  • [27] A Framework of Web Image Search Engine
    Xu, Weiguang
    Zhang, Yafei
    Lu, Jianjiang
    Li, Ran
    Xie, Zhenghui
    [J]. FIRST IITA INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2009, : 522 - 525
  • [28] Web search engine multimedia functionality
    Tjondronegoro, Dian
    Spink, Amanda
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2008, 44 (01) : 340 - 357
  • [29] Web Search Based on Web Communities Feedback Data
    Adda, Mehdi
    Missaoui, Rokia
    Valtchev, Petko
    [J]. E-TECHNOLOGIES-INNOVATION IN AN OPEN WORLD, 2009, 26 : 169 - +
  • [30] Dynamic and Decentralized Learning of Overlapping Network Communities
    Baingana, Brian
    Giannakis, Georgios B.
    [J]. 2015 IEEE 6TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING (CAMSAP), 2015, : 97 - 100