DSphere: A source-centric approach to crawling, indexing and searching the world wide web

被引:0
|
作者
Bamba, Bhuvan [1 ]
Liu, Ling [1 ]
Caverlee, James [1 ]
Padliya, Vaibhav [1 ]
Srivatsa, Mudhakar [1 ]
Bansal, Tushar [1 ]
Palekar, Mahesh [1 ]
Patrao, Joseph [1 ]
Li, Suiyang [1 ]
Singh, Aameek [1 ]
机构
[1] Georgia Inst Technol, Coll Comp, Atlanta, GA 30332 USA
来源
2007 IEEE 23RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3 | 2007年
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We describe DSPHERE1 - a decentralized system for crawling, indexing, searching and ranking of documents in the World Wide Web. Unlike most of the existing search technologies that depend heavily on a page-centric view of the Web, we advocate a source-centric view of the Web and propose a decentralized architecture for crawling, indexing and searching the Web in a distributed source-specific fashion. A fully decentralized crawler is developed to crawl the World Wide Web where each peer is assigned the responsibility of crawling a specific set of documents referred to as a source collection. Link analysis techniques are used for ranking documents. Traditional link analysis techniques suffer from problems like slow refresh rate and vulnerabilities to Web Spam. We propose a source-based link analysis approach, which computes fast and accurate ranking scores for all crawled documents.
引用
收藏
页码:1490 / +
页数:2
相关论文
共 50 条
  • [41] Multiple Ontology-Based Indexing of Multimedia Documents on the World Wide Web
    Maree, Mohammed
    Belkhatir, Mohammed
    Fauzi, Fariza
    Kmail, Aseel B.
    Ewais, Ahmad
    Sabha, Muath
    INTELLIGENT DECISION TECHNOLOGIES 2016, PT II, 2016, 57 : 51 - 62
  • [42] Uniform approach to programming the World Wide Web
    Michaelides, D
    Moreau, L
    De Roure, D
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 1999, 14 (02): : 69 - 81
  • [43] World Wide Web approach to teaching microprocessors
    Merat, FL
    Chung, D
    FRONTIERS IN EDUCATION 1997 - 27TH ANNUAL CONFERENCE, PROCEEDINGS, BOLS I - III, 1997, : 838 - 841
  • [44] The semantic web: A new approach for future World Wide Web
    Nasrolahi, Sahar
    Nikdast, Mahdi
    Boroujerdi, Mehrdad Mahdavi
    World Academy of Science, Engineering and Technology, 2009, 58 : 1149 - 1154
  • [45] A longitudinal study of world wide Web users' information-searching behavior
    Cothey, V
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2002, 53 (02): : 67 - 78
  • [46] A large scale system for searching and browsing images from the World Wide Web
    Yavlinsky, Alexei
    Heesch, Daniel
    Ruger, Stefan
    IMAGE AND VIDEO RETRIEVAL, PROCEEDINGS, 2006, 4071 : 537 - 540
  • [47] On-line goldmine? Searching for sociology of education on the World-Wide Web
    Selwyn, N
    BRITISH JOURNAL OF SOCIOLOGY OF EDUCATION, 2002, 23 (01) : 141 - 148
  • [48] Searching smart on the World Wide Web: Tools and techniques for getting quality results
    Wiley, DL
    ONLINE, 1998, 22 (04): : 107 - 108
  • [49] The information specialist's guide to searching and researching on the Internet and the World Wide Web
    Webber, S
    PROGRAM-ELECTRONIC LIBRARY AND INFORMATION SYSTEMS, 2001, 35 (03) : 314 - 316
  • [50] Differences between novice and experienced users in searching information on the world wide web
    Meadow, CT
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 2000, 51 (12): : 1154 - 1154