Efficient Verification of Web-Content Searching Through Authenticated Web Crawlers

被引:15
|
作者
Goodrich, Michael T. [1 ]
Nguyen, Duy [2 ]
Ohrimenko, Olga [2 ]
Papamanthou, Charalampos [3 ]
Tamassia, Roberto [2 ]
Triandopoulos, Nikos [4 ,5 ]
Lopes, Cristina Videira [1 ]
机构
[1] Univ Calif Irvine, Irvine, CA 92717 USA
[2] Brown Univ, Providence, RI 02912 USA
[3] Univ Calif Berkeley, Berkeley, CA 94720 USA
[4] RSA Labs, Bedford, MA USA
[5] Boston Univ, Boston, MA 02215 USA
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2012年 / 5卷 / 10期
基金
美国国家科学基金会;
关键词
D O I
10.14778/2336664.2336666
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the problem of verifying the correctness and completeness of the result of a keyword search. We introduce the concept of an authenticated web crawler and present its design and prototype implementation. An authenticated web crawler is a trusted program that computes a speciallycrafted signature over the web contents it visits. This signature enables (i) the verification of common Internet queries on web pages, such as conjunctive keyword searchesthis guarantees that the output of a conjunctive keyword search is correct and complete; (ii) the verification of the content returned by such Internet queriesthis guarantees that web data is authentic and has not been maliciously altered since the computation of the signature by the crawler. In our solution, the search engine returns a cryptographic proof of the query result. Both the proof size and the verification time are proportional only to the sizes of the query description and the query result, but do not depend on the number or sizes of the web pages over which the search is performed. As we experimentally demonstrate, the prototype implementation of our system provides a low communication overhead between the search engine and the user, and fast verification of the returned results by the user.
引用
收藏
页码:920 / 931
页数:12
相关论文
共 50 条
  • [1] Using Web-Content for Retrieving Snippets
    Hendriansyah, Okky
    Firgantoro, Tri
    Adriani, Mirna
    [J]. ADVANCES IN MULTILINGUAL AND MULTIMODAL INFORMATION RETRIEVAL, 2008, 5152 : 742 - 744
  • [2] Large Scale Web-Content Classification
    Deri, Luca
    Martinelli, Maurizio
    Sartiano, Daniele
    Sideri, Loredana
    [J]. 2015 7TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (IC3K), 2015, : 545 - 554
  • [3] JaNeT: A framework for flexible Web-content retrieval
    Bergenti, F
    Poggi, A
    Somacher, M
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 137 - 140
  • [4] Searching for Behavioural Bugs with Stateful Test Oracles in Web Crawlers
    Beroual, Oussama
    Guerin, Francis
    Halle, Sylvain
    [J]. 2017 IEEE/ACM 10TH INTERNATIONAL WORKSHOP ON SEARCH-BASED SOFTWARE TESTING (SBST), 2017, : 7 - 13
  • [5] Visually searching the Web for content
    Smith, JR
    Chang, SF
    [J]. IEEE MULTIMEDIA, 1997, 4 (03) : 12 - 20
  • [6] Web usage for investor relationship of the joint stock corporations and design of a web-content management tool
    Tanrikulu, Zuhal
    Alacath, Aylin
    [J]. INTERNET & INFORMATION SYSTEMS IN THE DIGITAL AGE: CHALLENGES AND SOLUTIONS, 2006, : 802 - 808
  • [7] Identification and Characterization of Crawlers through Analysis of Web Logs
    Algiriyage, Nilani
    Jayasena, Sanath
    Dias, Gihan
    Perera, Amila
    Dayananda, Kushan
    [J]. 2013 8TH IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS (ICIIS), 2013, : 150 - +
  • [8] Building an Efficient Web Portal for Students at Institutions of Higher Education Based on Web Crawlers
    Yi, Haibo
    Nie, Zhe
    [J]. 2017 3RD INTERNATIONAL CONFERENCE ON COMPUTATIONAL SYSTEMS AND COMMUNICATIONS (ICCSC 2017), 2017, : 96 - 100
  • [9] Aurora: A conceptual model for Web-content adaptation to support the universal usability of Web-based services
    Huang, AW
    Sundaresan, N
    [J]. CUU 2000 CONFERENCE PROCEEDINGS, 2000, : 124 - 131
  • [10] Integrating Learning and Web-Content Management Systems on Digital Systems Teaching
    Quinones, Jorge E.
    Vera, Alexander
    Bernal, Alvaro
    [J]. 2012 IEEE 4TH COLOMBIAN WORKSHOP ON CIRCUITS AND SYSTEMS (CWCAS), 2012,