HengHa: Data Harvesting Detection on Hidden Databases

被引:1
|
作者
Wang, Shiyuan [1 ]
Agrawal, Divyakant [1 ]
El Abbadi, Amr [1 ]
机构
[1] UC Santa Barbara, Dept Comp Sci, Santa Barbara, CA 93106 USA
关键词
Security;
D O I
10.1145/1866835.1866847
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The back-end databases of web-based applications are a major data security concern to enterprises. The problem becomes more critical with the proliferation of enterprise hosted web applications in the cloud. While prior work has concentrated on malicious attacks that try to break into the database using vulnerabilities of web applications, little work has focused on the threat of data harvesting through web form interfaces, in which large collections of the underlying data can be harvested and sensitive information can be learnt by iteratively submitting legitimate queries and analyzing the returned results for designing new queries. To defend against data harvesting without compromising usability, we consider a detection approach. We summarize the characteristics of data harvesting, and propose the notions of query correlation and result coverage for data harvesting detection. We design a detection system called HengHa, in which Heng examines the correlation among queries in a session, and Ha evaluates the data coverage of the results of queries in the same session. The experimental results verify the effectiveness and efficiency of HengHa for data harvesting detection.
引用
收藏
页码:59 / 64
页数:6
相关论文
共 50 条
  • [1] Address databases for national SDI: Comparing the novel data grid approach to data harvesting and federated databases
    Coetzee, Serena
    Bishop, Judith
    [J]. INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2009, 23 (09) : 1179 - 1209
  • [2] IHWC: intelligent hidden web crawler for harvesting data in urban domains
    Sawroop Kaur
    Aman Singh
    G. Geetha
    Xiaochun Cheng
    [J]. Complex & Intelligent Systems, 2023, 9 : 3635 - 3653
  • [3] IHWC: intelligent hidden web crawler for harvesting data in urban domains
    Kaur, Sawroop
    Singh, Aman
    Geetha, G.
    Cheng, Xiaochun
    [J]. COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (04) : 3635 - 3653
  • [4] StegoWall: Blind statistical detection of hidden data
    Voloshynovskiy, S
    Herrigel, A
    Rytsar, Y
    Pun, A
    [J]. SECURITY AND WATERMARKING OF MULTIMEDIA CONTENTS IV, 2002, 4675 : 57 - 68
  • [5] Data and syntax centric anomaly detection for relational databases
    Sallam, Asmaa
    Fadolalkarim, Daren
    Bertino, Elisa
    Xiao, Qian
    [J]. WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2016, 6 (06) : 231 - 239
  • [6] Correction: IHWC: intelligent hidden web crawler for harvesting data in urban domains
    Sawroop Kaur
    Aman Singh
    G. Geetha
    Xiaochun Cheng
    [J]. Complex & Intelligent Systems, 2023, 9 : 1113 - 1113
  • [7] Harvesting image databases from the web
    Schroff, F.
    Criminisi, A.
    Zisserman, A.
    [J]. 2007 IEEE 11TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1-6, 2007, : 2120 - +
  • [8] Harvesting Image Databases from the Web
    Schroff, Florian
    Criminisi, Antonio
    Zisserman, Andrew
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (04) : 754 - 766
  • [9] Data Harvesting and Event Detection from Czech Twitter
    Rajtmajer, Vaclav
    Kral, Pavel
    [J]. AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART 2017), 2018, 10839 : 102 - 115
  • [10] Outlier Detection in Spatial Databases Using Clustering Data Mining
    Karmaker, Amitava
    Rahman, Syed M.
    [J]. PROCEEDINGS OF THE 2009 SIXTH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: NEW GENERATIONS, VOLS 1-3, 2009, : 1657 - +