HengHa: Data Harvesting Detection on Hidden Databases

被引:1
|
作者
Wang, Shiyuan [1 ]
Agrawal, Divyakant [1 ]
El Abbadi, Amr [1 ]
机构
[1] UC Santa Barbara, Dept Comp Sci, Santa Barbara, CA 93106 USA
关键词
Security;
D O I
10.1145/1866835.1866847
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The back-end databases of web-based applications are a major data security concern to enterprises. The problem becomes more critical with the proliferation of enterprise hosted web applications in the cloud. While prior work has concentrated on malicious attacks that try to break into the database using vulnerabilities of web applications, little work has focused on the threat of data harvesting through web form interfaces, in which large collections of the underlying data can be harvested and sensitive information can be learnt by iteratively submitting legitimate queries and analyzing the returned results for designing new queries. To defend against data harvesting without compromising usability, we consider a detection approach. We summarize the characteristics of data harvesting, and propose the notions of query correlation and result coverage for data harvesting detection. We design a detection system called HengHa, in which Heng examines the correlation among queries in a session, and Ha evaluates the data coverage of the results of queries in the same session. The experimental results verify the effectiveness and efficiency of HengHa for data harvesting detection.
引用
收藏
页码:59 / 64
页数:6
相关论文
共 50 条
  • [21] The Doppelganger Effect: Hidden Duplicates in Databases of Transcriptome Profiles
    Waldron, Levi
    Riester, Markus
    Ramos, Marcel
    Parmigiani, Giovanni
    Birrer, Michael
    [J]. JNCI-JOURNAL OF THE NATIONAL CANCER INSTITUTE, 2016, 108 (11)
  • [22] Probability Model Based Hidden Databases Sampling Approach
    Tian Jian-Wei
    Li Shi-Jun
    Lu Qi
    [J]. 2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31, 2008, : 11072 - 11075
  • [23] Specialized hidden Markov model databases for microbial genomics
    Gollery, M
    [J]. COMPARATIVE AND FUNCTIONAL GENOMICS, 2003, 4 (02): : 250 - 254
  • [24] The Algebra of Hidden Relations as a Means of Modeling Statistical Databases
    E. M. Beniaminov
    [J]. Automatic Documentation and Mathematical Linguistics, 2019, 53 : 161 - 166
  • [25] Privacy Preservation of Aggregates in Hidden Databases: Why and How?
    Dasgupta, Arjun
    Zhang, Nan
    Das, Gautam
    Chaudhuri, Surajit
    [J]. ACM SIGMOD/PODS 2009 CONFERENCE, 2009, : 153 - 164
  • [26] HDBTracker: Monitoring the Aggregates On Dynamic Hidden Web Databases
    Liu, Weimo
    Bin Suhaim, Saad
    Thirumuruganathan, Saravanan
    Zhang, Nan
    Das, Gautam
    Jaoua, Ali
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2014, 7 (13): : 1569 - 1572
  • [27] Aggregate Estimation Over Dynamic Hidden Web Databases
    Liu, Weimo
    Thirumuruganathan, Saravanan
    Zhang, Nan
    Das, Gautam
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2014, 7 (12): : 1107 - 1118
  • [28] The Algebra of Hidden Relations as a Means of Modeling Statistical Databases
    Beniaminov, E. M.
    [J]. AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS, 2019, 53 (04) : 161 - 166
  • [29] Sampling, information extraction and summarisation of Hidden Web databases
    Hedley, Yih-Ling
    Younas, Muhammad
    James, Anne
    Sanderson, Mark
    [J]. DATA & KNOWLEDGE ENGINEERING, 2006, 59 (02) : 213 - 230
  • [30] Extracting and Analyzing Hidden Graphs from Relational Databases
    Xirogiannopoulos, Konstantinos
    Deshpande, Amol
    [J]. SIGMOD'17: PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2017, : 897 - 912