Filtering Data Streams for Entity-Based Continuous Queries

被引:11
|
作者
Cheng, Reynold [1 ]
Kao, Ben C. M. [1 ]
Kwan, Alan [1 ]
Prabhakar, Sunil [2 ]
Tu, Yi-Cheng [3 ]
机构
[1] Univ Hong Kong, Dept Comp Sci, Pokfulam, Hong Kong, Peoples R China
[2] Purdue Univ, Dept Comp Sci, W Lafayette, IN 47907 USA
[3] Univ S Florida, Dept Comp Sci & Engn, Tampa, FL 33620 USA
关键词
Data streams; continuous queries; adaptive filters; fraction-based tolerance;
D O I
10.1109/TKDE.2009.63
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The idea of allowing query users to relax their correctness requirements in order to improve performance of a data stream management system (e.g., location-based services and sensor networks) has been recently studied. By exploiting the maximum error (or tolerance) allowed in query answers, algorithms for reducing the use of system resources have been developed. In most of these works, however, query tolerance is expressed as a numerical value, which may be difficult to specify. We observe that in many situations, users may not be concerned with the actual value of an answer, but rather which object satisfies a query (e.g., "who is my nearest neighbor?"). In particular, an entity-based query returns only the names of objects that satisfy the query. For these queries, it is possible to specify a tolerance that is "nonvalue-based." In this paper, we study fraction-based tolerance, a type of nonvalue-based tolerance, where a user specifies the maximum fractions of a query answer that can be false positives and false negatives. We develop fraction-based tolerance for two major classes of entity-based queries: 1) nonrank-based query (e.g., range queries) and 2) rank-based query (e.g., k-nearest-neighbor queries). These definitions provide users with an alternative to specify the maximum tolerance allowed in their answers. We further investigate how these definitions can be exploited in a distributed stream environment. We design adaptive filter algorithms that allow updates be dropped conditionally at the data stream sources without affecting the overall query correctness. Extensive experimental results show that our protocols reduce the use of network and energy resources significantly.
引用
收藏
页码:234 / 248
页数:15
相关论文
共 50 条
  • [1] Consistent collective evaluation of multiple continuous queries for filtering heterogeneous data streams
    Lee, Hyun-Ho
    Lee, Won-Suk
    KNOWLEDGE AND INFORMATION SYSTEMS, 2010, 22 (02) : 185 - 210
  • [2] Consistent collective evaluation of multiple continuous queries for filtering heterogeneous data streams
    Hyun-Ho Lee
    Won-Suk Lee
    Knowledge and Information Systems, 2010, 22 : 185 - 210
  • [3] Entity-Based Query Recommendation for Long-Tail Queries
    Huang, Zhipeng
    Cautis, Bogdan
    Cheng, Reynold
    Zheng, Yudian
    Mamoulis, Nikos
    Yan, Jing
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2018, 12 (06)
  • [4] Continuous queries over data streams
    Babu, S
    Widom, J
    SIGMOD RECORD, 2001, 30 (03) : 109 - 120
  • [5] PROCESSING CONTINUOUS QUERIES ON SENSOR-BASED MULTIMEDIA DATA STREAMS BY MULTIMEDIA DEPENDENCY ANALYSIS AND ONTOLOGICAL FILTERING
    Chang, Shi-Kuo
    Colace, Francesco
    Zhao, Lei
    Sun, Yao
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2011, 21 (08) : 1169 - 1208
  • [6] Continuous Processing of Preference Queries in Data Streams
    Kontaki, Maria
    Papadopoulos, Apostolos N.
    Manolopoulos, Yannis
    SOFSEM 2010: THEORY AND PRACTICE OF COMPUTER SCIENCE, PROCEEDINGS, 2010, 5901 : 47 - 60
  • [7] CONTINUOUS MULTIPLE OLAP QUERIES FOR DATA STREAMS
    Parimala, N.
    Bhawna, S.
    INTERNATIONAL JOURNAL OF COOPERATIVE INFORMATION SYSTEMS, 2012, 21 (02) : 141 - 164
  • [8] Entity-Based Retrieval
    Raviv, Hadas
    SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, : 1277 - 1277
  • [9] Entity-Based Data Source Contextualization for Searching the Web of Data
    Wagner, Andreas
    Haase, Peter
    Rettinger, Achim
    Lamm, Holger
    SEMANTIC WEB: ESWC 2014 SATELLITE EVENTS, 2014, 8798 : 25 - 41
  • [10] A Study on a Spatiotemporal Entity-Based Event Data Model
    Wang, Mingming
    Zhang, Jiangshui
    Cao, Yibing
    Li, Shenghui
    Chen, Minjie
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2024, 13 (10)