Distributed processing of continuous sliding-window k-NN queries for data stream filtering

被引:4
|
作者
Pripuzic, Kresimir [1 ]
Zarko, Ivana Podnar [1 ]
Aberer, Karl [2 ]
机构
[1] Univ Zagreb, Fac Elect & Comp Engn, Zagreb 10000, Croatia
[2] Ecole Polytech Fed Lausanne, Sch Comp & Commun Sci, EPFL IC IIF LSIR, CH-1015 Lausanne, Switzerland
来源
关键词
k nearest neighbor queries; sliding windows; data streams; peer-to-peer system;
D O I
10.1007/s11280-011-0125-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A sliding-window k-NN query (k-NN/w query) continuously monitors incoming data stream objects within a sliding window to identify k closest objects to a query. It enables effective filtering of data objects streaming in at high rates from potentially distributed sources, and offers means to control the rate of object insertions into result streams. Therefore k-NN/w processing systems may be regarded as one of the prospective solutions for the information overload problem in applications that require processing of structured data in real-time, such as the Sensor Web. Existing k-NN/w processing systems are mainly centralized and cannot cope with multiple data streams, where data sources are scattered over the Internet. In this paper, we propose a solution for distributed continuous k-NN/w processing of structured data from distributed streams. We define a k-NN/w processing model for such setting, and design a distributed k-NN/w processing system on top of the Content-Addressable Network (CAN) overlay. An extensive evaluation using both real and synthetic data sets demonstrates the feasibility of the proposed solution because it balances the load among the peers, while the messaging overhead within the P2P network remains reasonable. Moreover, our results clearly show the solution is scalable for an increasing number of queries and peers.
引用
收藏
页码:465 / 494
页数:30
相关论文
共 50 条
  • [1] Distributed processing of continuous sliding-window k-NN queries for data stream filtering
    Krešimir Pripužić
    Ivana Podnar Žarko
    Karl Aberer
    [J]. World Wide Web, 2011, 14 : 465 - 494
  • [2] Efficiently processing continuous k-NN queries on data streams
    Boehm, Christian
    Ooi, Beng Chin
    Plant, Claudia
    Yan, Ying
    [J]. 2007 IEEE 23RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2007, : 131 - +
  • [3] MNIST Dataset Classification Utilizing k-NN Classifier with Modified Sliding-Window Metric
    Grover, Divas
    Toghi, Behrad
    [J]. ADVANCES IN COMPUTER VISION, VOL 2, 2020, 944 : 583 - 591
  • [4] Sliding-window top-k queries on uncertain streams
    Jin, Cheqing
    Yi, Ke
    Chen, Lei
    Yu, Jeffrey Xu
    Lin, Xuemin
    [J]. VLDB JOURNAL, 2010, 19 (03): : 411 - 435
  • [5] Sketching distributed sliding-window data streams
    Papapetrou, Odysseas
    Garofalakis, Minos
    Deligiannakis, Antonios
    [J]. VLDB JOURNAL, 2015, 24 (03): : 345 - 368
  • [6] Sliding-Window Top-k Queries on Uncertain Streams
    Jin, Cheqing
    Yi, Ke
    Chen, Lei
    Yu, Jeffrey Xu
    Lin, Xuemin
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2008, 1 (01): : 301 - 312
  • [7] Sliding-window top-k queries on uncertain streams
    Cheqing Jin
    Ke Yi
    Lei Chen
    Jeffrey Xu Yu
    Xuemin Lin
    [J]. The VLDB Journal, 2010, 19 : 411 - 435
  • [8] Sketching distributed sliding-window data streams
    Odysseas Papapetrou
    Minos Garofalakis
    Antonios Deligiannakis
    [J]. The VLDB Journal, 2015, 24 : 345 - 368
  • [9] Incremental evaluation of sliding-window queries over data streams
    Ghanem, Thanaa M.
    Hammad, Moustafa A.
    Mokbel, Mohamed F.
    Aref, Walid G.
    Elmagarmid, Ahmed K.
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2007, 19 (01) : 57 - 72
  • [10] Processing Class-Constraint K-NN Queries with MISP
    Milchevski, Evica
    Neffgen, Fabian
    Michel, Sebastian
    [J]. PROCEEDINGS OF THE 21ST WORKSHOP ON THE WEB AND DATABASES (WEBDB 2018), 2018,