Efficient monitoring of skyline queries over distributed data streams

被引:27
|
作者
Sun, Shengli [1 ]
Huang, Zhenghua [2 ]
Zhong, Hao [3 ]
Dai, Dongbo [4 ]
Liu, Hongbin [5 ]
Li, Jinjiu [6 ]
机构
[1] Peking Univ, Sch Software & Microelect, Beijing 100871, Peoples R China
[2] Tongji Univ, Dept Comp Sci, Sch Elect & Informat, Shanghai 200092, Peoples R China
[3] Chinese Acad Sci, Inst Software, Lab Internet Software Technol, Beijing, Peoples R China
[4] Fudan Univ, Sch Comp Sci & Technol, Shanghai 200433, Peoples R China
[5] State Grid Corp China, N China Grid China, Beijing, Peoples R China
[6] Univ Technol Sydney, Fac Engn & Informat Technol, Sydney, NSW 2007, Australia
基金
中国国家自然科学基金;
关键词
Distributed data streams; Skyline; Communication-optimal processing; Progressive refinement;
D O I
10.1007/s10115-009-0269-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data management and data mining over distributed data streams have received considerable attention within the database community recently. This paper is the first work to address skyline queries over distributed data streams, where streams derive from multiple horizontally split data sources. Skyline query returns a set of interesting objects which are not dominated by any other objects within the base dataset. Previous work is concentrated on skyline computations over static data or centralized data streams. We present an efficient and an effective algorithm called BOCS to handle this issue under a more challenging environment of distributed streams. BOCS consists of an efficient centralized algorithm GridSky and an associated communication protocol. Based on the strategy of progressive refinement in BOCS, the skyline is incrementally computed by two phases. In the first phase, local skylines on remote sites are maintained by GridSky. At each time, only skyline increments on remote sites are sent to the coordinator. In the second phase, a global skyline is obtained by integrating remote increments with the latest global skyline. A theoretical analysis shows that BOCS is communication-optimal among all algorithms which use a share-nothing strategy. Extensive experiments demonstrate that our proposals are efficient, scalable, and stable.
引用
收藏
页码:575 / 606
页数:32
相关论文
共 50 条
  • [21] Efficient Algorithms of Parallel Skyline Join over Data Streams
    Zhang, Jinchao
    Gu, JingZi
    Cheng, Shuai
    Li, Bo
    Wang, Weiping
    Meng, Dan
    [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2018, PT I, 2018, 11334 : 184 - 199
  • [22] Efficient mining of skyline objects in subspaces over data streams
    Huang, Zhenhua
    Sun, Shengli
    Wang, Wei
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2010, 22 (02) : 159 - 183
  • [23] Efficient skyline computation over distributed interval data
    Li, Xiaoyong
    Ren, Kaijun
    Li, Xiaoling
    Yu, Jie
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2017, 29 (10):
  • [24] Optimizing monitoring queries over distributed data
    Neven, Frank
    Van de Craen, Dieter
    [J]. ADVANCES IN DATABASE TECHNOLOGY - EDBT 2006, 2006, 3896 : 829 - +
  • [25] A Multicore Parallelization of Continuous Skyline Queries on Data Streams
    De Matteis, Tiziano
    Di Girolamo, Salvatore
    Mencagli, Gabriele
    [J]. EURO-PAR 2015: PARALLEL PROCESSING, 2015, 9233 : 402 - 413
  • [26] Efficient processing of multiple continuous skyline queries over a data stream
    Lee, Yu Won
    Lee, Ki Yong
    Kim, Myoung Ho
    [J]. INFORMATION SCIENCES, 2013, 221 : 316 - 337
  • [27] Secure and Efficient Skyline Queries on Encrypted Data
    Liu, Jinfei
    Yang, Juncheng
    Xiong, Li
    Pei, Jian
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (07) : 1397 - 1411
  • [28] SKYPEER: Efficient subspace skyline computation over distributed data
    Vlachou, Akrivi
    Doulkeridis, Christos
    Kotidis, Yannis
    Vazirgiannis, Michalis
    [J]. 2007 IEEE 23RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2007, : 391 - +
  • [29] Optimizing skyline queries over incomplete data
    Lee, Jongwuk
    Im, Hyeonseung
    You, Gae-won
    [J]. INFORMATION SCIENCES, 2016, 361 : 14 - 28
  • [30] Efficient Processing of Skyline-Join Queries over Multiple Data Sources
    Nagendra, Mithila
    Candan, K. Selcuk
    [J]. ACM TRANSACTIONS ON DATABASE SYSTEMS, 2015, 40 (02):