Efficient monitoring of skyline queries over distributed data streams

被引:27
|
作者
Sun, Shengli [1 ]
Huang, Zhenghua [2 ]
Zhong, Hao [3 ]
Dai, Dongbo [4 ]
Liu, Hongbin [5 ]
Li, Jinjiu [6 ]
机构
[1] Peking Univ, Sch Software & Microelect, Beijing 100871, Peoples R China
[2] Tongji Univ, Dept Comp Sci, Sch Elect & Informat, Shanghai 200092, Peoples R China
[3] Chinese Acad Sci, Inst Software, Lab Internet Software Technol, Beijing, Peoples R China
[4] Fudan Univ, Sch Comp Sci & Technol, Shanghai 200433, Peoples R China
[5] State Grid Corp China, N China Grid China, Beijing, Peoples R China
[6] Univ Technol Sydney, Fac Engn & Informat Technol, Sydney, NSW 2007, Australia
基金
中国国家自然科学基金;
关键词
Distributed data streams; Skyline; Communication-optimal processing; Progressive refinement;
D O I
10.1007/s10115-009-0269-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data management and data mining over distributed data streams have received considerable attention within the database community recently. This paper is the first work to address skyline queries over distributed data streams, where streams derive from multiple horizontally split data sources. Skyline query returns a set of interesting objects which are not dominated by any other objects within the base dataset. Previous work is concentrated on skyline computations over static data or centralized data streams. We present an efficient and an effective algorithm called BOCS to handle this issue under a more challenging environment of distributed streams. BOCS consists of an efficient centralized algorithm GridSky and an associated communication protocol. Based on the strategy of progressive refinement in BOCS, the skyline is incrementally computed by two phases. In the first phase, local skylines on remote sites are maintained by GridSky. At each time, only skyline increments on remote sites are sent to the coordinator. In the second phase, a global skyline is obtained by integrating remote increments with the latest global skyline. A theoretical analysis shows that BOCS is communication-optimal among all algorithms which use a share-nothing strategy. Extensive experiments demonstrate that our proposals are efficient, scalable, and stable.
引用
收藏
页码:575 / 606
页数:32
相关论文
共 50 条
  • [41] Efficient Computation of Skyline Queries on Incomplete Dynamic Data
    Wang, Hongzhi
    Yin, Shengjun
    Sun, Ming
    Wang, Ye
    Wang, Hepeng
    Li, Jianzhong
    Gao, Hong
    [J]. IEEE ACCESS, 2018, 6 : 52741 - 52753
  • [42] eSkyline: Processing Skyline Queries over Encrypted Data
    Bothe, Suvarna
    Karras, Panagiotis
    Vlachou, Akrivi
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (12): : 1338 - 1341
  • [43] A framework for multidimensional skyline queries over streaming data
    Alami, Karim
    Maabout, Sofian
    [J]. DATA & KNOWLEDGE ENGINEERING, 2020, 127
  • [44] Answering Skyline Queries over Incomplete Data with Crowdsourcing
    Miao, Xiaoye
    Gao, Yunjun
    Guo, Su
    Chen, Lu
    Yin, Jianwei
    Li, Qing
    [J]. 2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, : 2032 - 2033
  • [45] A Framework for Evaluating Skyline Queries over Incomplete Data
    Gulzar, Yonis
    Alwan, Ali A.
    Salleh, Norsaremah
    Al Shaikhli, Imad Fakhri
    Alvi, Syed Idrees Mairaj
    [J]. 11TH INTERNATIONAL CONFERENCE ON FUTURE NETWORKS AND COMMUNICATIONS (FNC 2016) / THE 13TH INTERNATIONAL CONFERENCE ON MOBILE SYSTEMS AND PERVASIVE COMPUTING (MOBISPC 2016) / AFFILIATED WORKSHOPS, 2016, 94 : 191 - 198
  • [46] Efficient Computation of Skyline Queries Over a Dynamic and Incomplete Database
    Dehaki, Ghazaleh Babanejad
    Ibrahim, Hamidah
    Sidi, Fatimah
    Udzir, Nur Izura
    Alwan, Ali A.
    Gulzar, Yonis
    [J]. IEEE ACCESS, 2020, 8 : 141523 - 141546
  • [47] Efficient Algorithms for Skyline Top-K Keyword Queries on XML Streams
    Li, Lingli
    Wang, Hongzhi
    Li, Jianzhong
    Gao, Hong
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, 2009, 5463 : 283 - 287
  • [48] S_IDS: An efficient skyline query algorithm over incomplete data streams
    Bai, Mei
    Han, Yuxue
    Yin, Peng
    Xite, Wang
    Li, Guanyu
    Ning, Bo
    Ma, Qian
    [J]. DATA & KNOWLEDGE ENGINEERING, 2024, 149
  • [49] FIDS: Monitoring frequent items over distributed data streams
    Fuller, Robert
    Kantardzic, Mehmed
    [J]. MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, PROCEEDINGS, 2007, 4571 : 464 - +
  • [50] Towards Efficient and Privacy-Preserving Interval Skyline Queries Over Time Series Data
    Zhang, Songnian
    Ray, Suprio
    Lu, Rongxing
    Zheng, Yandong
    Guan, Yunguo
    Shao, Jun
    [J]. IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2023, 20 (02) : 1348 - 1363