RLC: ranking lag correlations with flexible sliding windows in data streams

被引:2
|
作者
Wu, Shanshan [1 ]
Lin, Huaizhong [1 ]
Wang, Wenxiang [1 ]
Lu, Dongming [1 ]
U, Leong Hou [2 ]
Gao, Yunjun [1 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou, Zhejiang, Peoples R China
[2] Univ Macau, Fac Sci & Technol, Macau, Peoples R China
关键词
Lag correlation; Flexible sliding windows; Data streams;
D O I
10.1007/s10044-016-0577-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Lag correlation between two time series is the correlation shifted in time relative to one another. Existing work focuses on two computation models, landmark (where the lag correlation is computed over the entire stream) and sliding window (where the lag correlation is computed over the current window). However, these models may suffer from problems like result freshness (e.g., perished items in the landmark model) and parametric tuning (e.g., setting a proper length in the sliding window model). In this work, we attempt to analyze the lag correlation which is computed based on flexible sliding windows. In view of that, a new query called RLC (ranking lag correlations with flexible sliding windows in data streams) is proposed. The key challenge in answering the RLC query is that the number of windows to be analyzed will grow quadratically with the length of the stream, resulting a quadratic computation cost. To boost the computation, we employ the running sum and the geometric probing techniques to facilitate the query processing. We also present an approximate solution that further reduces the computation cost with an acceptable error rate in practice. The extensive experiments verify the efficiency of our proposed methods. We also demonstrate some lag correlations discovered from real datasets to show the practicality of this work.
引用
收藏
页码:601 / 611
页数:11
相关论文
共 50 条
  • [31] Distributed streams algorithms for sliding windows
    Gibbons, PB
    Tirthapura, S
    [J]. THEORY OF COMPUTING SYSTEMS, 2004, 37 (03) : 457 - 478
  • [32] Heavy Hitters in Streams and Sliding Windows
    Ben-Basat, Ran
    Einziger, Gil
    Friedman, Roy
    Kassner, Yaron
    [J]. IEEE INFOCOM 2016 - THE 35TH ANNUAL IEEE INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS, 2016,
  • [33] Distributed Streams Algorithms for Sliding Windows
    Phillip B. Gibbons
    Srikanta Tirthapura
    [J]. Theory of Computing Systems, 2004, 37 : 457 - 478
  • [34] Fast Discovery of Group Lag Correlations in Streams
    Sakurai, Yasushi
    Faloutsos, Christos
    Papadimitriou, Spiros
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2010, 5 (01)
  • [35] Queueing Analysis of Continuous Queries for Uncertain Data Streams Over Sliding Windows
    Xiao, Guoqing
    Li, Kenli
    Zhou, Xu
    Li, Keqin
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2016, 30 (09)
  • [36] Finding frequent items in sliding windows over data streams using EBF
    Wang, ShuYun
    Xu, HeXiang
    Hu, YunFa
    [J]. SNPD 2007: EIGHTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING, AND PARALLEL/DISTRIBUTED COMPUTING, VOL 3, PROCEEDINGS, 2007, : 682 - +
  • [37] Mining Swarm Patterns in Sliding Windows over Moving Object Data Streams
    Bhushan, Alka
    Bellur, Umesh
    Sharma, Kuldeep
    Deshpande, Srijay
    Sarda, Nandlal L.
    [J]. 25TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2017), 2017,
  • [38] Processing sliding windows over disordered streams
    Kim, Hyeon Gyu
    Kim, Myoung Ho
    [J]. 2008 THE INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING, 2008, : 25 - +
  • [39] Approximate Range Emptiness in Constant Time for IoT Data Streams over Sliding Windows
    Wang, Xiujun
    Liu, Zhi
    Yang, Yangzhao
    Shao, Xun
    Gu, Yu
    Ishihara, Susumu
    [J]. 2019 28TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND NETWORKS (ICCCN), 2019,