Framework for real-time clustering over sliding windows

被引:3
|
作者
Badiozamany, Sobhan [1 ]
Orsborn, Kjell [1 ]
Risch, Tore [1 ]
机构
[1] Uppsala Univ, Box 337, SE-75105 Uppsala, Sweden
关键词
Sliding windows; Clustering; Framework;
D O I
10.1145/2949689.2949696
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering queries over sliding windows require maintaining cluster memberships that change as windows slide. To address this, the Generic 2-phase Continuous Summarization framework (G2CS) utilizes a generation based window maintenance approach where windows are maintained over different time intervals. It provides algorithm independent and efficient sliding mechanisms for clustering queries where the clustering algorithms are defined in terms of queries over cluster data represented as temporal tables. A particular challenge for real-time detection of a high number of fastly evolving clusters is efficiently supporting smooth re-clustering in real-time, i.e. to minimize the sliding time with increasing window size and decreasing strides. To efficiently support such re-clustering for clustering algorithms where deletion of expired data is not supported, e.g. BIRCH, G2CS includes a novel window maintenance mechanism called Sliding Binary Merge (SBM), which maintains several generations of intermediate window instances and does not require decremental cluster maintenance. To improve real-time sliding performance, G2CS uses generation-based multi-dimensional indexing. Extensive performance evaluation on both synthetic and real data shows that G2CS scales substantially better than related approaches.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Sliding Alignment Windows for Real-Time Crowd Captioning
    Kazemi, Mohammad
    Lavaee, Rahman
    Naim, Iftekhar
    Gildea, Daniel
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2014, : 236 - 240
  • [2] A deadline-sensitive approach for real-time processing of sliding windows
    Wu, SS
    Yu, G
    Yu, YX
    Ou, ZY
    Yang, XH
    Gu, Y
    ADVANCES IN WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2005, 3739 : 566 - 577
  • [3] A Clustering Framework for Real-Time Rendering of Tree Foliage
    Rebollo, C.
    Remolar, I.
    Chover, M.
    Gumbau, J.
    Ripolles, O.
    JOURNAL OF COMPUTERS, 2007, 2 (04) : 57 - 67
  • [4] Clustering on Uncertain Data Stream over Sliding Windows
    Tu, Li
    2015 THIRD INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA, 2015, : 148 - 152
  • [5] Clustering Data Streams over Sliding Windows by DCA
    Ta Minh Thuy
    Le Thi Hoai An
    Boudjeloud-Assala, Lydia
    ADVANCED COMPUTATIONAL METHODS FOR KNOWLEDGE ENGINEERING, 2013, 479 : 65 - 75
  • [6] Real-Time Data Stream Partitioning over a Sliding Window in Real-Time Spatial Big Data
    Hamdi, Sana
    Bouazizi, Emna
    Faiz, Sami
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2018, PT I, 2018, 11334 : 75 - 88
  • [7] Sliding windows: A software method suitable for real-time inspection of textile surfaces
    Anagnostopoulos, C
    Anagnostopoulos, I
    Vergados, D
    Kayafas, E
    Loumos, V
    TEXTILE RESEARCH JOURNAL, 2004, 74 (07) : 646 - 651
  • [8] An Efficient and Distributed Framework for Real-Time Trajectory Stream Clustering
    Gao, Yunjun
    Fang, Ziquan
    Xu, Jiachen
    Gong, Shenghao
    Shen, Chunhui
    Chen, Lu
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (05) : 1857 - 1873
  • [9] WINDOWS GOES REAL-TIME
    RAJAMANI, K
    BHASKER, N
    GERBER, R
    SNYDER, S
    BYTE, 1992, 17 (04): : 119 - &
  • [10] WINDOWS GOES REAL-TIME
    NIEMI, MF
    BYTE, 1992, 17 (08): : 16 - 16