QuantTree: Histograms for Change Detection in Multivariate Data Streams

被引:0
|
作者
Boracchi, Giacomo [1 ]
Carrera, Diego [1 ]
Cervellera, Cristiano [2 ]
Maccio, Danilo [2 ]
机构
[1] Politecn Milan, Dipartimento Elettron Informaz & Bioingegneria, Milan, Italy
[2] CNR, Inst Intelligent Syst Automat, Genoa, Italy
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We address the problem of detecting distribution changes in multivariate data streams by means of histograms. Histograms are very general and flexible models, which have been relatively ignored in the change-detection literature as they often require a number of bins that grows un-feasibly with the data dimension. We present QuantTree, a recursive binary splitting scheme that adaptively defines the histogram bins to ease the detection of any distribution change. Our design scheme implies that i) we can easily control the overall number of bins and ii) the bin probabilities do not depend on the distribution of stationary data. This latter is a very relevant aspect in change detection, since thresholds of tests statistics based on these histograms (e.g., the Pearson statistic or the total variation) can be numerically computed from univariate and synthetically generated data, yet guaranteeing a controlled false positive rate. Our experiments show that the proposed histograms are very effective in detecting changes in high dimensional data streams, and that the resulting thresholds can effectively control the false positive rate, even when the number of training samples is relatively small.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Uniform Histograms for Change Detection in Multivariate Data
    Boracchi, Giacomo
    Cervellera, Cristiano
    Maccio, Danilo
    [J]. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 1732 - 1739
  • [2] Change detection in learning histograms from data streams
    Sebastiao, Raquel
    Gama, Joao
    [J]. PROGRESS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2007, 4874 : 112 - 123
  • [3] Nonparametric and Online Change Detection in Multivariate Datastreams Using QuantTree
    Frittoli, Luca
    Carrera, Diego
    Boracchi, Giacomo
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (08) : 8328 - 8342
  • [4] Change detection in parametric multivariate dynamic data streams using the ARMAX-GARCH model
    Yu, Miaomiao
    Wu, Chunjie
    Tsung, Fugee
    [J]. JOURNAL OF QUALITY TECHNOLOGY, 2022, 54 (03) : 303 - 323
  • [5] Visual and Dynamic Change Detection for Data Streams
    Boudjeloud-Assala, Lydia
    Pinheiro, Philippe
    Blansche, Alexandre
    Tamisier, Thomas
    Otjaques, Benoit
    [J]. NEURAL INFORMATION PROCESSING, PT III, 2015, 9491 : 402 - 410
  • [6] A Supervised Approach for Change Detection in Data Streams
    Bondu, A.
    Boulle, M.
    [J]. 2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 519 - 526
  • [7] Constructing fading histograms from data streams
    Sebastiao, Raquel
    Gama, Joao
    Mendonca, Teresa
    [J]. PROGRESS IN ARTIFICIAL INTELLIGENCE, 2014, 3 (01) : 15 - 28
  • [8] Adaptive clusters and histograms over data streams
    Puttagunta, V
    Kalpakis, K
    [J]. IKE '05: PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE ENGINEERING, 2005, : 98 - 104
  • [9] MULTIVARIATE HISTOGRAMS WITH DATA-DEPENDENT PARTITIONS
    Klemela, Jussi
    [J]. STATISTICA SINICA, 2009, 19 (01) : 159 - 176
  • [10] Change point detection for compositional multivariate data
    K. J., Prabuchandran
    Singh, Nitin
    Dayama, Pankaj
    Agarwal, Ashutosh
    Pandit, Vinayaka
    [J]. APPLIED INTELLIGENCE, 2022, 52 (02) : 1930 - 1955