An adaptive approach for online monitoring of large-scale data streams

被引:0
|
作者
Cao, Shuchen [1 ]
Zhang, Ruizhi [2 ]
机构
[1] Univ Nebraska Lincoln, Dept Stat, Lincoln, NE USA
[2] Univ Georgia, Dept Stat, Athens, GA USA
关键词
False discovery rate; CUSUM; quickest change detection; process control; FALSE DISCOVERY RATE; CHANGE-POINT DETECTION; CHANGEPOINT DETECTION; SCHEMES;
D O I
10.1080/24725854.2023.2281580
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In this article, we propose an adaptive top-r method to monitor large-scale data streams where the change may affect a set of unknown data streams at some unknown time. Motivated by parallel and distributed computing, we propose to develop global monitoring schemes by parallel running local detection procedures and then use the Benjamin-Hochberg false discovery rate control procedure to estimate the number of changed data streams adaptively. Our approach is illustrated in two concrete examples: one is a homogeneous case when all data streams are independent and identically distributed with the same known pre-change and post-change distributions. The other is when all data are normally distributed, and the mean shifts are unknown and can be positive or negative. Theoretically, we show that when the pre-change and post-change distributions are completely specified, our proposed method can estimate the number of changed data streams for both the pre-change and post-change status. Moreover, we perform simulations and two case studies to show its detection efficiency.
引用
收藏
页码:119 / 130
页数:12
相关论文
共 50 条
  • [41] Monitoring of large-scale federated data storage: XRootD and beyond
    Andreeva, J.
    Beche, A.
    Belov, S.
    Arias, D. Diguez
    Giordano, D.
    Oleynik, D.
    Petrosyan, A.
    Saiz, P.
    Tadel, M.
    Tuckett, D.
    Vukotic, I.
    20TH INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS (CHEP2013), PARTS 1-6, 2014, 513
  • [42] The Adaptive Research of Data Layout in Large-scale Meteorological Data Storage System
    Jiang, Xueying
    Chen, Wenhui
    Wang, Yao
    2013 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2013, : 1016 - 1020
  • [43] Online Adaptive Supervised Hashing for Large-Scale Cross-Modal Retrieval
    Su, Ruoqi
    Wang, Di
    Huang, Zhen
    Liu, Yuan
    An, Yaqiang
    IEEE ACCESS, 2020, 8 : 206360 - 206370
  • [44] Dynamic adaptive data structures for monitoring data streams
    Aguilar-Saborit, J.
    Trancoso, P.
    Muntes-Muleroc, V.
    Larriba-Pey, J. L.
    DATA & KNOWLEDGE ENGINEERING, 2008, 66 (01) : 92 - 115
  • [45] A Procedure for Efficient Large-Scale Retrospective Clinical Studies for Online Adaptive Radiotherapy
    Lambrecht, M.
    Graves, Y.
    Gautier, Q.
    Tian, Z.
    Kim, G.
    Uribe-Sanchez, A.
    Jia, X.
    Jiang, S.
    MEDICAL PHYSICS, 2012, 39 (06) : 3964 - 3964
  • [46] Online Adaptive Kernel Learning with Random Features for Large-scale Nonlinear Classification
    Chen, Yingying
    Yang, Xiaowei
    PATTERN RECOGNITION, 2022, 131
  • [47] An online incremental learning support vector machine for large-scale data
    Jun Zheng
    Furao Shen
    Hongjun Fan
    Jinxi Zhao
    Neural Computing and Applications, 2013, 22 : 1023 - 1035
  • [48] Online Censoring for Large-Scale Regressions with Application to Streaming Big Data
    Berberidis, Dimitris
    Kekatos, Vassilis
    Giannakis, Georgios B.
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2016, 64 (15) : 3854 - 3867
  • [49] Integrating Online Compression to Accelerate Large-Scale Data Analytics Applications
    Bicer, Tekin
    Yin, Jian
    Chiu, David
    Agrawal, Gagan
    Schuchardt, Karen
    IEEE 27TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2013), 2013, : 1205 - 1216
  • [50] An Online Incremental Learning Support Vector Machine for Large-scale Data
    Zheng, Jun
    Yu, Hui
    Shen, Furao
    Zhao, Jinxi
    ARTIFICIAL NEURAL NETWORKS-ICANN 2010, PT II, 2010, 6353 : 76 - +