An adaptive approach for online monitoring of large-scale data streams

被引:0
|
作者
Cao, Shuchen [1 ]
Zhang, Ruizhi [2 ]
机构
[1] Univ Nebraska Lincoln, Dept Stat, Lincoln, NE USA
[2] Univ Georgia, Dept Stat, Athens, GA USA
关键词
False discovery rate; CUSUM; quickest change detection; process control; FALSE DISCOVERY RATE; CHANGE-POINT DETECTION; CHANGEPOINT DETECTION; SCHEMES;
D O I
10.1080/24725854.2023.2281580
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In this article, we propose an adaptive top-r method to monitor large-scale data streams where the change may affect a set of unknown data streams at some unknown time. Motivated by parallel and distributed computing, we propose to develop global monitoring schemes by parallel running local detection procedures and then use the Benjamin-Hochberg false discovery rate control procedure to estimate the number of changed data streams adaptively. Our approach is illustrated in two concrete examples: one is a homogeneous case when all data streams are independent and identically distributed with the same known pre-change and post-change distributions. The other is when all data are normally distributed, and the mean shifts are unknown and can be positive or negative. Theoretically, we show that when the pre-change and post-change distributions are completely specified, our proposed method can estimate the number of changed data streams for both the pre-change and post-change status. Moreover, we perform simulations and two case studies to show its detection efficiency.
引用
收藏
页码:119 / 130
页数:12
相关论文
共 50 条
  • [31] A Large-Scale, Multiagency Approach to Defining a Reference Network for Pacific Northwest Streams
    Miller, Stephanie
    Eldred, Peter
    Muldoon, Ariel
    Anlauf-Dunn, Kara
    Stein, Charlie
    Hubler, Shannon
    Merrick, Lesley
    Haxton, Nick
    Larson, Chad
    Rehn, Andrew
    Ode, Peter
    Vander Laan, Jake
    ENVIRONMENTAL MANAGEMENT, 2016, 58 (06) : 1091 - 1104
  • [32] An efficient algorithm for dense regions discovery from large-scale data streams
    Yip, AM
    Wu, EH
    Ng, MK
    Chan, TF
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2004, 3056 : 116 - 120
  • [33] Anomaly Detection for Data Streams in Large-Scale Distributed Heterogeneous Computing Environments
    Dang, Yue
    Wang, Bin
    Brant, Ryan
    Zhang, Zhiping
    Alqallaf, Maha
    Wu, Zhiqiang
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON CYBER WARFARE AND SECURITY (ICCWS 2017), 2017, : 121 - 130
  • [34] A Large-Scale, Multiagency Approach to Defining a Reference Network for Pacific Northwest Streams
    Stephanie Miller
    Peter Eldred
    Ariel Muldoon
    Kara Anlauf-Dunn
    Charlie Stein
    Shannon Hubler
    Lesley Merrick
    Nick Haxton
    Chad Larson
    Andrew Rehn
    Peter Ode
    Jake Vander Laan
    Environmental Management, 2016, 58 : 1091 - 1104
  • [35] Large-scale characterization of Java']Java streams
    Rosales, Eduardo
    Basso, Matteo
    Rosa, Andrea
    Binder, Walter
    SOFTWARE-PRACTICE & EXPERIENCE, 2023, 53 (09): : 1763 - 1792
  • [36] Design and implementation of an adaptive monitoring system for large-scale server clusters
    School of Electronics and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China
    不详
    Hsi An Chiao Tung Ta Hsueh, 2008, 4 (399-403):
  • [37] Adaptive Real-time Monitoring for Large-scale Networked Systems
    Prieto, Alberto Gonzalez
    Stadler, Rolf
    2009 IFIP/IEEE INTERNATIONAL SYMPOSIUM ON INTEGRATED NETWORK MANAGEMENT (IM 2009) VOLS 1 AND 2, 2009, : 790 - 795
  • [38] MANAGING DATA FROM LARGE-SCALE CONTINUOUS MONITORING PROJECTS
    MCMORRIS, RL
    GRAVLEY, RJ
    CHEMICAL ENGINEERING PROGRESS, 1993, 89 (03) : 111 - 115
  • [39] Efficient Data Collection for Large-Scale Mobile Monitoring Applications
    Shen, Haiying
    Li, Ze
    Yu, Lei
    Qiu, Chenxi
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2014, 25 (06) : 1424 - 1436
  • [40] Application of Remote Sensing Data in Large-Scale Monitoring of Wetlands
    S. S. Shinkarenko
    S. A. Bartalev
    Cosmic Research, 2024, 62 (Suppl 1) : S100 - S114