An adaptive approach for online monitoring of large-scale data streams

被引:0
|
作者
Cao, Shuchen [1 ]
Zhang, Ruizhi [2 ]
机构
[1] Univ Nebraska Lincoln, Dept Stat, Lincoln, NE USA
[2] Univ Georgia, Dept Stat, Athens, GA USA
关键词
False discovery rate; CUSUM; quickest change detection; process control; FALSE DISCOVERY RATE; CHANGE-POINT DETECTION; CHANGEPOINT DETECTION; SCHEMES;
D O I
10.1080/24725854.2023.2281580
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In this article, we propose an adaptive top-r method to monitor large-scale data streams where the change may affect a set of unknown data streams at some unknown time. Motivated by parallel and distributed computing, we propose to develop global monitoring schemes by parallel running local detection procedures and then use the Benjamin-Hochberg false discovery rate control procedure to estimate the number of changed data streams adaptively. Our approach is illustrated in two concrete examples: one is a homogeneous case when all data streams are independent and identically distributed with the same known pre-change and post-change distributions. The other is when all data are normally distributed, and the mean shifts are unknown and can be positive or negative. Theoretically, we show that when the pre-change and post-change distributions are completely specified, our proposed method can estimate the number of changed data streams for both the pre-change and post-change status. Moreover, we perform simulations and two case studies to show its detection efficiency.
引用
收藏
页码:119 / 130
页数:12
相关论文
共 50 条
  • [21] A spatial-adaptive sampling procedure for online monitoring of big data streams
    Wang, Andi
    Xian, Xiaochen
    Tsung, Fugee
    Liu, Kaibo
    JOURNAL OF QUALITY TECHNOLOGY, 2018, 50 (04) : 329 - 343
  • [23] A Data-Centric Storage Approach for Monitoring System of Large-Scale Smart Grid
    Wang, Yan
    Deng, Qingxu
    Liu, Wei
    Song, Baoyan
    2012 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING (WICOM), 2012,
  • [24] DLLog: An Online Log Parsing Approach for Large-Scale System
    Cheng, Hailong
    Ying, Shi
    Duan, Xiaoyu
    Yuan, Wanli
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2024, 2024
  • [25] Online Dictionary Learning from Large-Scale Binary Data
    Shen, Yanning
    Giannakis, Georgios B.
    2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 1808 - 1812
  • [26] Distributed Large-Scale Data Collection in Online Social Networks
    Efstathiades, Hariton
    Antoniades, Demetris
    Pallis, George
    Dikaiakos, Marios D.
    2016 IEEE 2ND INTERNATIONAL CONFERENCE ON COLLABORATION AND INTERNET COMPUTING (IEEE CIC), 2016, : 373 - 380
  • [27] Novel approach to typical air-conditioning behavior pattern extraction based on large-scale VRF system online monitoring data
    Wu, Yi
    Zhou, Xin
    Qian, Mingyang
    Jin, Yuan
    Sun, Hongsan
    Yan, Da
    JOURNAL OF BUILDING ENGINEERING, 2023, 69
  • [28] An adaptive clustering algorithm by neighbourhood search for large-scale data
    Sevinc, Busra
    Gurler, Selma
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2023, 93 (01) : 175 - 187
  • [29] The HaLoop approach to large-scale iterative data analysis
    Bu, Yingyi
    Howe, Bill
    Balazinska, Magdalena
    Ernst, Michael D.
    VLDB JOURNAL, 2012, 21 (02): : 169 - 190
  • [30] The HaLoop approach to large-scale iterative data analysis
    Yingyi Bu
    Bill Howe
    Magdalena Balazinska
    Michael D. Ernst
    The VLDB Journal, 2012, 21 : 169 - 190