CHANGE-POINT DETECTION IN MULTINOMIAL DATA WITH A LARGE NUMBER OF CATEGORIES

被引:20
|
作者
Wang, Guanghui [1 ,2 ]
Zou, Changliang [1 ,2 ]
Yin, Guosheng [3 ]
机构
[1] Nankai Univ, Inst Stat, Tianjin, Peoples R China
[2] Nankai Univ, LPMC, Tianjin, Peoples R China
[3] Univ Hong Kong, Dept Stat & Actuarial Sci, Hong Kong, Hong Kong, Peoples R China
来源
ANNALS OF STATISTICS | 2018年 / 46卷 / 05期
关键词
Asymptotic normality; categorical data; high-dimensional homogeneity test; multiple change-point detection; sparse contingency table; TIME-SERIES; MULTIPLE; MODELS;
D O I
10.1214/17-AOS1610
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider a sequence of multinomial data for which the probabilities associated with the categories are subject to abrupt changes of unknown magnitudes at unknown locations. When the number of categories is comparable to or even larger than the number of subjects allocated to these categories, conventional methods such as the classical Pearson's chi-squared test and the deviance test may not work well. Motivated by high-dimensional homogeneity tests, we propose a novel change-point detection procedure that allows the number of categories to tend to infinity. The null distribution of our test statistic is asymptotically normal and the test performs well with finite samples. The number of change-points is determined by minimizing a penalized objective function based on segmentation, and the locations of the change-points are estimated by minimizing the objective function with the dynamic programming algorithm. Under some mild conditions, the consistency of the estimators of multiple change-points is established. Simulation studies show that the proposed method performs satisfactorily for identifying change-points in terms of power and estimation accuracy, and it is illustrated with an analysis of a real data set.
引用
收藏
页码:2020 / 2044
页数:25
相关论文
共 50 条
  • [1] Outlier detection for multinomial data with a large number of categories
    Yang, Xiaona
    Wang, Zhaojun
    Zi, Xuemin
    [J]. RANDOM MATRICES-THEORY AND APPLICATIONS, 2020, 9 (03)
  • [2] MULTINOMIAL SAMPLING FOR HIERARCHICAL CHANGE-POINT DETECTION
    Romero-Medrano, Lorena
    Moreno-Munoz, Pablo
    Artes-Rodriguez, Antonio
    [J]. PROCEEDINGS OF THE 2020 IEEE 30TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2020,
  • [3] Change-point detection in multinomial data using phi-divergence test statistics
    Batsidis, A.
    Horvath, L.
    Martin, N.
    Pardo, L.
    Zografos, K.
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2013, 118 : 53 - 66
  • [4] Multinomial Sampling of Latent Variables for Hierarchical Change-Point Detection
    Romero-Medrano, Lorena
    Moreno-Munoz, Pablo
    Artes-Rodriguez, Antonio
    [J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2022, 94 (02): : 215 - 227
  • [5] Multinomial Sampling of Latent Variables for Hierarchical Change-Point Detection
    Lorena Romero-Medrano
    Pablo Moreno-Muñoz
    Antonio Artés-Rodríguez
    [J]. Journal of Signal Processing Systems, 2022, 94 : 215 - 227
  • [6] Sequential change-point detection in a multinomial logistic regression model
    Li, Fuxiao
    Chen, Zhanshou
    Xiao, Yanting
    [J]. OPEN MATHEMATICS, 2020, 18 : 807 - 819
  • [7] Change-Point Detection in Angular Data
    Irina Grabovsky
    Lajos Horváth
    [J]. Annals of the Institute of Statistical Mathematics, 2001, 53 : 552 - 566
  • [8] Change-point detection in panel data
    Horvath, Lajos
    Huskova, Marie
    [J]. JOURNAL OF TIME SERIES ANALYSIS, 2012, 33 (04) : 631 - 648
  • [9] Change-point detection in angular data
    Grabovsky, I
    Horváth, L
    [J]. ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2001, 53 (03) : 552 - 566
  • [10] Change-Point Detection in a High-Dimensional Multinomial Sequence Based on Mutual Information
    Xiang, Xinrong
    Jin, Baisuo
    Wu, Yuehua
    [J]. ENTROPY, 2023, 25 (02)