Cluster non-Gaussian functional data

被引:6
|
作者
Zhong, Qingzhi [1 ,2 ]
Lin, Huazhen [1 ,2 ]
Li, Yi [3 ]
机构
[1] Southwestern Univ Finance & Econ, Ctr Stat Res, Chengdu 611130, Peoples R China
[2] Southwestern Univ Finance & Econ, Sch Stat, Chengdu 611130, Peoples R China
[3] Univ Michigan, Dept Biostat, Ann Arbor, MI 48109 USA
基金
美国国家卫生研究院; 中国国家自然科学基金;
关键词
clustering analysis; functional principal component analysis; non-Gaussian functional data; nonparametric transformation model; penalized EM algorithm; MAXIMUM-LIKELIHOOD; CONVERGENCE-RATES; MIXTURE MODEL; REGRESSION; TRANSFORMATION;
D O I
10.1111/biom.13349
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Gaussian distributions have been commonly assumed when clustering functional data. When the normality condition fails, biased results will follow. Additional challenges occur as the number of the clusters is often unknowna priori. This paper focuses on clustering non-Gaussian functional data without the prior information of the number of clusters. We introduce a semiparametric mixed normal transformation model to accommodate non-Gaussian functional data, and propose a penalized approach to simultaneously estimate the parameters, transformation function, and the number of clusters. The estimators are shown to be consistent and asymptotically normal. The practical utility of the methods is confirmed via simulations as well as an application of the analysis of Alzheimer's disease study. The proposed method yields much less classification error than the existing methods. Data used in preparation of this paper were obtained from the Alzheimer's Disease Neuroimaging Initiative database.
引用
收藏
页码:852 / 865
页数:14
相关论文
共 50 条
  • [41] Information content in threshold data with non-Gaussian noise
    Greenwood, Priscilla E.
    Lansky, Petr
    FLUCTUATION AND NOISE LETTERS, 2007, 7 (01): : L79 - L89
  • [42] A Noncentral and Non-Gaussian Probability Model for SAR Data
    Cristea, Anca
    Doulgeris, Anthony P.
    Eltoft, Torbjorn
    IMAGE ANALYSIS, SCIA 2017, PT II, 2017, 10270 : 159 - 168
  • [43] A copula model for non-Gaussian multivariate spatial data
    Krupskii, Pavel
    Genton, Marc G.
    JOURNAL OF MULTIVARIATE ANALYSIS, 2019, 169 : 264 - 277
  • [44] Non-Gaussian Penalized PARAFAC Analysis for fMRI Data
    Liang, Jingsai
    Zou, Jiancheng
    Hong, Don
    FRONTIERS IN APPLIED MATHEMATICS AND STATISTICS, 2019, 5
  • [45] Sampling the posterior: An approach to non-Gaussian data assimilation
    Apte, A.
    Hairer, M.
    Stuart, A. M.
    Voss, J.
    PHYSICA D-NONLINEAR PHENOMENA, 2007, 230 (1-2) : 50 - 64
  • [46] NON-GAUSSIAN DYNAMIC BAYESIAN MODELLING FOR PANEL DATA
    Juarez, Miguel A.
    Steel, Mark F. J.
    JOURNAL OF APPLIED ECONOMETRICS, 2010, 25 (07) : 1128 - 1154
  • [47] The 4 year COBE DMR data is non-Gaussian
    Ferreira, PG
    Górski, KM
    Magueijo, J
    3 K COSMOLOGY: EC-TMR CONFERENCE, 1999, 476 : 293 - 297
  • [48] Smoothing non-Gaussian data in two-dimensions
    Emserman, M
    Kafadar, K
    MINING AND MODELING MASSIVE DATA SETS IN SCIENCE, ENGINEERING, AND BUSINESS WITH A SUBTHEME IN ENVIRONMENTAL STATISTICS, 1997, 29 (01): : 178 - 183
  • [49] NONLINEAR DATA OBSERVABILITY AND NON-GAUSSIAN INFORMATION STRUCTURES
    MOHLER, RR
    HWANG, CS
    LECTURE NOTES IN CONTROL AND INFORMATION SCIENCES, 1986, 85 : 125 - 153
  • [50] Arbitrage chances and the non-Gaussian features of financial data
    Tanaka-Yamawaki, M
    Komaki, S
    Itabashi, T
    2003 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE FOR FINANCIAL ENGINEERING, PROCEEDINGS, 2003, : 237 - 242