Optimal Linear Discriminant Analysis for High-Dimensional Functional Data

被引:6
|
作者
Xue, Kaijie [1 ]
Yang, Jin [2 ]
Yao, Fang [3 ]
机构
[1] Nankai Univ, Sch Stat & Data Sci, Tianjin, Peoples R China
[2] Eunice Kennedy Shriver Natl Inst Child Hlth & Huma, Biostat & Bioinformat Branch, NIH, Bethesda, MD USA
[3] Peking Univ, Ctr Stat Sci, Sch Math Sci, Dept Probabil & Stat, Beijing, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Discriminant set inclusion; Functional principal components; Penalized classifier; CLASSIFICATION; REGRESSION; SELECTION; CLASSIFIERS; MODELS;
D O I
10.1080/01621459.2022.2164288
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Most of existing methods of functional data classification deal with one or a few processes. In this work we tackle classification of high-dimensional functional data, in which each observation is potentially associated with a large number of functional processes, p, which is comparable to or even much larger than the sample size n. The challenge arises from the complex inter-correlation structures among multiple functional processes, instead of a diagonal correlation for a single process. Since truncation is often needed for approximation in functional data, another difficulty stems from the fact that the discriminant set of the infinite-dimensional optimal classifier may be different from that of the truncated optimal classifier, when multiple (especially a large number of) processes are involved. We bridge the gap by proposing a penalized classifier that achieves both near-perfect classification that is unique to functional data, and discriminant set inclusion consistency in the sense that the classification-responsible functional predictors include those of the underlying optimal classifier. Simulation study and real data application are carried out to demonstrate its favorable performance. for this article are available online.
引用
收藏
页码:1055 / 1064
页数:10
相关论文
共 50 条
  • [1] A modified linear discriminant analysis for high-dimensional data
    Hyodo, Masashi
    Yamada, Takayuki
    Himeno, Tetsuto
    Seo, Takashi
    HIROSHIMA MATHEMATICAL JOURNAL, 2012, 42 (02) : 209 - 231
  • [2] Generalized Linear Discriminant Analysis for High-Dimensional Genomic Data
    Li, Sisi
    Lewinger, Juan Pablo
    GENETIC EPIDEMIOLOGY, 2017, 41 (07) : 704 - 704
  • [3] Generalized linear discriminant analysis for high-dimensional genomic data
    Li, Sisi
    Lewinger, Juan Pablo
    GENETIC EPIDEMIOLOGY, 2018, 42 (07) : 713 - 713
  • [4] On sparse linear discriminant analysis algorithm for high-dimensional data classification
    Ng, Michael K.
    Liao, Li-Zhi
    Zhang, Leihong
    NUMERICAL LINEAR ALGEBRA WITH APPLICATIONS, 2011, 18 (02) : 223 - 235
  • [5] QUADRATIC DISCRIMINANT ANALYSIS FOR HIGH-DIMENSIONAL DATA
    Wu, Yilei
    Qin, Yingli
    Zhu, Mu
    STATISTICA SINICA, 2019, 29 (02) : 939 - 960
  • [6] Optimal Feature Selection in High-Dimensional Discriminant Analysis
    Kolar, Mladen
    Liu, Han
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2015, 61 (02) : 1063 - 1083
  • [7] Modified linear discriminant analysis approaches for classification of high-dimensional microarray data
    Xu, Ping
    Brock, Guy N.
    Parrish, Rudolph S.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2009, 53 (05) : 1674 - 1687
  • [8] MULTICATEGORY VERTEX DISCRIMINANT ANALYSIS FOR HIGH-DIMENSIONAL DATA
    Wu, Tong Tong
    Lange, Kenneth
    ANNALS OF APPLIED STATISTICS, 2010, 4 (04): : 1698 - 1721
  • [9] A review of quadratic discriminant analysis for high-dimensional data
    Qin, Yingli
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2018, 10 (04)
  • [10] High-dimensional discriminant analysis
    Bouveyron, Charles
    Girard, Stephane
    Schmid, Cordelia
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2007, 36 (13-16) : 2607 - 2623