Optimal Linear Discriminant Analysis for High-Dimensional Functional Data

被引:6
|
作者
Xue, Kaijie [1 ]
Yang, Jin [2 ]
Yao, Fang [3 ]
机构
[1] Nankai Univ, Sch Stat & Data Sci, Tianjin, Peoples R China
[2] Eunice Kennedy Shriver Natl Inst Child Hlth & Huma, Biostat & Bioinformat Branch, NIH, Bethesda, MD USA
[3] Peking Univ, Ctr Stat Sci, Sch Math Sci, Dept Probabil & Stat, Beijing, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Discriminant set inclusion; Functional principal components; Penalized classifier; CLASSIFICATION; REGRESSION; SELECTION; CLASSIFIERS; MODELS;
D O I
10.1080/01621459.2022.2164288
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Most of existing methods of functional data classification deal with one or a few processes. In this work we tackle classification of high-dimensional functional data, in which each observation is potentially associated with a large number of functional processes, p, which is comparable to or even much larger than the sample size n. The challenge arises from the complex inter-correlation structures among multiple functional processes, instead of a diagonal correlation for a single process. Since truncation is often needed for approximation in functional data, another difficulty stems from the fact that the discriminant set of the infinite-dimensional optimal classifier may be different from that of the truncated optimal classifier, when multiple (especially a large number of) processes are involved. We bridge the gap by proposing a penalized classifier that achieves both near-perfect classification that is unique to functional data, and discriminant set inclusion consistency in the sense that the classification-responsible functional predictors include those of the underlying optimal classifier. Simulation study and real data application are carried out to demonstrate its favorable performance. for this article are available online.
引用
收藏
页码:1055 / 1064
页数:10
相关论文
共 50 条
  • [21] High-dimensional integrative copula discriminant analysis for multiomics data
    He, Yong
    Chen, Hao
    Sun, Hao
    Ji, Jiadong
    Shi, Yufeng
    Zhang, Xinsheng
    Liu, Lei
    STATISTICS IN MEDICINE, 2020, 39 (30) : 4869 - 4884
  • [22] Diagonal Discriminant Analysis With Feature Selection for High-Dimensional Data
    Romanes, Sarah E.
    Ormerod, John T.
    Yang, Jean Y. H.
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2020, 29 (01) : 114 - 127
  • [23] High-dimensional Linear Discriminant Analysis Classifier for Spiked Covariance Model
    Sifaou, Houssem
    Kammoun, Abla
    Alouini, Mohamed-Slim
    JOURNAL OF MACHINE LEARNING RESEARCH, 2020, 21
  • [24] Weighted linear programming discriminant analysis for high-dimensional binary classification
    Wu, Yufei
    Yu, Guan
    STATISTICAL ANALYSIS AND DATA MINING, 2020, 13 (05) : 437 - 450
  • [25] AN EFFICIENT GREEDY SEARCH ALGORITHM FOR HIGH-DIMENSIONAL LINEAR DISCRIMINANT ANALYSIS
    Yang, Hannan
    Lin, Danyu
    Li, Quefeng
    STATISTICA SINICA, 2023, 33 : 1343 - 1364
  • [26] High-dimensional linear discriminant analysis classifier for spiked covariance model ∗
    Sifaou, Houssem
    Kammoun, Abla
    Alouini, Mohamed-Slim
    Journal of Machine Learning Research, 2020, 21
  • [27] Stringing High-Dimensional Data for Functional Analysis
    Chen, Kun
    Chen, Kehui
    Mueller, Hans-Georg
    Wang, Jane-Ling
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (493) : 275 - 284
  • [28] Class-specific subspace discriminant analysis for high-dimensional data
    Bouveyron, Charles
    Girard, Stephane
    Schmid, Cordelia
    SUBSPACE, LATENT STRUCTURE AND FEATURE SELECTION, 2006, 3940 : 139 - 150
  • [29] Graph-based sparse linear discriminant analysis for high-dimensional classification
    Liu, Jianyu
    Yu, Guan
    Liu, Yufeng
    JOURNAL OF MULTIVARIATE ANALYSIS, 2019, 171 : 250 - 269
  • [30] Simultaneous variable selection and class fusion for high-dimensional linear discriminant analysis
    Guo, Jian
    BIOSTATISTICS, 2010, 11 (04) : 599 - 608