Model-based clustering and classification of functional data

被引:29
|
作者
Chamroukhi, Faicel [1 ]
Nguyen, Hien D. [2 ]
机构
[1] Normandie Univ, Dept Math & Comp Sci, UNICAEN, UMR CNRS LMNO, F-14000 Caen, France
[2] La Trobe Univ, Dept Math & Stat, Melbourne, Vic, Australia
基金
澳大利亚研究理事会;
关键词
algorithms; classification; clustering; EM; functional data analysis; mixture models; HIDDEN MARKOV MODEL; DISCRIMINANT-ANALYSIS; MAXIMUM-LIKELIHOOD; MIXTURE MODEL; EM ALGORITHM; REGRESSION; INFERENCE; TUTORIAL;
D O I
10.1002/widm.1298
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Complex data analysis is a central topic of modern statistics and learning systems which is becoming of broader interest with the increasing prevalence of high-dimensional data. The challenge is to develop statistical models and autonomous algorithms that are able to discern knowledge from raw data, which can be achieved through clustering techniques, or to make predictions of future data via classification techniques. Latent data models, including mixture model-based approaches, are among the most popular and successful approaches in both supervised and unsupervised learning. Although being traditional tools in multivariate analysis, they are growing in popularity when considered in the framework of functional data analysis (FDA). FDA is the data analysis paradigm in which each datum is a function, rather than a real vector. In many areas of application, including signal and image processing, functional imaging, bioinformatics, etc., the analyzed data are indeed often available in the form of discretized values of functions, curves, or surfaces. This functional aspect of the data adds additional difficulties when compared to classical multivariate data analysis. We review and present approaches for model-based clustering and classification of functional data. We present well-grounded statistical models along with efficient algorithmic tools to address problems regarding the clustering and the classification of these functional data, including their heterogeneity, missing information, and dynamical hidden structures. The presented models and algorithms are illustrated via real-world functional data analysis problems from several areas of application. This article is categorized under: Fundamental Concepts of Data and Knowledge > Data Concepts Structure Discovery and Clustering
引用
收藏
页数:36
相关论文
共 50 条
  • [21] On model-based clustering of skewed matrix data
    Melnykov, Volodymyr
    Zhu, Xuwen
    JOURNAL OF MULTIVARIATE ANALYSIS, 2018, 167 : 181 - 194
  • [22] Model-based clustering of array CGH data
    Shah, Sohrab P.
    Cheung, K-John, Jr.
    Johnson, Nathalie A.
    Alain, Guillaume
    Gascoyne, Randy D.
    Horsman, Douglas E.
    Ng, Raymond T.
    Murphy, Kevin P.
    BIOINFORMATICS, 2009, 25 (12) : I30 - I38
  • [23] Model-based multidimensional clustering of categorical data
    Chen, Tao
    Zhang, Nevin L.
    Liu, Tengfei
    Poon, Kin Man
    Wang, Yi
    ARTIFICIAL INTELLIGENCE, 2012, 176 (01) : 2246 - 2269
  • [24] Model-Based Hierarchical Clustering for Categorical Data
    Alalyan, Fahdah
    Zamzami, Nuha
    Bouguila, Nizar
    2019 IEEE 28TH INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS (ISIE), 2019, : 1424 - 1429
  • [25] Penalized model-based clustering of fMRI data
    Dilernia, Andrew
    Quevedo, Karina
    Camchong, Jazmin
    Lim, Kelvin
    Pan, Wei
    Zhang, Lin
    BIOSTATISTICS, 2022, 23 (03) : 825 - 843
  • [26] Model-based clustering and data transformations for gene expression data
    Yeung, KY
    Fraley, C
    Murua, A
    Raftery, AE
    Ruzzo, WL
    BIOINFORMATICS, 2001, 17 (10) : 977 - 987
  • [27] Estimation and model selection for model-based clustering with the conditional classification likelihood
    Baudry, Jean-Patrick
    ELECTRONIC JOURNAL OF STATISTICS, 2015, 9 (01): : 1041 - 1077
  • [28] Adaptive Model-Based Classification of PolSAR Data
    Li, Dong
    Zhang, Yunhua
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (12): : 6940 - 6955
  • [29] A simple model-based approach to variable selection in classification and clustering
    Partovi Nia, Vahid
    Davison, Anthony C.
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2015, 43 (02): : 157 - 175
  • [30] Parsimonious skew mixture models for model-based clustering and classification
    Vrbik, Irene
    McNicholas, Paul D.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 71 : 196 - 210