Model-based clustering and classification of functional data

被引:29
|
作者
Chamroukhi, Faicel [1 ]
Nguyen, Hien D. [2 ]
机构
[1] Normandie Univ, Dept Math & Comp Sci, UNICAEN, UMR CNRS LMNO, F-14000 Caen, France
[2] La Trobe Univ, Dept Math & Stat, Melbourne, Vic, Australia
基金
澳大利亚研究理事会;
关键词
algorithms; classification; clustering; EM; functional data analysis; mixture models; HIDDEN MARKOV MODEL; DISCRIMINANT-ANALYSIS; MAXIMUM-LIKELIHOOD; MIXTURE MODEL; EM ALGORITHM; REGRESSION; INFERENCE; TUTORIAL;
D O I
10.1002/widm.1298
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Complex data analysis is a central topic of modern statistics and learning systems which is becoming of broader interest with the increasing prevalence of high-dimensional data. The challenge is to develop statistical models and autonomous algorithms that are able to discern knowledge from raw data, which can be achieved through clustering techniques, or to make predictions of future data via classification techniques. Latent data models, including mixture model-based approaches, are among the most popular and successful approaches in both supervised and unsupervised learning. Although being traditional tools in multivariate analysis, they are growing in popularity when considered in the framework of functional data analysis (FDA). FDA is the data analysis paradigm in which each datum is a function, rather than a real vector. In many areas of application, including signal and image processing, functional imaging, bioinformatics, etc., the analyzed data are indeed often available in the form of discretized values of functions, curves, or surfaces. This functional aspect of the data adds additional difficulties when compared to classical multivariate data analysis. We review and present approaches for model-based clustering and classification of functional data. We present well-grounded statistical models along with efficient algorithmic tools to address problems regarding the clustering and the classification of these functional data, including their heterogeneity, missing information, and dynamical hidden structures. The presented models and algorithms are illustrated via real-world functional data analysis problems from several areas of application. This article is categorized under: Fundamental Concepts of Data and Knowledge > Data Concepts Structure Discovery and Clustering
引用
收藏
页数:36
相关论文
共 50 条
  • [31] Ecoregion Classification Using a Bayesian Approach and Model-based Clustering
    Pullar, D.
    Choy, S. Low
    Rochester, W.
    MODSIM 2005: INTERNATIONAL CONGRESS ON MODELLING AND SIMULATION: ADVANCES AND APPLICATIONS FOR MANAGEMENT AND DECISION MAKING: ADVANCES AND APPLICATIONS FOR MANAGEMENT AND DECISION MAKING, 2005, : 1560 - 1566
  • [32] Special issue on "New trends on model-based clustering and classification"
    Ingrassia, Salvatore
    McLachlan, Geoffrey J.
    Govaert, Gerard
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2015, 9 (04) : 367 - 369
  • [33] Model-based clustering for RNA-seq data
    Si, Yaqing
    Liu, Peng
    Li, Pinghua
    Brutnell, Thomas P.
    BIOINFORMATICS, 2014, 30 (02) : 197 - 205
  • [34] Model-Based Clustering for Conditionally Correlated Categorical Data
    Marbac, Matthieu
    Biernacki, Christophe
    Vandewalle, Vincent
    JOURNAL OF CLASSIFICATION, 2015, 32 (02) : 145 - 175
  • [35] Model-based clustering and outlier detection with missing data
    Hung Tong
    Cristina Tortora
    Advances in Data Analysis and Classification, 2022, 16 : 5 - 30
  • [36] Model-Based Clustering for Conditionally Correlated Categorical Data
    Matthieu Marbac
    Christophe Biernacki
    Vincent Vandewalle
    Journal of Classification, 2015, 32 : 145 - 175
  • [37] Model-based clustering and analysis of life history data
    Scott, Marc A.
    Mohan, Kaushik
    Gauthier, Jacques-Antoine
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2020, 183 (03) : 1231 - 1251
  • [38] Model-based clustering and outlier detection with missing data
    Tong, Hung
    Tortora, Cristina
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2022, 16 (01) : 5 - 30
  • [39] On Model-Based Clustering of Directional Data with Heavy Tails
    Yingying Zhang
    Volodymyr Melnykov
    Igor Melnykov
    Journal of Classification, 2023, 40 (3) : 527 - 551
  • [40] Bayesian model-based clustering for longitudinal ordinal data
    Roy Costilla
    Ivy Liu
    Richard Arnold
    Daniel Fernández
    Computational Statistics, 2019, 34 : 1015 - 1038