Covariance-based soft clustering of functional data based on the Wasserstein-Procrustes metric

被引:1
|
作者
Masarotto, Valentina [1 ,3 ]
Masarotto, Guido [2 ]
机构
[1] Leiden Univ, Math Inst, Leiden, Netherlands
[2] Univ Padua, Dept Stat Sci, Padua, Italy
[3] Leiden Univ, Math Inst, NL-2333 Leiden, Netherlands
关键词
covariance operators; functional data; fuzzy clustering; procrustes distance; soft clustering; trimmed average silhouette width; Wasserstein distance; STATISTICAL-ANALYSIS; OPERATORS; EQUALITY; TESTS; INFERENCE; GEOMETRY;
D O I
10.1111/sjos.12692
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider the problem of clustering functional data according to their covariance structure. We contribute a soft clustering methodology based on the Wasserstein-Procrustes distance, where the in-between cluster variability is penalized by a term proportional to the entropy of the partition matrix. In this way, each covariance operator can be partially classified into more than one group. Such soft classification allows for clusters to overlap, and arises naturally in situations where the separation between all or some of the clusters is not well-defined. We also discuss how to estimate the number of groups and to test for the presence of any cluster structure. The algorithm is illustrated using simulated and real data. An R implementation is available in the Appendix S1.
引用
收藏
页码:485 / 512
页数:28
相关论文
共 50 条
  • [1] Covariance-based Clustering in Multivariate and Functional Data Analysis
    Ieva, Francesca
    Paganoni, Anna Maria
    Tarabelloni, Nicholas
    JOURNAL OF MACHINE LEARNING RESEARCH, 2016, 17 : 1 - 21
  • [2] Toeplitz Inverse Covariance-Based Clustering of Multivariate Time Series Data
    Hallac, David
    Vare, Sagar
    Boyd, Stephen
    Leskovec, Jure
    KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, : 215 - 223
  • [3] Toeplitz Inverse Covariance-Based Clustering of Multivariate Time Series Data
    Hallac, David
    Vare, Sagar
    Boyd, Stephen
    Leskovec, Jure
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 5254 - 5258
  • [4] Covariance-Based Variable Selection for Compositional Data
    Hron, Karel
    Filzmoser, Peter
    Donevska, Sandra
    Fiserova, Eva
    MATHEMATICAL GEOSCIENCES, 2013, 45 (04) : 487 - 498
  • [5] Covariance-Based Variable Selection for Compositional Data
    Karel Hron
    Peter Filzmoser
    Sandra Donevska
    Eva Fišerová
    Mathematical Geosciences, 2013, 45 : 487 - 498
  • [6] A Sample Covariance-Based Approach For Spatial Binary Data
    Zarmehri, Sahar
    Hanks, Ephraim M.
    Lin, Lin
    JOURNAL OF AGRICULTURAL BIOLOGICAL AND ENVIRONMENTAL STATISTICS, 2021, 26 (02) : 220 - 249
  • [7] Brainwave Classification Using Covariance-Based Data Augmentation
    Yang, Wonseok
    Nam, Woochul
    IEEE ACCESS, 2020, 8 : 211714 - 211722
  • [8] Covariance-Based PCA for Multi-Size Data
    Zhai, Menghua
    Shi, Feiyu
    Duncan, Drew
    Jacobs, Nathan
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 1603 - 1608
  • [9] A Sample Covariance-Based Approach For Spatial Binary Data
    Sahar Zarmehri
    Ephraim M. Hanks
    Lin Lin
    Journal of Agricultural, Biological and Environmental Statistics, 2021, 26 : 220 - 249
  • [10] Covariance-Based MIMO Beamforming
    Sharifabad, Farnaz Karimdady
    Jensen, Michael A.
    2012 IEEE INTERNATIONAL CONFERENCE ON WIRELESS INFORMATION TECHNOLOGY AND SYSTEMS (ICWITS), 2012,