Model-based co-clustering for functional data

被引:27
|
作者
Ben Slimen, Yosra [1 ,2 ]
Allio, Sylvain [1 ]
Jacques, Julien [2 ]
机构
[1] Orange Labs, Belfort, France
[2] Univ Lyon, Univ Lyon 2, ERIC EA3083, Lyon, France
关键词
Co-clustering; Functional data; SEM-Gibbs algorithm; Latent block model; ICL-BIC criterion; Mobile network; Key performance indicators; APPROXIMATION; DENSITY;
D O I
10.1016/j.neucom.2018.02.055
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In order to provide a simplified representation of key performance indicators for an easier analysis by mobile network maintainers, a model-based co-clustering algorithm for functional data is proposed. Co-clustering aims to identify block patterns in a data set from a simultaneous clustering of rows and columns. The algorithm relies on the latent block model in which each curve is identified by its functional principal components that are modeled by a multivariate Gaussian distribution whose parameters are block-specific. These latter are estimated by a stochastic EM algorithm embedding a Gibbs sampling. In order to select the numbers of row-and column-clusters, an ICL-BIC criterion is introduced. In addition to be the first co-clustering algorithm for functional data, the advantage of the proposed model is its ability to extract the hidden double structure induced by the data and its ability to deal with missing values. The model has proven its efficiency on simulated data and on a real data application that helps to optimize the topology of 4G mobile networks. (c) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:97 / 108
页数:12
相关论文
共 50 条
  • [31] Model-based clustering of meta-analytic functional Imaging data
    Neumann, Jane
    von Cramon, D. Yves
    Lohmann, Gabriele
    [J]. HUMAN BRAIN MAPPING, 2008, 29 (02) : 177 - 192
  • [32] CO-CLUSTERING OF MULTIVARIATE FUNCTIONAL DATA FOR THE ANALYSIS OF AIR POLLUTION IN THE SOUTH OF FRANCE
    Bouveyron, Charles
    Jacques, Julien
    Schmutz, Amandine
    Simoes, Fanny
    Bottini, Silvia
    [J]. ANNALS OF APPLIED STATISTICS, 2022, 16 (03): : 1400 - 1422
  • [33] The latent topic block model for the co-clustering of textual interaction data
    Berge, Laurent R.
    Bouveyron, Charles
    Corneli, Marco
    Latouche, Pierre
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2019, 137 : 247 - 270
  • [34] Model-based clustering of longitudinal data
    McNicholas, Paul D.
    Murphy, T. Brendan
    [J]. CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2010, 38 (01): : 153 - 168
  • [35] Boosting for model-based data clustering
    Saffari, Amir
    Bischof, Horst
    [J]. PATTERN RECOGNITION, 2008, 5096 : 51 - 60
  • [36] Model-based clustering for longitudinal data
    De la Cruz-Mesia, Rolando
    Quintanab, Fernando A.
    Marshall, Guillermo
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2008, 52 (03) : 1441 - 1457
  • [37] Model-Based Clustering of Temporal Data
    El Assaad, Hani
    Same, Allou
    Govaert, Gerard
    Aknin, Patrice
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2013, 2013, 8131 : 9 - 16
  • [38] High-Order Co-clustering Text Data on Semantics-Based Representation Model
    Jing, Liping
    Yun, Jiali
    Yu, Jian
    Huang, Joshua
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT I: 15TH PACIFIC-ASIA CONFERENCE, PAKDD 2011, 2011, 6634 : 171 - 182
  • [39] Co-clustering for Binary Data with Maximum Modularity
    Labiod, Lazhar
    Nadif, Mohamed
    [J]. NEURAL INFORMATION PROCESSING, PT II, 2011, 7063 : 700 - 708
  • [40] CO-CLUSTERING SEPARATELY EXCHANGEABLE NETWORK DATA
    Choi, David
    Wolfe, Patrick J.
    [J]. ANNALS OF STATISTICS, 2014, 42 (01): : 29 - 63