Model-based co-clustering for functional data

被引:27
|
作者
Ben Slimen, Yosra [1 ,2 ]
Allio, Sylvain [1 ]
Jacques, Julien [2 ]
机构
[1] Orange Labs, Belfort, France
[2] Univ Lyon, Univ Lyon 2, ERIC EA3083, Lyon, France
关键词
Co-clustering; Functional data; SEM-Gibbs algorithm; Latent block model; ICL-BIC criterion; Mobile network; Key performance indicators; APPROXIMATION; DENSITY;
D O I
10.1016/j.neucom.2018.02.055
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In order to provide a simplified representation of key performance indicators for an easier analysis by mobile network maintainers, a model-based co-clustering algorithm for functional data is proposed. Co-clustering aims to identify block patterns in a data set from a simultaneous clustering of rows and columns. The algorithm relies on the latent block model in which each curve is identified by its functional principal components that are modeled by a multivariate Gaussian distribution whose parameters are block-specific. These latter are estimated by a stochastic EM algorithm embedding a Gibbs sampling. In order to select the numbers of row-and column-clusters, an ICL-BIC criterion is introduced. In addition to be the first co-clustering algorithm for functional data, the advantage of the proposed model is its ability to extract the hidden double structure induced by the data and its ability to deal with missing values. The model has proven its efficiency on simulated data and on a real data application that helps to optimize the topology of 4G mobile networks. (c) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:97 / 108
页数:12
相关论文
共 50 条
  • [1] Model-based co-clustering for ordinal data
    Jacques, Julien
    Biernacki, Christophe
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2018, 123 : 101 - 115
  • [2] Model-based co-clustering for mixed type data
    Selosse, Margot
    Jacques, Julien
    Biernacki, Christophe
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2020, 144
  • [3] Co-clustering contaminated data: a robust model-based approach
    Fibbi, Edoardo
    Perrotta, Domenico
    Torti, Francesca
    Van Aelst, Stefan
    Verdonck, Tim
    [J]. ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2024, 18 (01) : 121 - 161
  • [4] Co-clustering contaminated data: a robust model-based approach
    Edoardo Fibbi
    Domenico Perrotta
    Francesca Torti
    Stefan Van Aelst
    Tim Verdonck
    [J]. Advances in Data Analysis and Classification, 2024, 18 : 121 - 161
  • [5] Model-based co-clustering for the effective handling of sparse data
    Ailem, Melissa
    Role, Francois
    Nadif, Mohamed
    [J]. PATTERN RECOGNITION, 2017, 72 : 108 - 122
  • [6] Model-based Co-clustering for High Dimensional Sparse Data
    Salah, Aghiles
    Rogovschi, Nicoleta
    Nadif, Mohamed
    [J]. ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 51, 2016, 51 : 866 - 874
  • [7] Model-based Poisson co-clustering for Attributed Networks
    Riverain, Paul
    Fossier, Simon
    Nadif, Mohamed
    [J]. 21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS ICDMW 2021, 2021, : 703 - 710
  • [8] A Hierarchical Model-based Approach to Co-Clustering High-Dimensional Data
    Costa, Gianni
    Manco, Giuseppe
    Ortale, Riccardo
    [J]. APPLIED COMPUTING 2008, VOLS 1-3, 2008, : 886 - 890
  • [9] blockcluster: An R Package for Model-Based Co-Clustering
    Bhatia, Parmeet Singh
    Iovleff, Serge
    Govaert, Gerard
    [J]. JOURNAL OF STATISTICAL SOFTWARE, 2017, 76 (09): : 1 - 24
  • [10] Co-clustering for binary and functional data
    Ben Slimen, Yosra
    Jacques, Julien
    Allio, Sylvain
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2022, 51 (09) : 4845 - 4866