Co-clustering for binary and functional data

被引:1
|
作者
Ben Slimen, Yosra [1 ,2 ]
Jacques, Julien [1 ]
Allio, Sylvain [2 ]
机构
[1] Univ Lyon, ERIC EA3083, Lyon, France
[2] Orange Labs, Rech & Dev, Belfort, France
关键词
Co-clustering; EM algorithm; functional data; ICL-BIC criterion; Latent block model; Mixed data; Mobile network; MODEL;
D O I
10.1080/03610918.2020.1764033
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Due to the diversity of mobile network technologies, the volume of data that has to be observed by mobile operators in a daily basis has become enormous. This huge volume has become an obstacle to mobile networks management. This paper aims to provide a simplified representation of these data for an easier analysis. A model-based co-clustering algorithm for mixed data, functional and binary, is therefore proposed. Co-clustering aims to identify block patterns in a dataset from a simultaneous clustering of rows and columns. The proposed approach relies on the latent block model, and three algorithms are compared for its inference: stochastic EM within Gibbs sampling, classification EM and variational EM. The proposed model is the first co-clustering algorithm for mixed data that deals with functional and binary features. The model has proven its efficiency on simulated data and on real data extracted from live 4G mobile networks.
引用
收藏
页码:4845 / 4866
页数:22
相关论文
共 50 条
  • [1] Co-clustering for Binary Data with Maximum Modularity
    Labiod, Lazhar
    Nadif, Mohamed
    [J]. NEURAL INFORMATION PROCESSING, PT II, 2011, 7063 : 700 - 708
  • [2] Model-based co-clustering for functional data
    Ben Slimen, Yosra
    Allio, Sylvain
    Jacques, Julien
    [J]. NEUROCOMPUTING, 2018, 291 : 97 - 108
  • [3] Joint co-clustering: Co-clustering of genomic and clinical bioimaging data
    Ficarra, Elisa
    De Micheli, Giovanni
    Yoon, Sungroh
    Benini, Luca
    Macii, Enrico
    [J]. COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2008, 55 (05) : 938 - 949
  • [4] Co-clustering from Tensor Data
    Boutalbi, Rafika
    Labiod, Lazhar
    Nadif, Mohamed
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2019, PT I, 2019, 11439 : 370 - 383
  • [5] Sleeved co-clustering of lagged data
    Shaham, Eran
    Sarne, David
    Ben-Moshe, Boaz
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2012, 31 (02) : 251 - 279
  • [6] Sleeved co-clustering of lagged data
    Eran Shaham
    David Sarne
    Boaz Ben-Moshe
    [J]. Knowledge and Information Systems, 2012, 31 : 251 - 279
  • [7] Co-clustering of fuzzy lagged data
    Shaham, Eran
    Sarne, David
    Ben-Moshe, Boaz
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2015, 44 (01) : 217 - 252
  • [8] Co-clustering of fuzzy lagged data
    Eran Shaham
    David Sarne
    Boaz Ben-Moshe
    [J]. Knowledge and Information Systems, 2015, 44 : 217 - 252
  • [9] CO-CLUSTERING OF MULTIVARIATE FUNCTIONAL DATA FOR THE ANALYSIS OF AIR POLLUTION IN THE SOUTH OF FRANCE
    Bouveyron, Charles
    Jacques, Julien
    Schmutz, Amandine
    Simoes, Fanny
    Bottini, Silvia
    [J]. ANNALS OF APPLIED STATISTICS, 2022, 16 (03): : 1400 - 1422
  • [10] CO-CLUSTERING SEPARATELY EXCHANGEABLE NETWORK DATA
    Choi, David
    Wolfe, Patrick J.
    [J]. ANNALS OF STATISTICS, 2014, 42 (01): : 29 - 63