Co-clustering for binary and functional data

被引:1
|
作者
Ben Slimen, Yosra [1 ,2 ]
Jacques, Julien [1 ]
Allio, Sylvain [2 ]
机构
[1] Univ Lyon, ERIC EA3083, Lyon, France
[2] Orange Labs, Rech & Dev, Belfort, France
关键词
Co-clustering; EM algorithm; functional data; ICL-BIC criterion; Latent block model; Mixed data; Mobile network; MODEL;
D O I
10.1080/03610918.2020.1764033
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Due to the diversity of mobile network technologies, the volume of data that has to be observed by mobile operators in a daily basis has become enormous. This huge volume has become an obstacle to mobile networks management. This paper aims to provide a simplified representation of these data for an easier analysis. A model-based co-clustering algorithm for mixed data, functional and binary, is therefore proposed. Co-clustering aims to identify block patterns in a dataset from a simultaneous clustering of rows and columns. The proposed approach relies on the latent block model, and three algorithms are compared for its inference: stochastic EM within Gibbs sampling, classification EM and variational EM. The proposed model is the first co-clustering algorithm for mixed data that deals with functional and binary features. The model has proven its efficiency on simulated data and on real data extracted from live 4G mobile networks.
引用
收藏
页码:4845 / 4866
页数:22
相关论文
共 50 条
  • [21] Model-based co-clustering for ordinal data
    Jacques, Julien
    Biernacki, Christophe
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2018, 123 : 101 - 115
  • [22] Bipartite isoperimetric graph partitioning for data co-clustering
    Rege, Manjeet
    Dong, Ming
    Fotouhi, Farshad
    DATA MINING AND KNOWLEDGE DISCOVERY, 2008, 16 (03) : 276 - 312
  • [23] CFOND: Consensus Factorization for Co-Clustering Networked Data
    Guo, Ting
    Pan, Shirui
    Zhu, Xingquan
    Zhang, Chengqi
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (04) : 706 - 719
  • [24] A New Framework for Co-clustering of Gene Expression Data
    Zhang, Shuzhong
    Wang, Kun
    Chen, Bilian
    Huang, Xiuzhen
    PATTERN RECOGNITION IN BIOINFORMATICS, 2011, 7036 : 1 - +
  • [25] Bipartite isoperimetric graph partitioning for data co-clustering
    Manjeet Rege
    Ming Dong
    Farshad Fotouhi
    Data Mining and Knowledge Discovery, 2008, 16 : 276 - 312
  • [26] Subspace Weighting Co-Clustering of Gene Expression Data
    Chen, Xiaojun
    Huang, Joshua Z.
    Wu, Qingyao
    Yang, Min
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2019, 16 (02) : 352 - 364
  • [27] Multitask possibilistic and fuzzy co-clustering algorithm for clustering data with multisource features
    Jiaqi Ren
    Youlong Yang
    Neural Computing and Applications, 2020, 32 : 4785 - 4804
  • [28] Multitask fuzzy Bregman co-clustering approach for clustering data with multisource features
    Sokhandan, Alireza
    Adibi, Peyman
    Sajadi, Mohammadreza
    NEUROCOMPUTING, 2017, 247 : 102 - 114
  • [29] HetFCM: functional co-module discovery by heterogeneous network co-clustering
    Tan, Haojiang
    Guo, Maozu
    Chen, Jian
    Wang, Jun
    Yu, Guoxian
    NUCLEIC ACIDS RESEARCH, 2024, 52 (03) : E16
  • [30] Binary Clustering of Color Images by Fuzzy Co-Clustering with Non-Extensive Entropy Regularization
    Susan, Seba
    Agarwal, Meetu
    Agarwal, Seetu
    Kartikeya, Anand
    Meena, Ritu
    PROCEEDINGS OF THE 2016 2ND INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2016, : 512 - 517