Model-Based Edge Clustering

被引:3
|
作者
Sewell, Daniel K. [1 ]
机构
[1] Univ Iowa, Dept Biostat, 145 N Riverside Dr,100 CPHB, Iowa City, IA 52242 USA
关键词
Community detection; Latent space models; Network analysis; Social networks; COMMUNITY DETECTION; INFERENCE; SPREAD;
D O I
10.1080/10618600.2020.1811104
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Relational data can be studied using network analytic techniques which define the network as a set of actors and a set of edges connecting these actors. One important facet of network analysis that receives significant attention is community detection. However, while most community detection algorithms focus on clustering the actors of the network, it is very intuitive to cluster the edges. Connections exist because they were formed within some latent environment such as, in the case of a social network, a workplace or religious group, and hence by clustering the edges of a network we may gain some insight into these latent environments. We propose a model-based approach to clustering the edges of a network using a latent space model describing the features of both actors and latent environments. We derive a generalized EM algorithm for estimation and gradient-based Monte Carlo algorithms, and we demonstrate that the computational cost grows linearly in the number of actors for sparse networks rather than quadratically. We demonstrate the potential impact of our proposed approach on a patient transfer network, verifying these results by running simple epidemic simulations, and on a real friendship network among faculty members at a university in the United Kingdom.
引用
收藏
页码:390 / 405
页数:16
相关论文
共 50 条
  • [41] MODEL-BASED CLUSTERING OF LARGE NETWORKS
    Vu, Duy Q.
    Hunter, David R.
    Schweinberger, Michael
    [J]. ANNALS OF APPLIED STATISTICS, 2013, 7 (02): : 1010 - 1039
  • [42] Model-based clustering for longitudinal data
    De la Cruz-Mesia, Rolando
    Quintanab, Fernando A.
    Marshall, Guillermo
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2008, 52 (03) : 1441 - 1457
  • [43] Model-based Bayesian clustering (MBBC)
    Joo, Yongsung
    Booth, James G.
    Namkoong, Younghwan
    Casella, George
    [J]. BIOINFORMATICS, 2008, 24 (06) : 874 - 875
  • [44] Finite mixture models and model-based clusteringFinite mixture models and model-based clustering
    Melnykov, Volodymyr
    Maitra, Ranjan
    [J]. STATISTICS SURVEYS, 2010, 4 : 80 - 116
  • [45] Unsupervised fuzzy model-based Gaussian clustering
    Yang, Miin-Shen
    Chang-Chien, Shou-Jen
    Nataliani, Yessica
    [J]. INFORMATION SCIENCES, 2019, 481 : 1 - 23
  • [46] Finding Outliers in Gaussian Model-based Clustering
    Clark, Katharine M.
    Mcnicholas, Paul D.
    [J]. JOURNAL OF CLASSIFICATION, 2024, 41 (02) : 313 - 337
  • [47] SELECTING CATEGORICAL FEATURES IN MODEL-BASED CLUSTERING
    Silvestre, Claudia M. V.
    Cardoso, Margarida M. G.
    Figueiredo, Mario A. T.
    [J]. KDIR 2009: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND INFORMATION RETRIEVAL, 2009, : 303 - +
  • [48] Conditional mixture modeling and model-based clustering
    Melnykov, Volodymyr
    Wang, Yang
    [J]. PATTERN RECOGNITION, 2023, 133
  • [49] Model-based clustering with missing not at random data
    Sportisse, Aude
    Marbac, Matthieu
    Laporte, Fabien
    Celeux, Gilles
    Boyer, Claire
    Josse, Julie
    Biernacki, Christophe
    [J]. STATISTICS AND COMPUTING, 2024, 34 (04)
  • [50] Latent Model-Based Clustering for Biological Discovery
    Bing, Xin
    Bunea, Florentina
    Royer, Martin
    Das, Jishnu
    [J]. ISCIENCE, 2019, 14 : 125 - +