Model-Based Edge Clustering

被引:3
|
作者
Sewell, Daniel K. [1 ]
机构
[1] Univ Iowa, Dept Biostat, 145 N Riverside Dr,100 CPHB, Iowa City, IA 52242 USA
关键词
Community detection; Latent space models; Network analysis; Social networks; COMMUNITY DETECTION; INFERENCE; SPREAD;
D O I
10.1080/10618600.2020.1811104
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Relational data can be studied using network analytic techniques which define the network as a set of actors and a set of edges connecting these actors. One important facet of network analysis that receives significant attention is community detection. However, while most community detection algorithms focus on clustering the actors of the network, it is very intuitive to cluster the edges. Connections exist because they were formed within some latent environment such as, in the case of a social network, a workplace or religious group, and hence by clustering the edges of a network we may gain some insight into these latent environments. We propose a model-based approach to clustering the edges of a network using a latent space model describing the features of both actors and latent environments. We derive a generalized EM algorithm for estimation and gradient-based Monte Carlo algorithms, and we demonstrate that the computational cost grows linearly in the number of actors for sparse networks rather than quadratically. We demonstrate the potential impact of our proposed approach on a patient transfer network, verifying these results by running simple epidemic simulations, and on a real friendship network among faculty members at a university in the United Kingdom.
引用
收藏
页码:390 / 405
页数:16
相关论文
共 50 条
  • [1] Model-Based Clustering
    Paul D. McNicholas
    [J]. Journal of Classification, 2016, 33 : 331 - 373
  • [2] Model-Based Clustering
    Gormley, Isobel Claire
    Murphy, Thomas Brendan
    Raftery, Adrian E.
    [J]. ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, 2023, 10 : 573 - 595
  • [3] Model-Based Clustering
    McNicholas, Paul D.
    [J]. JOURNAL OF CLASSIFICATION, 2016, 33 (03) : 331 - 373
  • [4] Model-Based Clustering and New Edge Modelling in Large Computer Networks
    Metelli, Silvia
    Heard, Nicholas
    [J]. IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS: CYBERSECURITY AND BIG DATA, 2016, : 91 - 96
  • [5] Model-based clustering with envelopes
    Wang, Wenjing
    Zhang, Xin
    Mai, Qing
    [J]. ELECTRONIC JOURNAL OF STATISTICS, 2020, 14 (01): : 82 - 109
  • [6] Model-based linear clustering
    Yan, Guohua
    Welch, William J.
    Zamar, Ruben H.
    [J]. CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2010, 38 (04): : 716 - 737
  • [7] Challenges in model-based clustering
    Melnykov, Volodymyr
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2013, 5 (02): : 135 - 148
  • [8] Model-Based Clustering with HDBSCAN
    Strobl, Michael
    Sander, Joerg
    Campello, Ricardo J. G. B.
    Zaiane, Osmar
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2020, PT II, 2021, 12458 : 364 - 379
  • [9] A model-based distance for clustering
    Rattray, M
    [J]. IJCNN 2000: PROCEEDINGS OF THE IEEE-INNS-ENNS INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOL IV, 2000, : 13 - 16
  • [10] Parametric model-based clustering
    Nikulin, V
    Smola, AJ
    [J]. DATA MINING, INTRUSION DETECTION, INFORMATION ASSURANCE, AND DATA NETWORKS SECURITY 2005, 2005, 5812 : 190 - 201