Overlapping Community Detection via Semi-Binary Matrix Factorization: Identifiability and Algorithms

被引:2
|
作者
Sorensen, Mikael [1 ]
Sidiropoulos, Nicholas D. [1 ]
Swami, Ananthram [2 ]
机构
[1] Univ Virginia, Dept ECE, Charlottesville, VA 22904 USA
[2] DEVCOM Army Res Lab, Adelphi, MD 20783 USA
关键词
K-means clustering; combinatorial optimization; community detection; coupled matrix-tensor factorization; identifiability; matrix factorization; nonnegative matrix factorization; unsupervised learning; CANONICAL POLYADIC DECOMPOSITIONS; MULTILINEAR RANK-(LR; N; COUPLED DECOMPOSITIONS; PART I; SET; SIGNAL; UNIQUENESS; LR;
D O I
10.1109/TSP.2022.3200215
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Community detection is a fundamental problem in knowledge discovery and data mining. In this paper we propose a semi-binary matrix factorization (SBMF) model for community detection, which can be understood as a marriage between K-means clustering and (semi-)nonnegative matrix factorization. This leads to an easy-to-interpret factorization that can naturally handle overlapping communities. Unlike K-means, the proposed approach does not restrict each individual to belong to only a single community, nor does it restrict the sum of "soft membership" values to add up to one. We derive relatively easy-to-check uniqueness conditions suggesting that meaningful communities can be obtained via SBMF. Computing a (least-squares) optimal SBMF is a hard mixed integer nonconvex optimization problem. We bypass this challenge by converting the problem into a coupled matrix-tensor factorization form, which only involves continuous variables and can be tackled using tensor decomposition tools, and can also be used to initialize optimization based methods. We present experiments with real data to demonstrate the effectiveness of the proposed approach for community detection in coauthorship networks and in financial stock market data.
引用
收藏
页码:4321 / 4336
页数:16
相关论文
共 50 条
  • [41] LINEAR SPECTRAL UNMIXING VIA MATRIX FACTORIZATION: IDENTIFIABILITY CRITERIA FOR SPARSE ABUNDANCES
    Lin, Chia-Hsiang
    Bioucas Dias, Jose M.
    [J]. IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 6155 - 6158
  • [42] Binary Matrix Factorization and Completion via Integer Programming
    Gunluk, Oktay
    Hauser, Raphael Andreas
    Kovacs, Reka Agnes
    [J]. MATHEMATICS OF OPERATIONS RESEARCH, 2023, 49 (02) : 1278 - 1302
  • [43] Mining Discrete Patterns via Binary Matrix Factorization
    Jiang, Peng
    Heath, Michael T.
    [J]. 2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2013, : 1129 - 1136
  • [44] Mining Discrete Patterns via Binary Matrix Factorization
    Shen, Bao-Hong
    Ji, Shuiwang
    Ye, Jieping
    [J]. KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2009, : 757 - 765
  • [45] Binary matrix factorization via collaborative neurodynamic optimization
    Li, Hongzong
    Wang, Jun
    Zhang, Nian
    Zhang, Wei
    [J]. NEURAL NETWORKS, 2024, 176
  • [46] Nonnegative Residual Matrix Factorization for Community Detection
    Pei, Yulong
    Liu, Cong
    Zheng, Chuanyang
    Cheng, Long
    [J]. WEB INFORMATION SYSTEMS ENGINEERING, WISE 2020, PT I, 2020, 12342 : 196 - 209
  • [47] Simplex-Structured Matrix Factorization: Sparsity-Based Identifiability and Provably Correct Algorithms
    Abdolali, Maryam
    Gillis, Nicolas
    [J]. SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2021, 3 (02): : 593 - 623
  • [48] COMMUNITY DETECTION APPROACH VIA GRAPH REGULARIZED NON-NEGATIVE MATRIX FACTORIZATION
    Ul Haq, Amin
    Li, Jian Ping
    Khan, Ghufran Ahmad
    Khan, Jalaluddin
    Ishrat, Mohammad
    Guru, Abhishek
    Agbley, Bless Lord Y.
    [J]. 2022 19TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2022,
  • [49] Quantitative and Qualitative Analysis of Overlapping Community Detection Algorithms
    Orman, Günce Keziban
    [J]. IAENG International Journal of Computer Science, 2021, 48 (04)
  • [50] Instability of clustering metrics in overlapping community detection algorithms
    Kiedanski, Diego
    Rodriguez-Bocca, Pablo
    [J]. 2021 XLVII LATIN AMERICAN COMPUTING CONFERENCE (CLEI 2021), 2021,