Social Influence Based Clustering and Optimization over Heterogeneous Information Networks

被引:34
|
作者
Zhou, Yang [1 ]
Liu, Ling [1 ]
机构
[1] Georgia Inst Technol, Coll Comp, Atlanta, GA 30332 USA
基金
美国国家科学基金会;
关键词
Algorithms; Experimentation; Performance; Graph clustering; heterogeneous information network; social influence; SIMILARITY;
D O I
10.1145/2717314
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Social influence analysis has shown great potential for strategic marketing decision. It is well known that people influence one another based on both their social connections and the social activities that they have engaged in the past. In this article, we develop an innovative and high-performance social influence based graph clustering framework with four unique features. First, we explicitly distinguish social connection based influence (self-influence) and social activity based influence (co-influence). We compute the self-influence similarity between two members based on their social connections within a single collaboration network, and compute the co-influence similarity by taking into account not only the set of activities that people participate but also the semantic association between these activities. Second, we define the concept of influence-based similarity by introducing a unified influence-based similarity matrix that employs an iterative weight update method to integrate self-influence and co-influence similarities. Third, we design a dynamic learning algorithm, called SI-CLUSTER, for social influence based graph clustering. It iteratively partitions a large social collaboration network into K clusters based on both the social network itself and the multiple associated activity information networks, each representing a category of activities that people have engaged. To make the SI-CLUSTER algorithm converge fast, we transform sophisticated nonlinear fractional programming problem with respect to multiple weights into a straightforward nonlinear parametric programming problem of single variable. Finally, we develop an optimization technique of diagonalizable-matrix approximation to speed up the computation of self-influence similarity and co-influence similarities. Our SI-Cluster-Opt significantly improves the efficiency of SI-Cluster on large graphs while maintaining high quality of clustering results. Extensive experimental evaluation on three real-world graphs shows that, compared to existing representative graph clustering algorithms, our SI-CLUSTER-OPT approach not only achieves a very good balance between self-influence and co-influence similarities but also scales extremely well for clustering large graphs in terms of time complexity while meeting the guarantee of high density, low entropy and low Davies-Bouldin Index.
引用
收藏
页数:53
相关论文
共 50 条
  • [1] Social Influence Based Clustering of Heterogeneous Information Networks
    Zhou, Yang
    Liu, Ling
    [J]. 19TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'13), 2013, : 338 - 346
  • [2] Improving Collaborative Filtering with Social Influence over Heterogeneous Information Networks
    Zhou, Yang
    Liu, Ling
    Lee, Kisung
    Palanisamy, Balaji
    Zhang, Qi
    [J]. ACM TRANSACTIONS ON INTERNET TECHNOLOGY, 2020, 20 (04)
  • [3] Evolutionary Information Diffusion Over Heterogeneous Social Networks
    Cao, Xuanyu
    Chen, Yan
    Jiang, Chunxiao
    Liu, K. J. Ray
    [J]. IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS, 2016, 2 (04): : 595 - 610
  • [4] Meta path-based information entropy for modeling social influence in heterogeneous information networks
    Yang, Yudi
    Zhou, Lihua
    Jin, Zhao
    Yang, Jinhua
    [J]. 2019 20TH INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT (MDM 2019), 2019, : 557 - 562
  • [5] Optimal Control for Heterogeneous Node-Based Information Epidemics Over Social Networks
    Liu, Fangzhou
    Buss, Martin
    [J]. IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2020, 7 (03): : 1115 - 1126
  • [6] Spectral Clustering in Heterogeneous Information Networks
    Li, Xiang
    Kao, Ben
    Ren, Zhaochun
    Yin, Dawei
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 4221 - 4228
  • [7] Tensor decomposition based clustering method for heterogeneous information in networks
    Wu J.
    Huang H.
    Deng S.
    [J]. Deng, Su (sudeng@sohu.com), 2018, National University of Defense Technology (40): : 146 - 152and170
  • [8] A fast clustering algorithm based on embedding technology for heterogeneous information networks
    Chen, Li-Min
    Yang, Jing
    Zhang, Jian-Pei
    [J]. Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2015, 37 (11): : 2634 - 2641
  • [9] A node clustering algorithm for heterogeneous information networks based on node embeddings
    Liu, Dongjiang
    Li, Leixiao
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (2) : 3745 - 3766
  • [10] A node clustering algorithm for heterogeneous information networks based on node embeddings
    Dongjiang Liu
    Leixiao Li
    [J]. Multimedia Tools and Applications, 2024, 83 : 3745 - 3766