PathSelClus: Integrating Meta-Path Selection with User-Guided Object Clustering in Heterogeneous Information Networks

被引:128
|
作者
Sun, Yizhou [1 ]
Norick, Brandon [2 ]
Han, Jiawei [2 ]
Yan, Xifeng [3 ]
Yu, Philip S. [4 ,5 ]
Yu, Xiao [2 ]
机构
[1] Univ Illinois, Urbana, IL USA
[2] Univ Illinois, Dept Comp Sci, Urbana, IL USA
[3] Univ Calif Santa Barbara, Dept Comp Sci, Santa Barbara, CA 93106 USA
[4] Univ Illinois, Dept Comp Sci, Chicago, IL USA
[5] King Abdulaziz Univ, Dept Comp Sci, Jeddah 21413, Saudi Arabia
基金
美国国家科学基金会;
关键词
Algorithms; Heterogeneous information networks; meta-path selection; user-guided clustering;
D O I
10.1145/2500492
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Real-world, multiple-typed objects are often interconnected, forming heterogeneous information networks. A major challenge for link-based clustering in such networks is their potential to generate many different results, carrying rather diverse semantic meanings. In order to generate desired clustering, we propose to use meta-path, a path that connects object types via a sequence of relations, to control clustering with distinct semantics. Nevertheless, it is easier for a user to provide a few examples (seeds) than a weighted combination of sophisticated meta-paths to specify her clustering preference. Thus, we propose to integrate meta-path selection with user-guided clustering to cluster objects in networks, where a user first provides a small set of object seeds for each cluster as guidance. Then the system learns the weight for each meta-path that is consistent with the clustering result implied by the guidance, and generates clusters under the learned weights of meta-paths. A probabilistic approach is proposed to solve the problem, and an effective and efficient iterative algorithm, PathSelClus, is proposed to learn the model, where the clustering quality and the meta-path weights mutually enhance each other. Our experiments with several clustering tasks in two real networks and one synthetic network demonstrate the power of the algorithm in comparison with the baselines.
引用
收藏
页数:23
相关论文
共 50 条
  • [31] DPRel: A Meta-Path Based Relevance Measure for Mining Heterogeneous Networks
    Gupta, Mukul
    Kumar, Pradeep
    Bhasker, Bharat
    INFORMATION SYSTEMS FRONTIERS, 2019, 21 (05) : 979 - 995
  • [32] Meta-path Reduction with Transition Probability Preserving in Heterogeneous Information Network
    Wei, Xiaokai
    Liu, Zhiwei
    Sun, Lichao
    Yu, Philip S.
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 1245 - 1250
  • [33] Fast User-Guided Video Object Segmentation by Interaction-and-Propagation Networks
    Oh, Seoung Wug
    Lee, Joon-Young
    Xu, Ning
    Kim, Seon Joo
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5242 - 5251
  • [34] Meta-path automatically extracted from heterogeneous information network for recommendation
    Zhang, Yihao
    Liao, Weiwen
    Wang, Yulin
    Zhu, Junlin
    Chen, Ruizhen
    Zhang, Yunjia
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2024, 27 (03):
  • [35] DDI Prediction With Heterogeneous Information Network - Meta-Path Based Approach
    Tanvir, Farhan
    Saifuddin, Khaled Mohammed
    Islam, Muhammad Ifte Khairul
    Akbas, Esra
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2024, 21 (05) : 1168 - 1179
  • [36] Dynamic Heterogeneous Information Network Embedding With Meta-Path Based Proximity
    Wang, Xiao
    Lu, Yuanfu
    Shi, Chuan
    Wang, Ruijia
    Cui, Peng
    Mou, Shuai
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (03) : 1117 - 1132
  • [37] DPRel: A Meta-Path Based Relevance Measure for Mining Heterogeneous Networks
    Mukul Gupta
    Pradeep Kumar
    Bharat Bhasker
    Information Systems Frontiers, 2019, 21 : 979 - 995
  • [38] Effective Similarity Search on Heterogeneous Networks: A Meta-Path Free Approach
    Wang, Yue
    Wang, Zhe
    Zhao, Ziyuan
    Li, Zijian
    Jian, Xun
    Xin, Hao
    Chen, Lei
    Song, Jianchun
    Chen, Zhenhong
    Zhao, Meng
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (07) : 3225 - 3240
  • [39] A k-NN-Based Approach Using MapReduce for Meta-path Classification in Heterogeneous Information Networks
    Kodali, Sadhana
    Dabbiru, Madhavi
    Rao, B. Thirumala
    Patnaik, U. Kartheek Chandra
    SOFT COMPUTING IN DATA ANALYTICS, SCDA 2018, 2019, 758 : 277 - 284
  • [40] MEGA: identifying influential nodes in heterogeneous networks based on meta-path and attention
    Xie, Jinfang
    Yu, Jianyong
    Chen, Zijuan
    JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2025, 2025 (02):