PathSelClus: Integrating Meta-Path Selection with User-Guided Object Clustering in Heterogeneous Information Networks

被引:128
|
作者
Sun, Yizhou [1 ]
Norick, Brandon [2 ]
Han, Jiawei [2 ]
Yan, Xifeng [3 ]
Yu, Philip S. [4 ,5 ]
Yu, Xiao [2 ]
机构
[1] Univ Illinois, Urbana, IL USA
[2] Univ Illinois, Dept Comp Sci, Urbana, IL USA
[3] Univ Calif Santa Barbara, Dept Comp Sci, Santa Barbara, CA 93106 USA
[4] Univ Illinois, Dept Comp Sci, Chicago, IL USA
[5] King Abdulaziz Univ, Dept Comp Sci, Jeddah 21413, Saudi Arabia
基金
美国国家科学基金会;
关键词
Algorithms; Heterogeneous information networks; meta-path selection; user-guided clustering;
D O I
10.1145/2500492
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Real-world, multiple-typed objects are often interconnected, forming heterogeneous information networks. A major challenge for link-based clustering in such networks is their potential to generate many different results, carrying rather diverse semantic meanings. In order to generate desired clustering, we propose to use meta-path, a path that connects object types via a sequence of relations, to control clustering with distinct semantics. Nevertheless, it is easier for a user to provide a few examples (seeds) than a weighted combination of sophisticated meta-paths to specify her clustering preference. Thus, we propose to integrate meta-path selection with user-guided clustering to cluster objects in networks, where a user first provides a small set of object seeds for each cluster as guidance. Then the system learns the weight for each meta-path that is consistent with the clustering result implied by the guidance, and generates clusters under the learned weights of meta-paths. A probabilistic approach is proposed to solve the problem, and an effective and efficient iterative algorithm, PathSelClus, is proposed to learn the model, where the clustering quality and the meta-path weights mutually enhance each other. Our experiments with several clustering tasks in two real networks and one synthetic network demonstrate the power of the algorithm in comparison with the baselines.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Clustering via Meta-path Embedding for Heterogeneous Information Networks
    Zhang, Yongjun
    Yang, Xiaoping
    Wang, Liang
    11TH IEEE INTERNATIONAL CONFERENCE ON KNOWLEDGE GRAPH (ICKG 2020), 2020, : 188 - 194
  • [2] Integrating Meta-Path Selection with User-Preference for Top-k Relevant Search in Heterogeneous Information Networks
    Bu, Shaoli
    Hong, Xiaoguang
    Peng, Zhaohui
    Li, Qingzhong
    PROCEEDINGS OF THE 2014 IEEE 18TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2014, : 301 - 306
  • [3] WMPEClus: Clustering via Weighted Meta-Path Embedding for Heterogeneous Information Networks
    Zhang, Yongjun
    Yang, Xiaoping
    Wang, Liang
    Li, Kede
    2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 799 - 806
  • [4] ABLE: Meta-Path Prediction in Heterogeneous Information Networks
    Huang, Chenji
    Fang, Yixiang
    Lin, Xuemin
    Cao, Xin
    Zhang, Wenjie
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2022, 16 (04)
  • [5] Unsupervised meta-path selection for text similarity measure based on heterogeneous information networks
    Wang, Chenguang
    Song, Yangqiu
    Li, Haoran
    Zhang, Ming
    Han, Jiawei
    DATA MINING AND KNOWLEDGE DISCOVERY, 2018, 32 (06) : 1735 - 1767
  • [6] User-Guided Clustering in Heterogeneous Information Networks via Motif-Based Comprehensive Transcription
    Shi, Yu
    He, Xinwei
    Zhang, Naijing
    Yang, Carl
    Han, Jiawei
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT I, 2020, 11906 : 361 - 377
  • [7] Unsupervised meta-path selection for text similarity measure based on heterogeneous information networks
    Chenguang Wang
    Yangqiu Song
    Haoran Li
    Ming Zhang
    Jiawei Han
    Data Mining and Knowledge Discovery, 2018, 32 : 1735 - 1767
  • [8] Weighted Meta-Path Embedding Learning for Heterogeneous Information Networks
    Zhang, Yongjun
    Yang, Xiaoping
    Wang, Liang
    WEB INFORMATION SYSTEMS ENGINEERING, WISE 2020, PT I, 2020, 12342 : 29 - 40
  • [9] Leveraging Meta-path Contexts for Classification in Heterogeneous Information Networks
    Li, Xiang
    Ding, Danhao
    Kao, Ben
    Sun, Yizhou
    Mamoulis, Nikos
    2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 912 - 923
  • [10] Meta-Path Based Service Recommendation in Heterogeneous Information Networks
    Liang, Tingting
    Chen, Liang
    Wu, Jian
    Dong, Hai
    Bouguettaya, Athman
    SERVICE-ORIENTED COMPUTING, (ICSOC 2016), 2016, 9936 : 371 - 386