Multiple clusterings of heterogeneous information networks

被引:3
|
作者
Wei, Shaowei [1 ]
Yu, Guoxian [1 ,2 ]
Wang, Jun [3 ]
Domeniconi, Carlotta [4 ]
Zhang, Xiangliang [5 ]
机构
[1] Southwest Univ, Coll Comp & Informat Sci, Chongqing, Peoples R China
[2] Shandong Univ, Sch Software, Jinan, Peoples R China
[3] Shandong Univ, Joint SDU NTU Ctr Artificial Intelligence Res, Jinan, Peoples R China
[4] George Mason Univ, Dept Comp Sci, Fairfax, VA 22030 USA
[5] King Abdullah Univ Sci & Technol, Comp Elect & Math Sci & Engn Div, Thuwal, Saudi Arabia
基金
中国国家自然科学基金;
关键词
Multiple clusterings; Heterogeneous information networks; Meta-path; Quality and diversity; Network embedding;
D O I
10.1007/s10994-021-06000-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traditional clustering algorithms focus on a single clustering result; as such, they cannot explore potential diverse patterns of complex real world data. To deal with this problem, approaches that exploit meaningful alternative clusterings in data have been developed in recent years. Existing algorithms, including single view/multi-view multiple clustering methods, are designed for applications with i.i.d. data samples, and cannot handle the data samples with dependency presented in networks, especially in heterogeneous information networks (HIN). In this paper, we propose a framework (NetMCs) that can explore multiple clusterings in HIN. Specifically, NetMCs adopts a set of meta-path schemes with different semantics on HIN, and considers each meta-path scheme as a base clustering aspect. Guided by the meta-path schemes, NetMCs then introduces a variation of the skip-gram framework that can jointly optimize multiple clustering aspects, and simultaneously obtain the respective embedding representations and individual clusterings therein. To reduce redundancy between alternative clusterings, NetMCs utilizes an explicit regularization term to control the embedding diversity of the same nodes among different clustering aspects. Experiments on benchmark HIN datasets confirm the performance of NetMCs in generating multiple clusterings with high quality and diversity.
引用
收藏
页码:1505 / 1526
页数:22
相关论文
共 50 条
  • [1] Multiple clusterings of heterogeneous information networks
    Shaowei Wei
    Guoxian Yu
    Jun Wang
    Carlotta Domeniconi
    Xiangliang Zhang
    [J]. Machine Learning, 2021, 110 : 1505 - 1526
  • [2] Link Trustworthiness Evaluation over Multiple Heterogeneous Information Networks
    Wang, Meng
    Qin, Xu
    Jiang, Wei
    Li, Chunshu
    Qi, Guilin
    [J]. COMPLEXITY, 2021, 2021
  • [3] Collective Prediction of Multiple Types of Links in Heterogeneous Information Networks
    Cao, Bokai
    Kong, Xiangnan
    Yu, Philip S.
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2014, : 50 - 59
  • [4] Combining multiple clusterings using information theory based genetic algorithm
    Luo, Huilan
    Jing, Furong
    Xie, Xiaobing
    [J]. 2006 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, PTS 1 AND 2, PROCEEDINGS, 2006, : 84 - 89
  • [5] Finding multiple stable clusterings
    Hu, Juhua
    Qian, Qi
    Pei, Jian
    Jin, Rong
    Zhu, Shenghuo
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2017, 51 (03) : 991 - 1021
  • [6] Measuring Disease Similarity Based on Multiple Heterogeneous Disease Information Networks
    Tian, Ling
    Gao, Jianliang
    Wang, Jianxin
    Wang, Ying
    Song, Bo
    Hu, Xiaohua
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 228 - 231
  • [7] CHRS: Cold Start Recommendation Across Multiple Heterogeneous Information Networks
    Zhu, Junxing
    Zhang, Jiawei
    Zhang, Chenwei
    Wu, Quanyuan
    Jia, Yan
    Zhou, Bin
    Yu, Philip S.
    [J]. IEEE ACCESS, 2017, 5 : 15283 - 15299
  • [8] Learning Multiple Nonredundant Clusterings
    Cui, Ying
    Fern, Xiaoli Z.
    Dy, Jennifer G.
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2010, 4 (03)
  • [9] Comparing clusterings by the variation of information
    Meila, M
    [J]. LEARNING THEORY AND KERNEL MACHINES, 2003, 2777 : 173 - 187
  • [10] Multiple Independent Subspace Clusterings
    Wang, Xing
    Wang, Jun
    Domeniconi, Carlotta
    Yu, Guoxian
    Xiao, Guoqiang
    Guo, Maozu
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 5353 - 5360