DeHIN: A Decentralized Framework for Embedding Large-Scale Heterogeneous Information Networks

被引:3
|
作者
Imran, Mubashir [1 ]
Yin, Hongzhi [1 ]
Chen, Tong [1 ]
Huang, Zi [1 ]
Zheng, Kai [2 ]
机构
[1] Univ Queensland, Sch Informat Technol & Elect Engn, Brisbane, Qld 4072, Australia
[2] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 610056, Sichuan, Peoples R China
基金
中国国家自然科学基金; 澳大利亚研究理事会;
关键词
Heterogeneous networks; Task analysis; Parallel processing; Data models; Pipelines; Computational modeling; Training; Decentralized network embedding; heterogeneous networks; link prediction; node classification;
D O I
10.1109/TKDE.2022.3141951
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Modeling heterogeneity by extraction and exploitation of high-order information from heterogeneous information networks (HINs) has been attracting immense research attention in recent times. Such heterogeneous network embedding (HNE) methods effectively harness the heterogeneity of small-scale HINs. However, in the real world, the size of HINs grow exponentially with the continuous introduction of new nodes and different types of links, making it a billion-scale network. Learning node embeddings on such HINs creates a performance bottleneck for existing HNE methods that are commonly centralized, i.e., complete data and the model are both on a single machine. To address large-scale HNE tasks with strong efficiency and effectiveness guarantee, we present Decentralized Embedding Framework for Heterogeneous Information Network (DeHIN) in this paper. In DeHIN, we generate a distributed parallel pipeline that utilizes hypergraphs in order to infuse parallelization into the HNE task. DeHIN presents a context preserving partition mechanism that innovatively formulates a large HIN as a hypergraph, whose hyperedges connect semantically similar nodes. Our framework then adopts a decentralized strategy to efficiently partition HINs by adopting a tree-like pipeline. Then, each resulting subnetwork is assigned to a distributed worker, which employs the deep information maximization theorem to locally learn node embeddings from the partition it receives. We further devise a novel embedding alignment scheme to precisely project independently learned node embeddings from all subnetworks onto a common vector space, thus allowing for downstream tasks like link prediction and node classification. As shown from our experimental results, DeHIN significantly improves the efficiency and accuracy of existing HNE models as well as outperforms the large-scale graph embedding frameworks by efficiently scaling up to large-scale HINs.
引用
收藏
页码:3645 / 3657
页数:13
相关论文
共 50 条
  • [41] JOINT INFORMATION AND ENERGY TRANSFER FOR LARGE-SCALE NETWORKS
    Krikidis, Ioannis
    [J]. 2014 6TH INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS, CONTROL AND SIGNAL PROCESSING (ISCCSP), 2014, : 314 - 317
  • [42] DECENTRALIZED OBSERVATION IN LARGE-SCALE SYSTEMS
    SUNDARESHAN, MK
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1977, 7 (12): : 863 - 867
  • [43] Wireless Sensor Networks Suitable for Large-Scale Heterogeneous Networking
    Wang, Xi
    Li, Feifei
    [J]. INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2019, 15 (01) : 58 - 70
  • [44] DECENTRALIZED CONTROL OF LARGE-SCALE SYSTEMS
    IKEDA, M
    [J]. LECTURE NOTES IN CONTROL AND INFORMATION SCIENCES, 1989, 135 : 219 - 242
  • [45] Collaborative informed gateway selection in large-scale and heterogeneous networks
    Batbayar, Khulan
    Dimogerontakis, Emmanouil
    Meseguer, Roc
    Navarro, Leandro
    Sadre, Ramin
    [J]. 2019 IFIP/IEEE SYMPOSIUM ON INTEGRATED NETWORK AND SERVICE MANAGEMENT (IM), 2019, : 337 - 345
  • [46] LARGE-SCALE SYSTEMS AND DECENTRALIZED CONTROL
    ATHANS, M
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1978, 23 (02) : 105 - 106
  • [47] NETMOD - A DESIGN TOOL FOR LARGE-SCALE HETEROGENEOUS CAMPUS NETWORKS
    BACHMANN, DW
    SEGAL, ME
    SRINIVASAN, MM
    TEOREY, TJ
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 1991, 9 (01) : 15 - 24
  • [48] Coordinated SLNR Based Precoding in Large-Scale Heterogeneous Networks
    Boukhedimi, Ikram
    Kammoun, Abla
    Alouini, Mohamed-Slim
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2017, 11 (03) : 534 - 548
  • [49] Large-scale decentralized unit commitment
    Feizollahi, Mohammad Javad
    Costley, Mitch
    Ahmed, Shabbir
    Grijalva, Santiago
    [J]. INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2015, 73 : 97 - 106
  • [50] Decentralized Large-Scale Power Balancing
    Halvgaard, Rasmus
    Jorgensen, John B.
    Poulsen, Niels K.
    Madsen, Henrik
    Vandenberghe, Lieven
    [J]. 2013 4TH IEEE/PES INNOVATIVE SMART GRID TECHNOLOGIES EUROPE (ISGT EUROPE), 2013,