Degree-biased random walk for large-scale network embedding

被引:16
|
作者
Zhang, Yunyi [1 ,2 ]
Shi, Zhan [1 ,2 ]
Feng, Dan [1 ]
Zhan, Xiu-Xiu [3 ]
机构
[1] Huazhong Univ Sci & Technol, Minist Educ China, Engn Res Ctr Data Storage Syst & Technol, Wuhan Natl Lab Optoelect,Key Lab Informat Storage, Wuhan 430074, Hubei, Peoples R China
[2] Shenzhen Huazhong Univ Sci & Technol, Res Inst, Shenzhen 518000, Guangdong, Peoples R China
[3] Delft Univ Technol, Fac Elect Engn Math & Comp Sci, NL-2628 CD Delft, Netherlands
关键词
Network embedding; Scale-free; Random walks; SOCIAL NETWORKS; PREDICTION; CENTRALITY; FRAMEWORK;
D O I
10.1016/j.future.2019.05.033
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Network embedding aims at learning node representation by preserving the network topology. Previous embedding methods do not scale for large real-world networks which usually contain millions of nodes. They generally adopt a one-size-fits-all strategy to collect information, resulting in a large amount of redundancy. In this paper, we propose DiaRW, a scalable network embedding method based on a degree-biased random walk with variable length to sample context information for learning. Our walk strategy can well adapt to the scale-free feature of real-world networks and extract information from them with much less redundancy. In addition, our method can greatly reduce the size of context information, which is efficient for large-scale network embedding. Empirical experiments on node classification and link prediction prove not only the effectiveness but also the efficiency of DiaRW on a variety of real-world networks. Our algorithm is able to learn the network representations with millions of nodes and edges in hours on a single machine, which is tenfold faster than previous methods. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:198 / 209
页数:12
相关论文
共 50 条
  • [1] Comparison of mean hitting times for a degree-biased random walk
    Gerbaud, Antoine
    Altisen, Karine
    Devismes, Stephane
    Lafourcade, Pascal
    [J]. DISCRETE APPLIED MATHEMATICS, 2014, 170 : 104 - 109
  • [2] Network Embedding Based on Biased Random Walk for Community Detection in Attributed Networks
    Guo, Kun
    Zhao, Zizheng
    Yu, Zhiyong
    Guo, Wenzhong
    Lin, Ronghua
    Tang, Yong
    Wu, Ling
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2023, 10 (05) : 2279 - 2290
  • [3] Random walk with jumps in large-scale random geometric graphs
    Tzevelekas, Leonidas
    Oikonomou, Konstantinos
    Stavrakakis, Ioannis
    [J]. COMPUTER COMMUNICATIONS, 2010, 33 (13) : 1505 - 1514
  • [4] Large-Scale Network Embedding in Apache Spark
    Lin, Wenqing
    [J]. KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 3271 - 3279
  • [5] LINE: Large-scale Information Network Embedding
    Tang, Jian
    Qu, Meng
    Wang, Mingzhe
    Zhang, Ming
    Yan, Jun
    Mei, Qiaozhu
    [J]. PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW 2015), 2015, : 1067 - 1077
  • [6] Hyper2vec: Biased Random Walk for Hyper-network Embedding
    Huang, Jie
    Chen, Chuan
    Ye, Fanghua
    Wu, Jiajing
    Zheng, Zibin
    Ling, Guohui
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2019, 11448 : 273 - 277
  • [7] Symmetry in stochasticity: Random walk models of large-scale structure
    Sheth, Ravi K.
    [J]. PRAMANA-JOURNAL OF PHYSICS, 2011, 77 (01): : 169 - 184
  • [8] Symmetry in stochasticity: Random walk models of large-scale structure
    RAVI K SHETH
    [J]. Pramana, 2011, 77 : 169 - 184
  • [9] Community aware random walk for network embedding
    Keikha, Mohammad Mehdi
    Rahgozar, Maseud
    Asadpour, Masoud
    [J]. KNOWLEDGE-BASED SYSTEMS, 2018, 148 : 47 - 54
  • [10] Deep random walk of unitary invariance for large-scale data representation
    Wang, Shiping
    Chen, Zhaoliang
    Zhu, William
    Wang, Fei-Yue
    [J]. INFORMATION SCIENCES, 2021, 554 : 1 - 14