COSINE: Compressive Network Embedding on Large-Scale Information Networks

被引:2
|
作者
Zhang, Zhengyan [1 ]
Yang, Cheng [2 ]
Liu, Zhiyuan [1 ]
Sun, Maosong [1 ]
Fang, Zhichong [3 ]
Zhang, Bo [4 ]
Lin, Leyu [4 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[2] Beijing Univ Posts & Telecommun, Sch Comp Sci, Beijing 100876, Peoples R China
[3] Cornell Univ, Dept Comp Sci, Ithaca, NY 14850 USA
[4] Tencent, Search Prod Ctr, WeChat Search Applicat Dept, Beijing 100080, Peoples R China
基金
中国国家自然科学基金;
关键词
Training; Partitioning algorithms; Predictive models; Sparse matrices; Knowledge engineering; Computational modeling; Memory management; Node classification; link prediction; large-scale real-world network; network embedding; model compression; GRAPH; SCHEME;
D O I
10.1109/TKDE.2020.3030539
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There is recently a surge in approaches that learn low-dimensional embeddings of nodes in networks. However, for large-scale real-world networks, it's inefficient for existing approaches to store amounts of parameters in memory and update them edge by edge. With the knowledge that nodes having similar neighborhoods will be close to each other in the embedding space, we propose COSINE (COmpresSIve Network Embedding) algorithm, which reduces the memory footprint and accelerates the training process by parameter sharing among similar nodes. COSINE applies graph partitioning algorithms to networks and builds parameter sharing dependency of nodes based on the results of partitioning. In this way, COSINE injects prior knowledge about high-order structural information into models, which makes network embedding more efficient and effective. COSINE can be applied to any embedding lookup method and learn high-quality embeddings with limited memory and less training time. We conduct experiments on multi-label classification and link prediction, where baselines and our model have the same memory usage. Experimental results show that COSINE improves baselines by up to 23 percent on classification and 25 percent on link prediction. Moreover, the training time of all representation learning methods using COSINE decreases by 30 to 70 percent.
引用
收藏
页码:3655 / 3668
页数:14
相关论文
共 50 条
  • [1] LINE: Large-scale Information Network Embedding
    Tang, Jian
    Qu, Meng
    Wang, Mingzhe
    Zhang, Ming
    Yan, Jun
    Mei, Qiaozhu
    [J]. PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW 2015), 2015, : 1067 - 1077
  • [2] A General Embedding Framework for Heterogeneous Information Learning in Large-Scale Networks
    Huang, Xiao
    Li, Jundong
    Zou, Na
    Hu, Xia
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2018, 12 (06)
  • [3] DeHIN: A Decentralized Framework for Embedding Large-Scale Heterogeneous Information Networks
    Imran, Mubashir
    Yin, Hongzhi
    Chen, Tong
    Huang, Zi
    Zheng, Kai
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (04) : 3645 - 3657
  • [4] Decentralized Embedding Framework for Large-Scale Networks
    Imran, Mubashir
    Yin, Hongzhi
    Chen, Tong
    Shao, Yingxia
    Zhang, Xiangliang
    Zhou, Xiaofang
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2020), PT III, 2020, 12114 : 425 - 441
  • [5] Large-Scale Network Embedding in Apache Spark
    Lin, Wenqing
    [J]. KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 3271 - 3279
  • [6] Analysis of large-scale social and information networks
    Kleinberg, Jon
    [J]. PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2013, 371 (1987):
  • [7] NetSMF: Large-Scale Network Embedding as Sparse Matrix Factorization
    Qiu, Jiezhong
    Dong, Yuxiao
    Ma, Hao
    Li, Jian
    Wang, Chi
    Wang, Kuansan
    Tang, Jie
    [J]. WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 1509 - 1520
  • [8] Compressive Data Persistence in Large-Scale Wireless Sensor Networks
    Lin, Mu
    Luo, Chong
    Liu, Feng
    Wu, Feng
    [J]. 2010 IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE GLOBECOM 2010, 2010,
  • [9] Compressive Data Gathering for Large-Scale Wireless Sensor Networks
    Luo, Chong
    Wu, Feng
    Sun, Jun
    Chen, Chang Wen
    [J]. FIFTEENTH ACM INTERNATIONAL CONFERENCE ON MOBILE COMPUTING AND NETWORKING (MOBICOM 2009), 2009, : 145 - 156
  • [10] RaRE: Social Rank Regulated Large-scale Network Embedding
    Gu, Yupeng
    Sun, Yizhou
    Li, Yanen
    Yang, Yang
    [J]. WEB CONFERENCE 2018: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW2018), 2018, : 359 - 368