Optimal Representation of Large-Scale Graph Data Based on K2-Tree

被引:0
|
作者
Liang Chang
Xiangxuan Zeng
Zhoubo Xu
Junyan Qian
Tianlong Gu
Houbing Song
机构
[1] Guilin University of Electronic Technology,Guangxi Key Laboratory of Trusted Software
[2] West Virginia University,Department of Electrical and Computer Engineering
来源
关键词
Graph data; Web graph; Optimal representation; Clustering mechanism; K; -tree;
D O I
暂无
中图分类号
学科分类号
摘要
Graph is widely used to model data in various applications. With the rapid growth of many emerging applications such as Internet of Things, it is urgent to require the processing capability on large scale graphs with billions of vertices. Web graph is a typical case of graph data that is widely used for analyzing the structure, behavior and evolution of the World Wide Web. In this paper, we focus on optimal representation of large-scale Web graphs. Our work is motivated by the need of fit large-scale graphs into the main memory and carry out analyze on them. By analyzing the adjacency matrix of Web graphs, we find two characteristics on the distribution of 1s in the matrix. Firstly, only a very small proportion of elements in the matrix are 1s. Secondly, majority of 1s gather around the principal diagonal and form a few number of clusters in the matrix. Based on these characteristics, we first develop a clustering mechanism to locate the clusters of 1s in the adjacency matrix. Then, we combine this clustering mechanism with a structure named K2-tree and propose an approach for representing large-scale Web graphs compactly. Basic idea of the approach is trying to compress a large number of zeros as a single zero. Experimental results show that, our approach not only reduces the space for representing a Web graph, but also reduces the time consumption for operations such as retrieving neighbors of any nodes on the graph; compared with existing approaches, our approach achieves the best space/time tradeoff.
引用
收藏
页码:2271 / 2284
页数:13
相关论文
共 50 条
  • [1] Optimal Representation of Large-Scale Graph Data Based on K2-Tree
    Chang, Liang
    Zeng, Xiangxuan
    Xu, Zhoubo
    Qian, Junyan
    Gu, Tianlong
    Song, Houbing
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2017, 95 (03) : 2271 - 2284
  • [2] Optimal Representation of Large-Scale Graph Data Based on Grid Clustering and K2-Tree
    Li, Fengying
    Yang, Enyi
    Ma, Anqiao
    Dong, Rongsheng
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
  • [3] Optimal Representation for Web and Social Network Graphs Based on K2-Tree
    Li, Fengying
    Zhang, Qi
    Gu, Tianlong
    Dong, Rongsheng
    [J]. IEEE ACCESS, 2019, 7 : 52945 - 52954
  • [4] Large-scale Graph Representation Learning
    Leskovec, Jure
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 4 - 4
  • [5] Large-scale knowledge graph representation learning
    Badrouni, Marwa
    Katar, Chaker
    Inoubli, Wissem
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (09) : 5479 - 5499
  • [6] Large-Scale Graph Alignment Based on Topological Structure Representation Learning
    Wang, Chen-Xu
    Zhou, Jun-Ming
    Jiang, Pei-Jing
    [J]. Jisuanji Xuebao/Chinese Journal of Computers, 2023, 46 (07): : 1350 - 1365
  • [7] GNNVis: Visualize Large-Scale Data by Learning a Graph Neural Network Representation
    Huang, Yajun
    Zhang, Jingbin
    Yang, Yiyang
    Gong, Zhiguo
    Hao, Zhifeng
    [J]. CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 545 - 554
  • [8] Dating the arthropod tree based on large-scale transcriptome data
    Rehm, Peter
    Borner, Janus
    Meusemann, Karen
    von Reumont, Bjoern M.
    Simon, Sabrina
    Hadrys, Heike
    Misof, Bernhard
    Burmester, Thorsten
    [J]. MOLECULAR PHYLOGENETICS AND EVOLUTION, 2011, 61 (03) : 880 - 887
  • [9] Large-Scale Clustering With Structured Optimal Bipartite Graph
    Zhang, Han
    Nie, Feiping
    Li, Xuelong
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (08) : 9950 - 9963
  • [10] A Twig-Based Algorithm for Top-k Subgraph Matching in Large-Scale Graph Data
    Zhang, Haiwei
    Bai, Qijie
    Lian, Yining
    Wen, Yanlong
    [J]. BIG DATA RESEARCH, 2022, 30