Understanding Coarsening for Embedding Large-Scale Graphs

被引:2
|
作者
Akyildiz, Taha Atahan [1 ]
Aljundi, Amro Alabsi [1 ]
Kaya, Kamer [1 ]
机构
[1] Sabanci Univ, Fac Engn & Nat Sci, Istanbul, Turkey
关键词
Graph coarsening; graph embedding; multi-level approach; SCHEME;
D O I
10.1109/BigData50022.2020.9377898
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A significant portion of the data today, e.g, social networks, web connections, etc., can be modeled by graphs. A proper analysis of graphs with Machine Learning (ML) algorithms has the potential to yield far-reaching insights into many areas of research and industry. However, the irregular structure of graph data constitutes an obstacle for running ML tasks on graphs such as link prediction, node classification, and anomaly detection. Graph embedding is a compute-intensive process of representing graphs as a set of vectors in a d-dimensional space, which in turn makes it amenable to ML tasks. Many approaches have been proposed in the literature to improve the performance of graph embedding, e.g., using distributed algorithms, accelerators, and pre-processing techniques. Graph coarsening, which can be considered a pre-processing step, is a structural approximation of a given, large graph with a smaller one. As the literature suggests, the cost of embedding significantly decreases when coarsening is employed. In this work, we thoroughly analyze the impact of the coarsening quality on the embedding performance both in terms of speed and accuracy. Our experiments with a state-of-the-art, fast graph embedding tool show that there is an interplay between the coarsening decisions taken and the embedding quality.
引用
收藏
页码:2937 / 2946
页数:10
相关论文
共 50 条
  • [41] Fast graph clustering in large-scale systems based on spectral coarsening
    Sun, Dasong
    INTERNATIONAL JOURNAL OF MODERN PHYSICS B, 2021, 35 (09):
  • [42] Large-scale Parking Data Prediction: From A Graph Coarsening Perspective
    Wang, Yixuan
    Ku, Yixuan
    Liu, Qi
    Yang, Yang
    Peng, Lei
    2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 1410 - 1415
  • [43] A Large-Scale Cross-Architecture Evaluation of Thread-Coarsening
    Magni, Alberto
    Dubach, Christophe
    O'Boyle, Michael F. P.
    2013 INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SC), 2013,
  • [44] Flash Embedding: Storing Embedding Tables in SSD for Large-Scale Recommender Systems
    Wan, Hu
    Sun, Xuan
    Cui, Yufei
    Yang, Chia-Lin
    Kuo, Tei-Wei
    Xue, Chun Jason
    APSYS '21: PROCEEDINGS OF THE 12TH ACM SIGOPS ASIA-PACIFIC WORKSHOP ON SYSTEMS, 2021, : 9 - 16
  • [45] Understanding Large-Scale Software - A Hierarchical View
    Levy, Omer
    Feitelson, Dror G.
    2019 IEEE/ACM 27TH INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION (ICPC 2019), 2019, : 283 - 293
  • [46] Advances in the understanding of the large-scale gap test
    Burley, S. J.
    Bourne, N. K.
    Fung, V.
    Hollands, R.
    Millett, J. C. F.
    Milne, A. M.
    Wood, A.
    Shock Compression of Condensed Matter - 2005, Pts 1 and 2, 2006, 845 : 944 - 947
  • [47] Understanding Source Code Comments at Large-Scale
    He, Hao
    ESEC/FSE'2019: PROCEEDINGS OF THE 2019 27TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, 2019, : 1217 - 1219
  • [48] Understanding the Context of Large-Scale IT Project Failures
    Rich, Eliot
    Nelson, Mark R.
    INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGIES AND SYSTEMS APPROACH, 2012, 5 (02) : 1 - 24
  • [49] Understanding Large-Scale Dynamic Purchase Behavior
    Jacobs, Bruno
    Fok, Dennis
    Donkers, Bas
    MARKETING SCIENCE, 2021, 40 (05) : 844 - 870
  • [50] The Use of Weighted Graphs for Large-Scale Genome Analysis
    Zhou, Fang
    Toivonen, Hannu
    King, Ross D.
    PLOS ONE, 2014, 9 (03):