Understanding Coarsening for Embedding Large-Scale Graphs

被引:2
|
作者
Akyildiz, Taha Atahan [1 ]
Aljundi, Amro Alabsi [1 ]
Kaya, Kamer [1 ]
机构
[1] Sabanci Univ, Fac Engn & Nat Sci, Istanbul, Turkey
关键词
Graph coarsening; graph embedding; multi-level approach; SCHEME;
D O I
10.1109/BigData50022.2020.9377898
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A significant portion of the data today, e.g, social networks, web connections, etc., can be modeled by graphs. A proper analysis of graphs with Machine Learning (ML) algorithms has the potential to yield far-reaching insights into many areas of research and industry. However, the irregular structure of graph data constitutes an obstacle for running ML tasks on graphs such as link prediction, node classification, and anomaly detection. Graph embedding is a compute-intensive process of representing graphs as a set of vectors in a d-dimensional space, which in turn makes it amenable to ML tasks. Many approaches have been proposed in the literature to improve the performance of graph embedding, e.g., using distributed algorithms, accelerators, and pre-processing techniques. Graph coarsening, which can be considered a pre-processing step, is a structural approximation of a given, large graph with a smaller one. As the literature suggests, the cost of embedding significantly decreases when coarsening is employed. In this work, we thoroughly analyze the impact of the coarsening quality on the embedding performance both in terms of speed and accuracy. Our experiments with a state-of-the-art, fast graph embedding tool show that there is an interplay between the coarsening decisions taken and the embedding quality.
引用
收藏
页码:2937 / 2946
页数:10
相关论文
共 50 条
  • [31] Probabilistic Belief Embedding for Large-Scale Knowledge Population
    Fan, Miao
    Zhou, Qiang
    Abel, Andrew
    Zheng, Thomas Fang
    Grishman, Ralph
    COGNITIVE COMPUTATION, 2016, 8 (06) : 1087 - 1102
  • [32] Large-Scale Embedding Learning in Heterogeneous Event Data
    Gui, Huan
    Liu, Jialu
    Tao, Fangbo
    Jiang, Meng
    Norick, Brandon
    Han, Jiawei
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2016, : 907 - 912
  • [33] Coupled Binary Embedding for Large-Scale Image Retrieval
    Zheng, Liang
    Wang, Shengjin
    Tian, Qi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (08) : 3368 - 3380
  • [34] Embedding Feature Selection for Large-scale Hierarchical Classification
    Naik, Azad
    Rangwala, Huzefa
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 1212 - 1221
  • [35] Probabilistic Belief Embedding for Large-Scale Knowledge Population
    Miao Fan
    Qiang Zhou
    Andrew Abel
    Thomas Fang Zheng
    Ralph Grishman
    Cognitive Computation, 2016, 8 : 1087 - 1102
  • [36] AdaEmbed: Adaptive Embedding for Large-Scale Recommendation Models
    Lai, Fan
    Zhang, Wei
    Liu, Rui
    Tsai, William
    Wei, Xiaohan
    Hu, Yuxi
    Devkota, Sabin
    Huang, Jianyu
    Park, Jongsoo
    Liu, Xing
    Chen, Zeliang
    Wen, Ellie
    Rivera, Paul
    You, Jie
    Chen, Chun-Cheng Jason
    Chowdhury, Mosharaf
    PROCEEDINGS OF THE 17TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, OSDI 2023, 2023, : 817 - 831
  • [37] Gaussian embedding for large-scale gene set analysis
    Sheng Wang
    Emily R. Flynn
    Russ B. Altman
    Nature Machine Intelligence, 2020, 2 : 387 - 395
  • [38] InvVis: Large-Scale Data Embedding for Invertible Visualization
    Ye, Huayuan
    Li, Chenhui
    Li, Yang
    Wang, Changbo
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (01) : 1139 - 1149
  • [39] Product Embedding for Large-Scale Disaggregated Sales Data
    Li, Yinxing
    Terui, Nobuhiko
    PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (KDIR), VOL 1:, 2021, : 69 - 75
  • [40] Gaussian embedding for large-scale gene set analysis
    Wang, Sheng
    Flynn, Emily R.
    Altman, Russ B.
    NATURE MACHINE INTELLIGENCE, 2020, 2 (07) : 387 - 395