Efficient Machine Learning On Large-Scale Graphs

被引:0
|
作者
Erickson, Parker [1 ]
Lee, Victor E. [1 ]
Shi, Feng [1 ]
Tang, Jiliang [2 ]
机构
[1] TigerGraph, Redwood City, CA 94065 USA
[2] Michigan State Univ, E Lansing, MI 48824 USA
关键词
D O I
10.1145/3534678.3542623
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Machine learning on graph data has become a common area of interest across academia and industry. However, due to the size of real-world industry graphs (hundreds of millions of vertices and billions of edges) and the special architecture of graph neural networks, it is still a challenge for practitioners and researchers to perform machine learning tasks on large-scale graph data. It typically takes a powerful and expensive GPU machine to train a graph neural network on a million-vertex scale graph, let alone doing deep learning on real enterprise graphs. In this tutorial, we will cover how to develop and run performant graph algorithms and graph neural network models with TigerGraph [3], a massively parallel platform for graph analytics, and its Machine Learning Workbench with PyTorch Geometric [4] and DGL [8] support. Using an NFT transaction dataset [6], we will first investigate transactions using graph algorithms by themselves as methods of graph traversing, clustering, classification, and determining similarities between data. Secondly, we will show how to use those graph-derived features such as PageRank and embeddings to empower traditional machine learning models. Finally, we will demonstrate how to train common graph neural networks with TigerGraph and how to implement novel graph neural network models. Participants will use the TigerGraph ML Workbench Cloud to perform graph feature engineering and train their machine learning algorithms during the session.
引用
收藏
页码:4788 / 4789
页数:2
相关论文
共 50 条
  • [41] 21 000 birds in 4.5 h: efficient large-scale seabird detection with machine learning
    Kellenberger, Benjamin
    Veen, Thor
    Folmer, Eelke
    Tuia, Devis
    [J]. REMOTE SENSING IN ECOLOGY AND CONSERVATION, 2021, 7 (03) : 445 - 460
  • [42] On Efficient Training of Large-Scale Deep Learning Models
    Shen, Li
    Sun, Yan
    Yu, Zhiyuan
    Ding, Liang
    Tian, Xinmei
    Tao, Dacheng
    [J]. ACM Computing Surveys, 57 (03):
  • [43] Efficient Large-Scale Machine Learning Techniques for Rapid Motif Discovery in Energy Data Streams
    Lykothanasi, K. K.
    Sioutas, S.
    Tsichlas, K.
    [J]. ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2022, PART I, 2022, 646 : 331 - 342
  • [44] An Efficient and Scalable Algorithmic Method for Generating Large-Scale Random Graphs
    Alam, Maksudul
    Khan, Maleq
    Vullikanti, Anil
    Marathe, Madhav
    [J]. SC '16: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2016, : 372 - 383
  • [45] Towards efficient solutions of bitruss decomposition for large-scale bipartite graphs
    Wang, Kai
    Lin, Xuemin
    Qin, Lu
    Zhang, Wenjie
    Zhang, Ying
    [J]. VLDB JOURNAL, 2022, 31 (02): : 203 - 226
  • [46] Efficient and Clean Production Practice of Large-Scale Sintering Machine
    Wang, Daijun
    Wu, Shengli
    Li, Changxing
    Zhu, Juan
    [J]. ISIJ INTERNATIONAL, 2013, 53 (09) : 1665 - 1672
  • [47] Towards efficient solutions of bitruss decomposition for large-scale bipartite graphs
    Kai Wang
    Xuemin Lin
    Lu Qin
    Wenjie Zhang
    Ying Zhang
    [J]. The VLDB Journal, 2022, 31 : 203 - 226
  • [48] An Efficient Subgraph-Inferring Framework for Large-Scale Heterogeneous Graphs
    Zhou, Wei
    Huang, Hong
    Shi, Ruize
    Yin, Kehan
    Jin, Hai
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 8, 2024, : 9431 - 9439
  • [49] Towards an Optimized GROUP BY Abstraction for Large-Scale Machine Learning
    Li, Side
    Kumar, Arun
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2021, 14 (11): : 2327 - 2340
  • [50] Toward Large-Scale Vulnerability Discovery using Machine Learning
    Grieco, Gustavo
    Grinblat, Guillermo Luis
    Uzal, Lucas
    Rawat, Sanjay
    Feist, Josselin
    Mounier, Laurent
    [J]. CODASPY'16: PROCEEDINGS OF THE SIXTH ACM CONFERENCE ON DATA AND APPLICATION SECURITY AND PRIVACY, 2016, : 85 - 96