Asynchronous Distributed-Memory Parallel Algorithms for Influence Maximization

被引:0
|
作者
Singhal, Shubhendra Pal [1 ]
Hati, Souvadra [1 ]
Young, Jeffrey [1 ]
Sarkar, Vivek [1 ]
Hayashi, Akihiro [1 ]
Vuduc, Richard [1 ]
机构
[1] Georgia Inst Technol, Atlanta, GA 30332 USA
关键词
Influence maximization; FA-BSP; PGAS; IMM; NETWORKS;
D O I
10.1109/SC41406.2024.00108
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Influence maximization (IM) is the problem of finding the k most influential nodes in a graph. We propose distributed-memory parallel algorithms for the two main kernels of a state-of-the-art implementation of one IM algorithm, influence maximization via martingales (IMM). The baseline relies on a bulk-synchronous parallel approach and uses replication to reduce communication and achieve approximate load balance, at the cost of synchronization and high memory requirements. By contrast, our method fully distributes the data, thereby improving memory scalability, and uses fine-grained asynchronous parallelism to improve network utilization and the cost of doing more communication. We show our design and implementation can achieve up to 29.6x speedup over the MPI-based state-of-the-art on synthetic and real-world network graphs. Moreover, ours is the first implementation that can run IMM to find influencers in the 'twitter' graph (41M nodes and 1.4B edges) in 200 seconds using 8K CPU cores of NERSC Perlmutter supercomputer.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] On the complexity of list scheduling algorithms for distributed-memory systems
    Radulescu, Andrei
    van Gemund, Arjan J.C.
    Proceedings of the International Conference on Supercomputing, 1999, : 68 - 75
  • [42] LU FACTORIZATION ALGORITHMS ON DISTRIBUTED-MEMORY MULTIPROCESSOR ARCHITECTURES
    GEIST, GA
    ROMINE, CH
    SIAM JOURNAL ON SCIENTIFIC AND STATISTICAL COMPUTING, 1988, 9 (04): : 639 - 649
  • [43] DISTRIBUTED-MEMORY PARALLEL ALGORITHMS FOR DISTANCE-2 COLORING AND RELATED PROBLEMS IN DERIVATIVE COMPUTATION
    Bozdag, Doruk
    Catalyurek, Uemit V.
    Gebremedhin, Assefaw H.
    Manne, Fredrik
    Boman, Erik G.
    Ozguner, Fuesun
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2010, 32 (04): : 2418 - 2446
  • [44] PARALLEL SOLUTION OF TRIANGULAR SYSTEMS ON DISTRIBUTED-MEMORY MULTIPROCESSORS
    HEATH, MT
    ROMINE, CH
    SIAM JOURNAL ON SCIENTIFIC AND STATISTICAL COMPUTING, 1988, 9 (03): : 558 - 588
  • [45] A Distributed-Memory Parallel Approach for Volume Rendering with Shadows
    Mathai, Manish
    Larsen, Matthew
    Childs, Hank
    2023 IEEE 13TH SYMPOSIUM ON LARGE DATA ANALYSIS AND VISUALIZATION, LDAV, 2023, : 22 - 31
  • [46] Massively Parallel Polar Decomposition on Distributed-memory Systems
    Ltaief, Hatem
    Sukkari, Dalal
    Esposito, Aniello
    Nakatsukasa, Yuji
    Keyes, David
    ACM TRANSACTIONS ON PARALLEL COMPUTING, 2019, 6 (01)
  • [47] New parallel scheduling algorithm on distributed-memory systems
    Lu, G.H.
    Sun, S.X.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2001, 38 (02):
  • [48] ALLOCATING DATA TO DISTRIBUTED-MEMORY MULTIPROCESSORS BY GENETIC ALGORITHMS
    MANSOUR, N
    FOX, GC
    CONCURRENCY-PRACTICE AND EXPERIENCE, 1994, 6 (06): : 485 - 504
  • [49] Representing shared data on distributed-memory parallel computers
    Herley, KT
    MATHEMATICAL SYSTEMS THEORY, 1996, 29 (02): : 111 - 156
  • [50] ANALYSIS OF ASYNCHRONOUS POLYNOMIAL ROOT FINDING METHODS ON A DISTRIBUTED-MEMORY MULTICOMPUTER
    COSNARD, M
    FRAIGNIAUD, P
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1994, 5 (06) : 639 - 648