A machine learning-based approach for vital node identification in complex networks

被引:23
|
作者
Rezaei, Ahmad Asgharian [1 ]
Munoz, Justin [1 ]
Jalili, Mahdi [1 ]
Khayyam, Hamid [1 ]
机构
[1] RMIT Univ, Sch Engn, Melbourne, Australia
关键词
Influential node ranking; Complex networks; Machine learning; Vital node identification; Support vector machines; Influence maximization; Epidemic analysis; Complex Systems; IDENTIFYING INFLUENTIAL NODES; ONLINE SOCIAL NETWORKS; RANKING; SPREADERS; CENTRALITY; ALGORITHM;
D O I
10.1016/j.eswa.2022.119086
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Vital node identification is the problem of finding nodes of highest importance in complex networks. This problem has crucial applications in various contexts such as viral marketing or controlling the propagation of virus or rumours in real-world networks. Existing approaches for vital node identification mainly focus on capturing the importance of a node through a mathematical expression which directly relates structural properties of the node to its vitality. Although these heuristic approaches have achieved good performance in practice, they have weak adaptability, and their performance is limited to specific settings and certain dynamics. Inspired by the power of machine learning models for efficiently capturing different types of patterns and relations, we propose a machine learning-based, data driven approach for vital node identification. The main idea is to train the model with a small portion of the graph, say 0.5% of the nodes, and do the prediction on the rest of the nodes. The ground-truth vitality for the train data is computed by simulating the SIR diffusion method starting from the train nodes. We use collective feature engineering where each node in the network is represented by incorporating elements of its connectivity, degree and extended coreness. Several machine learning models are trained on the node representations, but the best results are achieved by a Support Vector Regression machine with RBF kernel. The empirical results confirms that the proposed model outperforms state-of-the-art models on a selection of datasets, while it also shows more adaptability to changes in the dynamics parameters. With respect to correlation of ranking of the nodes with the ground-truth ranking, the proposed model outperforms other models with a margin as high as 4.63%, while it maintains the lowest variation in performance, with a performance difference as low as 5% across different influence probabilities. The proposed model also obtains the highest uniqueness of ranking, achieving almost unique ranking with a monotonicity relation score of more than 0.9997 on four datasets.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Machine Learning-Based Link Fault Identification and Localization in Complex Networks
    Srinivasan, Srinikethan Madapuzi
    Tram Truong-Huu
    Gurusamy, Mohan
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (04): : 6556 - 6566
  • [2] Machine Learning-Based Source Identification in Sewer Networks
    Salem, Aly K.
    Abokifa, Ahmed A.
    [J]. JOURNAL OF WATER RESOURCES PLANNING AND MANAGEMENT, 2023, 149 (08)
  • [3] Sensor Placement Optimization in Sewer Networks: Machine Learning-Based Source Identification Approach
    Salem, Aly K.
    Abokifa, Ahmed A.
    [J]. JOURNAL OF WATER RESOURCES PLANNING AND MANAGEMENT, 2024, 150 (11)
  • [4] Vital node identification in complex networks based on autoencoder and graph neural network
    Xiong, You
    Hu, Zheng
    Su, Chang
    Cai, Shi-Min
    Zhou, Tao
    [J]. APPLIED SOFT COMPUTING, 2024, 163
  • [5] Auto Machine Learning-Based Approach for Source Printer Identification
    Phu-Qui Vo
    Nhan Tam Dang
    Phu Nguyen, Q.
    An Mai
    Nguyen, Loan T. T.
    Quoc-Thong Nguyen
    Ngoc-Thanh Nguyen
    [J]. RECENT CHALLENGES IN INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2022, 2022, 1716 : 668 - 680
  • [6] An insight into topological, machine and Deep Learning-based approaches for influential node identification in social media networks: a systematic review
    Rashid, Yasir
    Bhat, Javaid Iqbal
    [J]. MULTIMEDIA SYSTEMS, 2024, 30 (01)
  • [7] A MACHINE LEARNING-BASED APPROACH TO LOAD BALANCING IN COMPUTER-NETWORKS
    KUBAT, M
    [J]. CYBERNETICS AND SYSTEMS, 1992, 23 (3-4) : 389 - 400
  • [8] An insight into topological, machine and Deep Learning-based approaches for influential node identification in social media networks: a systematic review
    Yasir Rashid
    Javaid Iqbal Bhat
    [J]. Multimedia Systems, 2024, 30
  • [9] Machine learning-based prediction of Q-voter model in complex networks
    Pineda, Aruane M.
    Kent, Paul
    Connaughton, Colm
    Rodrigues, Francisco A.
    [J]. JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2023, 2023 (12):
  • [10] Machine Learning-Based Identification of Lithic Microdebitage
    Eberl, Markus
    Bell, Charreau S.
    Spencer-Smith, Jesse
    Raj, Mark
    Sarubbi, Amanda
    Johnson, Phyllis S.
    Rieth, Amy E.
    Chaudhry, Umang
    Estrada Aguila, Rebecca
    McBride, Michael
    [J]. ADVANCES IN ARCHAEOLOGICAL PRACTICE, 2023, 11 (02): : 152 - 163