A novel similarity measure for mining missing links in long-path networks

被引:6
|
作者
Ran, Yijun [1 ]
Liu, Tianyu [1 ]
Jia, Tao [1 ]
Xu, Xiao-Ke [2 ]
机构
[1] Southwest Univ, Coll Comp & Informat Sci, Chongqing 400715, Peoples R China
[2] Dalian Minzu Univ, Coll Informat & Commun Engn, Dalian 116600, Peoples R China
基金
中国国家自然科学基金;
关键词
structural equivalence; shortest path length; long-path networks; missing links; PREDICTION;
D O I
10.1088/1674-1056/ac4483
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Network information mining is the study of the network topology, which may answer a large number of application-based questions towards the structural evolution and the function of a real system. The question can be related to how the real system evolves or how individuals interact with each other in social networks. Although the evolution of the real system may seem to be found regularly, capturing patterns on the whole process of evolution is not trivial. Link prediction is one of the most important technologies in network information mining, which can help us understand the evolution mechanism of real-life network. Link prediction aims to uncover missing links or quantify the likelihood of the emergence of nonexistent links from known network structures. Currently, widely existing methods of link prediction almost focus on short-path networks that usually have a myriad of close triangular structures. However, these algorithms on highly sparse or long-path networks have poor performance. Here, we propose a new index that is associated with the principles of structural equivalence and shortest path length (SESPL) to estimate the likelihood of link existence in long-path networks. Through a test of 548 real networks, we find that SESPL is more effective and efficient than other similarity-based predictors in long-path networks. Meanwhile, we also exploit the performance of SESPL predictor and of embedding-based approaches via machine learning techniques. The results show that the performance of SESPL can achieve a gain of 44.09% over GraphWave and 7.93% over Node2vec. Finally, according to the matrix of maximal information coefficient (MIC) between all the similarity-based predictors, SESPL is a new independent feature in the space of traditional similarity features.
引用
收藏
页数:9
相关论文
共 22 条
  • [1] A novel similarity measure for mining missing links in long-path networks
    冉义军
    刘天宇
    贾韬
    许小可
    Chinese Physics B, 2022, 31 (06) : 75 - 83
  • [2] DESIGN AND CONSTRUCTION OF A NOVEL LONG-PATH SPECTROPHOTOMETER
    OBRIEN, GE
    HORNSTEIN, JV
    FLASCHKA, HA
    MICROCHEMICAL JOURNAL, 1977, 22 (04) : 548 - 556
  • [3] Precision as a measure of predictability of missing links in real networks
    Garcia-Perez, Guillermo
    Aliakbarisani, Roya
    Ghasemi, Abdorasoul
    Serrano, M. Angeles
    PHYSICAL REVIEW E, 2020, 101 (05)
  • [4] Studies of free space optical links through simulated boundary layer and long-path turbulence
    Wasiczko, L
    Smolyaninov, II
    Milner, SD
    Davis, CC
    OPTICS IN ATMOSPHERIC PROPAGATION AND ADAPTIVE SYSTEMS VI, 2004, 5237 : 127 - 135
  • [5] Mining Missing Links in Directed Social Networks based on Significant Motifs
    Li, Jinsong
    Peng, Jianhua
    Liu, Shuxin
    Li, Zhicheng
    PROCEEDINGS OF 2020 IEEE 10TH INTERNATIONAL CONFERENCE ON ELECTRONICS INFORMATION AND EMERGENCY COMMUNICATION (ICEIEC 2020), 2020, : 31 - 38
  • [6] Investigation of the role of similarity measure and ranking algorithm in mining social networks
    Alguliev, Rasim
    Aliguliyev, Ramiz
    Ganjaliyev, Fadai
    JOURNAL OF INFORMATION SCIENCE, 2011, 37 (03) : 229 - 234
  • [7] Precision as a measure of predictability of missing links in real networks (vol 101, 052318, 2020)
    Garcia-Perez, Guillermo
    Aliakbarisani, Roya
    Ghasemi, Abdorasoul
    Serrano, M. Angeles
    PHYSICAL REVIEW E, 2022, 106 (06)
  • [8] A novel dual-LED based long-path DOAS instrument for the measurement of aromatic hydrocarbons
    Stutz, Jochen
    Hurlock, Stephen C.
    Colosimo, Santo F.
    Tsai, Catalina
    Cheung, Ross
    Festa, James
    Pikelnaya, Olga
    Alvarez, Sergio
    Flynn, James H.
    Erickson, Matthew H.
    Olaguer, Eduardo P.
    ATMOSPHERIC ENVIRONMENT, 2016, 147 : 121 - 132
  • [9] A Semantic Path-Based Similarity Measure for Weighted Heterogeneous Information Networks
    Yang, Chunxue
    Zhao, Chenfei
    Wang, Hengliang
    Qiu, Riming
    Li, Yuan
    Mu, Kedian
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT (KSEM 2018), PT I, 2018, 11061 : 311 - 323
  • [10] A novel similarity measure for the link prediction in unipartite and bipartite networks
    Purushottam Kumar
    Dolly Sharma
    Social Network Analysis and Mining, 2021, 11