A comparative analysis of local similarity metrics and machine learning approaches: application to link prediction in author citation networks

被引:7
|
作者
Vital, Adilson [1 ]
Amancio, Diego R. [1 ]
机构
[1] Univ Sao Paulo, Inst Math & Comp Sci, Dept Comp Sci, Sao Carlos, SP, Brazil
基金
巴西圣保罗研究基金会;
关键词
Link prediction; Citation networks; Network similarity; Science of science; Authors citation networks; COMPLEX NETWORKS; COLLABORATION; EVOLUTION; SCIENCE;
D O I
10.1007/s11192-022-04484-6
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Understanding the evolution of paper and author citations is of paramount importance for the design of research policies and evaluation criteria that can promote and accelerate scientific discoveries. Recently many studies on the evolution of science have been conducted in the context of the emergent Science of Science field. While many studies have probed the link problem in citation networks, only a few works have analyzed the temporal nature of link prediction in author citation networks. In this study we compared the performance of 10 well-known local network similarity measurements with four machine learning models to predict future links in author citations networks. Differently from traditional link prediction methods, the temporal nature of the predict links is relevant for our approach. Our analysis revealed that the Jaccard coefficient was found to be among the most relevant measurements. The preferential attachment measurement, conversely, displayed the worst performance. We also found that the extension of local measurements to their weighted version do not significantly improved the performance of predicting citations. Finally, we also found that a XGBoost and neural network approach summarizing the information from all 10 considered similarity measurements was able to provide the highest AUC performance and competitive precision values.
引用
收藏
页码:6011 / 6028
页数:18
相关论文
共 50 条
  • [31] Supervised Machine Learning applied to Link Prediction in Bipartite Social Networks
    Benchettara, Nesserine
    Kanawati, Rushed
    Rouveirol, Celine
    2010 INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2010), 2010, : 326 - 330
  • [32] Link Prediction Algorithms for Social Networks Based on Machine Learning and HARP
    Shao, Hao
    Wang, Lunwen
    Ji, Yufan
    IEEE ACCESS, 2019, 7 : 122722 - 122729
  • [33] Citation network analysis of geostatistical and machine learning based spatial prediction
    Radhakrishnan Thanu Iyer
    Manojkumar Thananthu Krishnan
    Spatial Information Research, 2023, 31 : 625 - 636
  • [34] Hybrid Approach for Link Prediction using Supervised Machine Learning in Social Networks: Combining Global and Local Features
    Kumar, Shambhu
    Jain, Arti
    Bisht, Dinesh
    ACM International Conference Proceeding Series, 2023, : 591 - 597
  • [35] A Review and Analysis of Machine Learning and Statistical Approaches for Prediction
    Nisha, K. G.
    Sreekumar, K.
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON INVENTIVE COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICICCT), 2017, : 135 - 139
  • [36] A comparative study of Sentiment Analysis Machine Learning Approaches
    Maada, Loukmane
    Al Fararni, Khalid
    Aghoutane, Badraddine
    Fattah, Mohammed
    Farhaoui, Yousef
    2022 2ND INTERNATIONAL CONFERENCE ON INNOVATIVE RESEARCH IN APPLIED SCIENCE, ENGINEERING AND TECHNOLOGY (IRASET'2022), 2022, : 526 - 530
  • [37] Comparative analysis of flexural strength prediction in SFRC using frequentist, Bayesian, and Machine Learning approaches
    De La Rosa, Angel
    Sainz-Aja, Jose
    Rivas, Isaac
    Ruiz, Gonzalo
    Ferreno, Diego
    CASE STUDIES IN CONSTRUCTION MATERIALS, 2024, 21
  • [38] Comparative Analysis of Machine Learning Approaches of Prediction of Diabetes Consequences in Pregnancy with Implications of Data Matrices
    Kumar, A. Aruna
    Henge, Santosh Kumar
    SOFT COMPUTING FOR SECURITY APPLICATIONS, ICSCS 2022, 2023, 1428 : 613 - 626
  • [39] A comparative Analysis of Machine Learning Classification Approaches for Fountain Data Estimation in Wireless Sensor Networks
    Belabed, Fatma
    Bouallegue, Ridha
    2019 15TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE (IWCMC), 2019, : 1251 - 1254
  • [40] Comparative Analysis of Machine Learning Algorithms for Rainfall Prediction
    Patil, Rudragoud
    Bedekar, Gayatri
    INNOVATIVE DATA COMMUNICATION TECHNOLOGIES AND APPLICATION, ICIDCA 2021, 2022, 96 : 833 - 842