A comparative analysis of local similarity metrics and machine learning approaches: application to link prediction in author citation networks

被引:7
|
作者
Vital, Adilson [1 ]
Amancio, Diego R. [1 ]
机构
[1] Univ Sao Paulo, Inst Math & Comp Sci, Dept Comp Sci, Sao Carlos, SP, Brazil
基金
巴西圣保罗研究基金会;
关键词
Link prediction; Citation networks; Network similarity; Science of science; Authors citation networks; COMPLEX NETWORKS; COLLABORATION; EVOLUTION; SCIENCE;
D O I
10.1007/s11192-022-04484-6
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Understanding the evolution of paper and author citations is of paramount importance for the design of research policies and evaluation criteria that can promote and accelerate scientific discoveries. Recently many studies on the evolution of science have been conducted in the context of the emergent Science of Science field. While many studies have probed the link problem in citation networks, only a few works have analyzed the temporal nature of link prediction in author citation networks. In this study we compared the performance of 10 well-known local network similarity measurements with four machine learning models to predict future links in author citations networks. Differently from traditional link prediction methods, the temporal nature of the predict links is relevant for our approach. Our analysis revealed that the Jaccard coefficient was found to be among the most relevant measurements. The preferential attachment measurement, conversely, displayed the worst performance. We also found that the extension of local measurements to their weighted version do not significantly improved the performance of predicting citations. Finally, we also found that a XGBoost and neural network approach summarizing the information from all 10 considered similarity measurements was able to provide the highest AUC performance and competitive precision values.
引用
收藏
页码:6011 / 6028
页数:18
相关论文
共 50 条
  • [1] A comparative analysis of local similarity metrics and machine learning approaches: application to link prediction in author citation networks
    Adilson Vital
    Diego R. Amancio
    Scientometrics, 2022, 127 : 6011 - 6028
  • [2] Similarity metric induced metrics with application in machine learning and bioinformatics
    Zhang, Kaizhong
    2016 IEEE 15TH INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI*CC), 2016, : 283 - 287
  • [3] Comparative Analysis of Various Machine Learning Approaches for Bitcoin Price Prediction
    Muvvala, Abhishek
    Chivukula, Rohit
    Lakshmi, T. Jaya
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCES IN SIGNAL PROCESSING AND ARTIFICIAL INTELLIGENCE, ASPAI' 2020, 2020, : 161 - 164
  • [4] Link Prediction in Dynamic Networks Based on Machine Learning
    Liu, Jiachen
    Jiang, Yinan
    Wang, Yashen
    Xie, Haiyong
    Ni, Jie
    PROCEEDINGS OF 2020 3RD INTERNATIONAL CONFERENCE ON UNMANNED SYSTEMS (ICUS), 2020, : 836 - 841
  • [5] Similarity index based on local paths for link prediction of complex networks
    Lue, Linyuan
    Jin, Ci-Hang
    Zhou, Tao
    PHYSICAL REVIEW E, 2009, 80 (04)
  • [6] Link Prediction of Knowledge Diffusion in Disciplinary Citation Networks based on Local Information
    Yue, Zenghui
    Xu, Haiyun
    Yuan, Guoting
    Wang, Qianfei
    17TH INTERNATIONAL CONFERENCE ON SCIENTOMETRICS & INFORMETRICS (ISSI2019), VOL II, 2019, : 2526 - 2527
  • [7] Performance Metrics for the Comparative Analysis of Clinical Risk Prediction Models Employing Machine Learning
    Huang, Chenxi
    Li, Shu-Xia
    Caraballo, Cesar
    Masoudi, Frederick A.
    Rumsfeld, John S.
    Spertus, John A.
    Normand, Sharon-Lise T.
    Mortazavi, Bobak J.
    Krumholz, Harlan M.
    CIRCULATION-CARDIOVASCULAR QUALITY AND OUTCOMES, 2021, 14 (10): : 1076 - 1086
  • [8] Enhance Link Prediction in Online Social Networks Using Similarity Metrics, Sampling, and Classification
    Pham Minh Chuan
    Cu Nguyen Giap
    Le Hoang Son
    Bhatt, Chintan
    Tran Dinh Khang
    INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, INDIA 2017, 2018, 672 : 823 - 833
  • [9] Application of Machine Learning on Process Metrics for Defect Prediction in Mobile Application
    Kaur, Arvinder
    Kaur, Kamaldeep
    Kaur, Harguneet
    INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, VOL 1, INDIA 2016, 2016, 433 : 81 - 98
  • [10] Comparative Study of Machine Learning Approaches in Diabetes Prediction
    Parameswari, P.
    Rajathi, N.
    BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2020, 13 (11): : 42 - 46