Predicting scientific impact based on h-index

被引:31
|
作者
Ayaz, Samreen [1 ]
Masood, Nayyer [1 ]
Islam, Muhammad Arshad [1 ]
机构
[1] Capital Univ Sci &Technol, Dept Comp Sci, Islamabad, Pakistan
关键词
h-Index prediction; Regression; Career age; R-2; RESEARCHERS; VARIANTS; SCIENCE; POWER;
D O I
10.1007/s11192-017-2618-1
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Predicting the future impact of a scientist/researcher is a critical task. The objective of this work is to evaluate different h-index prediction models for the field of Computer Science. Different combinations of parameters have been identified to build the model and applied on a large data set taken from Arnetminer comprised of almost 1.8 million authors and 2.1 million publications' record of Computer Science. Machine learning prediction technique, regression, is used to find the best set of parameters suitable for h-index prediction for the scientists from all career ages, without enforcing any constraint on their current h-index values with R-2 as a metric to measure the accuracy. Further, these parameters are evaluated for different career ages and different thresholds for h-index values. Prediction results for 1 year are really good, having R-2 0.93 but for 5 years R-2 declines to 0.82 on average. Hence inferred that prediction of h-index is difficult for longer periods. Predictions for the researchers having 1 year experience are not precise, having R-2 0.60 for 1 year and 0.33 for 5 years. Considering scientists of different career ages, average R-2 values for researchers having 20-36 years of experience were 0.99. For the researches having different h-index values, researchers having low h-index were difficult to predict. Parameters set comprising of current h-index, average citations per paper, number of coauthors, years since publishing first article, number of publications, number of impact factor publications, and number of publications in distinct journals performed better than all other combinations.
引用
收藏
页码:993 / 1010
页数:18
相关论文
共 50 条
  • [31] The H-index
    Brähler, E
    Decker, O
    [J]. PSYCHOTHERAPIE PSYCHOSOMATIK MEDIZINISCHE PSYCHOLOGIE, 2005, 55 (11) : 451 - 451
  • [32] h-Index of high-impact hospitals
    Zhao, Star X.
    Ye, Fred Y.
    [J]. CURRENT SCIENCE, 2011, 101 (08): : 984 - 985
  • [33] Bibliometric indicator based on the h-index
    Dorta-Gonzalez, Pablo
    Isabel Dorta-Gonzalez, Maria
    [J]. REVISTA ESPANOLA DE DOCUMENTACION CIENTIFICA, 2010, 33 (02): : 225 - 245
  • [34] The H-index
    不详
    [J]. SCIENTIST, 2005, 19 (20): : 24 - 24
  • [35] Cost-sensitive selective naive Bayes classifiers for predicting the increase of the h-index for scientific journals
    Ibanez, Alfonso
    Bielza, Concha
    Larranaga, Pedro
    [J]. NEUROCOMPUTING, 2014, 135 : 42 - 52
  • [36] h-index sequence and h-index matrix:: Constructions and applications
    Liang, Liming
    [J]. SCIENTOMETRICS, 2006, 69 (01) : 153 - 159
  • [37] h-index sequence and h-index matrix: Constructions and applications
    Liming Liang
    [J]. Scientometrics, 2006, 69 : 153 - 159
  • [38] The single publication H-index and the indirect H-index of a researcher
    L. Egghe
    [J]. Scientometrics, 2011, 88 : 1003 - 1004
  • [39] The single publication H-index and the indirect H-index of a researcher
    Egghe, L.
    [J]. SCIENTOMETRICS, 2011, 88 (03) : 1003 - 1004
  • [40] Combination of Eigenfactor TM and h-index to evaluate scientific journals
    Yin, Chun-Yang
    Aris, Mohd Jindra
    Chen, Xi
    [J]. SCIENTOMETRICS, 2010, 84 (03) : 639 - 648