AdaWIRL: A Novel Bayesian Ranking Approach for Personal Big-Hit Paper Prediction

被引:3
|
作者
Zhang, Chuxu [1 ,2 ]
Yu, Lu [3 ]
Lu, Jie [4 ]
Zhou, Tao [5 ]
Zhang, Zi-Ke [1 ]
机构
[1] Hangzhou Normal Univ, Alibaba Res Ctr Complex Sci, Hangzhou, Zhejiang, Peoples R China
[2] Rutgers State Univ, Dept Comp Sci, New Brunswick, NJ USA
[3] Alibaba Grp, Hangzhou, Zhejiang, Peoples R China
[4] IBM Thomas J Watson Res Ctr, Yorktown Hts, NY USA
[5] Univ Elect Sci & Technol China, Big Data Res Ctr, Chengdu, Peoples R China
来源
WEB-AGE INFORMATION MANAGEMENT, PT II | 2016年 / 9659卷
关键词
INFORMATION; INDEX;
D O I
10.1007/978-3-319-39958-4_27
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Predicting the most impactful (big-hit) paper among a researcher's publications so it can be well disseminated in advance not only has a large impact on individual academic success, but also provides useful guidance to the research community. In this work, we tackle the problem of given the corpus of a researcher's publications in previous few years, how to effectively predict which paper will become the big-hit in the future. We explore a series of features that can drive a paper to become the big-hit, and design a novel Bayesian ranking algorithm AdaWIRL (Adaptive Weighted Impact Ranking Learning) that leverages a weighted training schema and an adaptive timely false correction strategy to predict big-hit papers. Experimental results on the large ArnetMiner dataset with over 1.7 million authors and 2 million papers demonstrate the effectiveness of AdaWIRL. Specifically, it correctly predicts over 78.3% of all researchers' big-hit papers and outperforms the compared regression and ranking algorithms, with an average of 5.8% and 2.9% improvement respectively. Further analysis shows that temporal features are the best indicator for personal big-hit papers, while authorship and social features are less relevant. We also demonstrate that there is a high correlation between the impact of a researcher's future works and their similarity to the predicted big-hit paper.
引用
收藏
页码:342 / 355
页数:14
相关论文
共 16 条
  • [1] Drug-target interaction prediction: A Bayesian ranking approach
    Peska, Ladislav
    Buza, Krisztian
    Koller, Julia
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2017, 152 : 15 - 21
  • [2] A Novel Approach for QoS Prediction Based on Bayesian Combinational Model
    Zhang, Pengcheng
    Sun, Yingtao
    Leung, Hareton
    Xu, Meijun
    Li, Wenrui
    CHINA COMMUNICATIONS, 2016, 13 (11) : 269 - 280
  • [3] A Novel Approach for QoS Prediction Based on Bayesian Combinational Model
    Pengcheng Zhang
    Yingtao Sun
    Hareton Leung
    Meijun Xu
    Wenrui Li
    中国通信, 2016, 13 (11) : 269 - 280
  • [4] A novel disease severity prediction scheme via big pair-wise ranking and learning techniques using image-based personal clinical data
    Huang, Wei
    SIGNAL PROCESSING, 2016, 124 : 233 - 245
  • [5] A Novel Approach to Big Data Veracity using Crowdsourcing Techniques and Bayesian Predictors
    Agarwal, Bhoomika
    Ravikumar, Abhiram
    Saha, Snehanshu
    2016 15TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2016), 2016, : 1020 - 1023
  • [6] A Novel Approach to Big Data Veracity using Crowdsourcing Techniques and Bayesian Predictors
    Agarwal, Bhoomika
    Ravikumar, Abhiram
    Saha, Snehanshu
    COMPUTE 2016, 2016, : 153 - 160
  • [7] A Novel QoS Prediction Approach for Cloud Service Based on Bayesian Networks Model
    Zhang, Pengcheng
    Han, Qing
    Li, Wenrui
    Leung, Hareton
    Song, Wei
    2016 5TH IEEE INTERNATIONAL CONFERENCE ON MOBILE SERVICES (MS 2016), 2016, : 111 - 118
  • [8] A Novel QoS Prediction Approach for Cloud Services Using Bayesian Network Model
    Li, Wenrui
    Zhang, Pengcheng
    Leung, Hareton
    Ji, Shunhui
    IEEE ACCESS, 2018, 6 : 1391 - 1406
  • [9] A novel Bayesian optimizable ensemble bagged trees model for cryptocurrency fraud prediction approach
    Norouzi, Monire
    SECURITY AND PRIVACY, 2024, 7 (06):
  • [10] ENHANCING PREDICTION ACCURACY IN THE EVALUATION OF POWER PLANT UPRATES UTILIZING A NOVEL 'BIG DATA' APPROACH
    Pechtl, Peter
    Scheinecker, Christian
    Petek, Josef
    PROCEEDINGS OF THE ASME POWER CONFERENCE, 2018, VOL 2, 2018,