Identifying Mis-Configured Author Profiles on Google Scholar Using Deep Learning

被引:3
|
作者
Tang, Jiaxin [1 ,2 ]
Chen, Yang [1 ,2 ]
She, Guozhen [1 ,2 ]
Xu, Yang [1 ]
Sha, Kewei [3 ]
Wang, Xin [1 ,2 ]
Wang, Yi [4 ,5 ]
Zhang, Zhenhua [6 ]
Hui, Pan [7 ,8 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai 200433, Peoples R China
[2] Fudan Univ, Shanghai Key Lab Intelligent Informat Proc, Shanghai 200433, Peoples R China
[3] Univ Houston Clear Lake, Dept Comp Sci, Houston, TX 77058 USA
[4] Peng Cheng Lab, Shenzhen 518055, Peoples R China
[5] Southern Univ Sci & Technol, Inst Future Networks, Shenzhen 518055, Peoples R China
[6] Meituan, Beijing 100102, Peoples R China
[7] Univ Helsinki, Dept Comp Sci, Helsinki 00014, Finland
[8] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Kowloon, Clear Water Bay, Hong Kong, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 15期
基金
中国国家自然科学基金;
关键词
Google Scholar; author profiles; mis-configuration; machine learning; neural network; node embedding; NETWORKS; DEFENSE; INDEX;
D O I
10.3390/app11156912
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Google Scholar has been a widely used platform for academic performance evaluation and citation analysis. The issue about the mis-configuration of author profiles may seriously damage the reliability of the data, and thus affect the accuracy of analysis. Therefore, it is important to detect the mis-configured author profiles. Dealing with this issue is challenging because the scale of the dataset is large and manual annotation is time-consuming and relatively subjective. In this paper, we first collect a dataset of Google Scholar's author profiles in the field of computer science and compare the mis-configured author profiles with the reliable ones. Then, we propose an integrated model that utilizes machine learning and node embedding to automatically detect mis-configured author profiles. Additionally, we conduct two application case studies based on the data of Google Scholar, i.e., outstanding scholar searching and university ranking, to demonstrate how the improved dataset after filtering out the mis-configured author profiles will change the results. The two case studies validate the importance and meaningfulness of the detection of mis-configured author profiles.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Microsoft Academic Search and Google Scholar Citations: Comparative Analysis of Author Profiles
    Luis Ortega, Jose
    Aguillo, Isidro F.
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2014, 65 (06) : 1149 - 1156
  • [2] A simple back-of-the-envelope test for self-citations using Google Scholar author profiles
    Frode Eika Sandnes
    Scientometrics, 2020, 124 : 1685 - 1689
  • [3] A simple back-of-the-envelope test for self-citations using Google Scholar author profiles
    Sandnes, Frode Eika
    SCIENTOMETRICS, 2020, 124 (02) : 1685 - 1689
  • [4] Flipbook in Learning: Bibliometric Analysis using Scopus Data and Google Scholar
    Charlina
    Septyanti, Elvrin
    Mustika, Tria Putri
    PROCEEDINGS OF URICET-2021: UNIVERSITAS RIAU INTERNATIONAL CONFERENCE ON EDUCATION TECHNOLOGY - 2021: EDUCATIONAL TECHNOLOGY FOR A BETTER LEARNING QUALITY, 2021, : 175 - 180
  • [5] Flipbook in Learning: Bibliometric Analysis using Scopus Data and Google Scholar
    Charlina
    Septyanti, Elvrin
    Mustika, Tria Putri
    Proceedings of URICET 2021 - Universitas Riau International Conference on Education Technology 2021, 2021, : 175 - 180
  • [6] Author Identification using Deep Learning
    Mohsen, Ahmed M.
    El-Makky, Nagwa M.
    Ghanem, Nagia
    2016 15TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2016), 2016, : 898 - 903
  • [7] Identifying Aerosol Subtypes from CALIPSO Lidar Profiles Using Deep Machine Learning
    Zeng, Shan
    Omar, Ali
    Vaughan, Mark
    Ortiz, Macarena
    Trepte, Charles
    Tackett, Jason
    Yagle, Jeremy
    Lucker, Patricia
    Hu, Yongxiang
    Winker, David
    Rodier, Sharon
    Getzewich, Brian
    ATMOSPHERE, 2021, 12 (01) : 1 - 19
  • [8] Deep learning with evolutionary and genomic profiles for identifying cancer subtypes
    Lin, Chun-Yu
    Ruan, Peiying
    Li, Ruiming
    Yang, Jinn-Moon
    See, Simon
    Song, Jiangning
    Akutsu, Tatsuya
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2019, 17 (03)
  • [9] Deep Learning with Evolutionary and Genomic Profiles for Identifying Cancer Subtypes
    Lin, Chun-Yu
    Li, Ruiming
    Akutsu, Tatsuya
    Ruan, Peiying
    See, Simon
    Yang, Jinn-Moon
    PROCEEDINGS 2018 IEEE 18TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE), 2018, : 147 - 150
  • [10] Google Stock Prices Prediction Using Deep Learning
    Ullah, Kifayat
    Qasim, Muhammad
    2020 IEEE 10TH INTERNATIONAL CONFERENCE ON SYSTEM ENGINEERING AND TECHNOLOGY (ICSET), 2020, : 108 - 113