Identifying Mis-Configured Author Profiles on Google Scholar Using Deep Learning

被引:3
|
作者
Tang, Jiaxin [1 ,2 ]
Chen, Yang [1 ,2 ]
She, Guozhen [1 ,2 ]
Xu, Yang [1 ]
Sha, Kewei [3 ]
Wang, Xin [1 ,2 ]
Wang, Yi [4 ,5 ]
Zhang, Zhenhua [6 ]
Hui, Pan [7 ,8 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai 200433, Peoples R China
[2] Fudan Univ, Shanghai Key Lab Intelligent Informat Proc, Shanghai 200433, Peoples R China
[3] Univ Houston Clear Lake, Dept Comp Sci, Houston, TX 77058 USA
[4] Peng Cheng Lab, Shenzhen 518055, Peoples R China
[5] Southern Univ Sci & Technol, Inst Future Networks, Shenzhen 518055, Peoples R China
[6] Meituan, Beijing 100102, Peoples R China
[7] Univ Helsinki, Dept Comp Sci, Helsinki 00014, Finland
[8] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Kowloon, Clear Water Bay, Hong Kong, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 15期
基金
中国国家自然科学基金;
关键词
Google Scholar; author profiles; mis-configuration; machine learning; neural network; node embedding; NETWORKS; DEFENSE; INDEX;
D O I
10.3390/app11156912
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Google Scholar has been a widely used platform for academic performance evaluation and citation analysis. The issue about the mis-configuration of author profiles may seriously damage the reliability of the data, and thus affect the accuracy of analysis. Therefore, it is important to detect the mis-configured author profiles. Dealing with this issue is challenging because the scale of the dataset is large and manual annotation is time-consuming and relatively subjective. In this paper, we first collect a dataset of Google Scholar's author profiles in the field of computer science and compare the mis-configured author profiles with the reliable ones. Then, we propose an integrated model that utilizes machine learning and node embedding to automatically detect mis-configured author profiles. Additionally, we conduct two application case studies based on the data of Google Scholar, i.e., outstanding scholar searching and university ranking, to demonstrate how the improved dataset after filtering out the mis-configured author profiles will change the results. The two case studies validate the importance and meaningfulness of the detection of mis-configured author profiles.
引用
收藏
页数:22
相关论文
共 50 条
  • [41] Video Forensics: Identifying Colorized Images Using Deep Learning
    Ulloa, Carlos
    Ballesteros, Dora M.
    Renza, Diego
    APPLIED SCIENCES-BASEL, 2021, 11 (02): : 1 - 14
  • [42] Identifying viruses from metagenomic data using deep learning
    Jie Ren
    Kai Song
    Chao Deng
    Nathan AAhlgren
    Jed AFuhrman
    Yi Li
    Xiaohui Xie
    Ryan Poplin
    Fengzhu Sun
    Quantitative Biology, 2020, 8 (01) : 64 - 77
  • [43] Identifying keystone species in microbial communities using deep learning
    Wang, Xu-Wen
    Sun, Zheng
    Jia, Huijue
    Michel-Mata, Sebastian
    Angulo, Marco Tulio
    Dai, Lei
    He, Xuesong
    Weiss, Scott T.
    Liu, Yang-Yu
    NATURE ECOLOGY & EVOLUTION, 2024, 8 (01) : 22 - 31
  • [44] Identifying Alpine Lakes in the Eastern Himalayas Using Deep Learning
    Xu, Jinhao
    Feng, Min
    Sui, Yijie
    Yan, Dezhao
    Zhang, Kuo
    Shi, Kaidan
    WATER, 2023, 15 (02)
  • [45] Identifying viruses from metagenomic data using deep learning
    Ren, Jie
    Song, Kai
    Deng, Chao
    Ahlgren, Nathan A.
    Fuhrman, Jed A.
    Li, Yi
    Xie, Xiaohui
    Poplin, Ryan
    Sun, Fengzhu
    QUANTITATIVE BIOLOGY, 2020, 8 (01) : 64 - 77
  • [46] Identifying Selected Diseases of Leaves using Deep Learning and Transfer Learning Models
    Mimi A.
    Zohura S.F.T.
    Ibrahim M.
    Haque R.R.
    Farrok O.
    Jabid T.
    Ali M.S.
    Machine Graphics and Vision, 2023, 32 (01): : 55 - 71
  • [47] Identifying Schizophrenia Using Structural MRI With a Deep Learning Algorithm
    Oh, Jihoon
    Oh, Baek-Lok
    Lee, Kyong-Uk
    Chae, Jeong-Ho
    Yun, Kyongsik
    FRONTIERS IN PSYCHIATRY, 2020, 11
  • [48] Identifying Informal Settlements Using Contourlet Assisted Deep Learning
    Ansari, Rizwan Ahmed
    Malhotra, Rakesh
    Buddhiraju, Krishna Mohan
    SENSORS, 2020, 20 (09)
  • [49] Identifying Bikers Without Helmets Using Deep Learning Models
    Hossain, Md Iqbal
    Muhib, Raghib Barkat
    Chakrabarty, Amitabha
    2021 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA 2021), 2021, : 510 - 517
  • [50] Identifying Diabetics Retinopathy using Deep Learning based Classification
    Umamageswari, A.
    Duela, J. Shiny
    Raja, K.
    2021 22ND INTERNATIONAL ARAB CONFERENCE ON INFORMATION TECHNOLOGY (ACIT), 2021, : 158 - 163