Identifying Mis-Configured Author Profiles on Google Scholar Using Deep Learning

被引:3
|
作者
Tang, Jiaxin [1 ,2 ]
Chen, Yang [1 ,2 ]
She, Guozhen [1 ,2 ]
Xu, Yang [1 ]
Sha, Kewei [3 ]
Wang, Xin [1 ,2 ]
Wang, Yi [4 ,5 ]
Zhang, Zhenhua [6 ]
Hui, Pan [7 ,8 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai 200433, Peoples R China
[2] Fudan Univ, Shanghai Key Lab Intelligent Informat Proc, Shanghai 200433, Peoples R China
[3] Univ Houston Clear Lake, Dept Comp Sci, Houston, TX 77058 USA
[4] Peng Cheng Lab, Shenzhen 518055, Peoples R China
[5] Southern Univ Sci & Technol, Inst Future Networks, Shenzhen 518055, Peoples R China
[6] Meituan, Beijing 100102, Peoples R China
[7] Univ Helsinki, Dept Comp Sci, Helsinki 00014, Finland
[8] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Kowloon, Clear Water Bay, Hong Kong, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 15期
基金
中国国家自然科学基金;
关键词
Google Scholar; author profiles; mis-configuration; machine learning; neural network; node embedding; NETWORKS; DEFENSE; INDEX;
D O I
10.3390/app11156912
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Google Scholar has been a widely used platform for academic performance evaluation and citation analysis. The issue about the mis-configuration of author profiles may seriously damage the reliability of the data, and thus affect the accuracy of analysis. Therefore, it is important to detect the mis-configured author profiles. Dealing with this issue is challenging because the scale of the dataset is large and manual annotation is time-consuming and relatively subjective. In this paper, we first collect a dataset of Google Scholar's author profiles in the field of computer science and compare the mis-configured author profiles with the reliable ones. Then, we propose an integrated model that utilizes machine learning and node embedding to automatically detect mis-configured author profiles. Additionally, we conduct two application case studies based on the data of Google Scholar, i.e., outstanding scholar searching and university ranking, to demonstrate how the improved dataset after filtering out the mis-configured author profiles will change the results. The two case studies validate the importance and meaningfulness of the detection of mis-configured author profiles.
引用
收藏
页数:22
相关论文
共 50 条
  • [21] An approach for identifying historic village using deep learning
    Tao, Jin
    Li, Geng
    Sun, Qiwei
    Chen, Youjia
    Xiao, Dawei
    Feng, Huicheng
    SN APPLIED SCIENCES, 2023, 5 (01):
  • [22] Identifying Pauli spin blockade using deep learning
    Schuff, Jonas
    Lennon, Dominic T.
    Geyer, Simon
    Craig, David L.
    Fedele, Federico
    Vigneau, Florian
    Camenzind, Leon C.
    Kuhlmann, Andreas V.
    Briggs, Andrew D.
    Zumbuhl, Dominik M.
    Sejdinovic, Dino
    Ares, Natalia
    QUANTUM, 2023, 7
  • [23] Identifying mental workload using EEG and deep learning
    Zhang, Qiankun
    Yuan, Ziqian
    Chen, He
    Li, Xiaoli
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 1138 - 1142
  • [24] Identifying Vulnerable IoT Applications using Deep Learning
    Naeem, Hajra
    Alalfi, Manar H.
    PROCEEDINGS OF THE 2020 IEEE 27TH INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION, AND REENGINEERING (SANER '20), 2020, : 582 - 586
  • [25] DroidDeepLearner: Identifying Android Malware Using Deep Learning
    Wang, Zi
    Cai, Juecong
    Cheng, Sihua
    Li, Wenjia
    2016 IEEE 37TH SARNOFF SYMPOSIUM, 2016, : 160 - 165
  • [26] Identifying Engineering Undergraduates' Learning Style Profiles Using Machine Learning Techniques
    Ramirez-Correa, Patricio
    Alfaro-Perez, Jorge
    Gallardo, Mauricio
    APPLIED SCIENCES-BASEL, 2021, 11 (22):
  • [27] Author Identification Using Chaos Game Representation and Deep Learning
    Stoean, Catalin
    Lichtblau, Daniel
    MATHEMATICS, 2020, 8 (11) : 1 - 19
  • [28] Identifying High Risk of Atherosclerosis Using Deep Learning and Ensemble Learning
    Olhosseiny, Hedieh Hashem
    Mirzaloo, Mohammadsalar
    Bolic, Miodrag
    Dajani, Hilmi R.
    Groza, Voicu
    Yoshida, Masayoshi
    2021 IEEE INTERNATIONAL SYMPOSIUM ON MEDICAL MEASUREMENTS AND APPLICATIONS (IEEE MEMEA 2021), 2021,
  • [29] Comparing Manually Added Research Labels and Automatically Extracted Research Keywords to Identify Specialist Researchers in Learning Analytics: A Case Study Using Google Scholar Researcher Profiles
    Aljohani, Naif Radi
    APPLIED SCIENCES-BASEL, 2023, 13 (12):
  • [30] Deep mapping gentrification in a large Canadian city using deep learning and Google Street View
    Ilic, Lazar
    Sawada, M.
    Zarzelli, Amaury
    PLOS ONE, 2019, 14 (03):