Identifying Mis-Configured Author Profiles on Google Scholar Using Deep Learning

被引：3

作者：

Tang, Jiaxin ^{[1
,2
]}

Chen, Yang ^{[1
,2
]}

She, Guozhen ^{[1
,2
]}

Xu, Yang ^{[1
]}

Sha, Kewei ^{[3
]}

Wang, Xin ^{[1
,2
]}

Wang, Yi ^{[4
,5
]}

Zhang, Zhenhua ^{[6
]}

Hui, Pan ^{[7
,8
]}

机构：

[1] Fudan Univ, Sch Comp Sci, Shanghai 200433, Peoples R China

[2] Fudan Univ, Shanghai Key Lab Intelligent Informat Proc, Shanghai 200433, Peoples R China

[3] Univ Houston Clear Lake, Dept Comp Sci, Houston, TX 77058 USA

[4] Peng Cheng Lab, Shenzhen 518055, Peoples R China

[5] Southern Univ Sci & Technol, Inst Future Networks, Shenzhen 518055, Peoples R China

[6] Meituan, Beijing 100102, Peoples R China

[7] Univ Helsinki, Dept Comp Sci, Helsinki 00014, Finland

[8] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Kowloon, Clear Water Bay, Hong Kong, Peoples R China

来源：

APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 15期

基金：

中国国家自然科学基金;

关键词：

Google Scholar; author profiles; mis-configuration; machine learning; neural network; node embedding; NETWORKS; DEFENSE; INDEX;

D O I：

10.3390/app11156912

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

Google Scholar has been a widely used platform for academic performance evaluation and citation analysis. The issue about the mis-configuration of author profiles may seriously damage the reliability of the data, and thus affect the accuracy of analysis. Therefore, it is important to detect the mis-configured author profiles. Dealing with this issue is challenging because the scale of the dataset is large and manual annotation is time-consuming and relatively subjective. In this paper, we first collect a dataset of Google Scholar's author profiles in the field of computer science and compare the mis-configured author profiles with the reliable ones. Then, we propose an integrated model that utilizes machine learning and node embedding to automatically detect mis-configured author profiles. Additionally, we conduct two application case studies based on the data of Google Scholar, i.e., outstanding scholar searching and university ranking, to demonstrate how the improved dataset after filtering out the mis-configured author profiles will change the results. The two case studies validate the importance and meaningfulness of the detection of mis-configured author profiles.

引用

页数：22

共 50 条

[21] An approach for identifying historic village using deep learning
Tao, Jin
Li, Geng
Sun, Qiwei
Chen, Youjia
Xiao, Dawei
Feng, Huicheng
SN APPLIED SCIENCES, 2023, 5 (01):
[22] Identifying Pauli spin blockade using deep learning
Schuff, Jonas
Lennon, Dominic T.
Geyer, Simon
Craig, David L.
Fedele, Federico
Vigneau, Florian
Camenzind, Leon C.
Kuhlmann, Andreas V.
Briggs, Andrew D.
Zumbuhl, Dominik M.
Sejdinovic, Dino
Ares, Natalia
QUANTUM, 2023, 7
[23] Identifying mental workload using EEG and deep learning
Zhang, Qiankun
Yuan, Ziqian
Chen, He
Li, Xiaoli
2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 1138 - 1142
[24] Identifying Vulnerable IoT Applications using Deep Learning
Naeem, Hajra
Alalfi, Manar H.
PROCEEDINGS OF THE 2020 IEEE 27TH INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION, AND REENGINEERING (SANER '20), 2020, : 582 - 586
[25] DroidDeepLearner: Identifying Android Malware Using Deep Learning
Wang, Zi
Cai, Juecong
Cheng, Sihua
Li, Wenjia
2016 IEEE 37TH SARNOFF SYMPOSIUM, 2016, : 160 - 165
[26] Identifying Engineering Undergraduates' Learning Style Profiles Using Machine Learning Techniques
Ramirez-Correa, Patricio
Alfaro-Perez, Jorge
Gallardo, Mauricio
APPLIED SCIENCES-BASEL, 2021, 11 (22):
[27] Author Identification Using Chaos Game Representation and Deep Learning
Stoean, Catalin
Lichtblau, Daniel
MATHEMATICS, 2020, 8 (11) : 1 - 19
[28] Identifying High Risk of Atherosclerosis Using Deep Learning and Ensemble Learning
Olhosseiny, Hedieh Hashem
Mirzaloo, Mohammadsalar
Bolic, Miodrag
Dajani, Hilmi R.
Groza, Voicu
Yoshida, Masayoshi
2021 IEEE INTERNATIONAL SYMPOSIUM ON MEDICAL MEASUREMENTS AND APPLICATIONS (IEEE MEMEA 2021), 2021,
[29] Comparing Manually Added Research Labels and Automatically Extracted Research Keywords to Identify Specialist Researchers in Learning Analytics: A Case Study Using Google Scholar Researcher Profiles
Aljohani, Naif Radi
APPLIED SCIENCES-BASEL, 2023, 13 (12):
[30] Deep mapping gentrification in a large Canadian city using deep learning and Google Street View
Ilic, Lazar
Sawada, M.
Zarzelli, Amaury
PLOS ONE, 2019, 14 (03):

← 1 2 3 4 5 →