Whois? Deep Author Name Disambiguation Using Bibliographic Data

被引:7
|
作者
Boukhers, Zeyd [1 ,2 ]
Asundi, Nagaraj Bahubali [1 ]
机构
[1] Univ Koblenz Landau, Inst Web Sci & Technol WeST, Koblenz, Germany
[2] Fraunhofer Inst Appl Informat Technol, St Augustin, Germany
关键词
Author name disambiguation; Entity linkage; Bibliographic data; Neural networks; Classification;
D O I
10.1007/978-3-031-16802-4_16
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As the number of authors is increasing exponentially over years, the number of authors sharing the same names is increasing proportionally. This makes it challenging to assign newly published papers to their adequate authors. Therefore, Author Name Ambiguity (ANA) is considered a critical open problem in digital libraries. This paper proposes an Author Name Disambiguation (AND) approach that links author names to their real-world entities by leveraging their co-authors and domain of research. To this end, we use a collection from the DBLP repository that contains more than 5 million bibliographic records authored by around 2.6 million co-authors. Our approach first groups authors who share the same last names and same first name initials. The author within each group is identified by capturing the relation with his/her co-authors and area of research, which is represented by the titles of the validated publications of the corresponding author. To this end, we train a neural network model that learns from the representations of the co-authors and titles. We validated the effectiveness of our approach by conducting extensive experiments on a large dataset.
引用
收藏
页码:201 / 215
页数:15
相关论文
共 50 条
  • [21] Automatic Method for Author Name Disambiguation using Social Networks
    Shin, Dongwook
    Kim, Taehwan
    Jung, Hana
    Choi, Joongmin
    2010 24TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS (AINA), 2010, : 1263 - 1270
  • [22] AuthCrowd: Author Name Disambiguation and Entity Matching using Crowdsourcing
    Correia, Antonio
    Guimaraes, Diogo
    Paulino, Dennis
    Jameel, Shoaib
    Schneider, Daniel
    Fonseca, Benjamim
    Paredes, Hugo
    PROCEEDINGS OF THE 2021 IEEE 24TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2021, : 150 - 155
  • [23] A Novel Approach for Author Name Disambiguation Using Ranking Confidence
    Lin, Xueqin
    Zhu, Jia
    Tang, Yong
    Yang, Fen
    Peng, Bo
    Li, Weiling
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2017), 2017, 10179 : 169 - 182
  • [24] Using Co-authorship Networks for Author Name Disambiguation
    Momeni, Fakhri
    Mayr, Philipp
    2016 IEEE/ACM JOINT CONFERENCE ON DIGITAL LIBRARIES (JCDL), 2016, : 261 - 262
  • [25] Author name disambiguation using a new categorical distribution similarity
    Nanyang Technological University, Singapore
    Lect. Notes Comput. Sci., PART 1 (569-584):
  • [26] Author Name Disambiguation Using Multiple Graph Attention Networks
    Zhang, Zhiqiang
    Wu, Chunqi
    Li, Zhao
    Peng, Juanjuan
    Wu, Haiyan
    Song, Haiyu
    Deng, Shengchun
    Wang, Biao
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [27] A novel approach for author name disambiguation using ranking confidence
    Lin, Xueqin
    Zhu, Jia
    Tang, Yong
    Yang, Fen
    Peng, Bo
    Li, Weiling
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2017, 10179 LNCS : 169 - 182
  • [28] Author Name Disambiguation for Citations Using Topic and Web Correlation
    Yang, Kai-Hsiang
    Peng, Hsin-Tsung
    Jiang, Jian-Yi
    Lee, Hahn-Ming
    Ho, Jan-Ming
    RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, 2008, 5173 : 185 - +
  • [29] Author Name Disambiguation Using Graph Node Embedding Method
    Zhang, Wenjing
    Yan, Zhongmin
    Zheng, Yongqing
    PROCEEDINGS OF THE 2019 IEEE 23RD INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2019, : 410 - 415
  • [30] Data sets for author name disambiguation: an empirical analysis and a new resource
    Mueller, Mark-Christoph
    Reitz, Florian
    Roy, Nicolas
    SCIENTOMETRICS, 2017, 111 (03) : 1467 - 1500