Towards Effective Author Name Disambiguation by Hybrid Attention

被引:1
|
作者
Zhou, Qian [1 ]
Chen, Wei [1 ]
Zhao, Peng-Peng [1 ]
Liu, An [1 ]
Xu, Jia-Jie [1 ]
Qu, Jian-Feng [1 ]
Zhao, Lei [1 ]
机构
[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215006, Peoples R China
基金
中国国家自然科学基金;
关键词
author name disambiguation; multiple-feature information; hybrid attention; pruning strategy; structural information loss of vector space;
D O I
10.1007/s11390-023-2070-z
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Author name disambiguation (AND) is a central task in academic search, which has received more attention recently accompanied by the increase of authors and academic publications. To tackle the AND problem, existing studies have proposed various approaches based on different types of information, such as raw document features (e.g., co-authors, titles, and keywords), the fusion feature (e.g., a hybrid publication embedding based on multiple raw document features), the local structural information (e.g., a publication's neighborhood information on a graph), and the global structural information (e.g., interactive information between a node and others on a graph). However, there has been no work taking all the above-mentioned information into account and taking full advantage of the contributions of each raw document feature for the AND problem so far. To fill the gap, we propose a novel framework named EAND (Towards Effective Author Name Disambiguation by Hybrid Attention). Specifically, we design a novel feature extraction model, which consists of three hybrid attention mechanism layers, to extract key information from the global structural information and the local structural information that are generated from six similarity graphs constructed based on different similarity coefficients, raw document features, and the fusion feature. Each hybrid attention mechanism layer contains three key modules: a local structural perception, a global structural perception, and a feature extractor. Additionally, the mean absolute error function in the joint loss function is used to introduce the structural information loss of the vector space. Experimental results on two real-world datasets demonstrate that EAND achieves superior performance, outperforming state-of-the-art methods by at least +2.74% in terms of the micro-F1 score and +3.31% in terms of the macro-F1 score.
引用
收藏
页码:929 / 950
页数:22
相关论文
共 50 条
  • [1] Towards a Flexible Author Name Disambiguation Framework
    Bolikowski, Lukasz
    Dendek, Piotr Jan
    DML 2011: TOWARDS A DIGITAL MATHEMATICS LIBRARY, 2011, : 27 - 37
  • [2] Author Name Disambiguation
    Smalheiser, Neil R.
    Torvik, Vetle I.
    ANNUAL REVIEW OF INFORMATION SCIENCE AND TECHNOLOGY, 2009, 43 : 287 - 313
  • [3] Hybrid Deep Pairwise Classification for Author Name Disambiguation
    Kim, Kunho
    Rohatgi, Shaurya
    Giles, C. Lee
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 2369 - 2372
  • [4] Author Name Disambiguation Using Multiple Graph Attention Networks
    Zhang, Zhiqiang
    Wu, Chunqi
    Li, Zhao
    Peng, Juanjuan
    Wu, Haiyan
    Song, Haiyu
    Deng, Shengchun
    Wang, Biao
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [5] A hybrid knowledge-based framework for author name disambiguation
    Protasiewicz, Jaroslaw
    Dadas, Slawomir
    2016 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2016, : 594 - 600
  • [6] Co-attention-Based Pairwise Learning for Author Name Disambiguation
    Wang, Shenghui
    Li, Qiuke
    Koopman, Rob
    LEVERAGING GENERATIVE INTELLIGENCE IN DIGITAL LIBRARIES: TOWARDS HUMAN-MACHINE COLLABORATION, ICADL 2023, PT II, 2023, 14458 : 240 - 249
  • [7] Author Name Disambiguation for PubMed
    Liu, Wanli
    Dogan, Rezarta Islamaj
    Kim, Sun
    Comeau, Donald C.
    Kim, Won
    Yeganova, Lana
    Lu, Zhiyong
    Wilbur, W. John
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2014, 65 (04) : 765 - 781
  • [8] Author Name Disambiguation in MEDLINE
    Torvik, Vetle I.
    Smalheiser, Neil R.
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2009, 3 (03)
  • [9] An Effective Author Name Disambiguation Framework for Large-Scale Publications
    Zhou, Anji
    Shi, Minghui
    Yuan, Rui
    IEEE ACCESS, 2024, 12 : 182086 - 182100
  • [10] Cost-effective on-demand associative author name disambiguation
    Veloso, Adriano
    Ferreira, Anderson A.
    Goncalves, Marcos Andre
    Laender, Alberto H. F.
    Meira, Wagner, Jr.
    INFORMATION PROCESSING & MANAGEMENT, 2012, 48 (04) : 680 - 697