Towards Effective Author Name Disambiguation by Hybrid Attention

被引:1
|
作者
Zhou, Qian [1 ]
Chen, Wei [1 ]
Zhao, Peng-Peng [1 ]
Liu, An [1 ]
Xu, Jia-Jie [1 ]
Qu, Jian-Feng [1 ]
Zhao, Lei [1 ]
机构
[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215006, Peoples R China
基金
中国国家自然科学基金;
关键词
author name disambiguation; multiple-feature information; hybrid attention; pruning strategy; structural information loss of vector space;
D O I
10.1007/s11390-023-2070-z
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Author name disambiguation (AND) is a central task in academic search, which has received more attention recently accompanied by the increase of authors and academic publications. To tackle the AND problem, existing studies have proposed various approaches based on different types of information, such as raw document features (e.g., co-authors, titles, and keywords), the fusion feature (e.g., a hybrid publication embedding based on multiple raw document features), the local structural information (e.g., a publication's neighborhood information on a graph), and the global structural information (e.g., interactive information between a node and others on a graph). However, there has been no work taking all the above-mentioned information into account and taking full advantage of the contributions of each raw document feature for the AND problem so far. To fill the gap, we propose a novel framework named EAND (Towards Effective Author Name Disambiguation by Hybrid Attention). Specifically, we design a novel feature extraction model, which consists of three hybrid attention mechanism layers, to extract key information from the global structural information and the local structural information that are generated from six similarity graphs constructed based on different similarity coefficients, raw document features, and the fusion feature. Each hybrid attention mechanism layer contains three key modules: a local structural perception, a global structural perception, and a feature extractor. Additionally, the mean absolute error function in the joint loss function is used to introduce the structural information loss of the vector space. Experimental results on two real-world datasets demonstrate that EAND achieves superior performance, outperforming state-of-the-art methods by at least +2.74% in terms of the micro-F1 score and +3.31% in terms of the macro-F1 score.
引用
收藏
页码:929 / 950
页数:22
相关论文
共 50 条
  • [41] An Unsupervised Heuristic Based Approach for Author Name Disambiguation
    Pooja, K. M.
    Mondal, Samrat
    Chandra, Joydeep
    2018 10TH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS (COMSNETS), 2018, : 540 - 542
  • [42] Completing features for author name disambiguation (AND): an empirical analysis
    Waqas, Humaira
    Qadir, Abdul
    SCIENTOMETRICS, 2022, 127 (02) : 1039 - 1063
  • [43] Dynamic author name disambiguation for growing digital libraries
    Yanan Qian
    Qinghua Zheng
    Tetsuya Sakai
    Junting Ye
    Jun Liu
    Information Retrieval Journal, 2015, 18 : 379 - 412
  • [44] Author Name Disambiguation Based on Heterogeneous Information Network
    Qiping D.
    Weijing C.
    Ling J.
    Yu’e Z.
    Data Analysis and Knowledge Discovery, 2022, 6 (04) : 60 - 68
  • [45] NameClarifier: A Visual Analytics System for Author Name Disambiguation
    Shen, Qiaomu
    Wu, Tongshuang
    Yang, Haiyan
    Wu, Yanhong
    Qu, Huamin
    Cui, Weiwei
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2017, 23 (01) : 141 - 150
  • [46] A Relevance Feedback Approach for the Author Name Disambiguation Problem
    Godoi, Thiago A.
    Torres, Ricardo da S.
    Carvalho, Ariadne M. B. R.
    Goncalves, Marcos Andre
    Ferreira, Anderson A.
    Fan, Weiguo
    Fox, Edward A.
    JCDL'13: PROCEEDINGS OF THE 13TH ACM/IEEE-CS JOINT CONFERENCE ON DIGITAL LIBRARIES, 2013, : 209 - 218
  • [47] A Web Service for Author Name Disambiguation in Scholarly Databases
    Kim, Kunho
    Sefid, Athar
    Weinberg, Bruce A.
    Giles, C. Lee
    2018 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES (IEEE ICWS 2018), 2018, : 265 - 273
  • [48] Large Scale Author Name Disambiguation in Digital Libraries
    Khabsa, Madian
    Treeratpituk, Pucktada
    Giles, C. Lee
    2014 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2014,
  • [49] A Multi-Level Author Name Disambiguation Algorithm
    Zhang, Siyang
    Xinhua, E.
    Pan, Tian
    IEEE ACCESS, 2019, 7 : 104250 - 104257
  • [50] A Brief Survey of Automatic Methods for Author Name Disambiguation
    Ferreira, Anderson A.
    Goncalves, Marcos Andre
    Laender, Alberto H. F.
    SIGMOD RECORD, 2012, 41 (02) : 15 - 26