Unsupervised Author Disambiguation using Heterogeneous Graph Convolutional Network Embedding

被引:0
|
作者
Qiao, Ziyue [1 ]
Du, Yi [1 ]
Fu, Yanjie [2 ]
Wang, Pengfei [3 ]
Zhou, Yuanchun [1 ]
机构
[1] Chinese Acad Sci, Comp Network Informat Ctr, Beijing, Peoples R China
[2] Univ Cent Florida, Dept Comp Sci, Orlando, FL 32816 USA
[3] Alibaba Grp, Alibaba DAMO Acad, Hangzhou, Zhejiang, Peoples R China
关键词
Name Disambiguation; Network Embedding; Clustering; Graph Convolutional Network; Meta Path; NAME DISAMBIGUATION;
D O I
10.1109/bigdata47090.2019.9005458
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
People share same names in real world. When a digital library user searches for an author name, he may see a mixture of publications by different authors who have the same name. Making distinctions between them is an important prerequisite to improve the quality of services and contents in digital libraries. The general task of author disambiguation is to associate publications which belong to an identical name or names with highly similar spellings to different people entities. In recent years, many researches have been conducted to solve this challenging task. However, some works rely heavily on external knowledge bases and manually annotated data. Some unsupervised learning based works require complex feature engineering. In this paper, we propose a novel and efficient author disambiguation framework which needs no labeled data. We first construct a publication heterogeneous network for each ambiguous name. Then, we use our proposed heterogeneous graph convolutional network embedding method that encodes both graph structure and node attribute information to learn publication representations. After that, we propose a graph enhanced clustering method for name disambiguation that can greatly accelerate the clustering process and need not require the number of distinct persons. Our framework can be continually retrained and applied on incremental disambiguation task when new publications arc put in. Experimental results on two datasets show that our framework clearly performs better than several state-of-the-art methods for author disambiguation.
引用
收藏
页码:910 / 919
页数:10
相关论文
共 50 条
  • [1] Author Name Disambiguation Using Graph Node Embedding Method
    Zhang, Wenjing
    Yan, Zhongmin
    Zheng, Yongqing
    [J]. PROCEEDINGS OF THE 2019 IEEE 23RD INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2019, : 410 - 415
  • [2] Heterogeneous Attributed Network Embedding with Graph Convolutional Networks
    Wang, Yueyang
    Duan, Ziheng
    Liao, Binbing
    Wu, Fei
    Zhuang, Yueting
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 10061 - 10062
  • [3] Dual-Channel Heterogeneous Graph Network for Author Name Disambiguation
    Zheng, Xin
    Zhang, Pengyu
    Cui, Yanjie
    Du, Rong
    Zhang, Yong
    [J]. INFORMATION, 2021, 12 (09)
  • [4] Author Name Disambiguation Based on Heterogeneous Graph
    Ma, Chuang
    Xia, Helong
    [J]. Journal of Computers (Taiwan), 2023, 34 (04) : 41 - 52
  • [5] Heterogeneous Information Network Embedding with Convolutional Graph Attention Networks
    Cao, Meng
    Ma, Xiying
    Zhu, Kai
    Xu, Ming
    Wang, Chongjun
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [6] Author Name Disambiguation via Heterogeneous Network Embedding from Structural and Semantic Perspectives
    Xie, Wenjin
    Liu, Siyuan
    Wang, Xiaomeng
    Jia, Tao
    [J]. 2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 245 - 250
  • [7] Author Name Disambiguation Based on Semi-supervised Learning with Graph Convolutional Network
    Sheng Xiaoguang
    Wang Ying
    Qian Li
    [J]. JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (12) : 3442 - 3450
  • [8] A Network-embedding Based Method for Author Disambiguation
    Xu, Jun
    Shen, Siqi
    Li, Dongsheng
    Fu, Yongquan
    [J]. CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 1735 - 1738
  • [9] Bibliographic Name Disambiguation with Graph Convolutional Network
    Yan, Hao
    Peng, Hao
    Li, Chen
    Li, Jianxin
    Wang, Lihong
    [J]. WEB INFORMATION SYSTEMS ENGINEERING - WISE 2019, 2019, 11881 : 538 - 551
  • [10] Graph Convolutional Network for Word Sense Disambiguation
    Zhang, Chun-Xiang
    Liu, Rui
    Gao, Xue-Yao
    Yu, Bo
    [J]. DISCRETE DYNAMICS IN NATURE AND SOCIETY, 2021, 2021