Semantic fingerprints-based author name disambiguation in Chinese documents

被引:3
|
作者
Hongqi Han
Changqing Yao
Yuan Fu
Yongsheng Yu
Yunliang Zhang
Shuo Xu
机构
[1] Institute of Scientific and Technical Information of China,
来源
Scientometrics | 2017年 / 111卷
关键词
Name disambiguation; Simhash; Semantic fingerprint;
D O I
暂无
中图分类号
学科分类号
摘要
Author name disambiguation is an important problem that needs to be resolved in bibliometric analysis or tech mining. Many techniques have been presented; however, most of them require a long run time or additional information. A new method based on semantic fingerprints was presented to disambiguate author names without external data. A manually annotated dataset was built to testify on the efficiency of the presented method. Experiments using co-author features, institution features, and text fingerprints were conducted respectively. We found that the first two methods had higher precision, but their recall was low, and the text fingerprint method had higher recall and satisfied precision. Based on these results, we integrated co-author features, institution features, and text fingerprints to provide semantic fingerprints for disambiguating author names and achieving better performance on the F-measure.
引用
收藏
页码:1879 / 1896
页数:17
相关论文
共 50 条
  • [31] Author Name Disambiguation for Citations on the Deep Web
    Zhang, Rui
    Shen, Derong
    Kou, Yue
    Nie, Tiezheng
    WEB-AGE INFORMATION MANAGEMENT, 2010, 6185 : 198 - 209
  • [32] Toward a New Paradigm for Author Name Disambiguation
    Manzoor, Ayesha
    Asghar, Sohail
    Amjad, Tehmina
    IEEE ACCESS, 2022, 10 : 76055 - 76068
  • [33] Towards a Flexible Author Name Disambiguation Framework
    Bolikowski, Lukasz
    Dendek, Piotr Jan
    DML 2011: TOWARDS A DIGITAL MATHEMATICS LIBRARY, 2011, : 27 - 37
  • [34] A Visual Analytics Approach to Author Name Disambiguation
    Muelder, Chris W.
    Faris, Robert
    Ma, Kwan-Liu
    2016 3RD IEEE/ACM INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING, APPLICATIONS AND TECHNOLOGIES (BDCAT), 2016, : 52 - 60
  • [35] Effect of forename string on author name disambiguation
    Kim, Jinseok
    Kim, Jenna
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2020, 71 (07) : 839 - 855
  • [36] Co-attention-Based Pairwise Learning for Author Name Disambiguation
    Wang, Shenghui
    Li, Qiuke
    Koopman, Rob
    LEVERAGING GENERATIVE INTELLIGENCE IN DIGITAL LIBRARIES: TOWARDS HUMAN-MACHINE COLLABORATION, ICADL 2023, PT II, 2023, 14458 : 240 - 249
  • [37] ANDMC: An Algorithm for Author Name Disambiguation Based on Molecular Cross Clustering
    Zhang, Siyang
    E, Xinhua
    Huang, Tao
    Yang, Fan
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2019, 11448 : 173 - 185
  • [38] Accuracy of simple, initials-based methods for author name disambiguation
    Milojevic, Stasa
    JOURNAL OF INFORMETRICS, 2013, 7 (04) : 767 - 773
  • [39] Multilayer heuristics based clustering framework (MHCF) for author name disambiguation
    Humaira Waqas
    Muhammad Abdul Qadir
    Scientometrics, 2021, 126 : 7637 - 7678
  • [40] NDFMF: An Author Name Disambiguation Algorithm based on the Fusion of Multiple Features
    Xu, Xiaolong
    Li, Yongping
    Liptrott, Mark
    Bessis, Nik
    2018 IEEE 42ND ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC 2018), VOL 2, 2018, : 187 - 190