Semantic fingerprints-based author name disambiguation in Chinese documents

被引:3
|
作者
Hongqi Han
Changqing Yao
Yuan Fu
Yongsheng Yu
Yunliang Zhang
Shuo Xu
机构
[1] Institute of Scientific and Technical Information of China,
来源
Scientometrics | 2017年 / 111卷
关键词
Name disambiguation; Simhash; Semantic fingerprint;
D O I
暂无
中图分类号
学科分类号
摘要
Author name disambiguation is an important problem that needs to be resolved in bibliometric analysis or tech mining. Many techniques have been presented; however, most of them require a long run time or additional information. A new method based on semantic fingerprints was presented to disambiguate author names without external data. A manually annotated dataset was built to testify on the efficiency of the presented method. Experiments using co-author features, institution features, and text fingerprints were conducted respectively. We found that the first two methods had higher precision, but their recall was low, and the text fingerprint method had higher recall and satisfied precision. Based on these results, we integrated co-author features, institution features, and text fingerprints to provide semantic fingerprints for disambiguating author names and achieving better performance on the F-measure.
引用
收藏
页码:1879 / 1896
页数:17
相关论文
共 50 条
  • [41] Ethnicity-based name partitioning for author name disambiguation using supervised machine learning
    Kim, Jinseok
    Kim, Jenna
    Owen-Smith, Jason
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2021, 72 (08) : 979 - 994
  • [42] Author name disambiguation: What difference does it make in author-based citation analysis?
    Strotmann, Andreas
    Zhao, Dangzhi
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2012, 63 (09): : 1820 - 1833
  • [43] Name Disambiguation using Semantic Association Clustering
    Jin, Hai
    Huang, Li
    Yuan, Pingpeng
    ICEBE 2009: IEEE INTERNATIONAL CONFERENCE ON E-BUSINESS ENGINEERING, PROCEEDINGS, 2009, : 42 - 48
  • [44] Molecular Fingerprints-Based Machine Learning for Metabolic Profiling
    Sirocchi, Christel
    Biancucci, Federica
    Suffian, Muhammad
    Benedetti, Riccardo
    Donati, Matteo
    Ferretti, Stefano
    Bogliolo, Alessandro
    Magnani, Mauro
    Menotta, Michele
    Montagna, Sara
    MACHINE LEARNING AND PRINCIPLES AND PRACTICE OF KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2023, PT IV, 2025, 2136 : 103 - 111
  • [45] A Clock Fingerprints-Based Approach for Wireless Transmitter Identification
    Zhao, Caidan
    Xie, Liang
    Huang, Lianfen
    Yao, Yan
    ADVANCED RESEARCH ON ELECTRONIC COMMERCE, WEB APPLICATION, AND COMMUNICATION, PT 2, 2011, 144 : 236 - 240
  • [46] Deep author name disambiguation using DBLP data
    Boukhers, Zeyd
    Asundi, Nagaraj Bahubali
    INTERNATIONAL JOURNAL ON DIGITAL LIBRARIES, 2024, 25 (03) : 431 - 441
  • [47] Dynamic author name disambiguation for growing digital libraries
    Qian, Yanan
    Zheng, Qinghua
    Sakai, Tetsuya
    Ye, Junting
    Liu, Jun
    INFORMATION RETRIEVAL JOURNAL, 2015, 18 (05): : 379 - 412
  • [48] Author Name Disambiguation by Using Deep Neural Network
    Hung Nghiep Tran
    Tin Huynh
    Tien Do
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, PT 1, 2014, 8397 : 123 - 132
  • [49] A Graph Combination With Edge Pruning-Based Approach for Author Name Disambiguation
    Pooja, K. M.
    Mondal, Samrat
    Chandra, Joydeep
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2020, 71 (01) : 69 - 83
  • [50] A knowledge graph embeddings based approach for author name disambiguation using literals
    Cristian Santini
    Genet Asefa Gesese
    Silvio Peroni
    Aldo Gangemi
    Harald Sack
    Mehwish Alam
    Scientometrics, 2022, 127 : 4887 - 4912