Identifying named entities in academic biographies with supervised learning

被引:0
|
作者
Patrick Kenekayoro
机构
[1] Niger Delta University,Mathematics/Computer Science Department
来源
Scientometrics | 2018年 / 116卷
关键词
Named entity recognition; Supervised learning; Natural language processing; Support vector machines; Random forests; Conditional random fields; 68U15; 62H30; C63; C80;
D O I
暂无
中图分类号
学科分类号
摘要
Personal webpages of researchers or faculty members make up a percentage of the academic web. These webpages contain semi-structured or plain text information, and research has shown the importance of combining information extracted from multiple academic websites to create a unified database that can help in expert finding, and thus improve information retrieval for end users. This research identifies the kind of named entities that could be present in academic biographies by manually examining the biographies extracted from ORCID public profiles, and describes a method that uses natural language processing techniques and supervised machine learning to automatically extract these named entities from the plain text biographies. Up to 86% accuracy was achieved with support vector machines, demonstrating that the method used in this research can be suitable for creating a reusable trained model that extracts useful academic information from researchers’ personal profiles in webpages or other data sources.
引用
收藏
页码:751 / 765
页数:14
相关论文
共 50 条
  • [1] Identifying named entities in academic biographies with supervised learning
    Kenekayoro, Patrick
    [J]. SCIENTOMETRICS, 2018, 116 (02) : 751 - 765
  • [2] Identifying Named Entities as they are Typed
    Arora, Ravneet Singh
    Tsai, Chen-Tse
    Preotiuc-Pietro, Daniel
    [J]. 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 976 - 988
  • [3] Disambiguating named entities with deep supervised learning via crowd labels
    Le-kui Zhou
    Si-liang Tang
    Jun Xiao
    Fei Wu
    Yue-ting Zhuang
    [J]. Frontiers of Information Technology & Electronic Engineering, 2017, 18 : 97 - 106
  • [4] Disambiguating named entities with deep supervised learning via crowd labels
    Zhou, Le-kui
    Tang, Si-liang
    Xiao, Jun
    Wu, Fei
    Zhuang, Yue-ting
    [J]. FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2017, 18 (01) : 97 - 106
  • [5] Identifying Named Entities of Adverse Drug Reaction with Adversarial Transfer Learning
    Han, Pu
    Zhong, Yule
    Lu, Haojie
    Ma, Shiwen
    [J]. Data Analysis and Knowledge Discovery, 2023, 7 (03) : 131 - 141
  • [6] Identifying Medical Named Entities with Word Information
    Ben, Yanyan
    Pang, Xueqin
    [J]. Data Analysis and Knowledge Discovery, 2023, 7 (05) : 123 - 132
  • [7] Crime Pattern Analysis by Identifying Named Entities and Relation Among Entities
    Das, Priyanka
    Das, Asit Kumar
    [J]. ADVANCED COMPUTATIONAL AND COMMUNICATION PARADIGMS, VOL 2, 2018, 706 : 75 - 84
  • [8] Identifying named entities from PubMed® for enriching semantic categories
    Kim, Sun
    Lu, Zhiyong
    Wilbur, John
    [J]. BMC BIOINFORMATICS, 2015, 16
  • [9] Identifying Named Entities in Biomedical Text Based on Stacked Generalization
    Wang, Haochang
    Zhao, Tiejun
    [J]. 2008 7TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-23, 2008, : 160 - +
  • [10] Identifying named entities from PubMed® for enriching semantic categories
    Sun Kim
    Zhiyong Lu
    W John Wilbur
    [J]. BMC Bioinformatics, 16