Name Disambiguation Using Semi-supervised Topic Model

被引:1
|
作者
Fu, JinLan [1 ]
Qiu, Jie [1 ]
Wang, Jing [1 ]
Li, Li [1 ]
机构
[1] Southwest Univ, Sch Comp & Informat Sci, Chongqing 400715, Peoples R China
关键词
Semi-supervised topic models; Name disambiguation; ENTITY DISAMBIGUATION; AUTHORSHIP;
D O I
10.1007/978-3-319-22053-6_50
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Name ambiguity is increasingly attracting more attention. With the development of information available on the Web, name disambiguation is becoming one of the most challenging tasks. For example, some persons may share the same personal name. In order to address this problem, topic coherence principle is used to eliminate ambiguity of the name entity. A semi-supervised topic model (STM) is proposed. When we search online, many irrelevant documents always return to users. Wikipedia hierarchical structure information enrich the semantics of the name entity. Information extracted from Wikipedia is sorted out and put in the knowledge base. It is used to match the query entity. By utilizing the context of the given query entity, we attempt to disambiguate various meanings with the proposed model. Experiments on two real-life datasets, show that STM is more superior than baselines (ETM and WPAM) with accuracy 84.75 %. The result shows that our method is promising in name disambiguation as well. Our work can provide invaluable insights into entity disambiguation.
引用
收藏
页码:471 / 480
页数:10
相关论文
共 50 条
  • [1] A Hybrid Semi-supervised Topic Model
    Zhang, Yanning
    Wei, Wei
    [J]. INTELLIGENT SCIENCE AND INTELLIGENT DATA ENGINEERING, ISCIDE 2011, 2012, 7202 : 309 - 317
  • [2] Semi-Supervised Multiple Disambiguation
    Ghoorchian, Kambiz
    Rahimian, Fatemeh
    Girdzijauskas, Sarunas
    [J]. 2015 IEEE TRUSTCOM/BIGDATASE/ISPA, VOL 2, 2015, : 88 - 95
  • [3] SEMI-SUPERVISED LEARNING OF LANGUAGE MODEL USING UNSUPERVISED TOPIC MODEL
    Bai, Shuanhu
    Huang, Chien-Lin
    Ma, Bin
    Li, Haizhou
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5382 - 5385
  • [4] A jointly distributed semi-supervised topic model
    Zhang, Yanning
    Wei, Wei
    [J]. NEUROCOMPUTING, 2014, 134 : 38 - 45
  • [5] Author Name Disambiguation Based on Semi-supervised Learning with Graph Convolutional Network
    Sheng Xiaoguang
    Wang Ying
    Qian Li
    [J]. JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (12) : 3442 - 3450
  • [6] Efficient supervised and semi-supervised approaches for affiliations disambiguation
    Pascal Cuxac
    Jean-Charles Lamirel
    Valerie Bonvallot
    [J]. Scientometrics, 2013, 97 : 47 - 58
  • [7] Efficient supervised and semi-supervised approaches for affiliations disambiguation
    Cuxac, Pascal
    Lamirel, Jean-Charles
    Bonvallot, Valerie
    [J]. SCIENTOMETRICS, 2013, 97 (01) : 47 - 58
  • [8] Semi-supervised Word Sense Disambiguation Using the Web as Corpus
    Guzman-Cabrera, Rafael
    Rosso, Paolo
    Montes-y-Gomez, Manuel
    Villasenor-Pineda, Luis
    Pinto-Avendano, David
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2009, 5449 : 256 - +
  • [9] Ethnicity Sensitive Author Disambiguation Using Semi-supervised Learning
    Louppe, Gilles
    Al-Natsheh, Hussein T.
    Susik, Mateusz
    Maguire, Eamonn James
    [J]. KNOWLEDGE ENGINEERING AND SEMANTIC WEB, KESW 2016, 2016, 649 : 272 - 287
  • [10] Personal name disambiguation in web search results based on a semi-supervised clustering approach
    Sugiyama, Kazunari
    Okumura, Manabu
    [J]. ASIAN DIGITAL LIBRARIES: LOOKING BACK 10 YEARS AND FORGING NEW FRONTIERS, PROCEEDINGS, 2007, 4822 : 250 - 256