A Latent Topic Model for Complete Entity Resolution

被引:0
|
作者
Shu, Liangcai [1 ]
Long, Bo [1 ]
Meng, Weiyi [1 ]
机构
[1] SUNY Binghamton, Dept Comp Sci, Binghamton, NY 13902 USA
关键词
DISTRIBUTIONS;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In bibliographies like DBLP and Citeseer, there are three kinds of entity-name problems that need to be solved. First, multiple entities share one name, which is called the name sharing problem. Second, one entity has different names, which is called the name variant problem. Third, multiple entities share multiple names, which is called the name mixing problem. We aim to solve these problems based on one model in this paper. We call this task complete entity resolution. Different from previous work, our work use global information based on data with two types of information, words and author names. We propose a generative latent topic model that involves both author names and words - the LDA-dual model, by extending the LDA (Latent Dirichlet Allocation) model. We also propose a method to obtain model parameters that is global information. Based on obtained model parameters, we propose two algorithms to solve the three problems mentioned above. Experimental results demonstrate the effectiveness and great potential of the proposed model and algorithms.
引用
收藏
页码:880 / 891
页数:12
相关论文
共 50 条
  • [31] An Improved Latent Dirichlet Allocation Model for Hot Topic Extraction
    Liu, Guolong
    Xu, Xiaofei
    Zhu, Ying
    Li, Li
    2014 IEEE FOURTH INTERNATIONAL CONFERENCE ON BIG DATA AND CLOUD COMPUTING (BDCLOUD), 2014, : 470 - 476
  • [32] Automatic drive annotation via multimodal latent topic model
    Bando, Takashi
    Takenaka, Kazuhito
    Nagasaka, Shogo
    Taniguchi, Tadairo
    2013 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2013, : 2744 - 2749
  • [33] A multi-collection latent topic model for federated search
    Mark Baillie
    Mark Carman
    Fabio Crestani
    Information Retrieval, 2011, 14 : 390 - 412
  • [34] A Lifelong Learning Topic Model Structured Using Latent Embeddings
    Xu, Mingyang
    Yang, Ruixin
    Harenberg, Steve
    Samatova, Nagiza F.
    2017 11TH IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2017, : 260 - 261
  • [35] Robust topic inference for latent semantic language model adaptation
    Heidel, Aaron
    Lee, Lin-shan
    2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 177 - 182
  • [36] A multi-collection latent topic model for federated search
    Baillie, Mark
    Carman, Mark
    Crestani, Fabio
    INFORMATION RETRIEVAL, 2011, 14 (04): : 390 - 412
  • [37] Scene and place recognition using a hierarchical latent topic model
    Yang, Jinfu
    Zhang, Shanshan
    Wang, Guanghui
    Li, Mingai
    NEUROCOMPUTING, 2015, 148 : 578 - 586
  • [38] Latent topic model-based group activity discovery
    Faruquie, T. A.
    Banerjee, S.
    Kalra, P.
    VISUAL COMPUTER, 2011, 27 (12): : 1071 - 1082
  • [39] Latent topic model-based group activity discovery
    T. A. Faruquie
    S. Banerjee
    P. Kalra
    The Visual Computer, 2011, 27 : 1071 - 1082
  • [40] Discriminative Action Recognition using Supervised Latent Topic Model
    Zou Huan-xin
    Sun Hao
    Ji Ke-feng
    DIGITAL MANUFACTURING & AUTOMATION III, PTS 1 AND 2, 2012, 190-191 : 1125 - 1128