Incorporating Entities in News Topic Modeling

被引:0
|
作者
Hu, Linmei [1 ]
Li, Juanzi [1 ]
Li, Zhihui [2 ]
Shao, Chao [1 ]
Li, Zhixing [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Tech, Beijing, Peoples R China
[2] Beijing Informat Sci & Technol Univ, Dept Comp Sci and Tech, Beijing, Peoples R China
关键词
news; named entity; generative entity topic models;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
News articles express information by concentrating on named entities like who, when, and where in news. Whereas, extracting the relationships among entities, words and topics through a large amount of news articles is nontrivial. Topic modeling like Latent Dirichlet Allocation has been applied a lot to mine hidden topics in text analysis, which have achieved considerable performance. However, it cannot explicitly show relationship between words and entities. In this paper, we propose a generative model, Entity-Centered Topic Model(ECTM) to summarize the correlation among entities, words and topics by taking entity topic as a mixture of word topics. Experiments on real news data sets show our model of a lower perplexity and better in clustering of entities than state-of-the-art entity topic model(CorrLDA2). We also present analysis for results of ECTM and further compare it with CorrLDA2.
引用
收藏
页码:139 / 150
页数:12
相关论文
共 50 条
  • [41] Unsupervised Topic Modeling with BERTopic for Coarse and Fine-Grained News Classification
    Al Sayed, Mohamad
    Brasoveanu, Adrian M. P.
    Nixon, Lyndon J. B.
    Scharl, Arno
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2023, PT I, 2023, 14134 : 162 - 174
  • [42] Integrating collaborative topic modeling and diversity for movie recommendations during news browsing
    Liu, Duen-Ren
    Chou, Yun-Cheng
    Jian, Ciao-Ting
    KYBERNETES, 2020, 49 (11) : 2633 - 2649
  • [43] Topic and style-adapted language modeling for Thai broadcast news ASR
    Jongtaveesataporn, Markpong
    Furui, Sadaoki
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 1828 - 1831
  • [44] A Full-Cycle Methodology for News Topic Modeling and User Feedback Research
    Koltsov, Sergei
    Pashakhin, Sergei
    Dokuka, Sofia
    SOCIAL INFORMATICS, SOCINFO 2018, PT I, 2018, 11185 : 308 - 321
  • [45] On the Long-Tail Entities in News
    Esquivel, Jose
    Albakour, Dyaa
    Martinez, Miguel
    Corney, David
    Moussa, Samir
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2017, 2017, 10193 : 691 - 697
  • [46] Contextualizing Trending Entities in News Stories
    Ponza, Marco
    Ceccarelli, Diego
    Ferragina, Paolo
    Meij, Edgar
    Kothari, Sambhav
    WSDM '21: PROCEEDINGS OF THE 14TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2021, : 346 - 354
  • [47] The Automatic Retrieval of News Entities Based on the Structure of a News Cluster
    Alekseev, A. A.
    Loukachevitch, N. V.
    SCIENTIFIC AND TECHNICAL INFORMATION PROCESSING, 2012, 39 (06) : 303 - 309
  • [48] News recommendations based on collaborative topic modeling and collaborative filtering with generative adversarial networks
    Liu, Duen-Ren
    Huang, Yang
    Jhao, Jhen-Jie
    Lee, Shin-Jye
    DATA TECHNOLOGIES AND APPLICATIONS, 2024, 58 (01) : 24 - 41
  • [49] NELasso: Group-Sparse Modeling for Characterizing Relations Among Named Entities in News Articles
    Tariq, Amara
    Karim, Asim
    Foroosh, Hassan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (10) : 2000 - 2014
  • [50] Analysis of issues related to nursing law: Examination of news articles using topic modeling
    Lee, Joohyun
    Chang, Hyoung Eun
    Cho, Jaehyuk
    Yoo, Seohyun
    Hyeon, Joonseo
    PLOS ONE, 2024, 19 (08):