Incorporating Entities in News Topic Modeling

被引:0
|
作者
Hu, Linmei [1 ]
Li, Juanzi [1 ]
Li, Zhihui [2 ]
Shao, Chao [1 ]
Li, Zhixing [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Tech, Beijing, Peoples R China
[2] Beijing Informat Sci & Technol Univ, Dept Comp Sci and Tech, Beijing, Peoples R China
关键词
news; named entity; generative entity topic models;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
News articles express information by concentrating on named entities like who, when, and where in news. Whereas, extracting the relationships among entities, words and topics through a large amount of news articles is nontrivial. Topic modeling like Latent Dirichlet Allocation has been applied a lot to mine hidden topics in text analysis, which have achieved considerable performance. However, it cannot explicitly show relationship between words and entities. In this paper, we propose a generative model, Entity-Centered Topic Model(ECTM) to summarize the correlation among entities, words and topics by taking entity topic as a mixture of word topics. Experiments on real news data sets show our model of a lower perplexity and better in clustering of entities than state-of-the-art entity topic model(CorrLDA2). We also present analysis for results of ECTM and further compare it with CorrLDA2.
引用
收藏
页码:139 / 150
页数:12
相关论文
共 50 条
  • [21] Incorporating Biterm Correlation Knowledge into Topic Modeling for Short Texts
    Zhang, Kai
    Zhou, Yuan
    Chen, Zheng
    Liu, Yufei
    Tang, Zhuo
    Yin, Li
    Chen, Jihong
    COMPUTER JOURNAL, 2022, 65 (03): : 537 - 553
  • [22] Trending Topic Aggregation by News-Based Context Modeling
    Fuchs, Sebastian
    Borth, Damian
    Ulges, Adrian
    KI 2016: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2016, 9904 : 162 - 168
  • [23] Topic modeling three decades of climate change news in Denmark
    Meier, Florian
    Eskjaer, Mikkel Fugl
    FRONTIERS IN COMMUNICATION, 2024, 8
  • [24] Wikipedia Based News Video Topic Modeling for Information Extraction
    Roy, Sujoy
    Mak, Mun-Thye
    Wan, Kong Wah
    ADVANCES IN MULTIMEDIA MODELING, PT II, 2011, 6524 : 411 - 420
  • [25] Automatic social media news classification: a topic modeling approach
    Amador, Daniel
    Gamboa-Venegas, Carlos
    Garcia, Ernesto
    Segura-Castillo, Andres
    TECNOLOGIA EN MARCHA, 2022, 35
  • [26] Investigating Cybersecurity News Articles by Applying Topic Modeling Method
    Ghasiya, Piyush
    Okamura, Koji
    35TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN 2021), 2021, : 432 - 438
  • [27] A survey on news text visualization via probabilistic topic modeling
    Tang, Siliang
    Cheng, Lu
    Shao, Jian
    Wu, Fei
    Lu, Weiming
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2015, 27 (05): : 771 - 782
  • [28] Analyzing a Decade of Wind Turbine Accident News with Topic Modeling
    Ertek, Gurdal
    Kailas, Lakshmi
    SUSTAINABILITY, 2021, 13 (22)
  • [29] Comparison of Topic Modeling Methods for Type Detection of Turkish News
    Guven, Zekeriya Anil
    Diri, Banu
    Cakaloglu, Tolgahan
    2019 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2019, : 150 - 154
  • [30] Robust Multi-view Topic Modeling by Incorporating Detecting Anomalies
    Zhang, Guoxi
    Iwata, Tomoharu
    Kashima, Hisashi
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2017, PT II, 2017, 10535 : 238 - 250