Incorporating Entities in News Topic Modeling

被引:0
|
作者
Hu, Linmei [1 ]
Li, Juanzi [1 ]
Li, Zhihui [2 ]
Shao, Chao [1 ]
Li, Zhixing [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Tech, Beijing, Peoples R China
[2] Beijing Informat Sci & Technol Univ, Dept Comp Sci and Tech, Beijing, Peoples R China
关键词
news; named entity; generative entity topic models;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
News articles express information by concentrating on named entities like who, when, and where in news. Whereas, extracting the relationships among entities, words and topics through a large amount of news articles is nontrivial. Topic modeling like Latent Dirichlet Allocation has been applied a lot to mine hidden topics in text analysis, which have achieved considerable performance. However, it cannot explicitly show relationship between words and entities. In this paper, we propose a generative model, Entity-Centered Topic Model(ECTM) to summarize the correlation among entities, words and topics by taking entity topic as a mixture of word topics. Experiments on real news data sets show our model of a lower perplexity and better in clustering of entities than state-of-the-art entity topic model(CorrLDA2). We also present analysis for results of ECTM and further compare it with CorrLDA2.
引用
收藏
页码:139 / 150
页数:12
相关论文
共 50 条
  • [1] Improving Topic Quality by Promoting Named Entities in Topic Modeling
    Krasnashchok, Katsiaryna
    Jouili, Salim
    PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2018, : 247 - 253
  • [2] Incorporating Deep Learning and News Topic Modeling for Forecasting Pork Prices: The Case of South Korea
    Chuluunsaikhan, Tserenpurev
    Ryu, Ga-Ae
    Yoo, Kwan-Hee
    Rah, HyungChul
    Nasridinov, Aziz
    AGRICULTURE-BASEL, 2020, 10 (11): : 1 - 22
  • [3] Twitter Topic Modeling on Football News
    Hidayatullah, Ahmad Fathan
    Pembrani, Elang Cergas
    Kurniawan, Wisnu
    Akbar, Gilang
    Pranata, Ridwan
    PROCEEDINGS OF 2018 3RD INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS), 2018, : 467 - 471
  • [4] Analyzing entities and topics in news articles using statistical topic models
    Newman, David
    Chemudugunta, Chaitanya
    Smyth, Padhraic
    Steyvers, Mark
    INTELLIGENCE AND SECURITY INFORMATICS, PROCEEDINGS, 2006, 3975 : 93 - 104
  • [5] Incorporating Entity Correlation Knowledge into Topic Modeling
    Wang, Qilin
    Song, Dandan
    Li, Xiuquan
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG KNOWLEDGE (IEEE ICBK 2017), 2017, : 254 - 258
  • [6] Incorporating Knowledge Graph Embeddings into Topic Modeling
    Yao, Liang
    Zhang, Yin
    Wei, Baogang
    Jin, Zhe
    Zhang, Rui
    Zhang, Yangyang
    Chen, Qinfei
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3119 - 3126
  • [7] Twitter Topic Modeling for Breaking News Detection
    Wold, Henning M.
    Vikre, Linn
    Gulla, Jon Atle
    Ozgobek, Ozlem
    Su, Xiaomeng
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND TECHNOLOGIES, VOL 2 (WEBIST), 2016, : 211 - 218
  • [8] TOPIC MODELING OF NEWS BASED ON SPARK MLLIB
    Gui, Jing
    Wang, Qi
    2017 14TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2017, : 224 - 228
  • [9] Legal public opinion news abstractive summarization by incorporating topic information
    Huang, Yuxin
    Yu, Zhengtao
    Guo, Junjun
    Yu, Zhiqiang
    Xian, Yantuan
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2020, 11 (09) : 2039 - 2050
  • [10] Legal public opinion news abstractive summarization by incorporating topic information
    Yuxin Huang
    Zhengtao Yu
    Junjun Guo
    Zhiqiang Yu
    Yantuan Xian
    International Journal of Machine Learning and Cybernetics, 2020, 11 : 2039 - 2050