Classification of New Titles by Two Stage Latent Dirichlet Allocation

被引:0
|
作者
Guven, Zekeriya Anil [1 ]
Diri, Banu [2 ]
Cakaloglu, Tolgahan [3 ]
机构
[1] Recep Tayyip Erdogan Univ, Bilisim Sistemleri Muhendisligi, Rize, Turkey
[2] Yildiz Tekn Univ, Bilgisayar Muhendisligi, Istanbul, Turkey
[3] Univ Arkansas, Dept Comp Sci, Fayetteville, AR 72701 USA
关键词
Topic Modelling; Latent Dirichlet Allocation; Natural Language Processing; New Analysis; Machine Learning;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid development of the Internet, thousands of different news reports from different channels are presented to us. So much news, particularly in the media sector, is an important question to be categorized and archived without human effort. In this study, it is aimed to be able to determine which news item belongs to large news headlines collected from news sites. For this, a two stage method is proposed, which is based on the classical Latent Dirichlet Allocation (LDA) algorithm used in the model. With the developed two stage LDA method, comparison of the conventional LDA was made. Then, by creating a file with an arff extension from the word weights of the topics, the success of the machine learning methods in Weka was measured.
引用
收藏
页码:99 / 103
页数:5
相关论文
共 50 条
  • [31] Biologically-aware Latent Dirichlet Allocation (BaLDA) for the Classification of Expression Microarray
    Perina, Alessandro
    Lovato, Pietro
    Murino, Vittorio
    Bicego, Manuele
    PATTERN RECOGNITION IN BIOINFORMATICS, 2010, 6282 : 230 - 241
  • [32] Analysing Android Apps Classification and Categories Validation by Using Latent Dirichlet Allocation
    Flondor, Elena
    Frincu, Marc
    COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2023, 2023, 14162 : 282 - 297
  • [33] Cardiology record multi-label classification using latent Dirichlet allocation
    Perez, Jorge
    Perez, Alicia
    Casillas, Arantza
    Gojenola, Koldo
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2018, 164 : 111 - 119
  • [34] Topic Extraction and Sentiment Classification by using Latent Dirichlet Markov Allocation and SentiWordNet
    Kaur, Preet Chandan
    Ghorpade, Tushar
    Mane, Vanita
    INTERNATIONAL CONFERENCE ON ADVANCES IN INFORMATION COMMUNICATION TECHNOLOGY & COMPUTING, 2016, 2016,
  • [35] Aurora Image Classification Based on Multi-Feature Latent Dirichlet Allocation
    Zhong, Yanfei
    Huang, Rui
    Zhao, Ji
    Zhao, Bei
    Liu, Tingting
    REMOTE SENSING, 2018, 10 (02)
  • [36] Parallel Latent Dirichlet Allocation on GPUs
    Moon, Gordon E.
    Nisa, Israt
    Sukumaran-Rajam, Aravind
    Bandyopadhyay, Bortik
    Parthasarathy, Srinivasan
    Sadayappan, P.
    COMPUTATIONAL SCIENCE - ICCS 2018, PT II, 2018, 10861 : 259 - 272
  • [37] Distributed Latent Dirichlet Allocation on Streams
    Guo, Yunyan
    Li, Jianzhong
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2022, 16 (01)
  • [38] Selecting Priors for Latent Dirichlet Allocation
    Syed, Shaheen
    Spruit, Marco
    2018 IEEE 12TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2018, : 194 - 202
  • [39] Latent IBP Compound Dirichlet Allocation
    Archambeau, Cedric
    Lakshminarayanan, Balaji
    Bouchard, Guillaume
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (02) : 321 - 333
  • [40] Crowd labeling latent Dirichlet allocation
    Luca Pion-Tonachini
    Scott Makeig
    Ken Kreutz-Delgado
    Knowledge and Information Systems, 2017, 53 : 749 - 765