Classification of New Titles by Two Stage Latent Dirichlet Allocation

被引:0
|
作者
Guven, Zekeriya Anil [1 ]
Diri, Banu [2 ]
Cakaloglu, Tolgahan [3 ]
机构
[1] Recep Tayyip Erdogan Univ, Bilisim Sistemleri Muhendisligi, Rize, Turkey
[2] Yildiz Tekn Univ, Bilgisayar Muhendisligi, Istanbul, Turkey
[3] Univ Arkansas, Dept Comp Sci, Fayetteville, AR 72701 USA
关键词
Topic Modelling; Latent Dirichlet Allocation; Natural Language Processing; New Analysis; Machine Learning;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid development of the Internet, thousands of different news reports from different channels are presented to us. So much news, particularly in the media sector, is an important question to be categorized and archived without human effort. In this study, it is aimed to be able to determine which news item belongs to large news headlines collected from news sites. For this, a two stage method is proposed, which is based on the classical Latent Dirichlet Allocation (LDA) algorithm used in the model. With the developed two stage LDA method, comparison of the conventional LDA was made. Then, by creating a file with an arff extension from the word weights of the topics, the success of the machine learning methods in Weka was measured.
引用
收藏
页码:99 / 103
页数:5
相关论文
共 50 条
  • [1] A New Latent generalized Dirichlet Allocation Model for Image Classification
    Ihou, Koffi Eddy
    Bouguila, Nizar
    PROCEEDINGS OF THE 2017 SEVENTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA 2017), 2017,
  • [2] IMPACT OF N-STAGE LATENT DIRICHLET ALLOCATION ON ANALYSIS OF HEADLINE CLASSIFICATION
    Guven, Zekeriya Anil
    Diri, Banu
    Cakaloglu, Tolgahan
    COMPUTER SCIENCE-AGH, 2022, 23 (03): : 377 - 396
  • [3] Latent Dirichlet Allocation Models for Image Classification
    Rasiwasia, Nikhil
    Vasconcelos, Nuno
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (11) : 2665 - 2679
  • [4] Latent Dirichlet Allocation Based Multilevel Classification
    Bhutada, Sunil
    Balaram, V. V. S. S. S.
    Bulusu, Vishnu Vardhan
    2014 INTERNATIONAL CONFERENCE ON CONTROL, INSTRUMENTATION, COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICCICCT), 2014, : 1020 - 1024
  • [5] Inference Algorithms in Latent Dirichlet Allocation for Semantic Classification
    Zubir, Wan Mohammad Aflah Mohammad
    Aziz, Izzatdin Abdul
    Jaafar, Jafreezal
    Hasan, Mohd Hilmi
    APPLIED COMPUTATIONAL INTELLIGENCE AND MATHEMATICAL METHODS: COMPUTATIONAL METHODS IN SYSTEMS AND SOFTWARE 2017, VOL. 2, 2018, 662 : 173 - 184
  • [6] THE VARIANT OF LATENT DIRICHLET ALLOCATION FOR NATURAL SCENE CLASSIFICATION
    Tang Yingjun
    COMPUTING AND INFORMATICS, 2011, 30 (02) : 311 - 319
  • [7] A Hybrid Latent Dirichlet Allocation Approach for Topic Classification
    Hsu, Chi-I
    Chiu, Chaochang
    2017 IEEE INTERNATIONAL CONFERENCE ON INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS (INISTA), 2017, : 312 - 315
  • [9] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022
  • [10] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 14, VOLS 1 AND 2, 2002, 14 : 601 - 608