Classification of New Titles by Two Stage Latent Dirichlet Allocation

被引:0
|
作者
Guven, Zekeriya Anil [1 ]
Diri, Banu [2 ]
Cakaloglu, Tolgahan [3 ]
机构
[1] Recep Tayyip Erdogan Univ, Bilisim Sistemleri Muhendisligi, Rize, Turkey
[2] Yildiz Tekn Univ, Bilgisayar Muhendisligi, Istanbul, Turkey
[3] Univ Arkansas, Dept Comp Sci, Fayetteville, AR 72701 USA
关键词
Topic Modelling; Latent Dirichlet Allocation; Natural Language Processing; New Analysis; Machine Learning;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid development of the Internet, thousands of different news reports from different channels are presented to us. So much news, particularly in the media sector, is an important question to be categorized and archived without human effort. In this study, it is aimed to be able to determine which news item belongs to large news headlines collected from news sites. For this, a two stage method is proposed, which is based on the classical Latent Dirichlet Allocation (LDA) algorithm used in the model. With the developed two stage LDA method, comparison of the conventional LDA was made. Then, by creating a file with an arff extension from the word weights of the topics, the success of the machine learning methods in Weka was measured.
引用
收藏
页码:99 / 103
页数:5
相关论文
共 50 条
  • [21] A text classification model constructed by Latent Dirichlet Allocation and Deep Learning
    Liu, Yu
    Jin, Zhengping
    PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON MECHATRONICS, MATERIALS, CHEMISTRY AND COMPUTER ENGINEERING 2015 (ICMMCCE 2015), 2015, 39 : 2501 - 2504
  • [22] Max-Margin Latent Dirichlet Allocation for Image Classification and Annotation
    Wang, Yang
    Mori, Greg
    PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2011, 2011,
  • [23] Sequential latent Dirichlet allocation
    Lan Du
    Wray Buntine
    Huidong Jin
    Changyou Chen
    Knowledge and Information Systems, 2012, 31 : 475 - 503
  • [24] Part of Speech Features for Sentiment Classification based on Latent Dirichlet Allocation
    Usop, Eka Surya
    Isnanto, R. Rizal
    Kusumaningrum, Retno
    2017 4TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY, COMPUTER, AND ELECTRICAL ENGINEERING (ICITACEE), 2017, : 31 - 34
  • [25] Semi-Supervised Latent Dirichlet Allocation and its Application for Document Classification
    Wang, Di
    Thint, Marcus
    Al-Rubaie, Ahmad
    2012 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY WORKSHOPS (WI-IAT WORKSHOPS 2012), VOL 3, 2012, : 306 - 310
  • [26] Social Event Classification via Boosted Multimodal Supervised Latent Dirichlet Allocation
    Qian, Shengsheng
    Zhang, Tianzhu
    Xu, Changsheng
    Hossain, M. Shamim
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2014, 11 (02)
  • [27] Website Classification Using Latent Dirichlet Allocation and its Application for Internet Advertising
    Katsumata, Sotaro
    Motohashi, Eiji
    Nishimoto, Akihiro
    Toyosawa, Eiji
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2016, : 538 - 544
  • [28] Multimodal Semantics-Based Supervised Latent Dirichlet Allocation for Event Classification
    Miao, Naiyang
    Xue, Feng
    Hong, Richang
    IEEE MULTIMEDIA, 2021, 28 (04) : 8 - 17
  • [29] Local–class–shared–topic latent Dirichlet allocation based scene classification
    Chao Huang
    Wang Luo
    Yurui Xie
    Multimedia Tools and Applications, 2017, 76 : 15661 - 15679
  • [30] Class-Specific Simplex-Latent Dirichlet Allocation for Image Classification
    Dixit, Mandar
    Rasiwasia, Nikhil
    Vasconcelos, Nuno
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 2672 - 2679