Bangla News Trend Observation using LDA Based Topic Modeling

被引:1
|
作者
Alam, Kazi Masudul [1 ]
Hemel, Md Tanvir Hussain [1 ]
Islam, S. M. Muhaiminul [1 ]
Akther, Avsha [1 ]
机构
[1] Khulna Univ, Comp Sci & Engn, DGTED Lab, Khulna, Bangladesh
关键词
Bangla News; Bangla Corpus; N-Grams; Topic Modelling; LDA;
D O I
10.1109/ICCIT51783.2020.9392719
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Topic Modelling is an essential field of natural language processing (NLP) that can be considered as a type of statistical model for extracting the abstract topics that have occurred in a collection of documents. Bangla is among the most popular and used languages around the world and nowadays innumerable Bangla texts are generated through digital and social media. So the significance of extracting knowledge from these data is invaluable for various sectors. However, the number of works in this field is inadequate because of the lack of proper datasets, tools, and applications. Therefore, preparing a convenient dataset in Bangla can be a great help for topic modeling as well as for other NLP related research. In this paper, we have addressed some of those complications by creating a proper dataset. Also, we have demonstrated a method of observing the Bangla media trend by applying Latent Dirichlet Allocation (LDA) on newspaper articles. The result of our experiment suggests that the proposed method can be an admissible way of utilizing news media data to observe media trends overtime properly.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Technology Topic Identification and Trend Prediction of New Energy Vehicle Using LDA Modeling
    Hu, Renjie
    Ma, Wencong
    Lin, Weiqiang
    Chen, Xiude
    Zhong, Zuchang
    Zeng, Chuhong
    [J]. COMPLEXITY, 2022, 2022
  • [2] Technology Topic Identification and Trend Prediction of New Energy Vehicle Using LDA Modeling
    Hu, Renjie
    Ma, Wencong
    Lin, Weiqiang
    Chen, Xiude
    Zhong, Zuchang
    Zeng, Chuhong
    [J]. COMPLEXITY, 2022, 2022
  • [3] LDA Based Topic Modeling of Journal Abstracts
    Anupriya, P.
    Karpagavalli, S.
    [J]. ICACCS 2015 PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING & COMMUNICATION SYSTEMS, 2015,
  • [4] Sparse Representation Based Query Classification Using LDA Topic Modeling
    Bhattacharya, Indrani
    Sil, Jaya
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT 2016, VOL 2, 2017, 469 : 621 - 629
  • [5] News Hotspots Detection and Tracking Based on LDA Topic Model
    Hu, Xiao
    [J]. PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC), VOL 1, 2016, : 248 - 252
  • [6] Bilingual COVID-19 Fake News Detection Based on LDA Topic Modeling and BERT Transformer
    Omrani, Pouria
    Ebrahimian, Zahra
    Toosi, Ramin
    Akhaee, Mohammad Ali
    [J]. 2023 6TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND IMAGE ANALYSIS, IPRIA, 2023,
  • [7] LDA-Based Topic Modeling Sentiment Analysis Using Topic/Document/Sentence (TDS) Model
    Farkhod, Akhmedov
    Abdusalomov, Akmalbek
    Makhmudov, Fazliddin
    Cho, Young Im
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (23):
  • [8] An extractive text summarization approach using tagged-LDA based topic modeling
    Rani, Ruby
    Lobiyal, D. K.
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (03) : 3275 - 3305
  • [9] An extractive text summarization approach using tagged-LDA based topic modeling
    Ruby Rani
    D. K. Lobiyal
    [J]. Multimedia Tools and Applications, 2021, 80 : 3275 - 3305
  • [10] Financial Topic Modeling Based on the BERT-LDA Embedding
    Zhou, Mei
    Kong, Ying
    Lin, Jianwu
    [J]. 2022 IEEE 20TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2022, : 495 - 500