Enhanced Sentiment Analysis and Topic Modeling During the Pandemic Using Automated Latent Dirichlet Allocation

被引:4
|
作者
Batool, Amreen [1 ]
Byun, Yung-Cheol [2 ]
机构
[1] Jeju Natl Univ, Inst Informat Sci Technol, Dept Elect Engn, Jeju 63243, South Korea
[2] Jeju Natl Univ, Inst Informat Sci Technol, Dept Comp Engn, Major Elect Engn, Jeju 63243, South Korea
来源
IEEE ACCESS | 2024年 / 12卷
基金
新加坡国家研究基金会;
关键词
Pandemics; COVID-19; Sentiment analysis; Social networking (online); Natural language processing; Data models; Computer viruses; Machine learning; Feature extraction; Topic modeling; LDA; sentiment analysis; machine learning; deep learning; feature extraction; RESPIRATORY SYNDROME CORONAVIRUS; COVID-19; DEEP; CLASSIFICATION; DISCOVERY; QUALITY; TWEETS;
D O I
10.1109/ACCESS.2024.3411717
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The COVID-19 pandemic has profoundly impacted human societies, resulting in the loss of millions of lives and slowing economic growth worldwide. This devastating pandemic underscores the gravity of viral threats and led to multifaceted consequences, including loss of livelihoods, dynamic labor force migration, and significant ramifications on mental health. Furthermore, different scientific institutions and companies are attempting to accelerate research and innovation by analyzing large data corpus for fighting against the pandemic. In this research study, an advanced approach based on automated Latent Dirichlet Allocation (LDA) is suggested dealing with a large data corpus for efficiently providing visualization of sentiment analysis and discovered topics. This innovative approach seeks to interrogate a substantial pandemic corpus, delving into the intricacies of public sentiment and discerning evolving trends pertinent to the pandemic. A sophisticated 10-topic LDA model was implemented, revealing Topic 8 as the most prevalent, with a frequency peak of 22.29, eclipsing other enumerated topics. We employ text-mining techniques like WordCloud and Word2Vec to offer insights into specific terms relevant to the pandemic, such as "Origin," "Symptom," "Diagnostic," and "Transmission." Applying the t-SNE method enriches the analysis by visually unraveling semantic clusters within the corpus. The subsequent phase involves modeling strategic topics within the corpus through an unsupervised LDA-based approach, leveraging our suggested framework. This novel perspective contributes to a deeper understanding of the underlying dynamics by analyzing a large data corpus quickly and automatically for providing visualization of discovered topics aiming to aid front-line workers, healthcare practitioners, and community support to fight against the pandemic.
引用
收藏
页码:81206 / 81220
页数:15
相关论文
共 50 条
  • [21] The application of network agenda setting model during the COVID-19 pandemic based on latent dirichlet allocation topic modeling
    Liu, Kai
    Geng, Xiaoyu
    Liu, Xiaoyan
    FRONTIERS IN PSYCHOLOGY, 2022, 13
  • [22] Topic Modeling of Online Accommodation Reviews via Latent Dirichlet Allocation
    Sutherland, Ian
    Sim, Youngseok
    Lee, Seul Ki
    Byun, Jaemun
    Kiatkawsin, Kiattipoom
    SUSTAINABILITY, 2020, 12 (05) : 1 - 15
  • [23] An exploration of research trends on metaverse: topic modeling with latent dirichlet allocation
    Park H.
    Ahn B.
    Kim T.
    Quality & Quantity, 2025, 59 (1) : 233 - 252
  • [24] Latent Dirichlet allocation (LDA) for topic modeling of the CFPB consumer complaints
    Bastani, Kaveh
    Namavari, Hamed
    Shaffer, Jeffrey
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 127 : 256 - 271
  • [25] Latent Dirichlet allocation (LDA) and topic modeling: models, applications, a survey
    Jelodar, Hamed
    Wang, Yongli
    Yuan, Chi
    Feng, Xia
    Jiang, Xiahui
    Li, Yanchao
    Zhao, Liang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (11) : 15169 - 15211
  • [26] Latent Dirichlet allocation (LDA) and topic modeling: models, applications, a survey
    Hamed Jelodar
    Yongli Wang
    Chi Yuan
    Xia Feng
    Xiahui Jiang
    Yanchao Li
    Liang Zhao
    Multimedia Tools and Applications, 2019, 78 : 15169 - 15211
  • [27] Semantic similarity measure for topic modeling using latent Dirichlet allocation and collapsed Gibbs sampling
    Micheal Olalekan Ajinaja
    Adebayo Olusola Adetunmbi
    Chukwuemeka Christian Ugwu
    Olugbemiga Solomon Popoola
    Iran Journal of Computer Science, 2023, 6 (1) : 81 - 94
  • [28] Topic analysis of online reviews for two competitive products using latent Dirichlet allocation
    Wang, Wenxin
    Feng, Yi
    Dai, Wenqiang
    ELECTRONIC COMMERCE RESEARCH AND APPLICATIONS, 2018, 29 : 142 - 156
  • [29] Topic and Trend Detection in Text Collections Using Latent Dirichlet Allocation
    Bolelli, Levent
    Ertekin, Seyda
    Giles, C. Lee
    ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, 2009, 5478 : 776 - +
  • [30] Automated Topic Modeling and Sentiment Analysis of Tweets on SparkR
    Monish, Prema
    Kumari, Santoshi
    Babu, Narendra C.
    2018 9TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2018,