Enhanced Sentiment Analysis and Topic Modeling During the Pandemic Using Automated Latent Dirichlet Allocation

被引:4
|
作者
Batool, Amreen [1 ]
Byun, Yung-Cheol [2 ]
机构
[1] Jeju Natl Univ, Inst Informat Sci Technol, Dept Elect Engn, Jeju 63243, South Korea
[2] Jeju Natl Univ, Inst Informat Sci Technol, Dept Comp Engn, Major Elect Engn, Jeju 63243, South Korea
来源
IEEE ACCESS | 2024年 / 12卷
基金
新加坡国家研究基金会;
关键词
Pandemics; COVID-19; Sentiment analysis; Social networking (online); Natural language processing; Data models; Computer viruses; Machine learning; Feature extraction; Topic modeling; LDA; sentiment analysis; machine learning; deep learning; feature extraction; RESPIRATORY SYNDROME CORONAVIRUS; COVID-19; DEEP; CLASSIFICATION; DISCOVERY; QUALITY; TWEETS;
D O I
10.1109/ACCESS.2024.3411717
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The COVID-19 pandemic has profoundly impacted human societies, resulting in the loss of millions of lives and slowing economic growth worldwide. This devastating pandemic underscores the gravity of viral threats and led to multifaceted consequences, including loss of livelihoods, dynamic labor force migration, and significant ramifications on mental health. Furthermore, different scientific institutions and companies are attempting to accelerate research and innovation by analyzing large data corpus for fighting against the pandemic. In this research study, an advanced approach based on automated Latent Dirichlet Allocation (LDA) is suggested dealing with a large data corpus for efficiently providing visualization of sentiment analysis and discovered topics. This innovative approach seeks to interrogate a substantial pandemic corpus, delving into the intricacies of public sentiment and discerning evolving trends pertinent to the pandemic. A sophisticated 10-topic LDA model was implemented, revealing Topic 8 as the most prevalent, with a frequency peak of 22.29, eclipsing other enumerated topics. We employ text-mining techniques like WordCloud and Word2Vec to offer insights into specific terms relevant to the pandemic, such as "Origin," "Symptom," "Diagnostic," and "Transmission." Applying the t-SNE method enriches the analysis by visually unraveling semantic clusters within the corpus. The subsequent phase involves modeling strategic topics within the corpus through an unsupervised LDA-based approach, leveraging our suggested framework. This novel perspective contributes to a deeper understanding of the underlying dynamics by analyzing a large data corpus quickly and automatically for providing visualization of discovered topics aiming to aid front-line workers, healthcare practitioners, and community support to fight against the pandemic.
引用
收藏
页码:81206 / 81220
页数:15
相关论文
共 50 条
  • [1] Public discourse and sentiment during the COVID 19 pandemic: Using Latent Dirichlet Allocation for topic modeling on Twitter
    Xue, Jia
    Chen, Junxiang
    Chen, Chen
    Zheng, Chengda
    Li, Sijia
    Zhu, Tingshao
    PLOS ONE, 2020, 15 (09):
  • [2] Sentiment Analysis Using Latent Dirichlet Allocation and Topic Polarity Wordcloud Visualization
    Bashri, Mohammad F. A.
    Kusumaningrum, Retno
    2017 5TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICOIC7), 2017,
  • [3] Public perceptions of digital fashion: An analysis of sentiment and Latent Dirichlet Allocation topic modeling
    Zou, Yixin
    Luh, Ding-Bang
    Lu, Shizhu
    FRONTIERS IN PSYCHOLOGY, 2022, 13
  • [4] Topic Modeling Using Latent Dirichlet allocation: A Survey
    Chauhan, Uttam
    Shah, Apurva
    ACM COMPUTING SURVEYS, 2021, 54 (07)
  • [5] Topic Modeling Twitter Data Using Latent Dirichlet Allocation and Latent Semantic Analysis
    Qomariyah, Siti
    Iriawan, Nur
    Fithriasari, Kartika
    2ND INTERNATIONAL CONFERENCE ON SCIENCE, MATHEMATICS, ENVIRONMENT, AND EDUCATION, 2019, 2019, 2194
  • [6] Topic modeling for expert finding using latent Dirichlet allocation
    Momtazi, Saeedeh
    Naumann, Felix
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2013, 3 (05) : 346 - 353
  • [7] Analysis of the impact of investor sentiment on stock price using the latent dirichlet allocation topic model
    Chen, Meilan
    Guo, Zhiying
    Abbass, Kashif
    Huang, Wenfeng
    FRONTIERS IN ENVIRONMENTAL SCIENCE, 2022, 10
  • [8] Topic Extraction and Sentiment Classification by using Latent Dirichlet Markov Allocation and SentiWordNet
    Kaur, Preet Chandan
    Ghorpade, Tushar
    Mane, Vanita
    INTERNATIONAL CONFERENCE ON ADVANCES IN INFORMATION COMMUNICATION TECHNOLOGY & COMPUTING, 2016, 2016,
  • [9] Road Traffic Topic Modeling on Twitter using Latent Dirichlet Allocation
    Hidayatullah, Ahmad Fathan
    Ma'arif, Muhammad Rifqi
    2017 INTERNATIONAL CONFERENCE ON SUSTAINABLE INFORMATION ENGINEERING AND TECHNOLOGY (SIET), 2017, : 47 - 52
  • [10] A FRAMEWORK OF URDU TOPIC MODELING USING LATENT DIRICHLET ALLOCATION (LDA)
    Shakeel, Khadija
    Tahir, Ghulam Rasool
    Tehseen, Irsha
    Ali, Mubashir
    2018 IEEE 8TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2018, : 117 - 123