Enhanced Sentiment Analysis and Topic Modeling During the Pandemic Using Automated Latent Dirichlet Allocation

被引:4
|
作者
Batool, Amreen [1 ]
Byun, Yung-Cheol [2 ]
机构
[1] Jeju Natl Univ, Inst Informat Sci Technol, Dept Elect Engn, Jeju 63243, South Korea
[2] Jeju Natl Univ, Inst Informat Sci Technol, Dept Comp Engn, Major Elect Engn, Jeju 63243, South Korea
来源
IEEE ACCESS | 2024年 / 12卷
基金
新加坡国家研究基金会;
关键词
Pandemics; COVID-19; Sentiment analysis; Social networking (online); Natural language processing; Data models; Computer viruses; Machine learning; Feature extraction; Topic modeling; LDA; sentiment analysis; machine learning; deep learning; feature extraction; RESPIRATORY SYNDROME CORONAVIRUS; COVID-19; DEEP; CLASSIFICATION; DISCOVERY; QUALITY; TWEETS;
D O I
10.1109/ACCESS.2024.3411717
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The COVID-19 pandemic has profoundly impacted human societies, resulting in the loss of millions of lives and slowing economic growth worldwide. This devastating pandemic underscores the gravity of viral threats and led to multifaceted consequences, including loss of livelihoods, dynamic labor force migration, and significant ramifications on mental health. Furthermore, different scientific institutions and companies are attempting to accelerate research and innovation by analyzing large data corpus for fighting against the pandemic. In this research study, an advanced approach based on automated Latent Dirichlet Allocation (LDA) is suggested dealing with a large data corpus for efficiently providing visualization of sentiment analysis and discovered topics. This innovative approach seeks to interrogate a substantial pandemic corpus, delving into the intricacies of public sentiment and discerning evolving trends pertinent to the pandemic. A sophisticated 10-topic LDA model was implemented, revealing Topic 8 as the most prevalent, with a frequency peak of 22.29, eclipsing other enumerated topics. We employ text-mining techniques like WordCloud and Word2Vec to offer insights into specific terms relevant to the pandemic, such as "Origin," "Symptom," "Diagnostic," and "Transmission." Applying the t-SNE method enriches the analysis by visually unraveling semantic clusters within the corpus. The subsequent phase involves modeling strategic topics within the corpus through an unsupervised LDA-based approach, leveraging our suggested framework. This novel perspective contributes to a deeper understanding of the underlying dynamics by analyzing a large data corpus quickly and automatically for providing visualization of discovered topics aiming to aid front-line workers, healthcare practitioners, and community support to fight against the pandemic.
引用
收藏
页码:81206 / 81220
页数:15
相关论文
共 50 条
  • [41] Using Latent Dirichlet Allocation to Incorporate Domain Knowledge For Topic Transition Detection
    Zhu, Xiaodan
    He, Xuming
    Munteanu, Cosmin
    Penn, Gerald
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2442 - 2445
  • [42] The microblog sentiment analysis based on latent dirichlet allocation and deep learning approaches
    Ma, Xiaowen
    JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2024, 24 (4-5) : 3113 - 3135
  • [43] Integrating Labeled Latent Dirichlet Allocation into Sentiment Analysis of Movie and General Domains
    Coughlin, Ryan
    Coetsier, Jean-Charles
    Jiamthapthaksin, Rachsuda
    2017 9TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SMART TECHNOLOGY (KST), 2017, : 18 - 22
  • [44] Sentiment Analysis in Social Networks: A Methodology Based on the Latent Dirichlet Allocation Approach
    Clarizia, Fabio
    Colace, Francesco
    Pascale, Francesco
    Lombardi, Marco
    Santaniello, Domenico
    PROCEEDINGS OF THE 11TH CONFERENCE OF THE EUROPEAN SOCIETY FOR FUZZY LOGIC AND TECHNOLOGY (EUSFLAT 2019), 2019, 1 : 241 - 248
  • [45] Indonesian's Song Lyrics Topic Modelling using Latent Dirichlet Allocation
    Laoh, Enrico
    Surjandari, Isti
    Febirautami, Limisgy Ramadhina
    2018 5TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE 2018), 2018, : 270 - 274
  • [46] Topic Analysis of the Research Domain in Knowledge Organization: A Latent Dirichlet Allocation Approach
    Joo, Soohyung
    Choi, Inkyung
    Choi, Namjoo
    KNOWLEDGE ORGANIZATION, 2018, 45 (02): : 170 - 183
  • [47] Comparison of n-stage Latent Dirichlet Allocation versus other topic modeling methods for emotion analysis
    Guven, Zekeriya Anil
    Diri, Banu
    Cakaloglu, Tolgahan
    JOURNAL OF THE FACULTY OF ENGINEERING AND ARCHITECTURE OF GAZI UNIVERSITY, 2020, 35 (04): : 2135 - 2145
  • [48] Modeling multi-topic information diffusion in social networks using latent Dirichlet allocation and Hawkes processes
    Pinto, Julio Cesar Louzada
    Chahed, Tijani
    10TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY AND INTERNET-BASED SYSTEMS SITIS 2014, 2014, : 339 - 346
  • [49] Sarcasmometer using Sentiment Analysis and Topic Modeling
    Bhan, Namrata
    D'silva, Mitchell
    2017 IEEE INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND CONTROL (ICAC3), 2017,
  • [50] Aspect Based Sentiment Analysis in E-Commerce User Reviews Using Latent Dirichlet Allocation (LDA) and Sentiment Lexicon
    Wahyudi, Eko
    Kusumaningrum, Retno
    2019 3RD INTERNATIONAL CONFERENCE ON INFORMATICS AND COMPUTATIONAL SCIENCES (ICICOS 2019), 2019,