Enhanced Sentiment Analysis and Topic Modeling During the Pandemic Using Automated Latent Dirichlet Allocation

被引:4
|
作者
Batool, Amreen [1 ]
Byun, Yung-Cheol [2 ]
机构
[1] Jeju Natl Univ, Inst Informat Sci Technol, Dept Elect Engn, Jeju 63243, South Korea
[2] Jeju Natl Univ, Inst Informat Sci Technol, Dept Comp Engn, Major Elect Engn, Jeju 63243, South Korea
来源
IEEE ACCESS | 2024年 / 12卷
基金
新加坡国家研究基金会;
关键词
Pandemics; COVID-19; Sentiment analysis; Social networking (online); Natural language processing; Data models; Computer viruses; Machine learning; Feature extraction; Topic modeling; LDA; sentiment analysis; machine learning; deep learning; feature extraction; RESPIRATORY SYNDROME CORONAVIRUS; COVID-19; DEEP; CLASSIFICATION; DISCOVERY; QUALITY; TWEETS;
D O I
10.1109/ACCESS.2024.3411717
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The COVID-19 pandemic has profoundly impacted human societies, resulting in the loss of millions of lives and slowing economic growth worldwide. This devastating pandemic underscores the gravity of viral threats and led to multifaceted consequences, including loss of livelihoods, dynamic labor force migration, and significant ramifications on mental health. Furthermore, different scientific institutions and companies are attempting to accelerate research and innovation by analyzing large data corpus for fighting against the pandemic. In this research study, an advanced approach based on automated Latent Dirichlet Allocation (LDA) is suggested dealing with a large data corpus for efficiently providing visualization of sentiment analysis and discovered topics. This innovative approach seeks to interrogate a substantial pandemic corpus, delving into the intricacies of public sentiment and discerning evolving trends pertinent to the pandemic. A sophisticated 10-topic LDA model was implemented, revealing Topic 8 as the most prevalent, with a frequency peak of 22.29, eclipsing other enumerated topics. We employ text-mining techniques like WordCloud and Word2Vec to offer insights into specific terms relevant to the pandemic, such as "Origin," "Symptom," "Diagnostic," and "Transmission." Applying the t-SNE method enriches the analysis by visually unraveling semantic clusters within the corpus. The subsequent phase involves modeling strategic topics within the corpus through an unsupervised LDA-based approach, leveraging our suggested framework. This novel perspective contributes to a deeper understanding of the underlying dynamics by analyzing a large data corpus quickly and automatically for providing visualization of discovered topics aiming to aid front-line workers, healthcare practitioners, and community support to fight against the pandemic.
引用
收藏
页码:81206 / 81220
页数:15
相关论文
共 50 条
  • [31] Attitudes Evaluation Toward COVID-19 Pandemic: An Application of Twitter Sentiment Analysis and Latent Dirichlet Allocation
    Shurrab, Saeed
    Shannak, Yazan
    Almshnanah, Abdulkarem
    Khazaleh, Huthaifa
    Najadat, Hassan
    2021 12TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2021, : 265 - 272
  • [32] Topic Modeling of Social Networking Service Data on Occupational Accidents in Korea: Latent Dirichlet Allocation Analysis
    Min, Kyoung-Bok
    Song, Sung-Hee
    Min, Jin-Young
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2020, 22 (08)
  • [33] Experimenting with Latent Semantic Analysis and Latent Dirichlet Allocation on Automated Essay Grading
    Hoblos, Jalaa
    2020 SEVENTH INTERNATIONAL CONFERENCE ON SOCIAL NETWORK ANALYSIS, MANAGEMENT AND SECURITY (SNAMS), 2020, : 153 - 159
  • [34] Analysis of Depression in News Articles Before and After the COVID-19 Pandemic Based on Unsupervised Learning and Latent Dirichlet Allocation Topic Modeling
    Been, Seonjae
    Byeon, Haewon
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (10) : 166 - 171
  • [35] A Hybrid Latent Dirichlet Allocation Approach for Topic Classification
    Hsu, Chi-I
    Chiu, Chaochang
    2017 IEEE INTERNATIONAL CONFERENCE ON INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS (INISTA), 2017, : 312 - 315
  • [36] Semantic latent dirichlet allocation for automatic topic extraction
    Bhutada, Sunil
    Balaram, V. V. S. S. S.
    Bulusu, Vishnu Vardhan
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2016, 37 (03): : 449 - 469
  • [37] Topic Modeling of the Pakistani Economy in English Newspapers via Latent Dirichlet Allocation (LDA)
    Ahmed, Fasih
    Nawaz, Muhammad
    Jadoon, Aisha
    SAGE OPEN, 2022, 12 (01):
  • [38] Topic Model Allocation of Conversational Dialogue Records by Latent Dirichlet Allocation
    Yeh, Jui-Feng
    Lee, Chen-Hsien
    Tan, Yi-Shiuan
    Yu, Liang-Chih
    2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,
  • [39] DUET: Data-Driven Approach Based on Latent Dirichlet Allocation Topic Modeling
    Wang, Yan
    Taylor, John E.
    JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2019, 33 (03)
  • [40] Latent Dirichlet Allocation (LDA) for Sentiment Analysis Toward Tourism Review in Indonesia
    Putri, I. R.
    Kusumaningrum, R.
    1ST INTERNATIONAL CONFERENCE ON COMPUTING AND APPLIED INFORMATICS 2016 : APPLIED INFORMATICS TOWARD SMART ENVIRONMENT, PEOPLE, AND SOCIETY, 2017, 801