Evaluation of the Optimal Topic Classification for Social Media Data Combined with Text Semantics: A Case Study of Public Opinion Analysis Related to COVID-19 with Microblogs

被引:6
|
作者
Liang, Qin [1 ]
Hu, Chunchun [1 ]
Chen, Si [2 ]
机构
[1] Wuhan Univ, Sch Geodesy & Geomat, Wuhan 430070, Peoples R China
[2] Wuhan Nat Resources & Planning Informat Ctr, Wuhan 430070, Peoples R China
关键词
LDA; topic model; BERT; topic classification; public opinion analysis;
D O I
10.3390/ijgi10120811
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Online public opinion reflects social conditions and public attitudes regarding special social events. Therefore, analyzing the temporal and spatial distributions of online public opinion topics can contribute to understanding issues of public concern, grasping and guiding the developing trend of public opinion. However, how to evaluate the validity of classification of online public opinion remains a challenging task in the topic mining field. By combining a Bidirectional Encoder Representations from Transformers (BERT) pre-training model with the Latent Dirichlet Allocation (LDA) topic model, we propose an evaluation method to determine the optimal classification number of topics from the perspective of semantic similarity. The effectiveness of the proposed method was verified based on the standard Chinese corpus THUCNews. Taking Coronavirus Disease 2019 (COVID-19)-related geotagged posts on Weibo in Wuhan city as an example, we used the proposed method to generate five categories of public opinion topics. Combining spatial and temporal information with the classification results, we analyze the spatial and temporal distribution patterns of the five optimal public opinion topics, which are found to be consistent with the epidemic development, demonstrating the feasibility of our method when applied to practical cases.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] Using Social Media in Tourist Sentiment Analysis: A Case Study of Andalusia during the Covid-19 Pandemic
    Flores-Ruiz, David
    Elizondo-Salto, Adolfo
    de la O Barroso-Gonzalez, Maria
    [J]. SUSTAINABILITY, 2021, 13 (07)
  • [32] Spatial-Temporal Pattern Evolution of Public Sentiment Responses to the COVID-19 Pandemic in Small Cities of China: A Case Study Based on Social Media Data Analysis
    Zhou, Yuye
    Xu, Jiangang
    Yin, Maosen
    Zeng, Jun
    Ming, Haolin
    Wang, Yiwen
    [J]. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2022, 19 (18)
  • [33] Revealing public attitudes toward mobile cabin hospitals during Covid-19 pandemic: Sentiment and topic analyses using social media data in China
    Zhou, Shenghua
    Wang, Hongyu
    Li, Dezhi
    Ng, S. Thomas
    Wei, Ran
    Zhao, Yongheng
    Zhou, Yubo
    [J]. SUSTAINABLE CITIES AND SOCIETY, 2024, 107
  • [34] COVID-19 outbreak and integration of social media in public health crisis communication: a case study of UMMC, Kuala Lumpur
    Ibrahim, Mohamed Nabeeh
    Sarmiti, Nor Zaliza
    Syed, Md Azalanshah Md
    [J]. JOURNAL OF HOSPITAL MANAGEMENT AND HEALTH POLICY, 2024, 8
  • [35] Social Media Usage in Health Communication and Its Implications on Public Health Security: A Case Study of COVID-19 in Zanzibar
    Khamis, Rashid Maalim
    Geng, Yiqun
    [J]. ONLINE JOURNAL OF COMMUNICATION AND MEDIA TECHNOLOGIES, 2021, 11 (01):
  • [36] Evaluation of a Social Media Campaign in Saskatchewan to Promote Healthy Eating During the COVID-19 Pandemic: Social Media Analysis and Qualitative Interview Study
    Grantham, Jordyn L.
    Verishagen, Carrie L.
    Whiting, Susan J.
    Henry, Carol J.
    Lieffers, Jessica R. L.
    [J]. JOURNAL OF MEDICAL INTERNET RESEARCH, 2021, 23 (07)
  • [37] Public Opinion Analysis of the Transportation Policy Using Social Media Data: A Case Study on the Delhi Odd-Even Policy
    Chakraborty, Pranamesh
    Sharma, Anuj
    [J]. TRANSPORTATION IN DEVELOPING ECONOMIES, 2019, 5 (01)
  • [38] Will the Relaxation of COVID-19 Control Measures Have an Impact on the Chinese Internet-Using Public? Social Media-Based Topic and Sentiment Analysis
    Xin, Yu
    Tan, Xiaoshuang
    Ren, Xiaohui
    [J]. INTERNATIONAL JOURNAL OF PUBLIC HEALTH, 2023, 68
  • [39] Toward human-centric urban infrastructure: Text mining for social media data to identify the public perception of COVID-19 policy in transportation hubs
    Park, June Young
    Mistur, Evan
    Kim, Donghwan
    Mo, Yunjeong
    Hoefer, Richard
    [J]. SUSTAINABLE CITIES AND SOCIETY, 2022, 76
  • [40] A Transformer-Based Model for Evaluation of Information Relevance in Online Social-Media: A Case Study of Covid-19 Media Posts
    Sharma, Utkarsh
    Pandey, Prateek
    Kumar, Shishir
    [J]. NEW GENERATION COMPUTING, 2022, 40 (04) : 1029 - 1052