Evaluation of the Optimal Topic Classification for Social Media Data Combined with Text Semantics: A Case Study of Public Opinion Analysis Related to COVID-19 with Microblogs

被引:6
|
作者
Liang, Qin [1 ]
Hu, Chunchun [1 ]
Chen, Si [2 ]
机构
[1] Wuhan Univ, Sch Geodesy & Geomat, Wuhan 430070, Peoples R China
[2] Wuhan Nat Resources & Planning Informat Ctr, Wuhan 430070, Peoples R China
关键词
LDA; topic model; BERT; topic classification; public opinion analysis;
D O I
10.3390/ijgi10120811
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Online public opinion reflects social conditions and public attitudes regarding special social events. Therefore, analyzing the temporal and spatial distributions of online public opinion topics can contribute to understanding issues of public concern, grasping and guiding the developing trend of public opinion. However, how to evaluate the validity of classification of online public opinion remains a challenging task in the topic mining field. By combining a Bidirectional Encoder Representations from Transformers (BERT) pre-training model with the Latent Dirichlet Allocation (LDA) topic model, we propose an evaluation method to determine the optimal classification number of topics from the perspective of semantic similarity. The effectiveness of the proposed method was verified based on the standard Chinese corpus THUCNews. Taking Coronavirus Disease 2019 (COVID-19)-related geotagged posts on Weibo in Wuhan city as an example, we used the proposed method to generate five categories of public opinion topics. Combining spatial and temporal information with the classification results, we analyze the spatial and temporal distribution patterns of the five optimal public opinion topics, which are found to be consistent with the epidemic development, demonstrating the feasibility of our method when applied to practical cases.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] Toward human-centric urban infrastructure: Text mining for social media data to identify the public perception of COVID-19 policy in transportation hubs
    Park, June Young
    Mistur, Evan
    Kim, Donghwan
    Mo, Yunjeong
    Hoefer, Richard
    [J]. SUSTAINABLE CITIES AND SOCIETY, 2022, 76
  • [42] Association Between Public Opinion and Malaysian Government Communication Strategies About the COVID-19 Crisis: Content Analysis of Image Repair Strategies in Social Media
    Masngut, Nasaai
    Mohamad, Emma
    [J]. JOURNAL OF MEDICAL INTERNET RESEARCH, 2021, 23 (08)
  • [43] New media platform's understanding of Chinese social workers' anti-epidemic actions: an analysis of network public opinion based on COVID-19
    Lin, Lin
    Jiang, Anqi
    Zheng, Yi
    Wang, Jingying
    Wang, Mengran
    [J]. SOCIAL WORK IN PUBLIC HEALTH, 2021, 36 (7-8) : 770 - 785
  • [44] Modified valence aware dictionary for sentiment reasoning classifier for detection and classification of Covid-19 related rumors from social media data streams
    Arora, Shruti
    Rani, Rinkle
    Saxena, Nitin
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (21):
  • [45] Utilising crowdsourcing and text mining to enhance information extraction from social media: A case study in handling COVID-19 supply requests in Thailand
    Rattanatamrong, Prapaporn
    Boonpalit, Yutthana
    Boonnavasin, Manassanan
    [J]. JOURNAL OF INFORMATION SCIENCE, 2024,
  • [46] How do Canadian public health agencies respond to the COVID-19 emergency using social media: a protocol for a case study using content and sentiment analysis
    Kothari, Anita
    Foisey, Lyndsay
    Donelle, Lorie
    Bauer, Michael
    [J]. BMJ OPEN, 2021, 11 (04):
  • [47] Analysis of COVID-19 Impact on Public Transport Usage based on Smart Card Data - A Regional Case Study in Australia
    Qu, Tianyang
    Du, Bo
    Zhang, Cheng
    Wang, Qi
    Hu, Hao
    Perez, Pascal
    [J]. 2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2022, : 3849 - 3854
  • [48] Public Engagement and Government Responsiveness in the Communications About COVID-19 During the Early Epidemic Stage in China: Infodemiology Study on Social Media Data
    Liao, Qiuyan
    Yuan, Jiehu
    Dong, Meihong
    Yang, Lin
    Fielding, Richard
    Lam, Wendy Wing Tak
    [J]. JOURNAL OF MEDICAL INTERNET RESEARCH, 2020, 22 (05)
  • [49] Public Sentiment Analysis and Topic Modeling Regarding COVID-19's Three Waves of Total Lockdown: A Case Study on Movement Control Order in Malaysia
    Alamoodi, A. H.
    Baker, Mohammed Rashad
    Albahri, O. S.
    Zaidan, B. B.
    Zaidan, A. A.
    Wong, Wing-Kwong
    Garfan, Salem
    Albahri, A. S.
    Alonso, Miguel A.
    Jasim, Ali Najm
    Baqer, M. J.
    [J]. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2022, 16 (07) : 2169 - 2190
  • [50] Sentiment Analysis of the Covid-19 Virus Infection in Indonesian Public Transportation on Twitter Data: A Case Study of Commuter Line Passengers
    Sari, Intania Cahya
    Ruldeviyani, Yova
    [J]. 2020 5TH INTERNATIONAL WORKSHOP ON BIG DATA AND INFORMATION SECURITY (IWBIS 2020), 2020, : 25 - 30