Revealing Public Opinion towards the COVID-19 Vaccine with Weibo Data in China: BertFDA-Based Model

被引:11
|
作者
Zhu, Jianping [1 ,2 ,3 ]
Weng, Futian [1 ,2 ,4 ]
Zhuang, Muni [1 ,2 ,4 ]
Lu, Xin [5 ]
Tan, Xu [6 ]
Lin, Songjie [6 ]
Zhang, Ruoyi [7 ]
机构
[1] Xiamen Univ, Natl Inst Data Sci Hlth & Med, Xiamen 361005, Peoples R China
[2] Xiamen Univ, Data Min Res Ctr, Xiamen 361005, Peoples R China
[3] Xiamen Univ, Sch Management, Xiamen 361005, Peoples R China
[4] Xiamen Univ, Sch Med, Xiamen 361005, Peoples R China
[5] Natl Univ Def Technol, Coll Syst Engn, Changsha 410073, Peoples R China
[6] Shenzhen Institiute Informat Technol, Career Oriented Multidisciplinary Educ Ctr, Shenzhen 518172, Peoples R China
[7] George Washington Univ, Columbia Coll Art & Sci, Washington, DC 20052 USA
关键词
COVID-19; vaccine; sentiment analysis; topic model; Bert; functional data analysis; SENTIMENT ANALYSIS; SOCIAL MEDIA;
D O I
10.3390/ijerph192013248
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The COVID-19 pandemic has created unprecedented burdens on people's health and subjective well-being. While countries around the world have established models to track and predict the affective states of COVID-19, identifying the topics of public discussion and sentiment evolution of the vaccine, particularly the differences in topics of concern between vaccine-support and vaccine-hesitant groups, remains scarce. Using social media data from the two years following the outbreak of COVID-19 (23 January 2020 to 23 January 2022), coupled with state-of-the-art natural language processing (NLP) techniques, we developed a public opinion analysis framework (BertFDA). First, using dynamic topic clustering on Weibo through the latent Dirichlet allocation (LDA) model, a total of 118 topics were generated in 24 months using 2,211,806 microblog posts. Second, by building an improved Bert pre-training model for sentiment classification, we provide evidence that public negative sentiment continued to decline in the early stages of COVID-19 vaccination. Third, by modeling and analyzing the microblog posts from the vaccine-support group and the vaccine-hesitant group, we discover that the vaccine-support group was more concerned about vaccine effectiveness and the reporting of news, reflecting greater group cohesion, whereas the vaccine-hesitant group was particularly concerned about the spread of coronavirus variants and vaccine side effects. Finally, we deployed different machine learning models to predict public opinion. Moreover, functional data analysis (FDA) is developed to build the functional sentiment curve, which can effectively capture the dynamic changes with the explicit function. This study can aid governments in developing effective interventions and education campaigns to boost vaccination rates.
引用
收藏
页数:26
相关论文
共 50 条
  • [41] Early Warning Scheme of COVID-19 related Internet Public Opinion based on RVM-L Model
    Zhu, Rongbo
    Ding, Qianao
    Yu, Mai
    Wang, Jun
    Ma, Maode
    [J]. SUSTAINABLE CITIES AND SOCIETY, 2021, 74
  • [42] How To Convince People To Get A COVID-19 Vaccine: An Opinion Dynamics Model for REUNS 2022
    Payton, Micah
    Dragovic, Natasa
    [J]. 2022 IEEE 19TH INTERNATIONAL CONFERENCE ON MOBILE AD HOC AND SMART SYSTEMS (MASS 2022), 2022, : 776 - 780
  • [43] The public information needs of COVID-19 vaccine: A study based on online Q&A communities and portals in China
    Wang, Lin
    Xian, Zuquan
    Du, Tianyu
    [J]. FRONTIERS IN PSYCHOLOGY, 2022, 13
  • [44] The Effect of the COVID-19 Vaccine on Daily Cases and Deaths Based on Global Vaccine Data
    Li, Zhiwei
    Liu, Xiangtong
    Liu, Mengyang
    Wu, Zhiyuan
    Liu, Yue
    Li, Weiming
    Liu, Mengmeng
    Wang, Xiaonan
    Gao, Bo
    Luo, Yanxia
    Li, Xia
    Tao, Lixin
    Wang, Wei
    Guo, Xiuhua
    [J]. VACCINES, 2021, 9 (11)
  • [45] COVID-19 Public Opinion: A Twitter Healthcare Data Processing Using Machine Learning Methodologies
    Agrawal, Shweta
    Jain, Sanjiv Kumar
    Sharma, Shruti
    Khatri, Ajay
    [J]. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2023, 20 (01)
  • [46] Analysis of factors influencing public attention to masks during the COVID-19 epidemic-Data from Sina Weibo
    Hong, Wei
    Lu, Xinhang
    Wu, Linhai
    Pu, Xujin
    [J]. MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2022, 19 (07) : 6469 - 6488
  • [47] Data quality model for assessing public COVID-19 big datasets
    Alladoumbaye Ngueilbaye
    Joshua Zhexue Huang
    Mehak Khan
    Hongzhi Wang
    [J]. The Journal of Supercomputing, 2023, 79 : 19574 - 19606
  • [48] Data quality model for assessing public COVID-19 big datasets
    Ngueilbaye, Alladoumbaye
    Huang, Joshua Zhexue
    Khan, Mehak
    Wang, Hongzhi
    [J]. JOURNAL OF SUPERCOMPUTING, 2023, 79 (17): : 19574 - 19606
  • [49] Epidemiological Analysis of Coronavirus Disease 2019 (COVID-19) in 2 Cities in China Based on Public Data
    Zhou, Ning
    Zhang, Xiaomeng
    Zhang, Ying
    Gao, Lu
    Zhou, Penghui
    Liu, Hui
    [J]. DISASTER MEDICINE AND PUBLIC HEALTH PREPAREDNESS, 2022, 16 (03) : 1156 - 1160
  • [50] Time-Divided SIQR Model for COVID-19 Analysis and Prediction in Consideration of Public Opinion
    Bao, Yu-Bin
    Liu, Ji-Ting
    Li, Xiao-Yu
    [J]. Dongbei Daxue Xuebao/Journal of Northeastern University, 2022, 43 (08): : 1065 - 1072