A Semi-Supervised Approach to Sentiment Analysis of Tweets during the 2022 Philippine Presidential Election

被引:10
|
作者
Macrohon, Julio Jerison E. [1 ]
Villavicencio, Charlyn Nayve [1 ,2 ]
Inbaraj, X. Alphonse [1 ]
Jeng, Jyh-Horng [1 ]
机构
[1] I Shou Univ, Dept Informat Engn, Kaohsiung 84001, Taiwan
[2] Bulacan State Univ, Coll Informat & Commun Technol, Bulacan 3000, Philippines
关键词
2022 Philippine Presidential Election; semi-supervised learning; Natural Language Processing; sentiment analysis; !text type='Python']Python[!/text; social media; Twitter; tweets;
D O I
10.3390/info13100484
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the increasing popularity of Twitter as both a social media platform and a data source for companies, decision makers, advertisers, and even researchers alike, data have been so massive that manual labeling is no longer feasible. This research uses a semi-supervised approach to sentiment analysis of both English and Tagalog tweets using a base classifier. In this study involving the Philippines, where social media played a central role in the campaign of both candidates, the tweets during the widely contested race between the son of the Philippines' former President and Dictator, and the outgoing Vice President of the Philippines were used. Using Natural Language Processing techniques, these tweets were annotated, processed, and trained to classify both English and Tagalog tweets into three polarities: positive, neutral, and negative. Through the Self-Training with Multinomial Naive Bayes as base classifier with 30% unlabeled data, the results yielded an accuracy of 84.83%, which outweighs other studies using Twitter data from the Philippines.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] SSLMM: Semi-Supervised Learning with Missing Modalities for Multimodal Sentiment Analysis
    Wang, Yiyu
    Jian, Haifang
    Zhuang, Jian
    Guo, Huimin
    Leng, Yan
    INFORMATION FUSION, 2025, 120
  • [32] Mining User Intents in Twitter: A Semi-Supervised Approach to Inferring Intent Categories for Tweets
    Wang, Jinpeng
    Cong, Gao
    Zhao, Wayne Xin
    Li, Xiaoming
    PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 339 - 345
  • [33] Sentiment analysis using semi-supervised learning with few labeled data
    Pan, Yuhao
    Chen, Zhiqun
    Suzuki, Yoshimi
    Fukumoto, Fumiyo
    Nishizaki, Hiromitsu
    2020 INTERNATIONAL CONFERENCE ON CYBERWORLDS (CW 2020), 2020, : 231 - 234
  • [34] Semi-supervised topic representation through sentiment analysis and semantic networks
    Ortu, Marco
    Romano, Maurizio
    Carta, Andrea
    BIG DATA RESEARCH, 2024, 37
  • [35] Set-Similarity Joins Based Semi-supervised Sentiment Analysis
    Dong, Xishuang
    Zou, Qibo
    Guan, Yi
    NEURAL INFORMATION PROCESSING, ICONIP 2012, PT I, 2012, 7663 : 176 - 183
  • [36] A semi-supervised approach in detecting sentiment and emotion based on digital payment reviews
    Vimala Balakrishnan
    Pik Yin Lok
    Hajar Abdul Rahim
    The Journal of Supercomputing, 2021, 77 : 3795 - 3810
  • [37] A semi-supervised approach in detecting sentiment and emotion based on digital payment reviews
    Balakrishnan, Vimala
    Lok, Pik Yin
    Abdul Rahim, Hajar
    JOURNAL OF SUPERCOMPUTING, 2021, 77 (04): : 3795 - 3810
  • [38] AraSenCorpus: A Semi-Supervised Approach for Sentiment Annotation of a Large Arabic Text Corpus
    Al-Laith, Ali
    Shahbaz, Muhammad
    Alaskar, Hind F.
    Rehmat, Asim
    APPLIED SCIENCES-BASEL, 2021, 11 (05):
  • [39] A Semi-supervised Approach for Sentiment Analysis of Arab(ic+izi) Messages: Application to the Algerian Dialect
    Guellil I.
    Adeel A.
    Azouaou F.
    Benali F.
    Hachani A.-E.
    Dashtipour K.
    Gogate M.
    Ieracitano C.
    Kashani R.
    Hussain A.
    SN Computer Science, 2021, 2 (2)
  • [40] Employing Personal/Impersonal Views in Supervised and Semi-supervised Sentiment Classification
    Li, Shoushan
    Huang, Chu-Ren
    Zhou, Guodong
    Lee, Sophia Yat Mei
    ACL 2010: 48TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2010, : 414 - 423