Sentiment Analysis and Comprehensive Evaluation of Supervised Machine Learning Models Using Twitter Data on Russia–Ukraine War

被引:6
|
作者
Wadhwani G.K. [1 ]
Varshney P.K. [1 ]
Gupta A. [1 ]
Kumar S. [2 ]
机构
[1] Department of Computer Science, IITM, GGSIPU, New Delhi
[2] Department of Computer Science and Engineering, Shoolini University, Himachal Pradesh, Solan
关键词
Feature engineering; Machine learning; Sentiment analysis; Supervised machine learning models; Text classification;
D O I
10.1007/s42979-023-01790-5
中图分类号
学科分类号
摘要
The Russia–Ukrainian War refers to the ongoing hostilities between Russia and Ukraine. It was first focused on whether Crimea and the Donbass were formally recognised as being a part of Ukraine when Russia started it in February 2014. The conflict dramatically grew when Russia began its incursion of Ukraine on February 24, 2022, following a military build-up on the Russian–Ukrainian border that started in late 2021. Examining public perceptions of the crisis between Russia and Ukraine is the goal of this piece. These days, social media has taken on a significant role in communication, and as a result, opinions can be found on platforms like Facebook, Twitter, and Instagram. The study makes use of his 11,250 tweets about the war between Russia and Ukraine from his Twitter account. Techniques, including image processing, object identification, and natural language processing, have shown application, power, and potential for machine learning. The same applies to text analytics. For text analysis, sentiment analysis, and entity annotation, machine learning techniques are frequently employed. According to the applicability and efficacy of the machine learning model, natural language processing toolkit in python is utilised in to examine the textual polarity and subjectivity score of tweets. Moreover, because machine learning models have a high degree of classification accuracy, they have been widely utilised to categorise emotions. We have developed and test models using three feature extraction techniques: TF-IDF (term frequency-inverse document frequency), BoW (bag of words), and N-gram. Each model was assessed using a number of important performance indicators, including accuracy, precision, recall, and F1 score. Results show that the extra trees classifier (ETC) model achieves a highest accuracy of 0.84 in combination with the Bow property which is a measure to evaluate the efficacy of a machine learning algorithm. Logistic regression (LR), decision tree (DT), support vector machine (SVM), XGB, Gaussian naive base (GNB), ADA, and K-nearest neighbours (KNN) comparison have also been made. © 2023, The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd.
引用
收藏
相关论文
共 50 条
  • [11] Mobility Pattern Analysis during Russia-Ukraine War Using Twitter Location Data
    Shu, Yupei
    Chen, Xu
    Di, Xuan
    INFORMATION, 2024, 15 (02)
  • [12] Machine learning tool for exploring sentiment analysis on twitter data
    Biradar, Shanta H.
    Gorabal, J. V.
    Gupta, Gaurav
    MATERIALS TODAY-PROCEEDINGS, 2022, 56 : 1927 - 1934
  • [13] Machine Learning-Based Sentiment Analysis of Twitter Data
    Karthiga, M.
    Kumar, Sathish G.
    Aravindhraj, N.
    Priyanka, S.
    PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING & COMMUNICATION ENGINEERING (ICACCE-2019), 2019,
  • [14] Machine learning tool for exploring sentiment analysis on twitter data
    Biradar, Shanta H.
    Gorabal, J.V.
    Gupta, Gaurav
    Materials Today: Proceedings, 2022, 56 : 1927 - 1934
  • [15] Sentiment Analysis of Twitter Data Using Machine Learning Techniques and Scikit-learn
    Elbagir, Shihab
    Yang, Jing
    2018 INTERNATIONAL CONFERENCE ON ALGORITHMS, COMPUTING AND ARTIFICIAL INTELLIGENCE (ACAI 2018), 2018,
  • [16] Sentiment Analysis on Twitter Data of World Cup Soccer Tournament Using Machine Learning
    Patel, Ravikumar
    Passi, Kalpdrum
    IOT, 2020, 1 (02):
  • [17] Sentiment Analysis of Twitter Posts using Machine Learning Algorithms
    Gupta, Ashutosh
    Singh, Anusha
    Pandita, Ishan
    Parashar, Harsh
    PROCEEDINGS OF THE 2019 6TH INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2019, : 980 - 983
  • [18] A Comprehensive Survey on Sentiment Analysis in Twitter Data
    Krishnan, Hema
    Elayidom, M. Sudheep
    Santhanakrishnan, T.
    INTERNATIONAL JOURNAL OF DISTRIBUTED SYSTEMS AND TECHNOLOGIES, 2022, 13 (05)
  • [19] Performing Sentiment Analysis on Twitter Data Using Deep Learning Models: A Comparative Study
    Varshney, Ashwani
    Kapoor, Yatin
    Thukral, Anjali
    Sharma, Richa
    Bedi, Punam
    ADVANCES IN DATA AND INFORMATION SCIENCES, 2022, 318 : 371 - 381
  • [20] Sentiment identification on Twitter using machine learning
    Morales-Castro, Wendy
    Careta, Eduardo Perez
    Rayas, Angelica Hernandez
    Mukhopadhyay, Tirtha Prasad
    Crespo, J. Armando Perez
    Cabrera, Rafael Guzman
    2022 EURO-ASIA CONFERENCE ON FRONTIERS OF COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, FCSIT, 2022, : 28 - 31