Sentiment analysis of tweets through Altmetrics: A machine learning approach

被引:0
|
作者
Hassan, Saeed-Ul [1 ]
Saleem, Aneela [1 ]
Soroya, Saira Hanif [2 ]
Safder, Iqra [1 ]
Iqbal, Sehrish [1 ]
Jamil, Saqib [3 ]
Bukhari, Faisal [4 ]
Aljohani, Naif Radi [5 ]
Nawaz, Raheel [6 ]
机构
[1] Informat Technol Univ, Lahore, Pakistan
[2] Univ Punjab, Dept Informat Management, Lahore 54590, Pakistan
[3] Univ Okara, Dept Management Sci, Okara, Pakistan
[4] Univ Punjab, Punjab Univ Coll Informat Technol PUCIT, Lahore, Pakistan
[5] King Abdulaziz Univ, Fac Comp & Informat Technol, Jeddah, Saudi Arabia
[6] Manchester Metropolitan Univ, Sch Comp Sci, Manchester, Lancs, England
关键词
Altmetrics; comparative analysis; machine learning; sentiment analysis; Twitter; SOCIAL MEDIA; TWITTER; NETWORK;
D O I
10.1177/0165551520930917
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The purpose of the study is to (a) contribute to annotating an Altmetrics dataset across five disciplines, (b) undertake sentiment analysis using various machine learning and natural language processing-based algorithms, (c) identify the best-performing model and (d) provide a Python library for sentiment analysis of an Altmetrics dataset. First, the researchers gave a set of guidelines to two human annotators familiar with the task of related tweet annotation of scientific literature. They duly labelled the sentiments, achieving an inter-annotator agreement (IAA) of 0.80 (Cohen's Kappa). Then, the same experiments were run on two versions of the dataset: one with tweets in English and the other with tweets in 23 languages, including English. Using 6388 tweets about 300 papers indexed in Web of Science, the effectiveness of employed machine learning and natural language processing models was measured by comparing with well-known sentiment analysis models, that is, SentiStrength and Sentiment140, as the baseline. It was proved that Support Vector Machine with uni-gram outperformed all the other classifiers and baseline methods employed, with an accuracy of over 85%, followed by Logistic Regression at 83% accuracy and Naive Bayes at 80%. The precision, recall and F1 scores for Support Vector Machine, Logistic Regression and Naive Bayes were (0.89, 0.86, 0.86), (0.86, 0.83, 0.80) and (0.85, 0.81, 0.76), respectively.
引用
收藏
页码:712 / 726
页数:15
相关论文
共 50 条
  • [1] Sentiment Analysis of Tweets using Machine Learning Approach
    Rathi, Megha
    Malik, Aditya
    Varshney, Daksh
    Sharma, Rachita
    Mendiratta, Sarthak
    [J]. 2018 ELEVENTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2018, : 365 - 367
  • [2] Detecting Cyberbullying from Tweets Through Machine Learning Techniques with Sentiment Analysis
    Atoum, Jalal Omer
    [J]. ADVANCES IN INFORMATION AND COMMUNICATION, FICC, VOL 2, 2023, 652 : 25 - 38
  • [3] Machine Learning Approach to Analyze the Sentiment of Airline Passengers' Tweets
    Wu, Shengyang
    Gao, Yi
    [J]. TRANSPORTATION RESEARCH RECORD, 2024, 2678 (02) : 48 - 56
  • [4] A machine learning-based approach for sentiment analysis on distance learning from Arabic Tweets
    Almalki, Jameel
    [J]. PEERJ COMPUTER SCIENCE, 2022, 8
  • [5] A Deep Learning Approach for Sentiment Analysis in Spanish Tweets
    Vizcarra, Gerson
    Mauricio, Antoni
    Mauricio, Leonidas
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT III, 2018, 11141 : 622 - 629
  • [6] Sentiment analysis of malayalam tweets using machine learning techniques
    Soumya, S.
    Pramod, K., V
    [J]. ICT EXPRESS, 2020, 6 (04): : 300 - 305
  • [7] Sentiment Analysis of Tweets Using Various Machine Learning Techniques
    Tariyal, Ankit
    Goyal, Sachin
    Tantububay, Neeraj
    [J]. 2018 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATION AND TELECOMMUNICATION (ICACAT), 2018,
  • [8] Sentiment Analysis on Tweets Using Machine Learning and Combinatorial Fusion
    Ondusko, Dominik
    Ho, James
    Roy, Brandon
    Hsu, D. Frank
    [J]. IEEE 17TH INT CONF ON DEPENDABLE, AUTONOM AND SECURE COMP / IEEE 17TH INT CONF ON PERVAS INTELLIGENCE AND COMP / IEEE 5TH INT CONF ON CLOUD AND BIG DATA COMP / IEEE 4TH CYBER SCIENCE AND TECHNOLOGY CONGRESS (DASC/PICOM/CBDCOM/CYBERSCITECH), 2019, : 1066 - 1071
  • [9] Evaluating sentiment analysis for Arabic Tweets using machine learning and deep learning
    Alshutayri, Areej
    Alamoudi, Huda
    Alshehri, Boushra
    Aldhahri, Eman
    Alsaleh, Iqbal
    Aljojo, Nahla
    Alghoson, Abdullah
    [J]. ROMANIAN JOURNAL OF INFORMATION TECHNOLOGY AND AUTOMATIC CONTROL-REVISTA ROMANA DE INFORMATICA SI AUTOMATICA, 2022, 32 (04): : 7 - 18
  • [10] Happy or grumpy? A Machine Learning Approach to Analyze the Sentiment of Airline Passengers' Tweets
    Wu, Shengyang
    Gao, Yi
    [J]. arXiv, 2022,