Improving Sentiment Analysis of Arabic Tweets by One-way ANOVA

被引:29
|
作者
Alassaf, Manar [1 ]
Qamar, Ali Mustafa [1 ]
机构
[1] Qassim Univ, Coll Comp, Dept Comp Sci, Buraydah, Saudi Arabia
关键词
Sentiment analysis; One-way ANOVA; Arabic tweets; Feature selection; Machine learning; High dimensionality; CLASSIFICATION; ALGORITHMS;
D O I
10.1016/j.jksuci.2020.10.023
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Social media is an indispensable necessity for modern life. As a result, it is full of people's opinions, emotions, ideas, and attitudes, whether positive or negative. This abundance of views creates many opportunities for applying sentiment analysis to the education sector, which reflects how countries and cultures develop. In this research, a real-world Twitter dataset was collected, containing approximately 8144 tweets related to Qassim University, Saudi Arabia. The main aim of this experimental study was to explore the possibility of using a one-way analysis of variance (ANOVA) as a feature selection method to considerably reduce the number of features when classifying opinions conveyed through Arabic tweets. The primary motivation for this research was that no previous studies had examined one-way ANOVA comprehensively to tackle the curse of dimensionality and to enhance classification performance in sentiment analysis for Arabic tweets. Therefore, various experiments were conducted to investigate the effects of one-way ANOVA and to select important features concerning the performance of different supervised machine learning classifiers. Support Vector Machine and Naive Bayes achieved the best results with one-way ANOVA as compared to the baseline experimental results in the collected dataset. Furthermore, the differences between all results have been statistically analyzed in this study. As further evidence, one-way ANOVA with Support Vector Machine represented an excellent combination across different Arabic benchmark datasets, with its results outperforming other studies. (C) 2020 The Authors. Published by Elsevier B.V. on behalf of King Saud University.
引用
收藏
页码:2849 / 2859
页数:11
相关论文
共 50 条
  • [1] Sentiment Analysis in Arabic Tweets
    Duwairi, R. M.
    Marji, Raed
    Sha'ban, Narmeen
    Rushaidat, Sally
    [J]. 2014 5TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2014,
  • [2] The one-way ANOVA test explained
    Chatzi, Anna
    Doody, Owen
    [J]. NURSE RESEARCHER, 2023, 31 (03) : 8 - 14
  • [3] One-way ANOVA with Unequal Variances
    Sadooghi-Alvandi, S. M.
    Jafari, A. A.
    Mardani-Fard, H. A.
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2012, 41 (22) : 4200 - 4221
  • [4] Clustering Arabic Tweets for Sentiment Analysis
    Abuaiadah, Diab
    Rajendran, Dileep
    Jarrar, Mustafa
    [J]. 2017 IEEE/ACS 14TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2017, : 449 - 456
  • [5] Analysis of One-Way ANOVA Model using Synthetic Data
    Biswajit Basak
    Bimal Sinha
    [J]. Sankhya B, 2024, 86 : 164 - 190
  • [6] Analysis of One-Way ANOVA Model using Synthetic Data
    Basak, Biswajit
    Sinha, Bimal
    [J]. SANKHYA-SERIES B-APPLIED AND INTERDISCIPLINARY STATISTICS, 2024, 86 (01): : 164 - 190
  • [7] One-Way High-Dimensional ANOVA
    Chen, Tansheng
    Zheng, Lukun
    [J]. JOURNAL OF MATHEMATICS, 2023, 2023
  • [8] NONPARAMETRIC ONE-WAY ANOVA WITH MULTIPLE COVARIATES
    GOCKA, EF
    HANES, B
    [J]. BEHAVIOR RESEARCH METHODS & INSTRUMENTATION, 1975, 7 (05): : 484 - 484
  • [9] Beyond the one-way ANOVA for ’omics data
    Kirsty L. Hassall
    Andrew Mead
    [J]. BMC Bioinformatics, 19
  • [10] ONE-WAY ANOVA FROM SUMMARY STATISTICS
    ROSSI, JS
    [J]. EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1987, 47 (01) : 37 - 38