Improving Sentiment Analysis of Arabic Tweets by One-way ANOVA

被引:29
|
作者
Alassaf, Manar [1 ]
Qamar, Ali Mustafa [1 ]
机构
[1] Qassim Univ, Coll Comp, Dept Comp Sci, Buraydah, Saudi Arabia
关键词
Sentiment analysis; One-way ANOVA; Arabic tweets; Feature selection; Machine learning; High dimensionality; CLASSIFICATION; ALGORITHMS;
D O I
10.1016/j.jksuci.2020.10.023
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Social media is an indispensable necessity for modern life. As a result, it is full of people's opinions, emotions, ideas, and attitudes, whether positive or negative. This abundance of views creates many opportunities for applying sentiment analysis to the education sector, which reflects how countries and cultures develop. In this research, a real-world Twitter dataset was collected, containing approximately 8144 tweets related to Qassim University, Saudi Arabia. The main aim of this experimental study was to explore the possibility of using a one-way analysis of variance (ANOVA) as a feature selection method to considerably reduce the number of features when classifying opinions conveyed through Arabic tweets. The primary motivation for this research was that no previous studies had examined one-way ANOVA comprehensively to tackle the curse of dimensionality and to enhance classification performance in sentiment analysis for Arabic tweets. Therefore, various experiments were conducted to investigate the effects of one-way ANOVA and to select important features concerning the performance of different supervised machine learning classifiers. Support Vector Machine and Naive Bayes achieved the best results with one-way ANOVA as compared to the baseline experimental results in the collected dataset. Furthermore, the differences between all results have been statistically analyzed in this study. As further evidence, one-way ANOVA with Support Vector Machine represented an excellent combination across different Arabic benchmark datasets, with its results outperforming other studies. (C) 2020 The Authors. Published by Elsevier B.V. on behalf of King Saud University.
引用
收藏
页码:2849 / 2859
页数:11
相关论文
共 50 条
  • [31] A comparison of tests for the one-way ANOVA problem for functional data
    Gorecki, Tomasz
    Smaga, Lukasz
    [J]. COMPUTATIONAL STATISTICS, 2015, 30 (04) : 987 - 1010
  • [32] Bayesian estimates in a one-way ANOVA random effects model
    Bian, GR
    [J]. AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 2002, 44 (01) : 99 - 108
  • [33] INVARIANT QUADRATIC ESTIMATORS IN RANDOM, ONE-WAY ANOVA MODEL
    LAMOTTE, LR
    [J]. BIOMETRICS, 1976, 32 (04) : 793 - 804
  • [34] Inference with Inducer Pivot Variables, an Application to the One-Way ANOVA
    Covas, Ricardo
    Mexia, Joao Tiago
    Fernandes, Celia
    Ramos, Paulo
    [J]. NUMERICAL ANALYSIS AND APPLIED MATHEMATICS ICNAAM 2011: INTERNATIONAL CONFERENCE ON NUMERICAL ANALYSIS AND APPLIED MATHEMATICS, VOLS A-C, 2011, 1389
  • [35] Some remarks on Bayesian inference for one-way ANOVA models
    Solari, Fabrizio
    Liseo, Brunero
    Sun, Dongchu
    [J]. ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2008, 60 (03) : 483 - 498
  • [36] Robust weighted one-way ANOVA: Improved approximation and efficiency
    Kulinskaya, Elena
    Dollinger, Michael B.
    [J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2007, 137 (02) : 462 - 472
  • [37] ANOVA estimators under imbalance in the one-way random model
    Norell, L
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2003, 32 (03) : 601 - 623
  • [38] A comparison of tests for the one-way ANOVA problem for functional data
    Tomasz Górecki
    Łukasz Smaga
    [J]. Computational Statistics, 2015, 30 : 987 - 1010
  • [40] One-Way ANOVA Model with Fuzzy Data for Consumer Demand
    Lin, Pei Chun
    Arbaiy, Nureize
    Hamid, Isredza Rahmi Abd.
    [J]. RECENT ADVANCES ON SOFT COMPUTING AND DATA MINING, 2017, 549 : 111 - 121