Text Mining-A Comparative Review of Twitter Sentiments Analysis

被引:0
|
作者
Patil S. [1 ]
Subil D. [1 ]
Nasar N. [1 ]
Kokatnoor S.A. [1 ]
Krishnan B. [1 ]
Kumar S. [1 ]
机构
[1] Department of Computer Science and Engineering, School of Engineering and Technology, CHRIST (Deemed to be University), Karnataka, 74, Bangalore
关键词
airline sentiments; decision trees; gaussian naive bayes; gini index; machine learning; multinomial naive bayes; multinomial naive bayes with bagging; Opinion mining; random forest;
D O I
10.2174/2666255816666230726140726
中图分类号
学科分类号
摘要
Background: Text mining derives information and patterns from textual data. Online social media platforms, which have recently acquired great interest, generate vast text data about human behaviors based on their interactions. This data is generally ambiguous and unstructured. The data includes typing errors and errors in grammar that cause lexical, syntactic, and semantic uncertainties. This results in incorrect pattern detection and analysis. Researchers are employing various text mining techniques that can aid in Topic Modeling, the detection of Trending Topics, the identification of Hate Speeches, and the growth of communities in online social media net-works. Objective: This review paper compares the performance of ten machine learning classification techniques on a Twitter data set for analyzing users' sentiments on posts related to airline usage. Methods: Review and comparative analysis of Gaussian Naive Bayes, Random Forest, Multinomial Naive Bayes, Multinomial Naive Bayes with Bagging, Adaptive Boosting (AdaBoost), Optimized AdaBoost, Support Vector Machine (SVM), Optimized SVM, Logistic Regression, and Long-Short Term Memory (LSTM) for sentiment analysis. Results: The results of the experimental study showed that the Optimized SVM performed better than the other classifiers, with a training accuracy of 99.73% and testing accuracy of 89.74% compared to other models. Conclusion: Optimized SVM uses the RBF kernel function and nonlinear hyperplanes to split the dataset into classes, correctly classifying the dataset into distinct polarity. This, together with Feature Engineering utilizing Forward Trigrams and Weighted TF-IDF, has improved Optimized SVM classifier performance regarding train and test accuracy. Therefore, the train and test accuracy of Optimized SVM are 99.73% and 89.74% respectively. When compared to Random Forest, a mar-ginal of 0.09% and 1.73% performance enhancement is observed in terms of train and test accuracy and 1.29% (train accuracy) and 3.63% (test accuracy) of improved performance when compared with LSTM. Likewise, Optimized SVM, gave more than 10% of enhanced performance in terms of train accuracy when compared with Gaussian Naïve Bayes, Multinomial Naïve Bayes, Multinomial Naïve Bayes with Bagging, Logistic Regression and a similar enhancement is observed with Ada-Boost and Optimized AdaBoost which are ensemble models during the experimental process. Optimized SVM also has outperformed all the classification models in terms of AUC-ROC train and test scores.. © 2024 Bentham Science Publishers.
引用
收藏
页码:21 / 37
页数:16
相关论文
共 50 条
  • [41] Health professionals' sentiments towards implemented information technologies in psychiatric hospitals: a text-mining analysis
    Golz, C.
    Aarts, S.
    Hacking, C.
    Hahn, S.
    Zwakhalen, S. M. G.
    BMC HEALTH SERVICES RESEARCH, 2022, 22 (01)
  • [42] Health professionals’ sentiments towards implemented information technologies in psychiatric hospitals: a text-mining analysis
    C. Golz
    S. Aarts
    C. Hacking
    S. Hahn
    S.M.G. Zwakhalen
    BMC Health Services Research, 22
  • [43] A Review of Opinion Mining in Twitter Streams
    Khan, Narmeen
    Khan, M. N. A.
    INTERNATIONAL JOURNAL OF NEXT-GENERATION COMPUTING, 2018, 9 (01): : 66 - 72
  • [44] An Analysis of Twitter Data on E-cigarette Sentiments and Promotion
    Godea, Andreea Kamiana
    Caragea, Cornelia
    Bulgarov, Florin Adrian
    Ramisetty-Mikler, Suhasini
    ARTIFICIAL INTELLIGENCE IN MEDICINE (AIME 2015), 2015, 9105 : 205 - 215
  • [45] An analysis of COVID-19 vaccine sentiments and opinions on Twitter
    Yousefinaghani, Samira
    Dara, Rozita
    Mubareka, Samira
    Papadopoulos, Andrew
    Sharif, Shayan
    INTERNATIONAL JOURNAL OF INFECTIOUS DISEASES, 2021, 108 : 256 - 262
  • [46] A review for comparative text mining: From data acquisition to practical application
    Wei, Na
    Zhao, Songzheng
    Liu, Jing
    Wang, Shenghui
    JOURNAL OF INFORMATION SCIENCE, 2023,
  • [47] Emotion Analysis for Opinion Mining From Text: A Comparative Study
    Mohsen, Amr Mansour
    Idrees, Amira M.
    Hassan, Hesham Ahmed
    INTERNATIONAL JOURNAL OF E-COLLABORATION, 2019, 15 (01) : 38 - 58
  • [48] A comparative analysis of the world’s constitutions: a text mining approach
    Tuncay Bayrak
    Social Network Analysis and Mining, 2022, 12
  • [49] A comparative analysis of the world's constitutions: a text mining approach
    Bayrak, Tuncay
    SOCIAL NETWORK ANALYSIS AND MINING, 2022, 12 (01)
  • [50] Sentiment Analysis: Mining Opinions, Sentiments, and Emotions
    Zhao, Jun
    Liu, Kang
    Xu, Liheng
    COMPUTATIONAL LINGUISTICS, 2016, 42 (03) : 595 - 598