Tweet Sentiment Classification Using an Ensemble of Machine Learning Supervised Classifiers Employing Statistical Feature Selection Methods

被引:10
|
作者
Devi, K. Lakshmi [1 ]
Subathra, P. [1 ]
Kumar, P. N. [1 ]
机构
[1] Amrita Vishwa Vidyapeetham, Dept CSE, Coimbatore 641112, Tamil Nadu, India
关键词
Bagging; Boosting; Ensemble learners; Entropy; Naive bayes; Sentiment classification; SVM;
D O I
10.1007/978-3-319-27212-2_1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Twitter is considered to be the most powerful tool of information dissemination among the micro-blogging websites. Everyday large user generated contents are being posted in Twitter and determining the sentiment of these contents can be useful to individuals, business companies, government organisations etc. Many Machine Learning approaches are being investigated for years and there is no consensus as to which method is most suitable for any particular application. Recent research has revealed the potential of ensemble learners to provide improved accuracy in sentiment classification. In this work, we conducted a performance comparison of ensemble learners like Bagging and Boosting with the baseline methods like Support Vector Machines, Naive Bayes and Maximum Entropy classifiers. As against the traditional method of using Bag of Words for feature selection, we have incorporated statistical methods of feature selection like Point wise Mutual Information and Chi-square methods, which resulted in improved accuracy. We performed the evaluation using Twitter dataset and the empirical results revealed that ensemble methods provided more accurate results than baseline classifiers.
引用
收藏
页码:1 / 13
页数:13
相关论文
共 50 条
  • [1] Using Feature Selection in Combination with Ensemble Learning Techniques to Improve Tweet Sentiment Classification Performance
    Prusa, Joseph D.
    Khoshgoftaar, Taghi M.
    Napolitano, Amri
    [J]. 2015 IEEE 27TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2015), 2015, : 186 - 193
  • [2] Machine Learning Ensemble Classifiers for Feature Selection in Rice Cultivars
    Thangavel, Chandrakumar
    Sakthipriya, D.
    [J]. APPLIED ARTIFICIAL INTELLIGENCE, 2024, 38 (01)
  • [3] Classification of lung cancer using ensemble-based feature selection and machine learning methods
    Cai, Zhihua
    Xu, Dong
    Zhang, Qing
    Zhang, Jiexia
    Ngai, Sai-Ming
    Shao, Jianlin
    [J]. MOLECULAR BIOSYSTEMS, 2015, 11 (03) : 791 - 800
  • [4] Sentiment Classification of Spanish Reviews: An Approach based on Feature Selection and Machine Learning Methods
    del Pilar Salas-Zarate, Maria
    Andres Paredes-Valverde, Mario
    Limon-Romero, Jorge
    Tlapa, Diego
    Baez-Lopez, Yolanda
    [J]. JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2016, 22 (05) : 691 - 708
  • [5] Sentiment classification using hybrid feature selection and ensemble classifier
    Jain, Achin
    Jain, Vanita
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (02) : 659 - 668
  • [6] Hybrid Ensemble Learning With Feature Selection for Sentiment Classification in Social Media
    Sharma, Sanur
    Jain, Anurag
    [J]. INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2020, 10 (02) : 40 - 58
  • [7] Evaluating Statistical and Machine Learning Supervised Classification Methods
    Hand, David J.
    [J]. STATISTICAL DATA SCIENCE, 2018, : 37 - 53
  • [8] Dimensionality Reduction for Sentiment Classification using Machine Learning Classifiers
    Islam, Mazharul
    Anjum, Aftab
    Ahsan, Tanveer
    Wang, Lin
    [J]. 2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 3097 - 3103
  • [9] Utilizing Ensemble, Data Sampling and Feature Selection Techniques for Improving Classification Performance on Tweet Sentiment Data
    Prusa, Joseph
    Khoshgoftaar, Taghi M.
    Napolitano, Amri
    [J]. 2015 IEEE 14TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2015, : 535 - 542
  • [10] Android malware classification using optimum feature selection and ensemble machine learning
    Islam, Rejwana
    Sayed, Moinul Islam
    Saha, Sajal
    Hossain, Mohammad Jamal
    Masud, Md Abdul
    [J]. Internet of Things and Cyber-Physical Systems, 2023, 3 : 100 - 111