Tweet Sentiment Classification Using an Ensemble of Machine Learning Supervised Classifiers Employing Statistical Feature Selection Methods

被引:10
|
作者
Devi, K. Lakshmi [1 ]
Subathra, P. [1 ]
Kumar, P. N. [1 ]
机构
[1] Amrita Vishwa Vidyapeetham, Dept CSE, Coimbatore 641112, Tamil Nadu, India
关键词
Bagging; Boosting; Ensemble learners; Entropy; Naive bayes; Sentiment classification; SVM;
D O I
10.1007/978-3-319-27212-2_1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Twitter is considered to be the most powerful tool of information dissemination among the micro-blogging websites. Everyday large user generated contents are being posted in Twitter and determining the sentiment of these contents can be useful to individuals, business companies, government organisations etc. Many Machine Learning approaches are being investigated for years and there is no consensus as to which method is most suitable for any particular application. Recent research has revealed the potential of ensemble learners to provide improved accuracy in sentiment classification. In this work, we conducted a performance comparison of ensemble learners like Bagging and Boosting with the baseline methods like Support Vector Machines, Naive Bayes and Maximum Entropy classifiers. As against the traditional method of using Bag of Words for feature selection, we have incorporated statistical methods of feature selection like Point wise Mutual Information and Chi-square methods, which resulted in improved accuracy. We performed the evaluation using Twitter dataset and the empirical results revealed that ensemble methods provided more accurate results than baseline classifiers.
引用
收藏
页码:1 / 13
页数:13
相关论文
共 50 条
  • [41] Solar Radiation Forecasting Using Machine Learning and Ensemble Feature Selection
    Solano, Edna S.
    Dehghanian, Payman
    Affonso, Carolina M.
    [J]. ENERGIES, 2022, 15 (19)
  • [42] Phishing Website Detection Using Machine Learning Classifiers Optimized by Feature Selection
    Mehanovic, Dzelila
    Kevric, Jasmin
    [J]. TRAITEMENT DU SIGNAL, 2020, 37 (04) : 563 - 569
  • [43] Feature Ensemble Plus Sample Selection: Domain Adaptation for Sentiment Classification
    Xia, Rui
    Zong, Chengqing
    Hu, Xuelei
    Cambria, Erik
    [J]. IEEE INTELLIGENT SYSTEMS, 2013, 28 (03) : 10 - 18
  • [44] An ensemble machine learning approach for classification tasks using feature generation
    Feng, Wenjuan
    Gou, Jin
    Fan, Zongwen
    Chen, Xiang
    [J]. CONNECTION SCIENCE, 2023, 35 (01)
  • [45] A Feature Selection Approach for Fall Detection Using Various Machine Learning Classifiers
    Tuan Minh Le
    Ly Van Tran
    Son Vu Truong Dao
    [J]. IEEE ACCESS, 2021, 9 : 115895 - 115908
  • [46] Feature Ensemble Plus Sample Selection: Domain Adaptation for Sentiment Classification
    Xia, Rui
    Zong, Chengqing
    Hu, Xuelei
    Cambria, Erik
    [J]. PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 4229 - 4233
  • [47] Feature Selection for Text Classification Using Machine Learning Approaches
    K. Thirumoorthy
    K. Muneeswaran
    [J]. National Academy Science Letters, 2022, 45 : 51 - 56
  • [48] Feature Selection for Text Classification Using Machine Learning Approaches
    Thirumoorthy, K.
    Muneeswaran, K.
    [J]. NATIONAL ACADEMY SCIENCE LETTERS-INDIA, 2022, 45 (01): : 51 - 56
  • [49] Connected Devices Classification using Feature Selection with Machine Learning
    Fagroud, Fatima Zahra
    Toumi, Hicham
    Lahmar, El Habib Ben
    Achtaich, Khadija
    El Filali, Sanaa
    Baddi, Youssef
    [J]. IAENG International Journal of Computer Science, 2022, 49 (02)
  • [50] Classification of Sentiment Reviews for Indian Railways Using Machine Learning Methods
    Bagga, Manju
    Aggarwa, Ritu
    Arora, Nitika
    [J]. INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING AND COMMUNICATIONS, ICICC 2022, VOL 1, 2023, 473 : 171 - 177