A novel classification approach based on Naive Bayes for Twitter sentiment analysis

被引:32
|
作者
Song, Junseok [1 ]
Kim, Kyung Tae [1 ]
Lee, Byungjun [1 ]
Kim, Sangyoung [1 ]
Youn, Hee Yong [1 ]
机构
[1] Sungkyunkwan Univ, Coll Software, Suwon, South Korea
基金
新加坡国家研究基金会;
关键词
Twitter sentiment analysis; Machine learning; Naive Bayes; Attribute weighting; Feature selection;
D O I
10.3837/tiis.2017.06.011
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With rapid growth of web technology and dissemination of smart devices, social networking service(SNS) is widely used. As a result, huge amount of data are generated from SNS such as Twitter, and sentiment analysis of SNS data is very important for various applications and services. In the existing sentiment analysis based on the Naive Bayes algorithm, a same number of attributes is usually employed to estimate the weight of each class. Moreover, uncountable and meaningless attributes are included. This results in decreased accuracy of sentiment analysis. In this paper two methods are proposed to resolve these issues, which reflect the difference of the number of positive words and negative words in calculating the weights, and eliminate insignificant words in the feature selection step using Multinomial Naive Bayes(MNB) algorithm. Performance comparison demonstrates that the proposed scheme significantly increases the accuracy compared to the existing Multivariate Bernoulli Naive Bayes(BNB) algorithm and MNB scheme.
引用
收藏
页码:2996 / 3011
页数:16
相关论文
共 50 条
  • [31] A Novel Feature-Based Text Classification Improving the Accuracy of Twitter Sentiment Analysis
    Wang, Yili
    Sun, Le
    Wang, Jin
    Zheng, Yuhui
    Youn, Hee Yong
    [J]. ADVANCES IN COMPUTER SCIENCE AND UBIQUITOUS COMPUTING, 2018, 474 : 440 - 445
  • [32] Sentiment Analysis using Naive Bayes and Complement Naive Bayes Classifier Algorithms on Hadoop Framework
    Seref, Berna
    Bostanci, Erkan
    [J]. 2018 2ND INTERNATIONAL SYMPOSIUM ON MULTIDISCIPLINARY STUDIES AND INNOVATIVE TECHNOLOGIES (ISMSIT), 2018, : 555 - 561
  • [33] Adapting Naive Bayes to Domain Adaptation for Sentiment Analysis
    Tan, Songbo
    Cheng, Xueqi
    Wang, Yuefen
    Xu, Hongbo
    [J]. ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, 2009, 5478 : 337 - +
  • [34] A Topic based Approach for Sentiment Analysis on Twitter Data
    Ficamos, Pierre
    Liu, Yan
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (12) : 201 - 205
  • [35] Sentiment classification on Big Data using Naive Bayes and Logistic Regression
    Prabhat, Anjuman
    Khullar, Vikas
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2017,
  • [36] Fast and Accurate Sentiment Classification Using an Enhanced Naive Bayes Model
    Narayanan, Vivek
    Arora, Ishan
    Bhatia, Arjun
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2013, 2013, 8206 : 194 - 201
  • [37] Aspect-based Sentiment Analysis to Review Products Using Naive Bayes
    Mubarok, Mohamad Syahrul
    Adiwijaya
    Aldhi, Muhammad Dwi
    [J]. INTERNATIONAL CONFERENCE ON MATHEMATICS: PURE, APPLIED AND COMPUTATION: EMPOWERING ENGINEERING USING MATHEMATICS, 2017, 1867
  • [38] A Pairwise Naive Bayes Approach to Bayesian Classification
    Asafu-Adjei, Josephine K.
    Betensky, Rebecca A.
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2015, 29 (07)
  • [39] Comparison of SVM and Naive Bayes for Sentiment Classification using BERT data
    Rana, Shivani
    Kanji, Rakesh
    Jain, Shruti
    [J]. 2022 5TH INTERNATIONAL CONFERENCE ON MULTIMEDIA, SIGNAL PROCESSING AND COMMUNICATION TECHNOLOGIES (IMPACT), 2022,
  • [40] News Classification on Twitter Using Naive Bayes and Hypernym-Hyponym Based Feature Expansion
    Irfani, Fakhruddin Farid
    Fauzi, M. Ali
    Sari, Yuita Arum
    [J]. PROCEEDINGS OF 2018 3RD INTERNATIONAL CONFERENCE ON SUSTAINABLE INFORMATION ENGINEERING AND TECHNOLOGY (SIET 2018), 2018, : 317 - 321