Social Media Sentiment Analysis Using K-Means and Naive Bayes Algorithm

被引:0
|
作者
Zul, Muhammad Ihsan [1 ]
Yulia, Feoni [2 ]
Nurmalasari, Dini [3 ]
机构
[1] Politekn Caltex Riau, Informat Engn, Pekanbaru, Indonesia
[2] Politekn Caltex Riau, Informat Syst, Pekanbaru, Indonesia
[3] Politekn Caltex Riau, Comp Engn, Pekanbaru, Indonesia
来源
2018 2ND INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATICS (ICON EEI): TOWARD THE MOST EFFICIENT WAY OF MAKING AND DEALING WITH FUTURE ELECTRICAL POWER SYSTEM AND BIG DATA ANALYSIS | 2018年
关键词
k-means; naive bayes; sentiwordnet; sentiment analysis; text mining; k-fold cross validation; confusion matrix;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Opinions are a major influence when making decisions for individuals or organizations. A collection of opinions can be extracted to gain useful knowledge. This knowledge is used as a source of information which can be used as a consideration in decision making. The extraction of knowledge from text has been known as text mining. Text mining has any kinds of algorithm to extract information from collected text, such as K-Means, K-Nearest Neighbors, Naive Bayes, and the others. One of the sources of opinion is from social media, especially Facebook and Twitter. On Facebook and Twitter, many people have been writing their opinions about many things. This very much data are difficult to analyze thoroughly. In this paper, K-Means and Naive Bayes algorithm are developed to analyze public opinions or sentiments. Outlier removal is also added to this analysis. Opinions are taken from Facebook and Twitter. The accuracy of the system is tested 10 times at k different points for each k value (k=6, 7, 8, 9 and 10). As the result, the combination of K-Means and Naive Bayes has lower accuracy than the accuracy produced by Naive Bayes without the combination of K-Means, but almost same accuracies. The accuracy of Naive Bayes algorithm is from 80.526%-82.500%, while the combination of Naive Bayes and K-Means has 80.323%-81.523% accuracy.
引用
收藏
页码:24 / 29
页数:6
相关论文
共 50 条
  • [1] Comparison of Naive Bayes and K-nearest neighbours for online transportation using sentiment analysis in social media
    Atmadja, A. R.
    Uriawan, W.
    Pritisen, F.
    Maylawati, D. S.
    Arbain, A.
    4TH ANNUAL APPLIED SCIENCE AND ENGINEERING CONFERENCE, 2019, 2019, 1402
  • [2] Development of Anti-Spam Technique using Modified K-Means & Naive Bayes Algorithm
    Tayal, Devendra K.
    Jain, Amita
    Meena, Kanak
    PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 2593 - 2597
  • [3] Sentiment Analysis Using Naive Bayes Algorithm With Case Study
    Akella, Jishnusri Ojaswy
    Akella, L. N. Yashaswy
    PROCEEDINGS OF THE 2018 3RD INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT 2018), 2018,
  • [4] Twitter Sentiment Analysis Using a Modified Naive Bayes Algorithm
    Masrani, Manav
    Poornalatha, G.
    INFORMATION SYSTEMS ARCHITECTURE AND TECHNOLOGY, PT I, 2018, 655 : 171 - 181
  • [5] Social Media Analysis using Optimized K-Means Clustering
    Alsayat, Ahmed
    El-Sayed, Hoda
    2016 IEEE/ACIS 14TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING RESEARCH, MANAGEMENT AND APPLICATIONS (SERA), 2016, : 61 - 66
  • [6] Sentiment Analysis of Tweets using Unsupervised Learning Techniques and the K-Means Algorithm
    Iparraguirre-Villanueva, Orlando
    Guevara-Ponce, Victor
    Sierra-Linan, Fernando
    Beltozar-Clemente, Saul
    Cabanillas-Carbone, Michael
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (06) : 571 - 578
  • [7] Sentiment Analysis on Twitter Data-set using Naive Bayes Algorithm
    Parveen, Huma
    Pandey, Shikha
    PROCEEDINGS OF THE 2016 2ND INTERNATIONAL CONFERENCE ON APPLIED AND THEORETICAL COMPUTING AND COMMUNICATION TECHNOLOGY (ICATCCT), 2016, : 416 - 419
  • [8] Intrusion Detection based on K-Means Clustering and Naive Bayes Classification
    Muda, Z.
    Yassin, W.
    Sulaiman, M. N.
    Udzir, N. I.
    2011 7TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY IN ASIA (CITA 11), 2011,
  • [9] An Integration of K-Means Clustering and Naive Bayes Classifier for Intrusion Detection
    Varuna, S.
    Natesan, P.
    2015 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATION AND NETWORKING (ICSCN), 2015,
  • [10] Understanding of Digital Learning Sources with the Heutagogy Approach using the K-Means and Naive Bayes Methods
    Praherdhiono, Henry
    Adi, Eka Pramono
    Devita, Riri Nada
    2018 4TH INTERNATIONAL CONFERENCE ON EDUCATION AND TECHNOLOGY (ICET), 2018, : 23 - 27