Sentiment classification of skewed shoppers' reviews using machine learning techniques, examining the textual features

被引:5
|
作者
Rezapour, Mahdi [1 ]
机构
[1] Wyoming Technol Transfer Ctr, 1000 E Univ Ave,Dept 3295, Laramie, WY 82071 USA
关键词
machine learning technique; opinion mining; polarity; opinion extraction; review classification; sentiment analysis; text classification|Natural language processing; ONLINE REVIEWS;
D O I
10.1002/eng2.12280
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
With the speedy growth of online shopping, it has become of crucial importance for product makers to analyze, and handle a wealth of products' reviews. However, such a high volume of reviews, along with a wide variety of opinions, makes it hard for manufacturers to know exactly how they can improve their products without having an efficient approach. For this purpose, the results of sentiment classification would help the customers to retrieve the necessary information to choose an appropriate product, and the sellers to effectively collect customer feedback in order to improve their products. Like most of the read-world problems, the shopping review data being used in this study were imbalanced, being predominately composed of positive with only a small percentage of negative reviews. Machine learning (ML) algorithms do not perform well when data are imbalanced, as they tend to get biased toward the overrepresented data category. The synthetic minority over-sampling technique (SMOTE) was used to address this class imbalance problem. In this study, three different ML-based algorithms, namely the Naive Bayes (NB), Support Vector Machine, and decision tree (DT) were employed. An extensive preprocessing procedure was taken to prepare the text datasets, and details are discussed in the manuscript. The performance analysis indicated that the DT algorithm outperforms the other two methods. As positive reviews account for the majority of the reviews, sparse words removal for the data resulted in the removal of almost all negative reviews' sentiments. Hence, the model training process is here performed on positive and negative reviews separately. A combination of the review titles with their contents, separate tokenization process, applications of various N-gram, and maintaining stops words (e.g. "not" or "but") were some other steps considered to improve the performance of the model.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Intensified Sentiment Analysis of Customer Product Reviews Using Acoustic and Textual Features
    Govindaraj, Sureshkumar
    Gopalakrishnan, Kumaravelan
    ETRI JOURNAL, 2016, 38 (03) : 494 - 501
  • [22] Experimental study on sentiment classification of Chinese review using machine learning techniques
    Li, Jun
    Sun, Maosong
    PROCEEDINGS OF THE 2007 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (NLP-KE'07), 2007, : 393 - +
  • [23] Evaluating Machine Learning and Unsupervised Semantic Orientation Approaches for Sentiment Analysis of Textual Reviews
    Waila, P.
    Marisha
    Singh, V. K.
    Singh, M. K.
    2012 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (ICCIC), 2012, : 458 - 463
  • [24] Sentiment Classification based on Machine Learning Approaches in Amazon Product Reviews
    Abu Kausar, Mohammad
    Fageeri, Sallam Osman
    Soosaimanickam, Arockiasamy
    ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2023, 13 (03) : 10849 - 10855
  • [25] Implementation of Sentiment Classification of Movie Reviews by Supervised Machine Learning Approaches
    Untawale, Tejaswini M.
    Choudhari, G.
    PROCEEDINGS OF THE 2019 3RD INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2019), 2019, : 1197 - 1200
  • [26] Sentiment Classification of User Reviews Using Supervised Learning Techniques with Comparative Opinion Mining Perspective
    Khan, Aurangzeb Aurangzeb
    Younis, Umair
    Kundi, Alam Sher
    Asghar, Muhammad Zubair
    Ullah, Irfan
    Aslam, Nida
    Ahmed, Imran
    ADVANCES IN COMPUTER VISION, VOL 2, 2020, 944 : 23 - 29
  • [27] Tourist Reviews Sentiment Classification using Deep Learning Techniques: A Case Study in Saudi Arabia
    Alharbi, Banan A.
    Mezher, Mohammad A.
    Barakeh, Abdullah M.
    International Journal of Advanced Computer Science and Applications, 2022, 13 (06) : 717 - 726
  • [28] Sentiment Analysis and Fake Amazon Reviews Classification Using SVM Supervised Machine Learning Model
    Tabany, Myasar
    Gueffal, Meriem
    JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2024, 15 (01) : 49 - 58
  • [29] Classification of Sentiment Analysis Using Machine Learning
    Parikh, Satyen M.
    Shah, Mitali K.
    INNOVATIVE DATA COMMUNICATION TECHNOLOGIES AND APPLICATION, 2020, 46 : 76 - 86
  • [30] Examining Machine Learning Techniques in Business News Headline Sentiment Analysis
    Lim, Seong Liang Ooi
    Lim, Hooi Mei
    Tan, Eng Kee
    Tan, Tien-Ping
    COMPUTATIONAL SCIENCE AND TECHNOLOGY (ICCST 2019), 2020, 603 : 363 - 372