Dimensionality Reduction for Sentiment Analysis using Pre-processing Techniques

被引:0
|
作者
Mhatre, Mayuri [1 ]
Phondekar, Dakshata [1 ]
Kadam, Pranali [1 ]
Chawathe, Anushka [1 ]
Ghag, Kranti [1 ]
机构
[1] SAKEC, Informat Technol Dept, Bombay, Maharashtra, India
关键词
Sentiment Analysis; Pre-processing; Slangs Handling; Stopwords Removal; Lemmatization;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Sentiment analysis is the study of people's opinions, sentiments, attitudes and emotions, expressed in written language but this process is time consuming, inconsistent and costly in business context. Pre-processing the data will help to ease this difficulty. Pre-processing is the process of cleaning and preparing the text for its analysis using pre-processing techniques. The existing pre-processing techniques are Handling Expressive Lengthening, Emoticons Handling, HTML Tags Removal, Punctuations Handling, Slangs Handling, Stopwords Removal, Stemming and Lemmatization. In this paper, the effect of various pre-processing techniques and their combinations was analyzed on the dataset taken from Kaggle called Bag of Words Meets Bags of Popcorn. By taking every possible combination of pre-processing techniques, the aim was to find the one giving highest accuracy. Random Forest Classifier was used to predict sentiments as it is known to give good accuracy and the result was evaluated using 10 fold cross validation method. Accuracy increased from unprocessed data to pre-processed data. It was concluded that using pre-processing techniques gives a higher accuracy than the traditional approach i.e. no pre-processing.
引用
收藏
页码:16 / 21
页数:6
相关论文
共 50 条
  • [31] Pre-processing for segmentation using independent component analysis
    Nakai, T
    Muraki, S
    Isoda, H
    Takehara, Y
    Sakahara, H
    Matsuo, K
    Kato, C
    Miki, Y
    [J]. NEUROIMAGE, 2001, 13 (06) : S207 - S207
  • [32] A Study of Underwater Image Pre-processing and Techniques
    Prasenan, Pooja
    Suriyakala, C. D.
    [J]. COMPUTATIONAL VISION AND BIO-INSPIRED COMPUTING ( ICCVBIC 2021), 2022, 1420 : 313 - 333
  • [33] Image pre-processing techniques for auto focusing
    Li, Qi
    Feng, Hua-Jun
    Xu, Zhi-Hai
    [J]. Guangdian Gongcheng/Opto-Electronic Engineering, 2004, 31 (09):
  • [34] Impact of Text Pre-processing on the Performance of Sentiment Analysis Models for Social Media Data
    Nhlabano, V. V.
    Lutu, P. E. N.
    [J]. 2018 INTERNATIONAL CONFERENCE ON ADVANCES IN BIG DATA, COMPUTING AND DATA COMMUNICATION SYSTEMS (ICABCD), 2018,
  • [35] Persian sentiment analysis of an online store independent of pre-processing using convolutional neural network with fastText embeddings
    Shumaly, Sajjad
    Yazdinejad, Mohsen
    Guo, Yanhui
    [J]. PEERJ COMPUTER SCIENCE, 2021, 7 : 1 - 22
  • [36] Improved Segmentation of Cardiac MRI Using Efficient Pre-Processing Techniques
    Joshi, Nikita
    Jain, Sarika
    [J]. JOURNAL OF INFORMATION TECHNOLOGY RESEARCH, 2022, 15 (01)
  • [37] Significance of Pre-Processing Phase and Dimensionality Reduction in EEG-based Dyslexia Diagnosis with Novel Features
    Parmar, Shankar K.
    Paunwala, Chirag N.
    [J]. 2022 IEEE 19TH INDIA COUNCIL INTERNATIONAL CONFERENCE, INDICON, 2022,
  • [38] Detection of Brain Tumour in Medical Images Using Pre-Processing Techniques
    Monika, Surineni
    Malathi, K.
    Monisha, Surineni
    [J]. RESEARCH JOURNAL OF PHARMACEUTICAL BIOLOGICAL AND CHEMICAL SCIENCES, 2016, 7 : 78 - 87
  • [39] Brain Tissue Segmentation Using NeuroNet With Different Pre-processing Techniques
    Islam Tushar, Fakrul
    Alyafi, Basel
    Hasan, Kamrul
    Dahal, Laysen
    [J]. 2019 JOINT 8TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV) AND 2019 3RD INTERNATIONAL CONFERENCE ON IMAGING, VISION & PATTERN RECOGNITION (ICIVPR) WITH INTERNATIONAL CONFERENCE ON ACTIVITY AND BEHAVIOR COMPUTING (ABC), 2019, : 223 - 227
  • [40] Analysis of Benchmark Image Pre-Processing Techniques for Coronary Angiogram Images
    Kavipriya, K.
    Hiremath, Manjunatha
    [J]. 2021 INTERNATIONAL CONFERENCE ON INNOVATIVE TRENDS IN INFORMATION TECHNOLOGY (ICITIIT), 2021,