A Hybrid CNN-LSTM Model for Improving Accuracy of Movie Reviews Sentiment Analysis

被引:140
|
作者
Rehman, Anwar Ur [1 ]
Malik, Ahmad Kamran [1 ]
Raza, Basit [1 ]
Ali, Waqar [1 ]
机构
[1] COMSATS Univ Islamabad CUI, Dept Comp Sci, Islamabad, Pakistan
关键词
Natural Language Processing (NLP); Sentiment Analysis; CNN; LSTM;
D O I
10.1007/s11042-019-07788-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, social media has become a tremendous source of acquiring user's opinions. With the advancement of technology and sophistication of the internet, a huge amount of data is generated from various sources like social blogs, websites, etc. In recent times, the blogs and websites are the real-time means of gathering product reviews. However, excessive number of blogs on the cloud has enabled the generation of huge volume of information in different forms like attitudes, opinions, and reviews. Therefore, a dire need emerges to find a method to extract meaningful information from big data, classify it into different categories and predict end user's behaviors or sentiments. Long Short-Term Memory (LSTM) model and Convolutional Neural Network (CNN) model have been applied to different Natural Language Processing (NLP) tasks with remarkable and effective results. The CNN model efficiently extracts higher level features using convolutional layers and max-pooling layers. The LSTM model is capable to capture long-term dependencies between word sequences. In this study, we propose a hybrid model using LSTM and very deep CNN model named as Hybrid CNN-LSTM Model to overcome the sentiment analysis problem. First, we use Word to Vector (Word2Vc) approach to train initial word embeddings. The Word2Vc translates the text strings into a vector of numeric values, computes distance between words, and makes groups of similar words based on their meanings. Afterword embedding is performed in which the proposed model combines set of features that are extracted by convolution and global max-pooling layers with long term dependencies. The proposed model also uses dropout technology, normalization and a rectified linear unit for accuracy improvement. Our results show that the proposed Hybrid CNN-LSTM Model outperforms traditional deep learning and machine learning techniques in terms of precision, recall, f-measure, and accuracy. Our approach achieved competitive results using state-of-the-art techniques on the IMDB movie review dataset and Amazon movie reviews dataset.
引用
收藏
页码:26597 / 26613
页数:17
相关论文
共 50 条
  • [1] A Hybrid CNN-LSTM Model for Improving Accuracy of Movie Reviews Sentiment Analysis
    Anwar Ur Rehman
    Ahmad Kamran Malik
    Basit Raza
    Waqar Ali
    [J]. Multimedia Tools and Applications, 2019, 78 : 26597 - 26613
  • [2] Dimensional Sentiment Analysis Using a Regional CNN-LSTM Model
    Wang, Jin
    Yu, Liang-Chih
    Lai, K. Robert
    Zhang, Xuejie
    [J]. PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2016), VOL 2, 2016, : 225 - 230
  • [3] An efficient CNN-LSTM model for sentiment detection in #BlackLivesMatter
    Ankita
    Rani, Shalli
    Bashir, Ali Kashif
    Alhudhaif, Adi
    Koundal, Deepika
    Gunduz, Emine Selda
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 193
  • [4] A Hybrid CNN-LSTM Model with Word-Emoji Embedding for Improving the Twitter Sentiment Analysis on Indonesia's PPKM Policy
    Pane, Syafrial Fachri
    Ramdan, Jenly
    Putrada, Aji Gautama
    Fauzan, Mohamad Nurkamal
    Awangga, Rolly Maulana
    Alamsyah, Nur
    [J]. Proceeding - 6th International Conference on Information Technology, Information Systems and Electrical Engineering: Applying Data Sciences and Artificial Intelligence Technologies for Environmental Sustainability, ICITISEE 2022, 2022, : 51 - 56
  • [5] Tree-Structured Regional CNN-LSTM Model for Dimensional Sentiment Analysis
    Wang, Jin
    Yu, Liang-Chih
    Lai, K. Robert
    Zhang, Xuejie
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 (28) : 581 - 591
  • [6] Arabic Sentiment Analysis Using Naive Bayes and CNN-LSTM
    Suleiman, Dima
    Odeh, Aseel
    Al-Sayyed, Rizik
    [J]. INFORMATICA-AN INTERNATIONAL JOURNAL OF COMPUTING AND INFORMATICS, 2022, 46 (06): : 79 - 86
  • [7] Text classification based on hybrid CNN-LSTM hybrid model
    She, Xiangyang
    Zhang, Di
    [J]. 2018 11TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2018, : 185 - 189
  • [8] A Bayesian CNN-LSTM Model for Sentiment Analysis in Massive Open Online Courses MOOCs
    Mrhar, Khaoula
    Benhiba, Lamia
    Bourekkache, Samir
    Abik, Mounia
    [J]. International Journal of Emerging Technologies in Learning, 2021, 16 (23) : 216 - 232
  • [9] A Bayesian CNN-LSTM Model for Sentiment Analysis in Massive Open Online Courses MOOCs
    Mrhar, Khaoula
    Benhiba, Lamia
    Bourekkache, Samir
    Abik, Mounia
    [J]. INTERNATIONAL JOURNAL OF EMERGING TECHNOLOGIES IN LEARNING, 2021, 16 (23): : 216 - 232
  • [10] A hybrid CNN-LSTM model for typhoon formation forecasting
    Chen, Rui
    Wang, Xiang
    Zhang, Weimin
    Zhu, Xiaoyu
    Li, Aiping
    Yang, Chao
    [J]. GEOINFORMATICA, 2019, 23 (03) : 375 - 396