A Hybrid CNN-LSTM Model for Improving Accuracy of Movie Reviews Sentiment Analysis

被引:140
|
作者
Rehman, Anwar Ur [1 ]
Malik, Ahmad Kamran [1 ]
Raza, Basit [1 ]
Ali, Waqar [1 ]
机构
[1] COMSATS Univ Islamabad CUI, Dept Comp Sci, Islamabad, Pakistan
关键词
Natural Language Processing (NLP); Sentiment Analysis; CNN; LSTM;
D O I
10.1007/s11042-019-07788-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, social media has become a tremendous source of acquiring user's opinions. With the advancement of technology and sophistication of the internet, a huge amount of data is generated from various sources like social blogs, websites, etc. In recent times, the blogs and websites are the real-time means of gathering product reviews. However, excessive number of blogs on the cloud has enabled the generation of huge volume of information in different forms like attitudes, opinions, and reviews. Therefore, a dire need emerges to find a method to extract meaningful information from big data, classify it into different categories and predict end user's behaviors or sentiments. Long Short-Term Memory (LSTM) model and Convolutional Neural Network (CNN) model have been applied to different Natural Language Processing (NLP) tasks with remarkable and effective results. The CNN model efficiently extracts higher level features using convolutional layers and max-pooling layers. The LSTM model is capable to capture long-term dependencies between word sequences. In this study, we propose a hybrid model using LSTM and very deep CNN model named as Hybrid CNN-LSTM Model to overcome the sentiment analysis problem. First, we use Word to Vector (Word2Vc) approach to train initial word embeddings. The Word2Vc translates the text strings into a vector of numeric values, computes distance between words, and makes groups of similar words based on their meanings. Afterword embedding is performed in which the proposed model combines set of features that are extracted by convolution and global max-pooling layers with long term dependencies. The proposed model also uses dropout technology, normalization and a rectified linear unit for accuracy improvement. Our results show that the proposed Hybrid CNN-LSTM Model outperforms traditional deep learning and machine learning techniques in terms of precision, recall, f-measure, and accuracy. Our approach achieved competitive results using state-of-the-art techniques on the IMDB movie review dataset and Amazon movie reviews dataset.
引用
收藏
页码:26597 / 26613
页数:17
相关论文
共 50 条
  • [41] A Hybrid CNN-LSTM Model for SMS Spam Detection in Arabic and English Messages
    Ghourabi, Abdallah
    Mahmood, Mahmood A.
    Alzubi, Qusay M.
    [J]. FUTURE INTERNET, 2020, 12 (09):
  • [42] A Hybrid CNN-LSTM Model for Forecasting Particulate Matter (PM2.5)
    Li, Taoying
    Hua, Miao
    Wu, Xu
    [J]. IEEE Access, 2020, 8 : 26933 - 26940
  • [43] Epilepsy Detection from EEG Data Using a Hybrid CNN-LSTM Model
    Neloy, Md. Arif Istiak
    Biswas, Anik
    Nahar, Nazmun
    Hossain, Mohammad Shahadat
    Andersson, Karl
    [J]. BRAIN INFORMATICS (BI 2022), 2022, 13406 : 253 - 263
  • [44] A study on water quality prediction by a hybrid CNN-LSTM model with attention mechanism
    Yang, Yurong
    Xiong, Qingyu
    Wu, Chao
    Zou, Qinghong
    Yu, Yang
    Yi, Hualing
    Gao, Min
    [J]. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2021, 28 (39) : 55129 - 55139
  • [45] Obstructive Sleep Apnea Syndrome Identification Using CNN-LSTM Hybrid Model
    Kulkarni, Prasanna
    Vora, Deepali
    Dewangan, Prajjwal
    Bindal, Rohi
    Zade, Nilima
    Singh, Anshita
    Gupte, Aditya
    [J]. JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (02) : 2386 - 2394
  • [46] A Hybrid CNN-LSTM Model for Psychopathic Class Detection from Tweeter Users
    Alotaibi, Fahad Mazaed
    Asghar, Muhammad Zubair
    Ahmad, Shakeel
    [J]. COGNITIVE COMPUTATION, 2021, 13 (03) : 709 - 723
  • [47] Hybrid CNN-LSTM models for river flow prediction
    Li, Xia
    Xu, Wei
    Ren, Minglei
    Jiang, Yanan
    Fu, Guangtao
    [J]. WATER SUPPLY, 2022, 22 (05) : 4902 - 4920
  • [48] A Hybrid CNN-LSTM Model for Aircraft 4D Trajectory Prediction
    Ma, Lan
    Tian, Shan
    [J]. IEEE ACCESS, 2020, 8 : 134668 - 134680
  • [49] A Prediction Method for Fuel Cell Degradation Based on CNN-LSTM Hybrid Model
    Zhang, Yufan
    Li, Yuren
    Liang, Bo
    Ma, Rui
    [J]. 2022 25TH INTERNATIONAL CONFERENCE ON ELECTRICAL MACHINES AND SYSTEMS (ICEMS 2022), 2022,
  • [50] A Hybrid CNN-LSTM Model for Forecasting Particulate Matter (PM2.5)
    Li, Taoying
    Hua, Miao
    Wu, Xu
    [J]. IEEE ACCESS, 2020, 8 : 26933 - 26940