Character level embedding with deep convolutional neural network for text normalization of unstructured data for Twitter sentiment analysis

被引:1
|
作者
Monika Arora
Vineet Kansal
机构
[1] IET,
[2] AKTU,undefined
来源
关键词
Opinion mining; Convolutional neural network; Phonetic algorithm; Soundex; SemEval dataset;
D O I
暂无
中图分类号
学科分类号
摘要
On social media platforms such as Twitter and Facebook, people express their views, arguments, and emotions of many events in daily life. Twitter is an international microblogging service featuring short messages called “tweets” from different languages. These texts often consist of noise in the form of incorrect grammar, abbreviations, freestyle, and typographical errors. Sentiment analysis (SA) aims to predict the actual emotions from the raw text expressed by the people through the field of natural language processing (NLP). The main aim of our work is to process the raw sentence from the Twitter dataset and find the actual polarity of the message. This paper proposes a text normalization with deep convolutional character level embedding (Conv-char-Emb) neural network model for SA of unstructured data. This model can tackle the problems: (1) processing the noisy sentence for sentiment detection (2) handling small memory space in word level embedded learning (3) accurate sentiment analysis of the unstructured data. The initial preprocessing stage for performing text normalization includes the following steps: tokenization, out of vocabulary (OOV) detection and its replacement, lemmatization and stemming. A character-based embedding in convolutional neural network (CNN) is an effective and efficient technique for SA that uses less learnable parameters in feature representation. Thus, the proposed method performs both the normalization and classification of sentiments for unstructured sentences. The experimental results are evaluated in the Twitter dataset by a different point polarity (positive, negative and neutral). As a result, our model performs well in normalization and sentiment analysis of the raw Twitter data enriched with hidden information.
引用
收藏
相关论文
共 50 条
  • [21] Deep neural network architecture for sentiment analysis and emotion identification of Twitter messages
    Stojanovski, Dario
    Strezoski, Gjorgji
    Madjarov, Gjorgji
    Dimitrovski, Ivica
    Chorbev, Ivan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (24) : 32213 - 32242
  • [22] Deep neural network architecture for sentiment analysis and emotion identification of Twitter messages
    Dario Stojanovski
    Gjorgji Strezoski
    Gjorgji Madjarov
    Ivica Dimitrovski
    Ivan Chorbev
    Multimedia Tools and Applications, 2018, 77 : 32213 - 32242
  • [23] Chinese Text Sentiment Analysis using Bilinear Character-Word Convolutional Neural Networks
    Wang, Xu
    Li, Jing
    Yang, Xi
    Wang, Yangxu
    Sang, Yongsheng
    INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE), 2017, 190 : 36 - 43
  • [24] Multi-level graph neural network for text sentiment analysis
    Liao, Wenxiong
    Zeng, Bi
    Liu, Jianqi
    Wei, Pengfei
    Cheng, Xiaochun
    Zhang, Weiwen
    COMPUTERS & ELECTRICAL ENGINEERING, 2021, 92
  • [25] Triplet Embedding Convolutional Recurrent Neural Network for Long Text Semantic Analysis
    Liu, Jingxuan
    Zhu, Ming
    Ouyang, Huajiang
    Sun, Guozi
    Li, Huakang
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2022, 2022, 13724 : 607 - 615
  • [26] Investigation on the Chinese Text Sentiment Analysis Based on Convolutional Neural Networks in Deep Learning
    Xu, Feng
    Zhang, Xuefen
    Xin, Zhanhong
    Yang, Alan
    CMC-COMPUTERS MATERIALS & CONTINUA, 2019, 58 (03): : 697 - 709
  • [27] Deep Convolution Neural Networks for Twitter Sentiment Analysis
    Zhao Jianqiang
    Gui Xiaolin
    Zhang Xuejun
    IEEE ACCESS, 2018, 6 : 23253 - 23260
  • [28] Performance Comparison of Text-based Sentiment Analysis using Recurrent Neural Network and Convolutional Neural Network
    Purnamasari, Prima Dewi
    Taqiyuddin, Muhammad
    Ratna, Anak Agung Putri
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON COMMUNICATION AND INFORMATION PROCESSING (ICCIP 2017), 2017, : 19 - 23
  • [29] Text Classification and Transfer Learning Based on Character-Level Deep Convolutional Neural Networks
    Sato, Minato
    Orihara, Ryohei
    Sei, Yuichi
    Tahara, Yasuyuki
    Ohsuga, Akihiko
    AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART 2017), 2018, 10839 : 62 - 81
  • [30] Clause Sentiment Identification Based on Convolutional Neural Network With Context Embedding
    Chen, Peng
    Xu, Bing
    Yang, Muyun
    Li, Sheng
    2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 1532 - 1538