Character level embedding with deep convolutional neural network for text normalization of unstructured data for Twitter sentiment analysis

被引:1
|
作者
Monika Arora
Vineet Kansal
机构
[1] IET,
[2] AKTU,undefined
来源
关键词
Opinion mining; Convolutional neural network; Phonetic algorithm; Soundex; SemEval dataset;
D O I
暂无
中图分类号
学科分类号
摘要
On social media platforms such as Twitter and Facebook, people express their views, arguments, and emotions of many events in daily life. Twitter is an international microblogging service featuring short messages called “tweets” from different languages. These texts often consist of noise in the form of incorrect grammar, abbreviations, freestyle, and typographical errors. Sentiment analysis (SA) aims to predict the actual emotions from the raw text expressed by the people through the field of natural language processing (NLP). The main aim of our work is to process the raw sentence from the Twitter dataset and find the actual polarity of the message. This paper proposes a text normalization with deep convolutional character level embedding (Conv-char-Emb) neural network model for SA of unstructured data. This model can tackle the problems: (1) processing the noisy sentence for sentiment detection (2) handling small memory space in word level embedded learning (3) accurate sentiment analysis of the unstructured data. The initial preprocessing stage for performing text normalization includes the following steps: tokenization, out of vocabulary (OOV) detection and its replacement, lemmatization and stemming. A character-based embedding in convolutional neural network (CNN) is an effective and efficient technique for SA that uses less learnable parameters in feature representation. Thus, the proposed method performs both the normalization and classification of sentiments for unstructured sentences. The experimental results are evaluated in the Twitter dataset by a different point polarity (positive, negative and neutral). As a result, our model performs well in normalization and sentiment analysis of the raw Twitter data enriched with hidden information.
引用
收藏
相关论文
共 50 条
  • [41] Character-level text classification via convolutional neural network and gated recurrent unit
    Bing Liu
    Yong Zhou
    Wei Sun
    International Journal of Machine Learning and Cybernetics, 2020, 11 : 1939 - 1949
  • [42] Character-level text classification via convolutional neural network and gated recurrent unit
    Liu, Bing
    Zhou, Yong
    Sun, Wei
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2020, 11 (08) : 1939 - 1949
  • [43] A Character-level Convolutional Neural Network with Dynamic Input Length for Thai Text Categorization
    Koomsubha, Thanabhat
    Vateekul, Peerapon
    2017 9TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SMART TECHNOLOGY (KST), 2017, : 101 - 105
  • [44] Multi-Channel Convolutional Neural Network for Twitter Emotion and Sentiment Recognition
    Islam, Jumayel
    Mercer, Robert E.
    Xiao, Lu
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 1355 - 1365
  • [45] Deep Convolutional Neural Network Based Medical Concept Normalization
    Song, Guojie
    Long, Qingqing
    Luo, Yi
    Wang, Yiming
    Jin, Yilun
    IEEE TRANSACTIONS ON BIG DATA, 2022, 8 (05) : 1195 - 1208
  • [46] Topic sentiment analysis based on deep neural network using document embedding technique
    Azam Seilsepour
    Reza Ravanmehr
    Ramin Nassiri
    The Journal of Supercomputing, 2023, 79 : 19809 - 19847
  • [47] A Deep Normalization and Convolutional Neural Network for Image Smoke Detection
    Yin, Zhijian
    Wan, Boyang
    Yuan, Feiniu
    Xia, Xue
    Shi, Jinting
    IEEE ACCESS, 2017, 5 : 18429 - 18438
  • [48] Aspect-Based Sentiment Analysis of Customer Speech Data Using Deep Convolutional Neural Network and BiLSTM
    Sivakumar Murugaiyan
    Srinivasulu Reddy Uyyala
    Cognitive Computation, 2023, 15 : 914 - 931
  • [49] Topic sentiment analysis based on deep neural network using document embedding technique
    Seilsepour, Azam
    Ravanmehr, Reza
    Nassiri, Ramin
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (17): : 19809 - 19847
  • [50] Variable Convolution and Pooling Convolutional Neural Network for Text Sentiment Classification
    Dong M.
    Li Y.
    Tang X.
    Xu J.
    Bi S.
    Cai Y.
    IEEE Access, 2020, 8 : 16174 - 16186