Character level embedding with deep convolutional neural network for text normalization of unstructured data for Twitter sentiment analysis

被引:1
|
作者
Monika Arora
Vineet Kansal
机构
[1] IET,
[2] AKTU,undefined
来源
关键词
Opinion mining; Convolutional neural network; Phonetic algorithm; Soundex; SemEval dataset;
D O I
暂无
中图分类号
学科分类号
摘要
On social media platforms such as Twitter and Facebook, people express their views, arguments, and emotions of many events in daily life. Twitter is an international microblogging service featuring short messages called “tweets” from different languages. These texts often consist of noise in the form of incorrect grammar, abbreviations, freestyle, and typographical errors. Sentiment analysis (SA) aims to predict the actual emotions from the raw text expressed by the people through the field of natural language processing (NLP). The main aim of our work is to process the raw sentence from the Twitter dataset and find the actual polarity of the message. This paper proposes a text normalization with deep convolutional character level embedding (Conv-char-Emb) neural network model for SA of unstructured data. This model can tackle the problems: (1) processing the noisy sentence for sentiment detection (2) handling small memory space in word level embedded learning (3) accurate sentiment analysis of the unstructured data. The initial preprocessing stage for performing text normalization includes the following steps: tokenization, out of vocabulary (OOV) detection and its replacement, lemmatization and stemming. A character-based embedding in convolutional neural network (CNN) is an effective and efficient technique for SA that uses less learnable parameters in feature representation. Thus, the proposed method performs both the normalization and classification of sentiments for unstructured sentences. The experimental results are evaluated in the Twitter dataset by a different point polarity (positive, negative and neutral). As a result, our model performs well in normalization and sentiment analysis of the raw Twitter data enriched with hidden information.
引用
收藏
相关论文
共 50 条
  • [31] Impact of convolutional neural network and FastText embedding on text classification
    Umer, Muhammad
    Imtiaz, Zainab
    Ahmad, Muhammad
    Nappi, Michele
    Medaglia, Carlo
    Choi, Gyu Sang
    Mehmood, Arif
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (04) : 5569 - 5585
  • [32] Convolutional Neural Network with Contextualized Word Embedding for Text Classification
    Fan, Gaoyang
    Zhu, Cui
    Zhu, Wenjun
    2019 INTERNATIONAL CONFERENCE ON IMAGE AND VIDEO PROCESSING, AND ARTIFICIAL INTELLIGENCE, 2019, 11321
  • [33] Impact of convolutional neural network and FastText embedding on text classification
    Muhammad Umer
    Zainab Imtiaz
    Muhammad Ahmad
    Michele Nappi
    Carlo Medaglia
    Gyu Sang Choi
    Arif Mehmood
    Multimedia Tools and Applications, 2023, 82 : 5569 - 5585
  • [34] A Deep Neural Network Approach using Convolutional Network and Long Short Term Memory for Text Sentiment Classification
    Shoryu, Teragawa
    Wang, Lei
    Ma, Ruixin
    PROCEEDINGS OF THE 2021 IEEE 24TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2021, : 763 - 768
  • [35] Transformer based Deep Intelligent Contextual Embedding for Twitter sentiment analysis
    Naseem, Usman
    Razzak, Imran
    Musial, Katarzyna
    Imran, Muhammad
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 113 : 58 - 69
  • [36] Comparative Analysis of Convolutional Neural Network and LSTM in Text-Based Sentiment Classification
    Kalaivani, M. S.
    Jayalakshmi, S.
    PROCEEDINGS OF THE 2021 FIFTH INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC 2021), 2021, : 1205 - 1211
  • [37] Deep Convolutional Network For Arabic sentiment Analysis
    Omara, Eslam
    Mosa, Mervat
    Ismail, Nabil
    2018 PROCEEDINGS OF THE INTERNATIONAL JAPAN-AFRICA CONFERENCE ON ELECTRONICS, COMMUNICATIONS, AND COMPUTATIONS (JAC-ECC 2018), 2018, : 155 - 159
  • [38] Sentiment Analysis Using Convolutional Neural Network
    Ouyang, Xi
    Zhou, Pan
    Li, Cheng Hua
    Liu, Lijun
    CIT/IUCC/DASC/PICOM 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY - UBIQUITOUS COMPUTING AND COMMUNICATIONS - DEPENDABLE, AUTONOMIC AND SECURE COMPUTING - PERVASIVE INTELLIGENCE AND COMPUTING, 2015, : 2363 - 2368
  • [39] Detecting Sensitive Information of Unstructured Text Using Convolutional Neural Network
    Xu, Guosheng
    Qi, Lanuthao
    Yu, Hai
    Xu, Sbengwei
    Zhao, Chunlu
    Yuan, Jing
    2019 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY (CYBERC), 2019, : 474 - 479
  • [40] Character Segmentation in Text Line via Convolutional Neural Network
    Li, Xiaohe
    Zhang, Xingming
    Yang, Bin
    Xia, Siyu
    2017 4TH INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2017, : 1175 - 1180