A semiautomatic annotation approach for sentiment analysis

被引:10
|
作者
Alahmary, Rahma [1 ,2 ]
Al-Dossari, Hmood [1 ]
机构
[1] King Saud Univ, Informat Syst Dept, POB 145111, Riyadh 4545, Saudi Arabia
[2] Al Imam Mohammad Ibn Saud Islamic Univ, Informat Syst Dept, Riyadh, Saudi Arabia
关键词
Annotation; deep learning; machine learning; Saudi dialect; sentiment analysis; OPINION;
D O I
10.1177/01655515211006594
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sentiment analysis (SA) aims to extract users' opinions automatically from their posts and comments. Almost all prior works have used machine learning algorithms. Recently, SA research has shown promising performance in using the deep learning approach. However, deep learning is greedy and requires large datasets to learn, so it takes more time for data annotation. In this research, we proposed a semiautomatic approach using Naive Bayes (NB) to annotate a new dataset in order to reduce the human effort and time spent on the annotation process. We created a dataset for the purpose of training and testing the classifier by collecting Saudi dialect tweets. The dataset produced from the semiautomatic model was then used to train and test deep learning classifiers to perform Saudi dialect SA. The accuracy achieved by the NB classifier was 83%. The trained semiautomatic model was used to annotate the new dataset before it was fed into the deep learning classifiers. The three deep learning classifiers tested in this research were convolutional neural network (CNN), long short-term memory (LSTM) and bidirectional long short-term memory (Bi-LSTM). Support vector machine (SVM) was used as the baseline for comparison. Overall, the performance of the deep learning classifiers exceeded that of SVM. The results showed that CNN reported the highest performance. On one hand, the performance of Bi-LSTM was higher than that of LSTM and SVM, and, on the other hand, the performance of LSTM was higher than that of SVM. The proposed semiautomatic annotation approach is usable and promising to increase speed and save time and effort in the annotation process.
引用
收藏
页码:398 / 410
页数:13
相关论文
共 50 条
  • [41] A sentiment analysis approach to increase authorship identification
    Martins, Ricardo
    Almeida, Jose Joao
    Henriques, Pedro
    Novais, Paulo
    [J]. EXPERT SYSTEMS, 2021, 38 (05)
  • [42] Sentiment Analysis of Twitter Data: A Hybrid Approach
    Srivastava, Ankit
    Singh, Vijendra
    Drall, Gurdeep Singh
    [J]. INTERNATIONAL JOURNAL OF HEALTHCARE INFORMATION SYSTEMS AND INFORMATICS, 2019, 14 (02) : 1 - 16
  • [43] A Deep Learning Approach to Sentiment Analysis in Turkish
    Ciftci, Basri
    Apaydin, Mehmet Serkan
    [J]. 2018 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP), 2018,
  • [44] A Hybrid Approach to Sentiment Analysis of News Comments
    Mukwazvure, Addlight
    Supreethi, K. P.
    [J]. 2015 4TH INTERNATIONAL CONFERENCE ON RELIABILITY, INFOCOM TECHNOLOGIES AND OPTIMIZATION (ICRITO) (TRENDS AND FUTURE DIRECTIONS), 2015,
  • [45] A Practical Approach to Sentiment Analysis of Hindi Tweets
    Sharma, Yakshi
    Mangat, Veenu
    Kaur, Mandeep
    [J]. 2015 1ST INTERNATIONAL CONFERENCE ON NEXT GENERATION COMPUTING TECHNOLOGIES (NGCT), 2015, : 677 - 680
  • [46] Sentiment Analysis Using Lexicon Based Approach
    Singh, Vijendra
    Singh, Gurdeep
    Rastogi, Priyanka
    Deswal, Devanshi
    [J]. 2018 FIFTH INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (IEEE PDGC), 2018, : 13 - 18
  • [47] Detection of spam reviews: a sentiment analysis approach
    Sunil Saumya
    Jyoti Prakash Singh
    [J]. CSI Transactions on ICT, 2018, 6 (2) : 137 - 148
  • [48] Multimodal Sentiment Analysis: A Multitask Learning Approach
    Fortin, Mathieu Page
    Chaib-draa, Brahim
    [J]. ICPRAM: PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2019, : 368 - 376
  • [49] A Sentiment Analysis Based Approach for Customer Segmentation
    Bhatnagar A.
    Bhatia M.
    [J]. Recent Patents on Engineering, 2022, 16 (02)
  • [50] A Stacked Ensemble Approach to Bengali Sentiment Analysis
    Sarkar, Kamal
    [J]. INTELLIGENT HUMAN COMPUTER INTERACTION (IHCI 2019), 2020, 11886 : 102 - 111