Sentiment Analysis of Lithuanian Texts Using Traditional and Deep Learning Approaches

被引:35
|
作者
Kapociute-Dzikiene, Jurgita [1 ]
Damasevicius, Robertas [2 ]
Wozniak, Marcin [3 ]
机构
[1] Vytautas Magnus Univ, Fac Informat, K Donelaicio 58, LT-44248 Kaunas, Lithuania
[2] Kaunas Univ Technol, Dept Software Engn, K Donelaicio 73, LT-44249 Kaunas, Lithuania
[3] Silesian Tech Univ, Inst Math, Kaszubska 23, PL-44100 Gliwice, Poland
关键词
sentiment analysis; machine learning; deep learning; neural word embeddings; Internet comments; Lithuanian language; NETWORK;
D O I
10.3390/computers8010004
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We describe the sentiment analysis experiments that were performed on the Lithuanian Internet comment dataset using traditional machine learning (Naive Bayes Multinomial-NBM and Support Vector Machine-SVM) and deep learning (Long Short-Term Memory-LSTM and Convolutional Neural Network-CNN) approaches. The traditional machine learning techniques were used with the features based on the lexical, morphological, and character information. The deep learning approaches were applied on the top of two types of word embeddings (Vord2Vec continuous bag-of-words with negative sampling and FastText). Both traditional and deep learning approaches had to solve the positive/negative/neutral sentiment classification task on the balanced and full dataset versions. The best deep learning results (reaching 0.706 of accuracy) were achieved on the full dataset with CNN applied on top of the FastText embeddings, replaced emoticons, and eliminated diacritics. The traditional machine learning approaches demonstrated the best performance (0.735 of accuracy) on the full dataset with the NBM method, replaced emoticons, restored diacritics, and lemma unigrams as features. Although traditional machine learning approaches were superior when compared to the deep learning methods; deep learning demonstrated good results when applied on the small datasets.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Deep learning for sentiment analysis: successful approaches and future challenges
    Tang, Duyu
    Qin, Bing
    Liu, Ting
    [J]. WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2015, 5 (06) : 292 - 303
  • [22] Sentiment Analysis of Tweets Using Deep Learning
    Ranganathan, Jaishree
    Tsahai, Tsega
    [J]. ADVANCED DATA MINING AND APPLICATIONS (ADMA 2022), PT I, 2022, 13725 : 106 - 117
  • [23] Sentiment Analysis using Deep Learning in Cloud
    Raza, Muhammad Raheel
    Hussain, Walayat
    Tanyildizi, Erkan
    Varol, Asaf
    [J]. 9TH INTERNATIONAL SYMPOSIUM ON DIGITAL FORENSICS AND SECURITY (ISDFS'21), 2021,
  • [24] Image Sentiment Analysis using Deep Learning
    Mittal, Namita
    Sharma, Divya
    Joshi, Manju Lata
    [J]. 2018 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2018), 2018, : 684 - 687
  • [25] Multimodal Sentiment Analysis Using Deep Learning
    Sharma, Rakhee
    Le Ngoc Tan
    Sadat, Fatiha
    [J]. 2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, : 1475 - 1478
  • [26] Sentiment Classification for Financial Texts Based on Deep Learning
    Dong, Shanshan
    Liu, Chang
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021
  • [27] A Comparative Evaluation of Traditional Machine Learning and Deep Learning Classification Techniques for Sentiment Analysis
    Dhola, Kaushik
    Saradva, Mann
    [J]. 2021 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2021), 2021, : 932 - 936
  • [28] Sentiment Analysis Using Deep Learning Approaches on Multi-Domain Dataset in Telugu Language
    Chattu, Kannaiah
    Sumathi, D.
    [J]. JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2024, 23 (03)
  • [29] Aspect-oriented extraction and sentiment analysis using optimized hybrid deep learning approaches
    Kotagiri, Srividya
    Sowjanya, A. Mary
    Anilkumar, B.
    Devi, N Lakshmi
    [J]. Multimedia Tools and Applications, 2024, 83 (41) : 88613 - 88644
  • [30] Sentiment analysis on cross-domain textual data using classical and deep learning approaches
    K. Paramesha
    H. L. Gururaj
    Anand Nayyar
    K. C. Ravishankar
    [J]. Multimedia Tools and Applications, 2023, 82 : 30759 - 30782