Sentiment Analysis of Lithuanian Texts Using Traditional and Deep Learning Approaches

被引:35
|
作者
Kapociute-Dzikiene, Jurgita [1 ]
Damasevicius, Robertas [2 ]
Wozniak, Marcin [3 ]
机构
[1] Vytautas Magnus Univ, Fac Informat, K Donelaicio 58, LT-44248 Kaunas, Lithuania
[2] Kaunas Univ Technol, Dept Software Engn, K Donelaicio 73, LT-44249 Kaunas, Lithuania
[3] Silesian Tech Univ, Inst Math, Kaszubska 23, PL-44100 Gliwice, Poland
关键词
sentiment analysis; machine learning; deep learning; neural word embeddings; Internet comments; Lithuanian language; NETWORK;
D O I
10.3390/computers8010004
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We describe the sentiment analysis experiments that were performed on the Lithuanian Internet comment dataset using traditional machine learning (Naive Bayes Multinomial-NBM and Support Vector Machine-SVM) and deep learning (Long Short-Term Memory-LSTM and Convolutional Neural Network-CNN) approaches. The traditional machine learning techniques were used with the features based on the lexical, morphological, and character information. The deep learning approaches were applied on the top of two types of word embeddings (Vord2Vec continuous bag-of-words with negative sampling and FastText). Both traditional and deep learning approaches had to solve the positive/negative/neutral sentiment classification task on the balanced and full dataset versions. The best deep learning results (reaching 0.706 of accuracy) were achieved on the full dataset with CNN applied on top of the FastText embeddings, replaced emoticons, and eliminated diacritics. The traditional machine learning approaches demonstrated the best performance (0.735 of accuracy) on the full dataset with the NBM method, replaced emoticons, restored diacritics, and lemma unigrams as features. Although traditional machine learning approaches were superior when compared to the deep learning methods; deep learning demonstrated good results when applied on the small datasets.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Sentiment Analysis of Lithuanian Texts Using Deep Learning Methods
    Kapociute-Dzikiene, Jurgita
    Damasevicius, Robertas
    Wozniak, Marcin
    [J]. INFORMATION AND SOFTWARE TECHNOLOGIES, ICIST 2018, 2018, 920 : 521 - 532
  • [2] Comparison of Deep Learning Approaches for Lithuanian Sentiment Analysis
    Kapociute-Dzikiene, Jurgita
    Salimbajevs, Askars
    [J]. BALTIC JOURNAL OF MODERN COMPUTING, 2022, 10 (03): : 283 - 294
  • [3] Sentiment Analysis using Deep Learning on Persian Texts
    Roshanfekr, Behnam
    Khadivi, Shahram
    Rahmati, Mohammad
    [J]. 2017 25TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2017, : 1503 - 1508
  • [4] Sentiment analysis using deep learning approaches: an overview
    Habimana, Olivier
    Li, Yuhua
    Li, Ruixuan
    Gu, Xiwu
    Yu, Ge
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2020, 63 (01)
  • [5] Sentiment analysis using deep learning approaches:an overview
    Olivier HABIMANA
    Yuhua LI
    Ruixuan LI
    Xiwu GU
    Ge YU
    [J]. Science China(Information Sciences), 2020, 63 (01) : 21 - 56
  • [6] Sentiment analysis using deep learning approaches: an overview
    Olivier Habimana
    Yuhua Li
    Ruixuan Li
    Xiwu Gu
    Ge Yu
    [J]. Science China Information Sciences, 2020, 63
  • [7] Amharic political sentiment analysis using deep learning approaches
    Fikirte Alemayehu
    Million Meshesha
    Jemal Abate
    [J]. Scientific Reports, 13
  • [8] Amharic political sentiment analysis using deep learning approaches
    Alemayehu, Fikirte
    Meshesha, Million
    Abate, Jemal
    [J]. SCIENTIFIC REPORTS, 2023, 13 (01)
  • [9] Deep Learning Approach for Sentiment Analysis of Short Texts
    Hassan, Abdalraouf
    Mahmood, Ausif
    [J]. 2017 3RD INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS (ICCAR), 2017, : 705 - 710
  • [10] Deep learning approaches for Arabic sentiment analysis
    Mohammed, Ammar
    Kora, Rania
    [J]. SOCIAL NETWORK ANALYSIS AND MINING, 2019, 9 (01)