Tunisian Dialect Sentiment Analysis: A Natural Language Processing-based Approach

被引:0
|
作者
Mulki, Hala [1 ]
Haddad, Hatem [2 ]
Ali, Chedi Bechikh [2 ]
Babaoglu, Ismail [1 ]
机构
[1] Selcuk Univ, Dept Comp Engn, Konya, Turkey
[2] Univ Libre Bruxelles, Dept Comp & Decis Engn CoDE, Brussels, Belgium
来源
COMPUTACION Y SISTEMAS | 2018年 / 22卷 / 04期
关键词
Tunisian sentiment analysis; text preprocessing; named entities;
D O I
10.13053/CyS-22-4-3009
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Social media platforms have been witnessing a significant increase in posts written in the Tunisian dialect since the uprising in Tunisia at the end of 2010. Most of the posted tweets or comments reflect the impressions of the Tunisian public towards social, economical and political major events. These opinions have been tracked, analyzed and evaluated through sentiment analysis systems. In the current study, we investigate the impact of several preprocessing techniques on sentiment analysis using two sentiment classification models: Supervised and lexicon-based. These models were trained on three Tunisian datasets of different sizes and multiple domains. Our results emphasize the positive impact of preprocessing phase on the evaluation measures of both sentiment classifiers as the baseline was significantly outperformed when stemming, emoji recognition and negation detection tasks were applied. Moreover, integrating named entities with these tasks enhanced the lexicon-based classification performance in all datasets and that of the supervised model in medium and small sized datasets.
引用
收藏
页码:1223 / 1232
页数:10
相关论文
共 50 条
  • [31] Indic SentiReview: Natural Language Processing based Sentiment Analysis on major Indian Languages
    Hadiya, Nidhi
    Nanavati, Nirali
    [J]. PROCEEDINGS OF THE 2019 3RD INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2019), 2019, : 322 - 327
  • [32] The Application of Natural Language Processing Technology Based on Deep Learning in Japanese Sentiment Analysis
    Zhang, Xuanxuan
    [J]. IEEE 1st International Conference on Ambient Intelligence, Knowledge Informatics and Industrial Electronics, AIKIIE 2023, 2023,
  • [33] Sentiment analysis of Japanese text and vocabulary learning based on natural language processing and SVM
    Song, Gang
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021,
  • [34] Hybrid Natural Language Processing Model for Sentiment Analysis during Natural Crisis
    Horvat, Marko
    Gledec, Gordan
    Leontic, Fran
    [J]. ELECTRONICS, 2024, 13 (10)
  • [35] Review-Based Sentiment Prediction of Rating Using Natural Language Processing Sentence-Level Sentiment Analysis with Bag-of-Words Approach
    Raju, K. Venkata
    Sridhar, M.
    [J]. FIRST INTERNATIONAL CONFERENCE ON SUSTAINABLE TECHNOLOGIES FOR COMPUTATIONAL INTELLIGENCE, 2020, 1045 : 807 - 821
  • [36] Natural language processing-based assessment of consistency in summaries of product characteristics of generic antimicrobials
    Shimazawa, Rumiko
    Kano, Yoshinobu
    Ikeda, Masayuki
    [J]. PHARMACOLOGY RESEARCH & PERSPECTIVES, 2018, 6 (06):
  • [37] Neuroimaging and natural language processing-based classification of suicidal thoughts in major depressive disorder
    Lee, Dong Yun
    Byeon, Gihwan
    Kim, Narae
    Son, Sang Joon
    Park, Rae Woong
    Park, Bumhee
    [J]. TRANSLATIONAL PSYCHIATRY, 2024, 14 (01):
  • [38] A COMPARATIVE STUDY OF STATISTICAL AND NATURAL LANGUAGE PROCESSING TECHNIQUES FOR SENTIMENT ANALYSIS
    Khong, Wai-Howe
    Soon, Lay-Ki
    Goh, Hui-Ngo
    [J]. JURNAL TEKNOLOGI, 2015, 77 (18): : 155 - 161
  • [39] Medical Needs Extraction for Breast Cancer Patients from Question and Answer Services: Natural Language Processing-Based Approach
    Kamba, Masaru
    Manabe, Masae
    Wakamiya, Shoko
    Yada, Shuntaro
    Aramaki, Eiji
    Odani, Satomi
    Miyashiro, Isao
    [J]. JMIR CANCER, 2021, 7 (04):
  • [40] Natural Language Processing for Arabic Sentiment Analysis: A Systematic Literature Review
    Al Katat, Souha
    Zaki, Chamseddine
    Hazimeh, Hussein
    El Bitar, Ibrahim
    Angarita, Rafael
    Trojman, Lionel
    [J]. IEEE TRANSACTIONS ON BIG DATA, 2024, 10 (05) : 576 - 594