Sentiment Analysis of Iraqi Arabic Dialect on Facebook Based on Distributed Representations of Documents

被引:22
|
作者
Alnawas, Anwar [1 ,2 ]
Arici, Nursal [1 ]
机构
[1] Gazi Univ, Fac Technol, Dept Comp Engn, TR-06500 Ankara, Turkey
[2] Southern Tech Univ, Nasiriyah Tech Inst, Basra, Iraq
关键词
Doc2Vec; Iraqi Arabic Dialect; word embedding; sentiments analysis; facebook; MEDIA;
D O I
10.1145/3278605
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, social media is used by many people to express their opinions about a variety of topics. Opinion Mining or Sentiment Analysis techniques extract opinions from user generated contents. Over the years, a multitude of Sentiment Analysis studies has been done about the English language with deficiencies of research in all other languages. Unfortunately, Arabic is one of the languages that seems to lack substantial research, despite the rapid growth of its use on social media outlets. Furthermore, specific Arabic dialects should be studied, not just Modern Standard Arabic. In this paper, we experiment sentiments analysis of Iraqi Arabic dialect using word embedding. First, we made a large corpus from previous works to learn word representations. Second, we generated word embedding model by training corpus using Doc2Vec representations based on Paragraph and Distributed Memory Model of Paragraph Vectors (DM-PV) architecture. Lastly, the represented feature used for training four binary classifiers (Logistic Regression, Decision Tree, Support Vector Machine and Naive Bayes) to detect sentiment. We also experimented different values of parameters (window size, dimension and negative samples). In the light of the experiments, it can be concluded that our approach achieves a better performance for Logistic Regression and Support Vector Machine than the other classifiers.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Semi-supervised distributed representations of documents for sentiment analysis
    Park, Saerom
    Lee, Jaewook
    Kim, Kyoungok
    [J]. NEURAL NETWORKS, 2019, 119 : 139 - 150
  • [2] Sentiment Analysis of Arabic Jordanian Dialect Tweets
    Atoum, Jalal Omer
    Nouman, Mais
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (02) : 256 - 262
  • [3] A systematic literature review of Arabic dialect sentiment analysis
    Matrane, Yassir
    Benabbou, Faouzia
    Sael, Nawal
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (06)
  • [4] Arabic Opinion Mining Using Distributed Representations of Documents
    El-Halees, Alaa M.
    [J]. 2017 PALESTINIAN INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (PICICT), 2017, : 28 - 33
  • [5] Sentiment analysis in poems in misurata sub-dialect a sentiment detection in an Arabic sub-dialect
    Department of Linguistics and Computer Science, Montclair State University, United States
    [J]. arXiv,
  • [6] Collecting and Processing Arabic Facebook Comments for Sentiment Analysis
    Elouardighi, Abdeljalil
    Maghfour, Mohcine
    Hammia, Hafdalla
    [J]. MODEL AND DATA ENGINEERING (MEDI 2017), 2017, 10563 : 262 - 274
  • [7] Arabic dialect sentiment analysis with ZERO effort. Case study: Algerian dialect
    Guellil, Imane
    Mendoza, Marcelo
    Azouaou, Faical
    [J]. INTELIGENCIA ARTIFICIAL-IBEROAMERICAN JOURNAL OF ARTIFICIAL INTELLIGENCE, 2020, 23 (65): : 124 - 135
  • [8] Empirical Evaluation of Word Representations on Arabic Sentiment Analysis
    Gridach, Mourad
    Haddad, Hatem
    Mulki, Hala
    [J]. ARABIC LANGUAGE PROCESSING: FROM THEORY TO PRACTICE, 2018, 782 : 147 - 158
  • [9] Annotated Corpus of Mesopotamian-Iraqi Dialect for Sentiment Analysis in Social Media
    Askar, Al-Khafaji Ali J.
    Sjarif, Nilam Nur Amir
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (04) : 101 - 105
  • [10] Sentiment analysis dataset in Moroccan dialect: bridging the gap between Arabic and Latin scripted dialect
    Jbel, Mouad
    Jabrane, Mourad
    Hafidi, Imad
    Metrane, Abdulmutallib
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2024,