Word Embeddings for Arabic Sentiment Analysis

被引:0
|
作者
Altowayan, A. Aziz [1 ]
Tao, Lixin [1 ]
机构
[1] Pace Univ, Dept Comp Sci, New York, NY 10038 USA
关键词
sentiment; word embeddings;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Manual feature extraction is a challenging and time consuming task, especially in a Morphologically Rich Language (MRL) such as Arabic. In this paper, we rely on word embeddings as the main source of features for opinion mining in Arabic text such as tweets, consumer reviews, and news articles. First, we compile a large Arabic corpus from various sources to learn word representations. Second, we train and generate word vectors (embeddings) from the corpus. Third, we use the embeddings in our feature representation for training several binary classifiers to detect subjectivity and sentiment in both Standard Arabic and Dialectal Arabic. We compare our results with other methods in literature; our approach-with no hand-crafted features-achieves a slightly better accuracy than the top hand-crafted methods. To reproduce our results and for further work, we publish the data and code used in our experiments.
引用
收藏
页码:3820 / 3825
页数:6
相关论文
共 50 条
  • [41] Fine-Tuning Word Embeddings for Aspect-Based Sentiment Analysis
    Duc-Hong Pham
    Thi-Thanh-Tan Nguyen
    Anh-Cuong Le
    TEXT, SPEECH, AND DIALOGUE, TSD 2017, 2017, 10415 : 500 - 508
  • [42] Word Embeddings with Fuzzy Ontology Reasoning for Feature Learning in Aspect Sentiment Analysis
    Sweidan, Asmaa Hashem
    El-Bendary, Nashwa
    Al-Feel, Haytham
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT II, 2022, 13530 : 320 - 331
  • [43] Syntax-ignorant N-gram embeddings for dialectal Arabic sentiment analysis
    Mulki, Hala
    Haddad, Hatem
    Gridach, Mourad
    Babaoglu, Ismail
    NATURAL LANGUAGE ENGINEERING, 2021, 27 (03) : 315 - 338
  • [44] Deep Hybrid Neural Networks with Improved Weighted Word Embeddings for Sentiment Analysis
    Othman, Rania
    Faiz, Rim
    Abdelsadek, Youcef
    Chelghoum, Kamel
    Kacem, Imed
    ADVANCES IN INTELLIGENT DATA ANALYSIS XIX, IDA 2021, 2021, 12695 : 50 - 62
  • [45] Cross-Domain Sentiment Classification with Word Embeddings and Canonical Correlation Analysis
    Ngo Xuan Bach
    Vu Thanh Hai
    Tu Minh Phuong
    PROCEEDINGS OF THE SEVENTH SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY (SOICT 2016), 2016, : 159 - 166
  • [46] Syntax-Ignorant N-gram Embeddings for Sentiment Analysis of Arabic Dialects
    Mulki, Hala
    Haddad, Hatem
    Gridach, Mourad
    Babaoglu, Ismail
    FOURTH ARABIC NATURAL LANGUAGE PROCESSING WORKSHOP (WANLP 2019), 2019, : 30 - 39
  • [47] Exploring the Effect of Word Embeddings and Bag-of-Words for Vietnamese Sentiment Analysis
    Pham, Duc-Hong
    UBIQUITOUS INTELLIGENT SYSTEMS, 2022, 302 : 595 - 605
  • [48] Twitter Sentiment Analysis Experiments Using Word Embeddings on Datasets of Various Scales
    Arslan, Yusuf
    Kucuk, Dilek
    Birturk, Aysenur
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2018), 2018, 10859 : 40 - 47
  • [49] Debiasing Word Embeddings from Sentiment Associations in Names
    Hube, Christoph
    Idahl, Maximilian
    Fetahu, Besnik
    PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM '20), 2020, : 259 - 267
  • [50] Domain Adapted Word Embeddings for Improved Sentiment Classification
    Sarma, Prathusha K.
    Liang, Yingyu
    Sethares, William A.
    DEEP LEARNING APPROACHES FOR LOW-RESOURCE NATURAL LANGUAGE PROCESSING (DEEPLO), 2018, : 51 - 59