Pre-trained Word Embeddings for Arabic Aspect-Based Sentiment Analysis of Airline Tweets

被引:21
|
作者
Ashi, Mohammed Matuq [1 ]
Siddiqui, Muazzam Ahmed [1 ]
Nadeem, Farrukh [1 ]
机构
[1] King Abdulaziz Univ, Fac Comp & Informat Technol, Dept Informat Syst, Jeddah, Saudi Arabia
关键词
Data mining; NLP; Machine learning; Word embeddings; Sentiment analysis; Aspect-based sentiment analysis;
D O I
10.1007/978-3-319-99010-1_22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, the use of word embeddings has become one of the most significant advancements in natural language processing (NLP). In this paper, we compared two word embedding models for aspect-based sentiment analysis (ABSA) of Arabic tweets. The ABSA problem was formulated as a two step process of aspect detection followed by sentiment polarity classification of the detected aspects. The compared embeddings models include fastText Arabic Wikipedia and AraVec-Web, both available as pre-trained models. Our corpus consisted of 5K airline service related tweets in Arabic, manually labeled for ABSA with imbalanced aspect categories. For classification, we used a support vector machine classifier for both, aspect detection, and sentiment polarity classification. Our results indicated that fastText Arabic Wikipedia word embeddings performed slightly better than AraVec-Web.
引用
收藏
页码:241 / 251
页数:11
相关论文
共 50 条
  • [1] A Comparative Study of Pre-trained Word Embeddings for Arabic Sentiment Analysis
    Zouidine, Mohamed
    Khalil, Mohammed
    [J]. 2022 IEEE 46TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2022), 2022, : 1243 - 1248
  • [2] Sentiment analysis based on improved pre-trained word embeddings
    Rezaeinia, Seyed Mahdi
    Rahmani, Rouhollah
    Ghodsi, Ali
    Veisi, Hadi
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2019, 117 : 139 - 147
  • [3] Learning Word Embeddings for Aspect-Based Sentiment Analysis
    Duc-Hong Pham
    Anh-Cuong Le
    Thi-Kim-Chung Le
    [J]. COMPUTATIONAL LINGUISTICS, PACLING 2017, 2018, 781 : 28 - 40
  • [4] Evaluating Pre-trained Word Embeddings and Neural Network Architectures for Sentiment Analysis in Spanish Financial Tweets
    Antonio Garcia-Diaz, Jose
    Apolinario-Arzube, Oscar
    Valencia-Garcia, Rafael
    [J]. ADVANCES IN COMPUTATIONAL INTELLIGENCE, MICAI 2020, PT II, 2020, 12469 : 167 - 178
  • [5] Aspect-Based Sentiment Analysis of Social Media Data With Pre-Trained Language Models
    Troya, Anina
    Pillai, Reshmi Gopalakrishna
    Rivero, Cristian Rodriguez
    Genc, Zulkuf
    Kayal, Subhradeep
    Araci, Dogu
    [J]. 2021 5TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL, NLPIR 2021, 2021, : 8 - 17
  • [6] Incorporating Dynamic Semantics into Pre-Trained Language Model for Aspect-based Sentiment Analysis
    Zhang, Kai
    Zhang, Kun
    Zhang, Mengdi
    Zhao, Hongke
    Liu, Qi
    Wu, Wei
    Chen, Enhong
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 3599 - 3610
  • [7] Aspect-Based Sentiment Analysis in Hindi Language by Ensembling Pre-Trained mBERT Models
    Pathak, Abhilash
    Kumar, Sudhanshu
    Roy, Partha Pratim
    Kim, Byung-Gyu
    [J]. ELECTRONICS, 2021, 10 (21)
  • [8] Aspect-based Sentiment Analysis and Location Detection for Arabic Language Tweets
    AlShammari, Norah
    AlMansour, Amal
    [J]. APPLIED COMPUTER SYSTEMS, 2022, 27 (02) : 119 - 127
  • [9] Aspect-based sentiment analysis with enhanced aspect-sensitive word embeddings
    Qi, Yusi
    Zheng, Xiaoqing
    Huang, Xuanjing
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2022, 64 (07) : 1845 - 1861
  • [10] Aspect-based sentiment analysis with enhanced aspect-sensitive word embeddings
    Yusi Qi
    Xiaoqing Zheng
    Xuanjing Huang
    [J]. Knowledge and Information Systems, 2022, 64 : 1845 - 1861