Improving sentiment domain adaptation for Arabic using an unsupervised self-labeling framework

被引:5
|
作者
Alqahtani, Yathrib [1 ,2 ,3 ]
Al-Twairesh, Nora [1 ,4 ]
Alsanad, Ahmed [1 ,3 ]
机构
[1] King Saud Univ, Coll Comp & Informat Sci, STCs Artificial Intelligence Chair, Riyadh, Saudi Arabia
[2] Saudi Elect Univ, Coll Comp & Informat, Dept Informat Technol, Riyadh, Saudi Arabia
[3] King Saud Univ, Coll Comp & Informat Sci, Dept Informat Syst, Riyadh, Saudi Arabia
[4] King Saud Univ, Coll Comp & Informat Sci, Dept Informat Technol, Riyadh, Saudi Arabia
关键词
Domain adaptation; Sentiment classification; Arabic language; Self; -labeling; Lexicon -based classification;
D O I
10.1016/j.ipm.2023.103338
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Numerous domain adaptation methods have been proposed over the last decade, of which the most widely used methods have become popular owing to their generality in terms of tasks or language. While generality fails to consider language-specific issues, sentiment-specific adapta-tion methods rely on language-specific high-quality resources such as tagging tools or sentiment lexicons. This study proposes a resource-free unsupervised self-labeling adaptation framework for Arabic sentiment classification. By leveraging the sentiment-specific task of lexicon induction using a combination of feature selection methods and an improved hybrid word pairwise simi-larity technique, the proposed framework proved to be less sensitive to the issue of Arabic feature sparsity. A total of 12 traditional and 12 transformer-based experiments on two Arabic multi -domain datasets adapted in the proposed framework demonstrated that a simple yet effective unsupervised self-labeling approach outperformed complex representation learning adaptation approaches for the Arabic language. The proposed framework showed an improvement over the best-performing method by 2% on a dataset of reviews and competitive results on a dataset of tweets.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] AdaSL: An Unsupervised Domain Adaptation framework for Arabic multi-dialectal Sequence Labeling
    El Mekki, Abdellah
    El Mahdaouy, Abdelkader
    Berrada, Ismail
    Khoumsi, Ahmed
    [J]. Information Processing and Management, 2022, 59 (04):
  • [2] AdaSL: An Unsupervised Domain Adaptation framework for Arabic multi-dialectal Sequence Labeling
    El Mekki, Abdellah
    El Mahdaouy, Abdelkader
    Berrada, Ismail
    Khoumsi, Ahmed
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (04)
  • [3] Self-Labeling Framework for Open-Set Domain Adaptation With Few Labeled Samples
    Yu, Qing
    Irie, Go
    Aizawa, Kiyoharu
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1474 - 1487
  • [4] Self-labeling methods for unsupervised transfer ranking
    Li, Pengfei
    Sanderson, Mark
    Carman, Mark
    Scholer, Falk
    [J]. INFORMATION SCIENCES, 2020, 516 : 293 - 315
  • [5] ITERATIVE SELF-LABELING DOMAIN ADAPTATION FOR LINEAR STRUCTURED IMAGE CLASSIFICATION
    Habrard, Amaury
    Peyrache, Jean-Philippe
    Sebban, Marc
    [J]. INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2013, 22 (05)
  • [6] Renewing Iterative Self-Labeling Domain Adaptation With Application to the Spine Motion Prediction
    Chen, Gecheng
    Zhou, Yu
    Zhang, Xudong
    Tuo, Rui
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2023, 21 (03) : 1 - 11
  • [7] Domain adaptation of weighted majority votes via perturbed variation-based self-labeling
    Morvant, Emilie
    [J]. PATTERN RECOGNITION LETTERS, 2015, 51 : 37 - 43
  • [8] Joint Feature and Labeling Function Adaptation for Unsupervised Domain Adaptation
    Cui, Fengli
    Chen, Yinghao
    Du, Yuntao
    Cao, Yikang
    Wang, Chongjun
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2022, PT I, 2022, 13280 : 432 - 446
  • [9] Unsupervised Unstained Cell Detection by SIFT Keypoint Clustering and Self-labeling Algorithm
    Mualla, Firas
    Schoell, Simon
    Sommerfeldt, Bjoern
    Maier, Andreas
    Steidl, Stefan
    Buchholz, Rainer
    Hornegger, Joachim
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION - MICCAI 2014, PT III, 2014, 8675 : 377 - 384
  • [10] Self-Labeling Framework for Novel Category Discovery over Domains
    Yu, Qing
    Ikami, Daiki
    Irie, Go
    Aizawa, Kiyoharu
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 3161 - 3169