Foul at SemEval-2023 Task 12: MARBERT Language model and lexical filtering for sentiments analysis of tweets in Algerian Arabic

被引:0
|
作者
Belbachir, Faiza [1 ]
机构
[1] IPSA Ecole Ingenieurs Aeronaut & Spatiale Paris, 63 Bd Brandebourg Bis, F-94200 Ivry, France
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes the system we designed for our participation in SemEval-2023 Task 12 Track 6 about Algerian dialect sentiment analysis. We propose a transformer language model approach combined with a lexicon mixing terms and emojis which is used in a postprocessing filtering stage. The Algerian sentiment lexicons were extracted manually from tweets. We report on our experiments on the Algerian dialect, where we compare the performance of MARBERT to the one of ArabicBERT and CAMeLBERT on the training and development datasets of Task 12. We also analyze the contribution of our post-processing lexical filtering for sentiment analysis. Our system obtained an F1 score equal to 70%, ranking 9th among 30 participants.
引用
收藏
页码:389 / 396
页数:8
相关论文
共 38 条
  • [21] Seals_Lab at SemEval-2023 Task 12: Sentiment Analysis for Low-resource African Languages, Hausa and Igbo
    Raychawdhary, Nilanjana
    Das, Amit
    Dozier, Gerry
    Seals, Cheryl D.
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1508 - 1517
  • [22] Howard University Computer Science at SemEval-2023 Task 12: A 2-Step System Design for Multilingual Sentiment Classification with Language Identification
    Aryal, Saurav
    Prioleau, Howard
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 2153 - 2159
  • [23] UMUTeam and SINAI at SemEval-2023 Task 9: Multilingual Tweet Intimacy Analysis using Multilingual Large Language Models and Data Augmentation
    Garcia-Diaz, Jose Antonio
    Pan, Ronghao
    Jimenez Zafra, Salud Maria
    Martin-Valdivia, Maria-Teresa
    Urena-Lopez, L. Alfonso
    Valencia-Garcia, Rafael
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 293 - 299
  • [24] Bhattacharya_Lab at SemEval-2023 Task 12: A Transformer-based Language Model for Sentiment Classification for Low Resource African Languages: Nigerian Pidgin and Yoruba
    Hughes, Nathaniel
    Baker, Kevan
    Singh, Aditya
    Singh, Aryavardhan
    Dauda, Tharalillah
    Bhattacharya, Sutanu
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1502 - 1507
  • [25] Team ISCL_WINTER at SemEval-2023 Task 12:AfriSenti-SemEval: Sentiment Analysis for Low-resource African Languages using Twitter Dataset
    Hancharova, Alina
    Wang, John
    Kumar, Mayank
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1085 - 1089
  • [26] Masakhane-Afrisenti at SemEval-2023 Task 12: Sentiment Analysis using Afro-centric Language Models and Adapters for Low-resource African Languages
    Azime, Israel Abebe
    Al-Azzawi, Sana Sabah
    Lambebo Tonja, Atnafu
    Shode, Iyanuoluwa
    Alabi, Jesujoba
    Awokoya, Ayodele
    Oduwole, Mardiyyah
    Adewumi, Tosin
    Fanijo, Samuel
    Oyinkansola, Awosan
    Yousuf, Oreen
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1311 - 1316
  • [27] UCAS-IIE-NLP at SemEval-2023 Task 12: Enhancing Generalization of Multilingual BERT for Low-resource Sentiment Analysis
    Hu, Dou
    Wei, Lingwei
    Liu, Yaxin
    Zhou, Wei
    Hu, Songlin
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1849 - 1857
  • [28] GunadarmaXBRIN at SemEval-2023 Task 12: Utilization of SVM and AfriBERTa for Monolingual, Multilingual, and Zero-shot Sentiment Analysis in African Languages
    Arlim, Novitasari
    Riyanto, Slamet
    Rodiah, Rodiah
    Siagian, Al Hafiz Akbar Maulana
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 869 - 877
  • [29] DCU at SemEval-2023 Task 10: A Comparative Analysis of Encoder-only and Decoder-only Language Models with Insights into Interpretability
    Verma, Kanishk
    Adebayo, Kolawole
    Wagner, Joachim
    Davis, Brian
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1736 - 1750
  • [30] HW-TSC at SemEval-2023 Task 7: Exploring the Natural Language Inference Capabilities of ChatGPT and Pre-trained Language Model for Clinical Trial
    Zhao, Xiaofeng
    Zhang, Min
    Ma, MiaoMiao
    Su, Chang
    Liu, Yilun
    Wang, Minghan
    Qiao, Xiaosong
    Guo, Jiaxin
    Li, Yinglu
    Ma, Wenbing
    Tao, Shimin
    Yang, Hao
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1603 - 1608