Foul at SemEval-2023 Task 12: MARBERT Language model and lexical filtering for sentiments analysis of tweets in Algerian Arabic

被引:0
|
作者
Belbachir, Faiza [1 ]
机构
[1] IPSA Ecole Ingenieurs Aeronaut & Spatiale Paris, 63 Bd Brandebourg Bis, F-94200 Ivry, France
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes the system we designed for our participation in SemEval-2023 Task 12 Track 6 about Algerian dialect sentiment analysis. We propose a transformer language model approach combined with a lexicon mixing terms and emojis which is used in a postprocessing filtering stage. The Algerian sentiment lexicons were extracted manually from tweets. We report on our experiments on the Algerian dialect, where we compare the performance of MARBERT to the one of ArabicBERT and CAMeLBERT on the training and development datasets of Task 12. We also analyze the contribution of our post-processing lexical filtering for sentiment analysis. Our system obtained an F1 score equal to 70%, ranking 9th among 30 participants.
引用
收藏
页码:389 / 396
页数:8
相关论文
共 38 条
  • [31] UM6P at SemEval-2023 Task 12: Out-Of-Distribution Generalization Method for African Languages Sentiment Analysis
    El Mahdaouy, Abdelkader
    Alami, Hamza
    Lamsiyah, Salima
    Berrada, Ismail
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1004 - 1010
  • [32] SinaAI at SemEval-2023 Task 3: A Multilingual Transformer Language Model-based Approach for the Detection of News Genre, Framing and Persuasion Techniques
    Sadeghi, Aryan
    Alipour, Reza
    Taeb, Kamyar
    Morassafar, Parimehr
    Salemahim, Nima
    Asgari, Ehsaneddin
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 2168 - 2173
  • [33] CPIC at SemEval-2023 Task 7: GPT2-based Model for Multi-evidence Natural Language Inference for Clinical Trial Data
    Huang, Mingtong
    Ren, Junxiang
    Liu, Lang
    Song, Ruilin
    Yin, Wenbo
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 397 - 401
  • [34] Azaad@BND at SemEval-2023 Task 2: How to Go from a Simple Transformer Model to a Better Model to Get Better Results in Natural Language Processing
    Ahmadi, Reza
    Arefi, Shiva
    Jafarabad, Mohammad
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 2184 - 2187
  • [35] TAM of SCNU at SemEval-2023 Task 1: FCLL: A Fine-grained Contrastive Language-Image Learning Model for Cross-language VisualWord Sense Disambiguation
    Yang, Qihao
    Li, Yong
    Wang, Xuelin
    Li, Shunhao
    Hao, Tianyong
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 506 - 511
  • [36] ZBL2W at SemEval-2023 Task 9: A Multilingual Fine-tuning Model with Data Augmentation for Tweet Intimacy Analysis
    Zhang, Hao
    Wu, Youlin
    Lu, Junyu
    Bai, Zewen
    Wu, Jiangming
    Lin, Hongfei
    Zhang, Shaowu
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 770 - 775
  • [37] YNU-HPCC at SemEval-2023 Task7: Multi-evidence Natural Language Inference for Clinical Trial Data based on a BioBERT Model
    Feng, Chao
    Wang, Jin
    Zhang, Xuejie
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 664 - 670
  • [38] JCT at SemEval-2023 Tasks 12A and 12B: Sentiment Analysis for Tweets Written in Low-resource African Languages using Various Machine Learning and Deep Learning Methods, Resampling, and HyperParameter Tuning
    Keinan, Ron
    HaCohen-Kerner, Yaakov
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 365 - 378