PingAnLifeInsurance at SemEval-2023 Task 12: Sentiment Analysis for Low-resource African Languages with Multi-Model Fusion

被引:0
|
作者
Jin, MeiZhi [1 ]
Chen, Cheng [1 ]
Zhou, MengYuan [1 ]
Yuan, MengFei [1 ]
Hou, XiaoLong [1 ]
Du, XiYang [1 ]
Jiang, LianXin [1 ]
Li, JianYu [1 ]
机构
[1] Ping An Life Insurance Co China Ltd, Shenzhen, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes our system used in the SemEval-2023 Task12: Sentiment Analysis for Low-resource African Languages using Twitter Dataset. The AfriSenti-SemEval Shared Task 12 is based on a collection of Twitter datasets in 14 African languages for sentiment classification. It consists of three sub-tasks. Task A is a monolingual sentiment classification which covered 12 African languages. Task B is a multilingual sentiment classification which combined training data from Task A (12 African languages). Task C is a zero-shot sentiment classification. We utilized various strategies, including monolingual training, multilingual mixed training, and translation technology, and proposed a weighted voting method that combined the results of different strategies. Substantially, in the monolingual subtask, our system achieved Top-1 in two languages (Yoruba and Twi) and Top-2 in four languages (Nigerian Pidgin, Algerian Arabic, and Swahili, Multilingual). In the multilingual subtask, Our system achived Top-2 in publish leaderBoard.
引用
收藏
页码:679 / 685
页数:7
相关论文
共 50 条
  • [31] UniSent: Universal Sentiment Analysis System for Low-Resource Languages
    Jabreel, Mohammed
    Maaroof, Najlaa
    Valls, Aida
    Moreno, Antonio
    ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT, 2019, 319 : 387 - 396
  • [32] Comparative Analysis of Transformer Models for Sentiment Analysis in Low-Resource Languages
    Aliyu, Yusuf
    Sarlan, Aliza
    Danyaro, Kamaluddeen Usman
    Rahman, Abdulahi Sani B. A.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (04) : 353 - 364
  • [33] Uppsala University at SemEval-2023 Task12: Zero-shot Sentiment Classification for Nigerian Pidgin Tweets
    Kniele, Annika
    Beloucif, Meriem
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1491 - 1497
  • [34] Foul at SemEval-2023 Task 12: MARBERT Language model and lexical filtering for sentiments analysis of tweets in Algerian Arabic
    Belbachir, Faiza
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 389 - 396
  • [35] MLlab4CS at SemEval-2023 Task 2: Named Entity Recognition in low-resource language Bangla using Multilingual Language Models
    Mukherjee, Shrimon
    Ghosh, Madhusudan
    Girish
    Basuchowdhuri, Partha
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1388 - 1394
  • [36] Examining Sentiment Analysis for Low-Resource Languages with Data Augmentation Techniques
    Thakkar, Gaurish
    Preradovic, Nives Mikelic
    Tadic, Marko
    ENG, 2024, 5 (04): : 2920 - 2942
  • [37] HHS at SemEval-2023 Task 10: A Comparative Analysis of Sexism Detection Based on the RoBERTa Model
    Zhang, Yao
    Wang, Liqing
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 963 - 968
  • [38] Exploring Multi-lingual, Multi-task, and Adversarial Learning for Low-resource Sentiment Analysis
    Mamta
    Ekbal, Asif
    Bhattacharyya, Pushpak
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (05)
  • [39] Zero-shot Sentiment Analysis in Low-Resource Languages Using a Multilingual Sentiment Lexicon
    Koto, Fajri
    Beck, Tilman
    Talat, Zeerak
    Gurevych, Iryna
    Baldwin, Timothy
    PROCEEDINGS OF THE 18TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 298 - 320
  • [40] Multi-task Sequence Classification for Disjoint Tasks in Low-resource Languages
    Radom, Jarema
    Kocon, Jan
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KSE 2021), 2021, 192 : 1132 - 1140