Trinity at SemEval-2023 Task 12: Sentiment Analysis for Low-resource African Languages using Twitter Dataset

被引:0
|
作者
Rathi, Shashank [1 ]
Pande, Siddhesh [1 ]
Atkare, Harshwardhan [1 ]
Tangsali, Rahul [1 ]
Vyawahare, Aditya [1 ]
Kadam, Dipali [1 ]
机构
[1] PICT, Pune, Maharashtra, India
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a summary of our findings obtained on sentiment analysis of 3 African languages among the 17 languages mentioned in the shared task. We carried out a sentiment analysis on Hausa, Yoruba, and Swahili. The models used here for the mentioned task were logistic regression, SVM, RandomForest, and mBERT along with a few data-preprocessing and oversampling techniques. The performance of the models used was evaluated by considering weighted average and macro average F1 scores as metrics. The best set of scores obtained on the languages Hausa, Yoruba and Swahili are (76.53, 76.55), (74.83, 73.15) and (57.79, 48.59) respectively.
引用
收藏
页码:1161 / 1165
页数:5
相关论文
共 50 条
  • [31] UniSent: Universal Sentiment Analysis System for Low-Resource Languages
    Jabreel, Mohammed
    Maaroof, Najlaa
    Valls, Aida
    Moreno, Antonio
    ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT, 2019, 319 : 387 - 396
  • [32] Comparative Analysis of Transformer Models for Sentiment Analysis in Low-Resource Languages
    Aliyu, Yusuf
    Sarlan, Aliza
    Danyaro, Kamaluddeen Usman
    Rahman, Abdulahi Sani B. A.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (04) : 353 - 364
  • [33] Uppsala University at SemEval-2023 Task12: Zero-shot Sentiment Classification for Nigerian Pidgin Tweets
    Kniele, Annika
    Beloucif, Meriem
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1491 - 1497
  • [34] Examining Sentiment Analysis for Low-Resource Languages with Data Augmentation Techniques
    Thakkar, Gaurish
    Preradovic, Nives Mikelic
    Tadic, Marko
    ENG, 2024, 5 (04): : 2920 - 2942
  • [35] ChaPat at SemEval-2023 Task 9: Text Intimacy Analysis using Ensembles of Multilingual Transformers
    Chavan, Tanmay
    Patwardhan, Ved
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1304 - 1310
  • [36] Enhancing Sentiment Analysis in Amharic: Leveraging Transformer-Based Language Model for Low-Resource African Languages
    Raychawdhary, Nilanjana
    Das, Amit
    Bhattacharya, Sutanu
    Dozier, Gerry
    Seals, Cheryl D.
    SOUTHEASTCON 2024, 2024, : 50 - 55
  • [37] Building lexicon-based sentiment analysis model for low-resource languages
    Mohammed, Idi
    Prasad, Rajesh
    METHODSX, 2023, 11
  • [38] Howard University Computer Science at SemEval-2023 Task 12: A 2-Step System Design for Multilingual Sentiment Classification with Language Identification
    Aryal, Saurav
    Prioleau, Howard
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 2153 - 2159
  • [39] Foul at SemEval-2023 Task 12: MARBERT Language model and lexical filtering for sentiments analysis of tweets in Algerian Arabic
    Belbachir, Faiza
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 389 - 396
  • [40] Deep Persian sentiment analysis: Cross-lingual training for low-resource languages
    Ghasemi, Rouzbeh
    Ashrafi Asli, Seyed Arad
    Momtazi, Saeedeh
    JOURNAL OF INFORMATION SCIENCE, 2022, 48 (04) : 449 - 462