Trinity at SemEval-2023 Task 12: Sentiment Analysis for Low-resource African Languages using Twitter Dataset

被引:0
|
作者
Rathi, Shashank [1 ]
Pande, Siddhesh [1 ]
Atkare, Harshwardhan [1 ]
Tangsali, Rahul [1 ]
Vyawahare, Aditya [1 ]
Kadam, Dipali [1 ]
机构
[1] PICT, Pune, Maharashtra, India
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a summary of our findings obtained on sentiment analysis of 3 African languages among the 17 languages mentioned in the shared task. We carried out a sentiment analysis on Hausa, Yoruba, and Swahili. The models used here for the mentioned task were logistic regression, SVM, RandomForest, and mBERT along with a few data-preprocessing and oversampling techniques. The performance of the models used was evaluated by considering weighted average and macro average F1 scores as metrics. The best set of scores obtained on the languages Hausa, Yoruba and Swahili are (76.53, 76.55), (74.83, 73.15) and (57.79, 48.59) respectively.
引用
收藏
页码:1161 / 1165
页数:5
相关论文
共 50 条
  • [41] Sentiment Analysis in Low-Resource Settings: A Comprehensive Review of Approaches, Languages, and Data Sources
    Aliyu, Yusuf
    Sarlan, Aliza
    Danyaro, Kamaluddeen Usman
    Rahman, Abdullahi Sani B. A.
    Abdullahi, Mujaheed
    IEEE ACCESS, 2024, 12 : 66883 - 66909
  • [42] Deep Ensemble Network for Sentiment Analysis in Bi-lingual Low-resource Languages
    Roy, Pradeep Kumar
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (01)
  • [43] jelenasteam at SemEval-2023 Task 9: Quantification of Intimacy in Multilingual Tweets using Machine Learning Algorithms: A Comparative Study on the MINT Dataset
    Lazic, Jelena
    Vujnovic, Sanja
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 638 - 643
  • [44] Optimizing Multilingual Sentiment Analysis in Low-Resource Languages with Adaptive Pretraining and Strategic Language Selection
    Raychawdhary, Nilanjana
    Das, Amit
    Bhattacharya, Sutanu
    Dozier, Gerry
    Seals, Cheryl D.
    2024 IEEE 3RD INTERNATIONAL CONFERENCE ON COMPUTING AND MACHINE INTELLIGENCE, ICMI 2024, 2024,
  • [45] Sentiment analysis on a low-resource language dataset using multimodal representation learning and cross-lingual transfer learning
    Gladys, A. Aruna
    Vetriselvi, V.
    APPLIED SOFT COMPUTING, 2024, 157
  • [46] A Deep Learning model for Question Analysis in Low-resource Languages: A Dataset and Case Study for Persian
    Khaksefidi, Fatemeh Ebrahimi
    Fatemi, Afsaneh
    Nematbakhsh, Mohammad Ali
    Kia, Mahsa Abazari
    2024 14TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION SYSTEMS, ICPRS, 2024,
  • [47] Leveraging Multilingual Transformer for Multiclass Sentiment Analysis in Code-Mixed Data of Low-Resource Languages
    Nazir, Muhammad Kashif
    Faisal, Cm Nadeem
    Habib, Muhammad Asif
    Ahmad, Haseeb
    IEEE ACCESS, 2025, 13 : 7538 - 7554
  • [48] UMUTeam and SINAI at SemEval-2023 Task 9: Multilingual Tweet Intimacy Analysis using Multilingual Large Language Models and Data Augmentation
    Garcia-Diaz, Jose Antonio
    Pan, Ronghao
    Jimenez Zafra, Salud Maria
    Martin-Valdivia, Maria-Teresa
    Urena-Lopez, L. Alfonso
    Valencia-Garcia, Rafael
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 293 - 299
  • [49] Exploring Multi-lingual, Multi-task, and Adversarial Learning for Low-resource Sentiment Analysis
    Mamta
    Ekbal, Asif
    Bhattacharyya, Pushpak
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (05)
  • [50] Unveiling Sentiments: A Deep Dive Into Sentiment Analysis for Low-Resource Languages-A Case Study on Hausa Texts
    Shehu, Harisu Abdullahi
    Majikumna, Kaloma Usman
    Suleiman, Aminu Bashir
    Luka, Stephen
    Sharif, Md. Haidar
    Ramadan, Rabie A.
    Kusetogullari, Huseyin
    IEEE ACCESS, 2024, 12 : 98900 - 98916