Bhattacharya_Lab at SemEval-2023 Task 12: A Transformer-based Language Model for Sentiment Classification for Low Resource African Languages: Nigerian Pidgin and Yoruba

被引:0
|
作者
Hughes, Nathaniel [1 ]
Baker, Kevan [2 ]
Singh, Aditya [1 ]
Singh, Aryavardhan [1 ]
Dauda, Tharalillah [1 ]
Bhattacharya, Sutanu [1 ]
机构
[1] Auburn Univ, Dept Comp Sci & Comp Informat Syst, Montgomery, AL 36117 USA
[2] Florida Polytech Univ, Dept Comp Sci, Lakeland, FL USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment Analysis is an aspect of natural language processing (NLP) that has been a topic of research. While most studies focus on high-resource languages with an extensive amount of available data, the study on low-resource languages with insufficient data needs attention. To address this issue, we propose a transformer-based method for sentiment analysis for low-resources African languages, Nigerian Pidgin and Yoruba. To evaluate the effectiveness of our multilingual language models for monolingual sentiment classification, we participated in the AfriSenti SemEval shared task 2023 competition. On the official evaluation set, our group (named as Bhattacharya_Lab) ranked 1 out of 33 participating groups in the Monolingual Sentiment Classification task (i.e., Task A) for Nigerian Pidgin (i.e., Track 4), and in the Top 5 among 33 participating groups in the Monolingual Sentiment Classification task for Yoruba (i.e., Track 2) respectively, demonstrating the potential for our transformer-based language models to improve sentiment analysis in low-resource languages. Overall, our study highlights the importance of exploring the potential of NLP in low-resource languages and the impact of transformer-based multilingual language models in sentiment analysis for the low-resource African languages, Nigerian Pidgin and Yoruba.
引用
收藏
页码:1502 / 1507
页数:6
相关论文
共 24 条
  • [1] Seals_Lab at SemEval-2023 Task 12: Sentiment Analysis for Low-resource African Languages, Hausa and Igbo
    Raychawdhary, Nilanjana
    Das, Amit
    Dozier, Gerry
    Seals, Cheryl D.
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1508 - 1517
  • [2] ABCD Team at SemEval-2023 Task 12: An Ensemble Transformer-based System for African Sentiment Analysis
    Dang Van Thin
    Dai Ba Nguyen
    Dang Ba Qui
    Duong Ngoc Hao
    Ngan Luu-Thuy Nguyen
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 324 - 330
  • [3] SemEval-2023 Task 12: Sentiment Analysis for African Languages (AfriSenti-SemEval)
    Muhammad, Shamsuddeen Hassan
    Abdulmumin, Idris
    Yimam, Seid Muhie
    Adelani, David Ifeoluwa
    Ahmad, Ibrahim Said
    Ousidhoum, Nedjma
    Ayele, Abinew Ali
    Mohammad, Saif M.
    Beloucif, Meriem
    Ruder, Sebastian
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 2319 - 2337
  • [4] NLP-LISAC at SemEval-2023 Task 12: Sentiment Analysis for Tweets expressed in African languages via Transformer-based Models
    Benlahbib, Abdessamad
    Boumhidi, Achraf
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 199 - 204
  • [5] PingAnLifeInsurance at SemEval-2023 Task 12: Sentiment Analysis for Low-resource African Languages with Multi-Model Fusion
    Jin, MeiZhi
    Chen, Cheng
    Zhou, MengYuan
    Yuan, MengFei
    Hou, XiaoLong
    Du, XiYang
    Jiang, LianXin
    Li, JianYu
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 679 - 685
  • [6] Uppsala University at SemEval-2023 Task12: Zero-shot Sentiment Classification for Nigerian Pidgin Tweets
    Kniele, Annika
    Beloucif, Meriem
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1491 - 1497
  • [7] Trinity at SemEval-2023 Task 12: Sentiment Analysis for Low-resource African Languages using Twitter Dataset
    Rathi, Shashank
    Pande, Siddhesh
    Atkare, Harshwardhan
    Tangsali, Rahul
    Vyawahare, Aditya
    Kadam, Dipali
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1161 - 1165
  • [8] UIO at SemEval-2023 Task 12: Multilingual fine-tuning for sentiment classification in low-resource languages
    Ronningstad, Egil
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1054 - 1060
  • [9] HausaNLP at SemEval-2023 Task 12: Leveraging African Low Resource TweetData for Sentiment Analysis
    Salahudeen, Saheed Abdullahi
    Lawan, Falalu Ibrahim
    Wali, Ahmad Mustapha
    Imam, Amina Abubakar
    Shuaibu, Aliyu Rabiu
    Yusuf, Aliyu
    Rabiu, Nur Bala
    Bello, Musa
    Adamu, Shamsuddeen Umaru
    Aliyu, Saminu Mohammad
    Gadanya, Murja Sani
    Muaz, Sanah Abdullahi
    Ahmad, Mahmoud Said
    Abdullahi, Abdulkadir
    Jamoh, Abdulmalik Yusuf
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 50 - 57
  • [10] DuluthNLP at SemEval-2023 Task 12: AfriSenti-SemEval: Sentiment Analysis for Low-resource African Languages using Twitter Dataset
    Akrah, Samuel
    Pedersen, Ted
    17TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2023, 2023, : 1697 - 1701