Evaluating Ensembled Transformers for Multilingual Code-Switched Sentiment Analysis

被引:0
|
作者
Aryal, Saurav K. [1 ]
Prioleau, Howard [1 ]
Washington, Gloria [1 ]
Burge, Legand [1 ]
机构
[1] Howard Univ, Comp Sci, Washington, DC 20059 USA
来源
2023 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE, CSCI 2023 | 2023年
基金
美国国家卫生研究院;
关键词
Code Switching; Ensembling; BERT; Transformers;
D O I
10.1109/CSCI62032.2023.00032
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment analysis is essential for understanding human-authored texts, especially in multilingual communities where code-switching is common. Most existing research focuses on single-language pair sentiment analysis. We introduce a three-step approach for sentiment analysis on code-switched data: translating the code-switched data into English at word and sentence levels, training on Transformer models, and utilizing a stacking classifier to ensemble the models for sentiment classification. We establish a performance benchmark for binary and ternary sentiment classification by applying this to five datasets featuring English mixed with Spanish, Tamil, Telugu, Hindi, and Malayalam. Our method emphasizes the potential of ensembled Transformer models in this domain, paving the way for future advancements.
引用
收藏
页码:165 / 173
页数:9
相关论文
共 50 条
  • [41] Collecting Code-Switched Data from Social Media
    Mendels, Gideon
    Soto, Victor
    Jaech, Aaron
    Hirschberg, Julia
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 671 - 678
  • [42] Code-Switched Named Entity Recognition with Embedding Attention
    Wang, Changhan
    Cho, Kyunghyun
    Kiela, Douwe
    COMPUTATIONAL APPROACHES TO LINGUISTIC CODE-SWITCHING, 2018, : 154 - 158
  • [43] An Arabic-Moroccan Darija Code-Switched Corpus
    Samih, Younes
    Maier, Wolfgang
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 4170 - 4175
  • [44] Processing of Code-Switched Sentences in Noise by Bilingual Children
    Gross, Megan C.
    Patel, Haliee
    Kaushanskaya, Margarita
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2021, 64 (04): : 1283 - 1302
  • [45] Sociolinguistic effects on code-switched ads targeting bilingual consumers
    Luna, D
    Peracchio, LA
    JOURNAL OF ADVERTISING, 2005, 34 (02) : 43 - 56
  • [46] Meta-Transfer Learning for Code-Switched Speech Recognition
    Winata, Genta Indra
    Cahyawijaya, Samuel
    Lin, Zhaojiang
    Liu, Zihan
    Xu, Peng
    Fung, Pascale
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 3770 - 3776
  • [47] Code-switched inspired losses for generic spoken dialog representations
    Chapuis, Emile
    Colombo, Pierre
    Labeau, Matthieu
    Clavel, Chloe
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 8320 - 8337
  • [48] Adapter-based fine-tuning of pre-trained multilingual language models for code-mixed and code-switched text classification
    Himashi Rathnayake
    Janani Sumanapala
    Raveesha Rukshani
    Surangika Ranathunga
    Knowledge and Information Systems, 2022, 64 : 1937 - 1966
  • [49] Two sepedi-english code-switched speech corpora
    Modipa, Thipe, I
    Davel, Marelie H.
    LANGUAGE RESOURCES AND EVALUATION, 2022, 56 (03) : 703 - 727
  • [50] Adapter-based fine-tuning of pre-trained multilingual language models for code-mixed and code-switched text classification
    Rathnayake, Himashi
    Sumanapala, Janani
    Rukshani, Raveesha
    Ranathunga, Surangika
    KNOWLEDGE AND INFORMATION SYSTEMS, 2022, 64 (07) : 1937 - 1966