Evaluating Ensembled Transformers for Multilingual Code-Switched Sentiment Analysis

被引:0
|
作者
Aryal, Saurav K. [1 ]
Prioleau, Howard [1 ]
Washington, Gloria [1 ]
Burge, Legand [1 ]
机构
[1] Howard Univ, Comp Sci, Washington, DC 20059 USA
基金
美国国家卫生研究院;
关键词
Code Switching; Ensembling; BERT; Transformers;
D O I
10.1109/CSCI62032.2023.00032
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment analysis is essential for understanding human-authored texts, especially in multilingual communities where code-switching is common. Most existing research focuses on single-language pair sentiment analysis. We introduce a three-step approach for sentiment analysis on code-switched data: translating the code-switched data into English at word and sentence levels, training on Transformer models, and utilizing a stacking classifier to ensemble the models for sentiment classification. We establish a performance benchmark for binary and ternary sentiment classification by applying this to five datasets featuring English mixed with Spanish, Tamil, Telugu, Hindi, and Malayalam. Our method emphasizes the potential of ensembled Transformer models in this domain, paving the way for future advancements.
引用
收藏
页码:165 / 173
页数:9
相关论文
共 50 条
  • [21] Normalization of code-switched text for speech synthesis
    Manghat, Sreeram
    Manghat, Sreeja
    Schultz, Tanja
    INTERSPEECH 2022, 2022, : 4297 - 4301
  • [22] Machine Translation on a Parallel Code-Switched Corpus
    Menacer, M. A.
    Langlois, D.
    Jouvet, D.
    Fohr, D.
    Mella, O.
    Smaili, K.
    ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, 11489 : 426 - 432
  • [23] A Deep Generative Model for Code-Switched Text
    Samanta, Bidisha
    Reddy, Sharmila
    Jagirdar, Hussain
    Ganguly, Niloy
    Chakrabarti, Soumen
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 5175 - 5181
  • [24] Improving Pretraining Techniques for Code-Switched NLP
    Das, Richeek
    Ranjan, Sahasra
    Pathak, Shreya
    Jyothi, Preethi
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 1176 - 1191
  • [25] CODE-SWITCHED INSERTIONS IN THE SPEECH OF A BILINGUAL CHILD
    Chirsheva, Galina N.
    Korovushkin, Petr, V
    PHILOLOGICAL CLASS, 2020, 25 (02): : 236 - 246
  • [26] Predicting vowel substitution in code-switched speech
    Modipa, Thipe I.
    Davel, Marelie H.
    PROCEEDINGS OF THE 2015 PATTERN RECOGNITION ASSOCIATION OF SOUTH AFRICA AND ROBOTICS AND MECHATRONICS INTERNATIONAL CONFERENCE (PRASA-ROBMECH), 2015, : 154 - 159
  • [27] Tackling Code-Switched NER: Participation of CMU
    Geetha, Parvathy
    Chandu, Khyathi Raghavi
    Black, Alan W.
    COMPUTATIONAL APPROACHES TO LINGUISTIC CODE-SWITCHING, 2018, : 126 - 131
  • [28] Sentiment Analysis for Egyptian Arabic-English Code-Switched Data Using Traditional Neural Models and Advanced Language Models
    Sherif, Ahmed
    Sabty, Caroline
    SPEECH AND COMPUTER, SPECOM 2024, PT II, 2025, 15300 : 54 - 69
  • [29] Language Informed Modeling of Code-Switched Text
    Chandu, Khyathi Raghavi
    Manzini, Thomas
    Singh, Sumeet
    Black, Alan
    COMPUTATIONAL APPROACHES TO LINGUISTIC CODE-SWITCHING, 2018, : 92 - 97
  • [30] GLUECoS : An Evaluation Benchmark for Code-Switched NLP
    Khanuja, Simran
    Dandapat, Sandipan
    Srinivasan, Anirudh
    Sitaram, Sunayana
    Choudhury, Monojit
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 3575 - 3585