Evaluating Ensembled Transformers for Multilingual Code-Switched Sentiment Analysis

被引:0
|
作者
Aryal, Saurav K. [1 ]
Prioleau, Howard [1 ]
Washington, Gloria [1 ]
Burge, Legand [1 ]
机构
[1] Howard Univ, Comp Sci, Washington, DC 20059 USA
基金
美国国家卫生研究院;
关键词
Code Switching; Ensembling; BERT; Transformers;
D O I
10.1109/CSCI62032.2023.00032
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment analysis is essential for understanding human-authored texts, especially in multilingual communities where code-switching is common. Most existing research focuses on single-language pair sentiment analysis. We introduce a three-step approach for sentiment analysis on code-switched data: translating the code-switched data into English at word and sentence levels, training on Transformer models, and utilizing a stacking classifier to ensemble the models for sentiment classification. We establish a performance benchmark for binary and ternary sentiment classification by applying this to five datasets featuring English mixed with Spanish, Tamil, Telugu, Hindi, and Malayalam. Our method emphasizes the potential of ensembled Transformer models in this domain, paving the way for future advancements.
引用
收藏
页码:165 / 173
页数:9
相关论文
共 50 条
  • [31] Fine-Tuned Self-supervised Speech Representations for Language Diarization in Multilingual Code-Switched Speech
    Frost, Geoffrey
    Morris, Emily
    van Vuren, Joshua Jansen
    Niesler, Thomas
    ARTIFICIAL INTELLIGENCE RESEARCH, SACAIR 2022, 2022, 1734 : 246 - 259
  • [32] Multilingual Named Entity Recognition on Spanish-English Code-switched Tweets using Support Vector Machines
    Claeser, Daniel
    Kent, Samantha
    Felske, Dennis
    COMPUTATIONAL APPROACHES TO LINGUISTIC CODE-SWITCHING, 2018, : 132 - 137
  • [33] Semi-supervised Development of ASR Systems for Multilingual Code-switched Speech in Under-resourced Languages
    Biswas, Astik
    Yilmaz, Emre
    de Wet, Febe
    Van der Westhuizen, Ewald
    Niesler, Thomas
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 3468 - 3474
  • [34] MULTILINGUAL CODE-MIXED SENTIMENT ANALYSIS IN HATE SPEECH
    Ranjan, Tulika
    Singh, Anish
    Kumari, Rina
    Swain, Sujata
    Bandyopadhyay, Anjan
    Parida, Ajaya kumar
    SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2023, 24 (04): : 873 - 882
  • [35] TRANSFORMER-TRANSDUCERS FOR CODE-SWITCHED SPEECH RECOGNITION
    Dalmia, Siddharth
    Liu, Yuzong
    Ronanki, Srikanth
    Kirchhoff, Katrin
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5859 - 5863
  • [36] An Algerian Arabic-French Code-Switched Corpus
    Cotterell, Ryan
    Renduchintala, Adithya
    Saphra, Naomi
    Callison-Burch, Chris
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014,
  • [37] Code-Switched Text Synthesis in Unseen Language Pairs
    Hsu, I-Hung
    Ray, Avik
    Grag, Shubham
    Peng, Nanyun
    Huang, Jing
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 5137 - 5151
  • [38] Code-Switched Advertisements in the Non-Bilingual Community
    Wang, Yunqi
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SMALL AND MEDIUM-SIZED ENTERPRISES (SMES) PSYCHOLOGICAL ADAPTATION AND SOCIAL BEHAVIOR UNDER FINANCIAL CRISIS, 2010, : 323 - 327
  • [39] Homophone Identification and Merging for Code-switched Speech Recognition
    Srivastava, Brij Mohan Lal
    Sitara, Sunayana
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1943 - 1947
  • [40] Multilingual Neural Network Acoustic Modelling for ASR of Under-Resourced English-isiZulu Code-Switched Speech
    Biswas, Astik
    de Wet, Febe
    van der Westhuizen, Ewald
    Yzlmaz, Emre
    Niesler, Thomas
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2603 - 2607