Impact of Emojis in Emotion Analysis on Code-Mixed Text

被引:0
|
作者
Tang, Tianai [1 ]
Nongpong, Kwankamol [1 ]
机构
[1] Assumption Univ, Intelligent Syst Res Lab, Bangkok, Thailand
关键词
Code-mixed text; Emotion analysis; Emoji; BERT;
D O I
10.1145/3639233.3639342
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the maturity of sentiment analysis technology, researchers are gradually not satisfied with only excavating positive and negative attitudes from the text but hope to obtain more delicate emotions. In addition, the development of social media makes people from different backgrounds like to express their emotions and feelings on social platforms. However, textual communication from social media is too colloquial, leading to the code-mixed text phenomenon, where sentences contain different languages, which poses difficulties for text analysis research. We observed that emojis in text contain emotion content and are universal across various language texts. This paper proposes a deep learning method for multi-class code-mixed emotion analysis using emojis. The proposed method has achieved good results on media text datasets and attempts to solve the difficulty of mixed sentiment classification with different codes. In this work, experiments were carried out on Hindi-English and Chinese-English code-mixed datasets, and the results have shown that integrating emoji representation with multilingual language model yielded 4% improvement on emotion analysis. Emojis should be preserved in emotion analysis regardless of which language the text is in.
引用
收藏
页码:25 / 30
页数:6
相关论文
共 50 条
  • [1] Emotion Detection in Code-Mixed Roman Urdu - English Text
    Ilyas, Abdullah
    Shahzad, Khurram
    Malik, Muhammad Kamran
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (02)
  • [2] Sentiment Analysis of Code-Mixed Text: A Comprehensive Review
    Perera, Anne
    Caldera, Amitha
    [J]. JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2024, 30 (02) : 242 - 261
  • [3] Multi-Label Emotion Classification on Code-Mixed Text: Data and Methods
    Ameer, Iqra
    Sidorov, Grigori
    Gomez-Adorno, Helena
    Nawab, Rao Muhammad Adeel
    [J]. IEEE ACCESS, 2022, 10 : 8779 - 8789
  • [4] Speech Synthesis of Code-Mixed Text
    Sitaram, Sunayana
    Black, Alan W.
    [J]. LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 3422 - 3428
  • [5] Text Normalization in Code-Mixed Social Media Text
    Dutta, Sukanya
    Saha, Tista
    Banerjee, Somnath
    Naskar, Sudip Kumar
    [J]. 2015 IEEE 2ND INTERNATIONAL CONFERENCE ON RECENT TRENDS IN INFORMATION SYSTEMS (RETIS), 2015, : 378 - 382
  • [6] Normalisation of Indonesian-English Code-Mixed Text and its Effect on Emotion Classification
    Yulianti, Evi
    Kurnia, Ajmal
    Adriani, Mirna
    Duto, Yoppy Setyo
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (11) : 674 - 685
  • [7] Effective Distributed Representation of Code-Mixed Text
    Malte, Aditya
    Sonawane, Sheetal
    [J]. 2019 IEEE 16TH INDIA COUNCIL INTERNATIONAL CONFERENCE (IEEE INDICON 2019), 2019,
  • [8] Analysis of Part of Speech Tags in Language Identification of Code-Mixed Text
    Ansari, Mohd Zeeshan
    Khan, Shazia
    Amani, Tamsil
    Hamid, Aman
    Rizvi, Syed
    [J]. ADVANCES IN COMPUTING AND INTELLIGENT SYSTEMS, ICACM 2019, 2020, : 417 - 425
  • [9] Overview of the track on Sentiment Analysis for Dravidian Languages in Code-Mixed Text
    Chakravarthi, Bharathi Raja
    Priyadharshini, Ruba
    Muralidaran, Vigneshwaran
    Suryawanshi, Shardul
    Jose, Navya
    Sherly, Elizabeth
    [J]. PROCEEDINGS OF THE 12TH ANNUAL MEETING OF THE FORUM FOR INFORMATION RETRIEVAL EVALUATION (FIRE 2020), 2020, : 21 - 24
  • [10] Classification of Code-Mixed Bilingual Phonetic Text Using Sentiment Analysis
    Singh, Shailendra Kumar
    Sachan, Manoj Kumar
    [J]. INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2021, 17 (02) : 59 - 78