Sentiment Analysis of Code-Mixed Roman Urdu-English Social Media Text using Deep Learning Approaches

被引:12
|
作者
Younas, Aqsa [1 ]
Nasim, Raheela [1 ]
Ali, Saqib [1 ,2 ]
Wang, Guojun [2 ]
Qi, Fang [3 ]
机构
[1] Univ Agr Faisalabad, Dept Comp Sci, Faisalabad 38000, Pakistan
[2] Guangzhou Univ, Sch Comp Sci, Guangzhou 510006, Peoples R China
[3] Cent South Univ, Sch Comp Sci & Engn, Changsha 410083, Peoples R China
基金
中国国家自然科学基金;
关键词
Sentiment analysis; Code-mixed text; Roman Urdu; Deep learning; XLM-RoBERTa; Multilingual BERT;
D O I
10.1109/CSE50738.2020.00017
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Sentiment analysis is the computational study of attitudes, opinions, and sentiments towards certain issues, products, individuals, and organizations. Companies and customers are making decisions by seeking opinions from social media text. Sentiment analysis is getting intelligent with the advancement of artificial intelligence and natural language processing. With a stunning increase in the use of social media, a huge volume of text available on these platforms is in imperfect and informal languages like Roman Urdu mixed with the English language. Present sentiment analysis techniques do not perform precisely on these code-mixed imperfect, informal, and poorly resourced languages. A promising solution is the use of deep learning models on these code-mixed Roman Urdu and English text. Therefore, the objective of this paper is to perform a sentiment analysis of code-mixed Roman Urdu and English social media text using state-of-the-art deep learning models. Our work is independent of lexical normalization, language dictionary, and code transfer indication. We perform sentiment analysis using Multilingual BERT (mBERT) and XLM-RoBERTa (XLM-R) models. The results reveal that performance of XLM-R model with tuned hyperparameters for code-mixed Roman Urdu and English social media text is better than the mBERT model with F1 score of 71%.
引用
收藏
页码:66 / 71
页数:6
相关论文
共 50 条
  • [21] Named Entity Recognition for Hindi-English Code-Mixed Social Media Text
    Singh, Vinay
    Shrivastava, Manish
    Akhtar, Syed Sarfaraz
    Vijay, Deepanshu
    [J]. NAMED ENTITIES, 2018, : 27 - 35
  • [22] Sentiment Analysis of Persian-English Code-mixed Texts
    Sabri, Nazanin
    Edalat, Ali
    Bahrak, Behnam
    [J]. 2021 26TH INTERNATIONAL COMPUTER CONFERENCE, COMPUTER SOCIETY OF IRAN (CSICC), 2021,
  • [23] Experimenting Language Identification for Sentiment Analysis of English Punjabi Code Mixed Social Media Text
    Bansal, Neetika
    Goyal, Vishal
    Rani, Simpel
    [J]. INTERNATIONAL JOURNAL OF E-ADOPTION, 2020, 12 (01) : 52 - 62
  • [24] UTSA: Urdu Text Sentiment Analysis Using Deep Learning Methods
    Naqvi, Uzma
    Majid, Abdul
    Abbas, Syed Ali
    [J]. IEEE ACCESS, 2021, 9 : 114085 - 114094
  • [25] UTSA: Urdu Text Sentiment Analysis Using Deep Learning Methods
    Naqvi, Uzma
    Majid, Abdul
    Abbas, Syed Ali
    [J]. IEEE Access, 2021, 9 : 114085 - 114094
  • [26] Transformer based multilingual joint learning framework for code-mixed and english sentiment analysis
    Mamta
    Ekbal, Asif
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2024, 62 (01) : 231 - 253
  • [27] An Effective Bi-LSTM Word Embedding System for Analysis and Identification of Language in Code-Mixed Social Media Text in English and Roman Hindi
    Shekhar, Shashi
    Sharma, Dilip Kumar
    Beg, M. M. Sufyan
    [J]. COMPUTACION Y SISTEMAS, 2020, 24 (04): : 1415 - 1427
  • [28] Transformer based multilingual joint learning framework for code-mixed and english sentiment analysis
    Asif Mamta
    [J]. Journal of Intelligent Information Systems, 2024, 62 (1) : 231 - 253
  • [29] Domain-specific Sentiment Analysis Approaches for Code-mixed Social Network Data
    Pravalika, A.
    Oza, Vishvesh
    Meghana, N. P.
    Kamath, Sowmya S.
    [J]. 2017 8TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2017,
  • [30] Automatic Language Identification system for code-mixed English-Kannada Social Media Text
    Lakshmi, Sowmya B. S.
    Shambhavi, B. R.
    [J]. 2017 2ND INTERNATIONAL CONFERENCE ON COMPUTATIONAL SYSTEMS AND INFORMATION TECHNOLOGY FOR SUSTAINABLE SOLUTION (CSITSS-2017), 2017, : 214 - 218