Urdu Sentiment Analysis

被引:5
|
作者
Rehman, Iffraah [1 ]
Soomro, Tariq Rahim [1 ]
机构
[1] Inst Business Management IoBM, CCSIS, Karachi, Pakistan
关键词
Machine learning algorithms; sentiment analysis; Tweepy; WEKA; TEXT;
D O I
10.2478/acss-2022-0004
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The world is heading towards more modernized and digitalized data and therefore a significant growth is observed in the active number of social media users with each passing day. Each post and comment can give an insight into valuable information about a certain topic or issue, a product or a brand, etc. Similarly, the process to uncover the underlying information from the opinion that a person keeps about any entity is called a sentiment analysis. The analysis can be carried out through two main approaches, i.e., either lexicon-based or machine learning algorithms. A significant amount of work in the different domains has been done in numerous languages for sentiment analysis, but minimal research has been conducted on the national language of Pakistan, which is Urdu. Twitter users who are familiar with Urdu update the tweets in two different textual formats either in Urdu Script (Nastaleeq) or in Roman Urdu. Thus, the paper is an attempt to perform the sentiment analysis on the Urdu language by extracting the tweets (Nastaleeq and Roman Urdu both) from Twitter using Tweepy APL A machine learning-based approach has been adopted for this study and the tool opted for the purpose is WEKA. The best algorithm was identified based on evaluation metrics, which comprise the number of correctly and incorrectly classified instances, accuracy, precision, and recall. SMO was found to be the most suitable machine learning algorithm for performing the sentiment analysis on Urdu (Nastaleeq) tweets, while the Roman Urdu Random Forest algorithm was identified as the best one.
引用
收藏
页码:30 / 42
页数:13
相关论文
共 50 条
  • [41] Deep Learning-Based Sentiment Analysis for Roman Urdu Text
    Ghulam, Hussain
    Zeng, Feng
    Li, Wenjia
    Xiao, Yutong
    2018 INTERNATIONAL CONFERENCE ON IDENTIFICATION, INFORMATION AND KNOWLEDGE IN THE INTERNET OF THINGS, 2019, 147 : 131 - 135
  • [42] Sentiment Analysis Based on Urdu Reviews Using Hybrid Deep Learning Models
    Singh, Neha
    Jaiswal, Umesh Chandra
    APPLIED COMPUTER SYSTEMS, 2023, 28 (02) : 258 - 265
  • [43] Contextually Enriched Meta-Learning Ensemble Model for Urdu Sentiment Analysis
    Ahmed, Kanwal
    Nadeem, Muhammad Imran
    Li, Dun
    Zheng, Zhiyun
    Al-Kahtani, Nouf
    Alkahtani, Hend Khalid
    Mostafa, Samih M.
    Mamyrbayev, Orken
    SYMMETRY-BASEL, 2023, 15 (03):
  • [44] Aspect-based sentiment analysis in Urdu language: resource creation and evaluation
    Altaf, Amna
    Anwar, Muhammad Waqas
    Jamal, Muhammad Hasan
    Bajwa, Usama Ijaz
    Rani, Sadaf
    Neural Computing and Applications, 2024, 36 (34) : 21365 - 21381
  • [45] An Intelligent Unsupervised Approach for Handling Context-DependentWords in Urdu Sentiment Analysis
    Mukhtar, Neelam
    Khan, Mohammad Abid
    Chiragh, Nadia
    Nazir, Shah
    Jan, Asim Ullah
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (05)
  • [46] Multi-class sentiment analysis of urdu text using multilingual BERT
    Lal Khan
    Ammar Amjad
    Noman Ashraf
    Hsien-Tsung Chang
    Scientific Reports, 12
  • [47] Effective Use of Evaluation Measures for the Validation of Best Classifier in Urdu Sentiment Analysis
    Neelam Mukhtar
    Mohammad Abid Khan
    Nadia Chiragh
    Cognitive Computation, 2017, 9 : 446 - 456
  • [48] Effective Use of Evaluation Measures for the Validation of Best Classifier in Urdu Sentiment Analysis
    Mukhtar, Neelam
    Khan, Mohammad Abid
    Chiragh, Nadia
    COGNITIVE COMPUTATION, 2017, 9 (04) : 446 - 456
  • [49] Lexical Variation and Sentiment Analysis of Roman Urdu Sentences with Deep Neural Networks
    Manzoor, Muhammad Arslan
    Mamoon, Saqib
    Tao, Song Kei
    Zakir, Ali
    Adil, Muhammad
    Lu, Jianfeng
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (02) : 719 - 726
  • [50] Lexical variation and sentiment analysis of Roman Urdu sentences with deep neural networks
    Manzoor M.A.
    Mamoon S.
    Tao S.K.
    Zakir A.
    Adil M.
    Lu J.
    Lu, Jianfeng, 1600, Science and Information Organization : 719 - 726