A Review of Urdu Sentiment Analysis with Multilingual Perspective: A Case of Urdu and Roman Urdu Language

被引:14
|
作者
Khan, Ihsan Ullah [1 ]
Khan, Aurangzeb [1 ,2 ]
Khan, Wahab [1 ]
Su'ud, Mazliham Mohd [2 ]
Alam, Muhammad Mansoor [3 ]
Subhan, Fazli [2 ,4 ]
Asghar, Muhammad Zubair [5 ]
机构
[1] Univ Sci & Technol, Dept Comp Sci, Bannu 28100, Pakistan
[2] Multimedia Univ, Fac Comp & Informat, Kuala Lumpur 50050, Malaysia
[3] Riphah Int Univ, Rawalpindi 74400, Pakistan
[4] Natl Univ Modern Languages NUML, Fac Engn & Comp Sci, Islamabad 44000, Pakistan
[5] Gomal Univ, Inst Comp & Informat Technol, Dera Ismail Khan 29050, Pakistan
关键词
preprocessing; feature extraction; classification;
D O I
10.3390/computers11010003
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Research efforts in the field of sentiment analysis have exponentially increased in the last few years due to its applicability in areas such as online product purchasing, marketing, and reputation management. Social media and online shopping sites have become a rich source of user-generated data. Manufacturing, sales, and marketing organizations are progressively turning their eyes to this source to get worldwide feedback on their activities and products. Millions of sentences in Urdu and Roman Urdu are posted daily on social sites, such as Facebook, Instagram, Snapchat, and Twitter. Disregarding people's opinions in Urdu and Roman Urdu and considering only resource-rich English language leads to the vital loss of this vast amount of data. Our research focused on collecting research papers related to Urdu and Roman Urdu language and analyzing them in terms of preprocessing, feature extraction, and classification techniques. This paper contains a comprehensive study of research conducted on Roman Urdu and Urdu text for a product review. This study is divided into categories, such as collection of relevant corpora, data preprocessing, feature extraction, classification platforms and approaches, limitations, and future work. The comparison was made based on evaluating different research factors, such as corpus, lexicon, and opinions. Each reviewed paper was evaluated according to some provided benchmarks and categorized accordingly. Based on results obtained and the comparisons made, we suggested some helpful steps in a future study.
引用
收藏
页数:29
相关论文
共 50 条
  • [1] Sentiment Analysis for Roman Urdu
    Rafique, Ayesha
    Malik, Muhammad Kamran
    Nawaz, Zubair
    Bukhari, Faisal
    Jalbani, Akhtar Hussain
    [J]. MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2019, 38 (02) : 463 - 470
  • [2] Roman-Urdu-Parl: Roman-Urdu and Urdu Parallel Corpus for Urdu Language Understanding
    Alam, Mehreen
    Ul Hussain, Sibt
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (01)
  • [3] Sentiment Analysis of Reviews in Natural Language: Roman Urdu as a Case Study
    Qureshi, Muhammad Aasim
    Asif, Muhammad
    Hassan, Mohd Fadzil
    Abid, Adnan
    Kamal, Asad
    Safdar, Sohail
    Akber, Rehan
    [J]. IEEE ACCESS, 2022, 10 : 24945 - 24954
  • [4] A Roman Urdu Corpus for sentiment analysis
    Khan, Marwa
    Naseer, Asma
    Wali, Aamir
    Tamoor, Maria
    [J]. COMPUTER JOURNAL, 2024,
  • [5] Sentiment Analysis System for Roman Urdu
    Mehmood, Khawar
    Essam, Daryl
    Shafi, Kamran
    [J]. INTELLIGENT COMPUTING, VOL 1, 2019, 858 : 29 - 42
  • [6] Automatic Detection of Offensive Language for Urdu and Roman Urdu
    Akhter, Muhammad Pervez
    Zheng Jiangbin
    Naqvi, Irfan Raza
    Abdelmajeed, Mohammed
    Sadiq, Muhammad Tariq
    [J]. IEEE ACCESS, 2020, 8 : 91213 - 91226
  • [7] Sentiment Analysis for a Resource Poor Language-Roman Urdu
    Mehmood, Khawar
    Essam, Daryl
    Shafi, Kamran
    Malik, Muhammad Kamran
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2020, 19 (01)
  • [8] RUSAS: Roman Urdu Sentiment Analysis System
    Jawad, Kazim
    Ahmad, Muhammad
    Alvi, Majdah
    Alvi, Muhammad Bux
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 79 (01): : 1463 - 1480
  • [9] Urdu Sentiment Analysis
    Rehman, Iffraah
    Soomro, Tariq Rahim
    [J]. APPLIED COMPUTER SYSTEMS, 2022, 27 (01) : 30 - 42
  • [10] Urdu Sentiment Analysis
    Khan, Khairullah
    Rahman, Atta Ur
    Khan, Aurangzeb
    Khan, Ashraf Ullah
    Saqia, Bibi
    Khan, Wahab
    Khans, Asfandyar
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (09) : 646 - 651