Creating sentiment lexicon for sentiment analysis in Urdu: The case of a resource-poor language

被引:35
|
作者
Asghar, Muhammad Zubair [1 ]
Sattar, Anum [1 ]
Khan, Aurangzeb [2 ]
Ali, Amjad [3 ]
Kundi, Fazal Masud [1 ]
Ahmad, Shakeel [4 ]
机构
[1] Gomal Univ, ICIT, Dera Ismail Khan, KP, Pakistan
[2] Univ Sci & Technol, Dept Comp Sci, Bannu, Pakistan
[3] Univ Swat, Dept Comp & Software Technol, Saidu Sharif, Pakistan
[4] King Abdul Aziz Univ KAU, FCITR, Jeddah, Saudi Arabia
关键词
polarity lexicon; sentiment analysis; Urdu sentiment lexicon; Urdu SentiWordNet; FRAMEWORK;
D O I
10.1111/exsy.12397
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The sentiment analysis (SA) applications are becoming popular among the individuals and organizations for gathering and analysing user's sentiments about products, services, policies, and current affairs. Due to the availability of a wide range of English lexical resources, such as part-of-speech taggers, parsers, and polarity lexicons, development of sophisticated SA applications for the English language has attracted many researchers. Although there have been efforts for creating polarity lexicons in non-English languages such as Urdu, they suffer from many deficiencies, such as lack of publically available sentiment lexicons with a proper scoring mechanism of opinion words and modifiers. In this work, we present a word-level translation scheme for creating a first comprehensive Urdu polarity resource: "Urdu Lexicon" using a merger of existing resources: list of English opinion words, SentiWordNet, English-Urdu bilingual dictionary, and a collection of Urdu modifiers. We assign two polarity scores, positive and negative, to each Urdu opinion word. Moreover, modifiers are collected, classified, and tagged with proper polarity scores. We also perform an extrinsic evaluation in terms of subjectivity detection and sentiment classification, and the evaluation results show that the polarity scores assigned by this technique are more accurate than the baseline methods.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] Sentiment Analysis Using XLM-R Transformer and Zero-shot Transfer Learning on Resource-poor Indian Language
    Kumar, Akshi
    Albuquerque, Victor Hugo C.
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (05)
  • [22] Sentiment lexicon for sentiment analysis of Saudi dialect tweets
    Al-Thubaity, Abdulmohsen
    Alqahtani, Qubayl
    Aljandal, Abdulaziz
    ARABIC COMPUTATIONAL LINGUISTICS, 2018, 142 : 301 - 307
  • [23] Financial Sentiment Lexicon Analysis
    Sohangir, Sahar
    Petty, Nicholas
    Wang, Dingding
    2018 IEEE 12TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2018, : 286 - 289
  • [24] Twitter Sentiment Analysis in Resource Limited Language
    Gupta, Riya
    Agarwal, Sandli
    Garg, Shreya
    Kaushal, Rishabh
    BIG DATA ANALYTICS IN ASTRONOMY, SCIENCE, AND ENGINEERING, BDA 2023, 2024, 14516 : 45 - 58
  • [25] A Roman Urdu Corpus for sentiment analysis
    Khan, Marwa
    Naseer, Asma
    Wali, Aamir
    Tamoor, Maria
    COMPUTER JOURNAL, 2024, 67 (09): : 2864 - 2876
  • [26] Sentiment Analysis of Urdu Language: Handling Phrase-Level Negation
    Syed, Afraz Zahra
    Aslam, Muhammad
    Maria Martinez-Enriquez, Ana
    ADVANCES IN ARTIFICIAL INTELLIGENCE, PT I, 2011, 7094 : 382 - +
  • [27] Sentiment Analysis System for Roman Urdu
    Mehmood, Khawar
    Essam, Daryl
    Shafi, Kamran
    INTELLIGENT COMPUTING, VOL 1, 2019, 858 : 29 - 42
  • [28] Zero-shot Sentiment Analysis in Low-Resource Languages Using a Multilingual Sentiment Lexicon
    Koto, Fajri
    Beck, Tilman
    Talat, Zeerak
    Gurevych, Iryna
    Baldwin, Timothy
    PROCEEDINGS OF THE 18TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 298 - 320
  • [29] SentiFul: A Lexicon for Sentiment Analysis
    Neviarouskaya, Alena
    Prendinger, Helmut
    Ishizuka, Mitsuru
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2011, 2 (01) : 22 - 36
  • [30] Extending persian sentiment lexicon with idiomatic expressions for sentiment analysis
    Dashtipour, Kia
    Gogate, Mandar
    Gelbukh, Alexander
    Hussain, Amir
    SOCIAL NETWORK ANALYSIS AND MINING, 2022, 12 (01)