Creating sentiment lexicon for sentiment analysis in Urdu: The case of a resource-poor language

被引:35
|
作者
Asghar, Muhammad Zubair [1 ]
Sattar, Anum [1 ]
Khan, Aurangzeb [2 ]
Ali, Amjad [3 ]
Kundi, Fazal Masud [1 ]
Ahmad, Shakeel [4 ]
机构
[1] Gomal Univ, ICIT, Dera Ismail Khan, KP, Pakistan
[2] Univ Sci & Technol, Dept Comp Sci, Bannu, Pakistan
[3] Univ Swat, Dept Comp & Software Technol, Saidu Sharif, Pakistan
[4] King Abdul Aziz Univ KAU, FCITR, Jeddah, Saudi Arabia
关键词
polarity lexicon; sentiment analysis; Urdu sentiment lexicon; Urdu SentiWordNet; FRAMEWORK;
D O I
10.1111/exsy.12397
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The sentiment analysis (SA) applications are becoming popular among the individuals and organizations for gathering and analysing user's sentiments about products, services, policies, and current affairs. Due to the availability of a wide range of English lexical resources, such as part-of-speech taggers, parsers, and polarity lexicons, development of sophisticated SA applications for the English language has attracted many researchers. Although there have been efforts for creating polarity lexicons in non-English languages such as Urdu, they suffer from many deficiencies, such as lack of publically available sentiment lexicons with a proper scoring mechanism of opinion words and modifiers. In this work, we present a word-level translation scheme for creating a first comprehensive Urdu polarity resource: "Urdu Lexicon" using a merger of existing resources: list of English opinion words, SentiWordNet, English-Urdu bilingual dictionary, and a collection of Urdu modifiers. We assign two polarity scores, positive and negative, to each Urdu opinion word. Moreover, modifiers are collected, classified, and tagged with proper polarity scores. We also perform an extrinsic evaluation in terms of subjectivity detection and sentiment classification, and the evaluation results show that the polarity scores assigned by this technique are more accurate than the baseline methods.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Words Are Important: Improving Sentiment Analysis in the Persian Language by Lexicon Refining
    Basiri, Mohammad Ehsan
    Kabiri, Arman
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2018, 17 (04)
  • [42] RUSAS: Roman Urdu Sentiment Analysis System
    Jawad, Kazim
    Ahmad, Muhammad
    Alvi, Majdah
    Alvi, Muhammad Bux
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 79 (01): : 1463 - 1480
  • [43] Lexicon-Based Sentiment Analysis of Facebook Comments in Vietnamese Language
    Son Trinh
    Luu Nguyen
    Minh Vo
    Phuc Do
    RECENT DEVELOPMENTS IN INTELLIGENT INFORMATION AND DATABASE SYSTEMS, 2016, 642 : 263 - 276
  • [44] Urdu Sentiment Analysis With Deep Learning Methods
    Khan, Lal
    Amjad, Ammar
    Ashraf, Noman
    Chang, Hsien-Tsung
    Gelbukh, Alexander
    IEEE ACCESS, 2021, 9 : 97803 - 97812
  • [45] Is there a language of sentiment? An analysis of lexical resources for sentiment analysis
    Ann Devitt
    Khurshid Ahmad
    Language Resources and Evaluation, 2013, 47 : 475 - 511
  • [46] Automatic Indonesian Sentiment Lexicon Curation with Sentiment Valence Tuning for Social Media Sentiment Analysis
    Wijayanti, Rini
    Arisal, Andria
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (01)
  • [47] Is there a language of sentiment? An analysis of lexical resources for sentiment analysis
    Devitt, Ann
    Ahmad, Khurshid
    LANGUAGE RESOURCES AND EVALUATION, 2013, 47 (02) : 475 - 511
  • [48] Sentiment analysis Approach to adapt a shallow parsing based sentiment lexicon
    Desai, Jayraj M.
    Andhariya, Swapnil R.
    2015 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2015,
  • [49] Generate domain-specific sentiment lexicon for review sentiment analysis
    Hongyu Han
    Jianpei Zhang
    Jing Yang
    Yiran Shen
    Yongshi Zhang
    Multimedia Tools and Applications, 2018, 77 : 21265 - 21280
  • [50] Automatically Constructing a Fine-Grained Sentiment Lexicon for Sentiment Analysis
    Wang, Yabing
    Huang, Guimin
    Li, Maolin
    Li, Yiqun
    Zhang, Xiaowei
    Li, Hui
    COGNITIVE COMPUTATION, 2023, 15 (01) : 254 - 271