Exploiting Linguistic Features for Effective Sentence-Level Sentiment Analysis in Urdu Language

被引:7
|
作者
Altaf, Amna [1 ]
Anwar, Muhammad Waqas [1 ]
Jamal, Muhammad Hasan [1 ]
Bajwa, Usama Ijaz [1 ]
机构
[1] COMSATS Univ Islamabad, Dept Comp Sci, Lahore Campus 1-5 Km Def Rd Raiwind Rd, Lahore, Punjab, Pakistan
关键词
Supervised Machine Learning; Parts of Speech Tagging; Sentiment Analysis; Urdu Language; SELECTION;
D O I
10.1007/s11042-023-15216-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Rapid increase in the use of social media has led to the generation of gigabytes of information shared by billions of users worldwide. To analyze this information and determine the behavior of people towards different events, sentiment analysis is widely used by researchers. Existing studies in Urdu sentiment analysis mostly use traditional n-gram features, which unlike linguistic features, do not focus on the contextual information being discussed. Moreover, no existing study classifies sentiments of proverbs and idioms which is challenging as mostly they do not contain sentiment words but carry strong sentiments. This study exploits linguistic features of Urdu language for sentence-level sentiment analysis and classifies idioms and proverbs using classical machine learning techniques. We develop a dataset comprising of idioms, proverbs, and sentences from the news domain, and extract part-of-speech tag-based features, boolean features, and numeric features from the dataset after keen linguistic analysis of Urdu language. Experimental results show that J48 classifier performs best in sentiment classification with an accuracy of 90% and an F-measure of 88%.
引用
收藏
页码:41813 / 41839
页数:27
相关论文
共 50 条
  • [1] Exploiting Linguistic Features for Effective Sentence-Level Sentiment Analysis in Urdu Language
    Amna Altaf
    Muhammad Waqas Anwar
    Muhammad Hasan Jamal
    Usama Ijaz Bajwa
    [J]. Multimedia Tools and Applications, 2023, 82 : 41813 - 41839
  • [2] Sentence-Level Sentiment Analysis in Persian
    Basiri, Mohammad Ehsan
    Kabiri, Arman
    [J]. 2017 3RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND IMAGE ANALYSIS (IPRIA), 2017, : 84 - 89
  • [3] Sentence-Level Sentiment Polarity Classification Using a Linguistic Approach
    Tan, Luke Kien-Weng
    Na, Jin-Cheon
    Theng, Yin-Leng
    Chang, Kuiyu
    [J]. DIGITAL LIBRARIES: FOR CULTURAL HERITAGE, KNOWLEDGE DISSEMINATION, AND FUTURE CREATION: ICADL 2011, 2011, 7008 : 77 - +
  • [4] Sentence-Level Sentiment Analysis in the Presence of Modalities
    Liu, Yang
    Yu, Xiaohui
    Liu, Bing
    Chen, Zhongshuai
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, CICLING 2014, PART II, 2014, 8404 : 1 - 16
  • [5] Sentence-Level Sentiment Analysis via BERT and BiGRU
    Shen, Jianghong
    Liao, Xiaodong
    Tao, Zhuang
    [J]. 2019 INTERNATIONAL CONFERENCE ON IMAGE AND VIDEO PROCESSING, AND ARTIFICIAL INTELLIGENCE, 2019, 11321
  • [6] Sentence-Level Sentiment Analysis via Sequence Modeling
    Liu, Xiaohua
    Zhou, Ming
    [J]. APPLIED INFORMATICS AND COMMUNICATION, PT III, 2011, 226 : 337 - +
  • [7] Sentence-level Sentiment Analysis via Sequence Modeling
    Liu, Xiaohua
    Zhou, Ming
    [J]. 2010 THE 3RD INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION (PACIIA2010), VOL III, 2010, : 176 - 179
  • [8] A complete framework for aspect-level and sentence-level sentiment analysis
    Chiha, Rim
    Ben Ayed, Mounir
    Pereira, Celia da Costa
    [J]. APPLIED INTELLIGENCE, 2022, 52 (15) : 17845 - 17863
  • [9] A complete framework for aspect-level and sentence-level sentiment analysis
    Rim Chiha
    Mounir Ben Ayed
    Célia da Costa Pereira
    [J]. Applied Intelligence, 2022, 52 : 17845 - 17863
  • [10] Exploiting Sentence-Level Features for Near-Duplicate Document Detection
    Wang, Jenq-Haur
    Chang, Hung-Chi
    [J]. INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2009, 5839 : 205 - +