Extending persian sentiment lexicon with idiomatic expressions for sentiment analysis

被引:0
|
作者
Kia Dashtipour
Mandar Gogate
Alexander Gelbukh
Amir Hussain
机构
[1] Edinburgh Napier University,School of Computing, Merchiston Campus
[2] Instituto Politécnico Nacional (IPN),Centro de Investigación en Computación (CIC)
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Nowadays, it is important for buyers to know other customer opinions to make informed decisions on buying a product or service. In addition, companies and organizations can exploit customer opinions to improve their products and services. However, the Quintilian bytes of the opinions generated every day cannot be manually read and summarized. Sentiment analysis and opinion mining techniques offer a solution to automatically classify and summarize user opinions. However, current sentiment analysis research is mostly focused on English, with much fewer resources available for other languages like Persian. In our previous work, we developed PerSent, a publicly available sentiment lexicon to facilitate lexicon-based sentiment analysis of texts in the Persian language. However, PerSent-based sentiment analysis approach fails to classify the real-world sentences consisting of idiomatic expressions. Therefore, in this paper, we describe an extension of the PerSent lexicon with more than 1000 idiomatic expressions, along with their polarity, and propose an algorithm to accurately classify Persian text. Comparative experimental results reveal the usefulness of the extended lexicon for sentiment analysis as compared to PerSent lexicon-based sentiment analysis as well as Persian-to-English translation-based approaches. The extended version of the lexicon will be made publicly available.
引用
收藏
相关论文
共 50 条
  • [41] Thai Sentiment Lexicon Construction
    Intasorn, Jeerawat
    Gertphol, Sethavidh
    Sammapun, Usa
    [J]. 2021 13TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SMART TECHNOLOGY (KST-2021), 2021, : 123 - 128
  • [42] Data Mining through Sentiment Analysis: Lexicon based Sentiment Analysis Model using Aspect Catalogue
    Mehto, Aman
    Indras, Karnika
    [J]. 2016 SYMPOSIUM ON COLOSSAL DATA ANALYSIS AND NETWORKING (CDAN), 2016,
  • [43] The Impact of Sentiment Features on the Sentiment Polarity Classification in Persian Reviews
    Ehsan Asgarian
    Mohsen Kahani
    Shahla Sharifi
    [J]. Cognitive Computation, 2018, 10 : 117 - 135
  • [44] Metaphorical Expressions in Automatic Arabic Sentiment Analysis
    Alsiyat, Israa
    Piao, Scott
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 4911 - 4916
  • [45] A Lexicon-based Feature for Twitter Sentiment Analysis
    Limboi, Sergiu
    Diosan, Laura
    [J]. 2022 IEEE 18TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING, ICCP, 2022, : 95 - 102
  • [46] Annotated news corpora and a lexicon for sentiment analysis in Slovene
    Jože Bučar
    Martin Žnidaršič
    Janez Povh
    [J]. Language Resources and Evaluation, 2018, 52 : 895 - 919
  • [47] The Impact of Sentiment Features on the Sentiment Polarity Classification in Persian Reviews
    Asgarian, Ehsan
    Kahani, Mohsen
    Sharifi, Shahla
    [J]. COGNITIVE COMPUTATION, 2018, 10 (01) : 117 - 135
  • [48] Expanding Sentiment Lexicon with Multi-word Terms for Domain-Specific Sentiment Analysis
    Tan, Sang-Sang
    Na, Jin-Cheon
    [J]. DIGITAL LIBRARIES: KNOWLEDGE, INFORMATION, AND DATA IN AN OPEN ACCESS SOCIETY, 2016, 10075 : 285 - 296
  • [49] Automated measures of sentiment via transformer- and lexicon-based sentiment analysis (TLSA)
    Zhao, Xinyan
    Wong, Chau-Wai
    [J]. JOURNAL OF COMPUTATIONAL SOCIAL SCIENCE, 2024, 7 (01): : 145 - 170
  • [50] Sentiment Analysis of News Articles: A Lexicon based Approach
    Taj, Soonh
    Shaikh, Baby Bakhtawer
    Meghji, Areej Fatemah
    [J]. 2019 2ND INTERNATIONAL CONFERENCE ON COMPUTING, MATHEMATICS AND ENGINEERING TECHNOLOGIES (ICOMET), 2019,