WELMSD - word embedding and language model based sarcasm detection

被引:6
|
作者
Kumar, Pradeep [1 ]
Sarin, Gaurav [2 ]
机构
[1] Indian Inst Management Lucknow, IT & Syst, Lucknow, Uttar Pradesh, India
[2] Delhi Sch Business, VIPS TC, Delhi, India
基金
中国国家自然科学基金;
关键词
Sarcasm identification; Sentiment analysis; Language models; Word embeddings; SENTIMENT ANALYSIS; IRONY DETECTION; CLASSIFICATION;
D O I
10.1108/OIR-03-2021-0184
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Purpose Sarcasm is a sentiment in which human beings convey messages with the opposite meanings to hurt someone emotionally or condemn something in a witty manner. The difference between the text's literal and its intended meaning makes it tough to identify. Mostly, researchers and practitioners only consider explicit information for text classification; however, considering implicit with explicit information will enhance the classifier's accuracy. Several sarcasm detection studies focus on syntactic, lexical or pragmatic features that are uttered using words, emoticons and exclamation marks. Discrete models, which are utilized by many existing works, require manual features that are costly to uncover. Design/methodology/approach In this research, word embeddings used for feature extraction are combined with context-aware language models to provide automatic feature engineering capabilities as well superior classification performance as compared to baseline models. Performance of the proposed models has been shown on three benchmark datasets over different evaluation metrics namely misclassification rate, receiver operating characteristic (ROC) curve and area under curve (AUC). Findings Experimental results suggest that FastText word embedding technique with BERT language model gives higher accuracy and helps to identify the sarcastic textual element correctly. Originality/value Sarcasm detection is a sub-task of sentiment analysis. To help in appropriate data-driven decision-making, the sentiment of the text that gets reversed due to sarcasm needs to be detected properly. In online social environments, it is critical for businesses and individuals to detect the correct sentiment polarity. This will aid in the right selling and buying of products and/or services, leading to higher sales and better market share for businesses, and meeting the quality requirements of customers.
引用
收藏
页码:1242 / 1256
页数:15
相关论文
共 50 条
  • [1] Transformer-Based Word Embedding With CNN Model to Detect Sarcasm and Irony
    Ravinder Ahuja
    S. C. Sharma
    [J]. Arabian Journal for Science and Engineering, 2022, 47 : 9379 - 9392
  • [2] Transformer-Based Word Embedding With CNN Model to Detect Sarcasm and Irony
    Ahuja, Ravinder
    Sharma, S. C.
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2022, 47 (08) : 9379 - 9392
  • [3] Multi-Modal Sarcasm Detection with Sentiment Word Embedding
    Fu, Hao
    Liu, Hao
    Wang, Hongling
    Xu, Linyan
    Lin, Jiali
    Jiang, Dazhi
    [J]. ELECTRONICS, 2024, 13 (05)
  • [4] ELMO Embedding for Sarcasm Detection
    Deka, Joon Jyoti
    Sarkar, Achyuth
    [J]. ADVANCES IN INFORMATION COMMUNICATION TECHNOLOGY AND COMPUTING, AICTC 2021, 2022, 392 : 357 - 366
  • [5] A Word Embedding based Generalized Language Model for Information Retrieval
    Ganguly, Debasis
    Roy, Dwaipayan
    Mitra, Mandar
    Jones, Gareth J. F.
    [J]. SIGIR 2015: PROCEEDINGS OF THE 38TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2015, : 795 - 798
  • [6] On the use of word embedding for cross language plagiarism detection
    Asghari, Habibollah
    Fatemi, Omid
    Mohtaj, Salar
    Faili, Heshaam
    Rosso, Paolo
    [J]. INTELLIGENT DATA ANALYSIS, 2019, 23 (03) : 661 - 680
  • [7] Clickbait Detection Based on Word Embedding Models
    Vorakitphan, Vorakit
    Leu, Fang-Yie
    Fan, Yao-Chung
    [J]. INNOVATIVE MOBILE AND INTERNET SERVICES IN UBIQUITOUS COMPUTING, IMIS-2018, 2019, 773 : 557 - 564
  • [8] Sarcasm detection in natural language processing
    Ashwitha, A.
    Shruthi, G.
    Shruthi, H. R.
    Upadhyaya, Makarand
    Ray, Abhra Pratip
    Manjunath, T. C.
    [J]. MATERIALS TODAY-PROCEEDINGS, 2021, 37 : 3324 - 3331
  • [9] Deep learning based sarcasm detection and classification model
    Bhukya, Raghuram
    Vodithala, Swathy
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2024, 46 (01) : 2053 - 2066
  • [10] Word Embedding Based Event Detection on Social Media
    Ertugrul, Ali Mert
    Velioglu, Burak
    Karagoz, Pinar
    [J]. HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, HAIS 2017, 2017, 10334 : 3 - 14