Topic-Enriched Word Embeddings for Sarcasm Identification

被引:121
|
作者
Onan, Aytug [1 ]
机构
[1] Izmir Katip Celebi Univ, Fac Engn & Architecture, Dept Comp Engn, TR-35620 Izmir, Turkey
关键词
Sarcasm detection; Word-embedding based features; Deep learning;
D O I
10.1007/978-3-030-19807-7_29
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sarcasm is a type of nonliteral language, where people may express their negative sentiments with the use of words with positive literal meaning, and, conversely, negative meaning words may be utilized to indicate positive sentiment. User-generated text messages on social platforms may contain sarcasm. Sarcastic utterance may change the sentiment orientation of text documents from positive to negative, or vice versa. Hence, the predictive performance of sentiment classification schemes may be degraded if sarcasm cannot be properly handled. In this paper, we present a deep learning based approach to sarcasm identification. In this regard, the predictive performance of topic-enriched word embedding scheme has been compared to conventional word-embedding schemes (such as, word2vec, fastText and GloVe). In addition to word-embedding based feature sets, conventional lexical, pragmatic, implicit incongruity and explicit incongruity based feature sets are considered. In the experimental analysis, six subsets of Twitter messages have been taken into account, ranging from 5000 to 30.000. The experimental analysis indicate that topic-enriched word embedding schemes utilized in conjunction with conventional feature sets can yield promising results for sarcasm identification.
引用
收藏
页码:293 / 304
页数:12
相关论文
共 50 条
  • [21] Incorporating word embeddings into topic modeling of short text
    Wang Gao
    Min Peng
    Hua Wang
    Yanchun Zhang
    Qianqian Xie
    Gang Tian
    Knowledge and Information Systems, 2019, 61 : 1123 - 1145
  • [22] Topic Discovery for Short Texts Using Word Embeddings
    Xun, Guangxu
    Gopalakrishnan, Vishrawas
    Ma, Fenglong
    Li, Yaliang
    Gao, Jing
    Zhang, Aidong
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2016, : 1299 - 1304
  • [23] Topic Modeling for Short Texts with Auxiliary Word Embeddings
    Li, Chenliang
    Wang, Haoran
    Zhang, Zhiqian
    Sun, Aixin
    Ma, Zongyang
    SIGIR'16: PROCEEDINGS OF THE 39TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2016, : 165 - 174
  • [24] Deep CNN-LSTM with Word Embeddings for News Headline Sarcasm Detection
    Mandal, Paul K.
    Mahto, Rakeshkumar
    16TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY-NEW GENERATIONS (ITNG 2019), 2019, 800 : 495 - 498
  • [25] A Latent Concept Topic Model for Robust Topic Inference Using Word Embeddings
    Hu, Weihua
    Tsujii, Jun'ichi
    PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2016), VOL 2, 2016, : 380 - 386
  • [26] A clustering-based topic model using word networks and word embeddings
    Wenchuan Mu
    Kwan Hui Lim
    Junhua Liu
    Shanika Karunasekera
    Lucia Falzon
    Aaron Harwood
    Journal of Big Data, 9
  • [27] A clustering-based topic model using word networks and word embeddings
    Mu, Wenchuan
    Lim, Kwan Hui
    Liu, Junhua
    Karunasekera, Shanika
    Falzon, Lucia
    Harwood, Aaron
    JOURNAL OF BIG DATA, 2022, 9 (01)
  • [28] INTENT DETECTION USING SEMANTICALLY ENRICHED WORD EMBEDDINGS
    Kim, Joo-Kyung
    Tur, Gokhan
    Celikyilmaz, Asli
    Cao, Bin
    Wang, Ye-Yi
    2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 414 - 419
  • [29] Semantics-assisted Wasserstein Learning for Topic and Word Embeddings
    Li, Changchun
    Li, Ximing
    Ouyang, Jihong
    Wang, Yiming
    20TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2020), 2020, : 292 - 301
  • [30] Enhancing Topic Modeling for Short Texts with Auxiliary Word Embeddings
    Li, Chenliang
    Duan, Yu
    Wang, Haoran
    Zhang, Zhiqian
    Sun, Aixin
    Ma, Zongyang
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2017, 36 (02)