Text Length Adaptation in Sentiment Classification

被引:0
|
作者
Amplayo, Reinald Kim [1 ,3 ]
Lim, Seonjae [2 ]
Hwang, Seung-won [3 ]
机构
[1] Univ Edinburgh, Edinburgh, Midlothian, Scotland
[2] Samsung Elect, Seoul, South Korea
[3] Yonsei Univ, Seoul, South Korea
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Can a text classifier generalize well for datasets where the text length is different? For example, when short reviews are sentiment-labeled, can these transfer to predict the sentiment of long reviews (i.e., short to long transfer), or vice versa? While unsupervised transfer learning has been well-studied for cross domain/lingual transfer tasks, Cross Length Transfer (CLT) has not yet been explored. One reason is the assumption that length difference is trivially transferable in classification. We show that it is not, because short/long texts differ in context richness and word intensity. We devise new benchmark datasets from diverse domains and languages, and show that existing models from similar tasks cannot deal with the unique challenge of transferring across text lengths. We introduce a strong baseline model called BaggedCNN that treats long texts as bags containing short texts. We propose a state-of-the-art CLT model called Length Transfer Networks (LeTraNets) that introduces a two-way encoding scheme for short and long texts using multiple training mechanisms. We test our models and find that existing models perform worse than the BaggedCNN baseline, while LeTraNets outperforms all models.
引用
收藏
页码:646 / 661
页数:16
相关论文
共 50 条
  • [21] Interactive Dual Attention Network for Text Sentiment Classification
    Zhu, Yinglin
    Zheng, Wenbin
    Tang, Hong
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2020, 2020
  • [22] Short text sentiment classification based on context reconstruction
    Yang, Zhen
    Lai, Ying-Xu
    Duan, Li-Juan
    Li, Yu-Jian
    Zidonghua Xuebao/Acta Automatica Sinica, 2012, 38 (01): : 55 - 67
  • [23] Chinese Text Sentiment Classification based on Granule Network
    Zhang Xia
    Wang Suzhen
    Xu Mingzhu
    Yin Yixin
    2009 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING ( GRC 2009), 2009, : 775 - +
  • [24] Sentiment-based Classification of Radical Text on the Web
    Scrivens, Ryan
    Frank, Richard
    2016 EUROPEAN INTELLIGENCE AND SECURITY INFORMATICS CONFERENCE (EISIC), 2016, : 104 - 107
  • [25] Standard and Dialectal Arabic Text Classification for Sentiment Analysis
    Maghfour, Mohcine
    Elouardighi, Abdeljalil
    MODEL AND DATA ENGINEERING, MEDI 2018, 2018, 11163 : 282 - 291
  • [26] Sentiment and intent classification of in-text citations usinBERT
    Visser, Ruan
    Dunaiski, Marcel
    EPiC Series in Computing, 2022, 85 : 129 - 145
  • [27] A Survey on Text Classification Techniques for Sentiment Polarity Detection
    Arunachalam, N.
    Sneka, Josephine S.
    MadhuMathi, G.
    2017 INNOVATIONS IN POWER AND ADVANCED COMPUTING TECHNOLOGIES (I-PACT), 2017,
  • [28] Text Mining Facebook Status Updates for Sentiment Classification
    Akaichi, Jalel
    Dhouioui, Zeineb
    Lopez-Huertas Perez, Maria Jose
    2013 17TH INTERNATIONAL CONFERENCE ON SYSTEM THEORY, CONTROL AND COMPUTING (ICSTCC), 2013, : 640 - 645
  • [29] Statistical Text Analysis and Sentiment Classification in Social Media
    Cho, Sang-Hyun
    Kang, Hang-Bong
    PROCEEDINGS 2012 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2012, : 1112 - 1117
  • [30] Sentiment Classification of Short Text Using Sentimental Context
    Zheng, Wenjie
    Xu, Zenan
    Rao, Yanghui
    Xie, Haoran
    Wang, Fu Lee
    Kwan, Reggie
    PROCEEDINGS OF 4TH INTERNATIONAL CONFERENCE ON BEHAVIORAL, ECONOMIC ADVANCE IN BEHAVIORAL, ECONOMIC, SOCIOCULTURAL COMPUTING (BESC), 2017,